This paper shows, how we can extract the signal of interest from wikipedia public dump. The fundamental idea lean on the fact each wikipedia page which belongs to specific location included with geographical coordination. Thus at the first phase of our project, we checked for each and every title line in wikipedia dump, whether it is included with geographical coordination. In case of true assumption, we analyse the coordination to discover pages belong to Italy and collect them based on time series data-base. At the final phase of our project we categorize and count our extracted attributes based on different languages and visualize them by plot for days, months and years.

Analyse and visualize signal of interest for Italian zone wikipedia pages

Dashtban Kenari, Seyednima
2017/2018

Abstract

This paper shows, how we can extract the signal of interest from wikipedia public dump. The fundamental idea lean on the fact each wikipedia page which belongs to specific location included with geographical coordination. Thus at the first phase of our project, we checked for each and every title line in wikipedia dump, whether it is included with geographical coordination. In case of true assumption, we analyse the coordination to discover pages belong to Italy and collect them based on time series data-base. At the final phase of our project we categorize and count our extracted attributes based on different languages and visualize them by plot for days, months and years.
2017-03-23
File in questo prodotto:
File Dimensione Formato  
847677-1190358.pdf

accesso aperto

Tipologia: Altro materiale allegato
Dimensione 2.15 MB
Formato Adobe PDF
2.15 MB Adobe PDF Visualizza/Apri

I documenti in UNITESI sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14247/19085