Jump to content

Wikipedia:WikiProject Wikidemia/Quant/Arch

From Wikipedia, the free encyclopedia

Parser

[edit]
  • This converts the zipped xml database dumps into csv files with file specification:....

Stats

[edit]
  • csv files of header information can be read into Statistical software packages R and Stata

Analysis

[edit]

Figure Production

[edit]

Table Production

[edit]

Data Anomalies

[edit]

In the Indonesian Wikipedia dump occasionally usernames appear in the <ip> tag (e.g. user:Vyasa). These appear to be localized to 2003. It is not clear why this occurs.