[Main Page]

XML troubles

(Difference between revisions)



Line 3: Line 3:
Our use of the WikiXML tool made us aware of the following problems
Our use of the WikiXML tool made us aware of the following problems
-
* it was developed with and for the Wikipedia 2006 versions, so the current SQL dumps had to be converted to an older 2008 format
+
* it was developed with and for the Wikipedia 2006 versions, so the current (2008) SQL dumps had to be converted to an older 2006 format
* some of the pages became void during the conversion. Examples are the pages of Augsburg, contrasted with Berlin:
* some of the pages became void during the conversion. Examples are the pages of Augsburg, contrasted with Berlin:

Revision as of 12:07, 20 March 2009

In this page we provide information on the problems found with the XML version of the GikiCLEF collection, in order to both fully inform GikiCLEF participants and help other possible future users of it.

Our use of the WikiXML tool made us aware of the following problems