In this page we provide information on the problems found with the XML version of the GikiCLEF collection, in order to both fully inform GikiCLEF participants and help other possible future users of it.
Our use of the WikiXML tool made us aware of the following problems
- it was developed with and for the Wikipedia 2006 versions, so the current SQL dumps had to be converted to an older 2008 format
- some of the pages became void during the conversion. Examples are the pages of Augsburg, contrasted with Berlin:
- finally, some pages, although apparently correct XML, do not get displayed in the browsers (tested with Firefox and IE), so before complaining that they are void, check the View source. Current suspect is a style with display:none
![[Main Page]](/GikiCLEF/images/logoGikiCLEF.png)