views:

85

answers:

3

Hi,

I want to get a list of all the wikipedia categories. I can find them here : http://en.wikipedia.org/wiki/Special:Categories Is there a way to download all of them in xml/csv format.

Thank you
Bala

+1  A: 

You might want to look at DBpedia or Freebase. They routinely extract semantic data from Wikipedia's organized segments, like categories, titles, and info boxes. The only downside is you'll probably need to download their entire extraction (several gigabytes), load it into a local Triplestore, and query it with Sparql. With DBpedia, you might be able to skip a few steps by using their online RDF endpoint.

Chris S
+3  A: 

You can download the SQL dump - enwiki-latest-category.sql.gz at http://download.wikimedia.org/enwiki/latest/

tszming
+1  A: 

Use the MediaWiki API: http://en.wikipedia.org/w/api.php?action=query&list=allcategories

Bryan