Dear Pythonistas,
I have the following link:
the reference part of the url has the following information:
A7 == The parliament (current is the seventh parliament, the former is A6 and so forth)
2010 == year
0001 == document number
For every year and parliament I would like to identify the number of documents on the website. The task is complicated by the fact that for 2010, for instance, numbers 186, 195,196 have empty pages, while the max number is 214. Ideally the output should be a vector with all the document numbers, excluding the missing ones.
Can anyone tell me if this is possible in python?
Best, Thomas