Dear Coding Experts,
Edit: Just for clarification I am using python, and would like to do this within python.
I am in the middle of collecting data for a research project at our university. Basically I need to scrape a lot of information from a website that moniters the European Parliament. Here is an example of how the url of one site looks like:
The numbers after the reference part of the adress refers to: A7 = Parliament in session (previous parliaments are A6 etc.), 2010 = year, 0190 = number of the file.
What I want to do is to create a variable that has all the urls for different parliaments, so I can loop over this variable and scrape the information from the websites.
Any help is much appreciated!
With kind regards,
Thomas Jensen
P.S: I have tried this:
number = range(1,190,1)
for i in number:
search_url = "http://www.europarl.europa.eu/sides/getDoc.do?type=REPORT&mode=XML&reference=A7-2010-" + str(number[i]) +"&language=EN"
results = search_url
print results
but this gives me the following error: Traceback (most recent call last): File "", line 7, in IndexError: list index out of range