ansaurus

Question

beautifulsoup and mechanize to get ajax call result

Answer 1

+1 A:

No, you can't do that easily. AFAIK your options are, easiest first:

Read the AJAX javascript code yourself, as a human programmer, understand it, and then write python code to simulate the AJAX calls by hand. You can also use some capture software to capture requests/responses made live and try to reproduce them with code;
Use selenium or some other browser automation tool to fetch the page on a real web browser;
Use some python javascript runner like spidermonkey or pyv8 to run the javascript code, and hook it to your copy of the HTML dom;

nosklo 2010-04-09 19:19:32

hi, first option wouldnt be that easy because the javascript is in packed versionthanks for the heads up i'll look over it first thing tommorow

nabizan 2010-04-09 20:07:55

@nabizan: that's why I also suggested usage of capture software on option 1

nosklo 2010-04-09 22:57:18

@nosklo: hi, so after a little digging on unpack javascript i find out what should i call and how. than it was quite easy (see my answer for more info)

nabizan 2010-04-10 13:04:31

Answer 2

A:

ok so i have figured it out ... it was quite simple after i realised that i could use combination of urllib, ulrlib2 and beautifulsoup

import urllib, urllib2
from BeautifulSoup import BeautifulSoup as bs_parse

data = urllib.urlencode(values)
req  = urllib2.Request(url, data)
res  = urllib2.urlopen(req)
page = bs_parse(res.read())

nabizan 2010-04-10 12:59:27

Answer 3

+1 A:

import urllib, urllib2
from BeautifulSoup import BeautifulSoup as bs_parse

data = urllib.urlencode(values)
req  = urllib2.Request(url, data)
res  = urllib2.urlopen(req)
page = bs_parse(res.read())

2010-06-21 19:40:33

ansaurus

tags:

views:

answers:

beautifulsoup and mechanize to get ajax call result

related questions