ansaurus

Question

Answer 1

+5 A:

Use BeautifulSoup or lxml.

Ignacio Vazquez-Abrams 2010-01-27 10:38:04

+1 for BeautifulSoup

The MYYN 2010-01-27 10:45:18

Thanks. The problem is that I cant get the video link. But if I push in play button in the web page, I can download the video file with the firefox extension "Download Helper" but I would like to do this automatically. Any help please?

mmm286 2010-01-27 10:47:28

Then you've misrepresented the problem. If you need to decompile the SWF file then you'll have to look elsewhere.

Ignacio Vazquez-Abrams 2010-01-27 10:53:11

the videos are generated by javascript. therefore, you can't really use HTML parser for that.

ghostdog74 2010-01-27 11:17:58

Answer 2

A:

import re 
from urllib2 import urlopen
text = urlopen('http://www.rtve.es/mediateca/videos/20100125/saber-comer---salsa-verde-judiones-25-01-10/676590.shtm').read()
reg = re.compile(r'http://www\.rtv.*flv')
reg.findall(text)

Normallly you can use this one. But there is no your link inside.

bluszcz 2010-01-27 10:42:00

you can also use mentioned BeatifulSoup or mechanise.

bluszcz 2010-01-27 10:42:43

Thanks. The problem is that I cant get the video link. But if I push in play button in the web page, I can download the video file with the firefox extension "Download Helper" but I would like to do this automatically. Any help please?

mmm286 2010-01-27 10:48:04

Answer 3

A:

@OP, those videos are generated by javascript. For this topic, see here. Or search google for references.

ghostdog74 2010-01-27 11:21:08

Many thanks, I try to find an alternative, but I don't find nothing. I have to to the video dowloads manually :-(Many thanks!

mmm286 2010-01-27 14:58:09

ansaurus

tags:

views:

answers:

Help parsing a page with python

related questions