News Aggregater of sorts

views:

answers:

News Aggregater of sorts

There is a website that my company uses that updates information about 3 specific things throughout the day. We use the information from 1 of them and what we are wanting to do is pull this information as it is added to their site and add it to a page of our own to view easier. Is this even possible? Can anyone point me in the direction of setting this up? It is all text that we want to pull.

+1 A:

Pick a language (e.g. Perl). Find an HTTP library for it (e.g. LWP). Fetch the page and run it through an HTTP parser (e.g. HTML::TreeBuilder). Pull out the bits you want and shove them into a template (e.g. TT) then dump to a file. Stick the program in cron or Windows Scheduler.

David Dorward 2010-06-14 19:40:50

I dont know perl, what other languages can I use?

shinjuo 2010-06-14 19:42:07

Whatever you like. The principles are still the same.

David Dorward 2010-06-14 19:44:11

What do you think the easiest way to do it is?

shinjuo 2010-06-14 19:53:49

Perl, but you don't know the language, so it probably isn't the easier way for you. You might try paying someone to do it for you, that's usually pretty easy.

David Dorward 2010-06-14 20:30:42

I would like to try to learn to do it myself. Is there anyway to do it using PHP or javascript

shinjuo 2010-06-14 20:59:41

I also know C and C++

shinjuo 2010-06-14 21:01:08

Yes, you can use any of those languages (although you couldn't use JS in a browser environment). Pick one. Then find an HTTP … I'm repeating my answer aren't I?

David Dorward 2010-06-14 21:02:08

Okay One last question I think. Is a HTTP parser and a HTML parser the same thing?

shinjuo 2010-06-14 21:03:58

No (why would I put them as seperate steps if they were?). HTTP is a means of fetching data over a network. HTML is a language for describing the structure and semantics of text. Some libraries handle both functions, but they aren't the same thing.

David Dorward 2010-06-14 21:05:52

The reason I asked is because there was a website with an HTML parser.

shinjuo 2010-06-14 21:07:44

ansaurus

tags:

views:

answers:

News Aggregater of sorts

related questions