ansaurus

Question

how to search for specific file type with yahoo search API?

Answer 1

A:

Yes, there is:

http://developer.yahoo.com/search/boss/boss_guide/Web_Search.html#id356163

Tiago 2009-02-07 00:30:34

Answer 2

A:

Thank you. I found myself that something like this works OK (file type is the first argument, and query is the second):

format = sys.argv[1]

query = " ".join(sys.argv[2:])

srch = create_search("Web", app_id, query=query, format=format)

2009-02-08 21:54:07

Answer 3

A:

Here's what I do for this sort of thing. It exposes more of the parameters so you can tune it to your needs. This should print out the first ten PDFs URLs from the query "resume" [mine's not one of them ;) ]. You can download those URLs however you like.

The json dictionary that gets returned from the query is a little gross, but this should get you started. Be aware that in real code you will need to check whether some of the keys in the dictionary exist. When there are no results, this code will probably throw an exception.

The link that Tiago provided is good for knowing what values are supported for the "type" parameter.

from yos.crawl import rest
APPID="XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX"
base_url = "http://boss.yahooapis.com/ysearch/%s/v%d/%s?start=%d&amp;count=%d&amp;type=%s" + "&appid=" + APPID
querystr="resume"
start=0
count=10
type="pdf"
search_url = base_url % ("web", 1, querystr, start, count, type)
json_result = rest.load_json(search_url)
for url in [recs['url'] for recs in json_result['ysearchresponse']['resultset_web']]:
    print url

Owen 2009-07-24 14:52:02

ansaurus

tags:

views:

answers:

how to search for specific file type with yahoo search API?

related questions