views:

66

answers:

1

Using servlets I need to send request to predefined urls and the pages obtained must be parsed to determine the image tag and the keywords associated with the image and thus receive the links of the image or obtain the images.

+1  A: 

Are the pages you need to obtain valid html/xml/xhtml ? If so you can just use a dom parser in your language's library, if not I recommend searching for image tags manually and getting the href attributes. The keywords thing isn't easy. I suppose grab all meta tags, textNodes and titles and alt attributes and then search for word frequency to get keywords.

apphacker