hi all, I'm playing around with a webpage fetcher in java right now and I'm curious what the best way to do this in Java is.
I have a link: e.g.: http://www.nytimes.com/2010/07/08/technology/personaltech/08pogue.html?ref=technology
and when I crawl that page I might find img src paths like
"../public/images/header.jpg"
"../../test/logo.gif"
where it may be relative to one of the subdirs the file is in
Question is.. in Java is there a lib that would be able to turn these into absolute paths like
http://www.nytimes.com/2010/07/08/technology/public/images/header.jpg
?
thanks