Are there any HTML parsers that parse HTML docs offline, i.e. stored on your computer? If so, can anyone name some good ones please?
UPDATE: Hah, NVM, found the answer, would anyone be able to provide an example of this in html Jericho?
UPDATE2: I thought I had found the answer but I am wrong, mistook InputStream for FileInputStream :(