I am not aware of any Java classes that will read the ASCII characters and ignore the rest, but the easiest thing I can come up with here is to use the strings
utility (assuming you are on a Unix-based system).
SYNOPSIS
strings [ - ] [ -a ] [ -o ] [ -t format ] [ -number ] [ -n number ]
[--] [file ...]
DESCRIPTION
Strings looks for ASCII strings in a binary file or standard
input.
Strings is useful for identifying random object files and
many other
things. A string is any sequence of 4 (the default) or more
printing
characters ending with a newline or a null. Unless the -
flag is
given, strings looks in all sections of the object files except
the
(_TEXT,_text) section. If no files are specified standard input
is
read.
You could then pipe the output to another file and do whatever you want with it.
Edit: with the additional information that all the ASCII comes at the beginning, it would be a little easier to extract the text programmatically; still, this is faster than writing code.