I've implemented several different "scanners" in java, from the Scanner class to simply using
String.split("\ss+")
but when there are several whitespaces in a row like "the_quick____brown___fox"
they all tokenize certain white spaces (Imagine the underscores are whitespaces). Any suggestions?