views:

98

answers:

8

I'm trying to use

"value1:value2::value3".split(":");

Problem is that I want it to include the blank results.

It returns: [value1, value2, value3]
It should be: [value1, value2, , value3]

Does anyone know the regexp to fix this?

Ok I found cause of problem. I'm actually reading a text file and it contains this line:

123:;~\&:ST02:M:test:M:4540145::type;12:51253:D:2.2:567766::AL:::::::2.2b

When I process this line reading the text file it produces the erroneous result mentioned above, which is it doesn't include any empty results in cases like this: :::::.

But when I use the above line in a test program it doesn't compile and I get a "invalid escape sequence". I think its because of the "\&".

Is there a workaround to this problem by using a regular expression?

+1  A: 

I don't honestly see the big draw of split. StringTokenizer works just as well for most things like this and will easily send back the tokens (so you can tell there was nothing in between :: ).

I just wish it worked a little better with the enhanced for loop, but that aside, it wouldn't hurt to give it a try.

I think there is a regexp trick to get your matched tokens to return as well but I've gone 20 years without learning regexp and it's still never been the best answer to any problem I've tackled (Not that I would actually know since I don't ever use it, but the non-regexp solutions are generally too easy to beat.)

Bill K
+1  A: 

I think that a StringTokenizer might work better for you, YMMV

mezmo
+4  A: 

Works for me.

class t {
    public static void main(String[] _) {
        String t1 = "value1:value2::value3";
        String[] t2 = t1.split(":");
        System.out.println("t2 has "+t2.length+" elements");
        for (String tt : t2) System.out.println("\""+tt+"\"");
    }
}

gives the output

$ java t
t2 has 4 elements
"value1"
"value2"
""
"value3"
crazyscot
+4  A: 

split does include empty matches in the result, have a look at the docs here. However, by default, trailing empty strings (those at the end of the array) are discarded. If you want to include these as well, try split(":", -1).

casablanca
Thanks... adding a -1 actually solved the special characters issue when reading from text file. It also included empty strings.
Marquinio
+1  A: 

Use a negative limit in your split statement:

String str = "val1:val2::val3";
String[] st = str.split(":", -1);
for (int i = 0; i< st.length; i++)
    System.out.println(st[i]);

Results:

val1
val2

val3
Matthew Flynn
+1  A: 
public static void main(String[] args){
  String[] arr = "value1:value2::value3".split(":");
  for(String elm:arr){
    System.out.println("'"+elm+"',");
  }
  System.out.println(arr.length);
}

prints

'value1',
'value2',
'',
'value3',
4

Which is exactly what you want. Your mistake is somewhere else...

f1sh
A: 

That should work but give StringTokenizer a go if you're still having issues.

AHungerArtist
+1  A: 

Using Guava's Splitter class:

Iterable<String> split = Splitter.on(':').split("value1:value2::value3");

Splitter does not omit empty results by default, though you can make one that does. Though it seems from what others are saying that what you're doing should work as well.

ColinD