ansaurus

Question

Most efficient way to parse this with the Java Scanner library?

Answer 1

+2 A:

The input looks like it is complex enough to warrent an full blown parser. I would recommend to use a library such as ANTLR ( http://www.antlr.org/ ).

Arne 2010-08-17 13:11:55

May have to take that route, though I don't know if I want to have to "re-write" my code. :\ Already invested a lot of time in it, but thanks for the suggestion. :3

Unrealomega 2010-08-17 13:28:41

Answer 2

+1 A:

I'd first define an enum with the keywords, like:

 public enum Keyword {SECTOR, FLAGS, AMBIENT, EXTRA, COLORMAP, TINT, 
    BOUNDBOX, COLLIDEBOX, CENTER, RADIUS, VERTICES, SURFACES}

Parsing can be done line by line, splitting at whitespace chars. Then I'd convert the first element to an enum from the Keyword class and use a simple switch construct to handle the values:

public Model parse(List<String> lines) {

   Model model = new Model();

   Iterator<String> it = lines.iterator();
   while(it.hasNext()) {
      String[] elements = it.next().split("\s+");

      switch(Keyword.valueOf(elements[0])) {
        case SECTOR: model.addSector(elements[1]); break;
        case FLAGS: model.addFlags(elements[1]); break;
        // ...
        case VERTICES:
          int numberOfVertices = Integer.parseInt(elements[1]);
          for (int i = 0; i < numberOfVertices; i++) {
             elements = it.next().split("\s+");
             model.addVertice(i, elements[1]);
          }
          break;
        case default:
          // handle malformed line

      }
   }
   return model;
}

Andreas_D 2010-08-17 14:04:04

I like the look of this one. Clean, easy, and already checks for malformed files. I may use this for now, for testing purposes.

Unrealomega 2010-08-17 17:49:38

Answer 3

+1 A:

How about this approach:

find next command (SECTOR, FLAGS, AMBIENT LIGHT, EXTRA LIGHT, etc)
no command found? -> output error and stop
map to command implementation 
execute command (pass it the scanner and your state holder)
command impl handles specific reading of arguments
rinse, repeat,...

You will have to create a Command interface:

public interface Command {
    String getName();
    void execute(Scanner in, ReadState state);
}

and a separate implementation of it for each type of command you can encounter:

public class SectorCommand implements Command {
    public String getName() {
        return "SECTOR";
    }
    public void execute(Scanner in, ReadState state) {
        state.setSector(in.nextInt());
    }
}

and of some sort of factory to find commands:

public class CommandFactory {

    private Map<String, Command> commands;
    public CommandFactory() {
        commands = new HashMap<String, Command>();
        addCommand(new SectorCommand());
        // add other commands
    }
    public Command findCommand(Scanner in) {
        for (Map.Entry<String, Command> entry : commands.entrySet()) {
            if (in.findInLine(entry.getKey())) {
                return commands.get(entry.getValue);
            }
        }
        throw new IllegalArgumentException("No command found");
    }
    private void addCommand(Command command) {
        commands.put(command.getName(), command); 
    }
}

(this code may not compile)

Adriaan Koster 2010-08-17 14:27:06

Answer 4

A:

If the file is very big,I suggest that you can use java.io.RandomAccessFile,it can skip any area that you want to parse and it's very fast. If you map whole file into memnory, it may slow down you application.

It's alternative to use java.util.StringTokenizer to split simple case.For example, white space,comma and so on. It's more faster than regular expression.

Mercy 2010-08-17 14:29:50

ansaurus

tags:

views:

answers:

Most efficient way to parse this with the Java Scanner library?

related questions