ansaurus

Question

Using GA in GUI

Answer 1

A:

If I understand correctly, you need to determine how the actions of your bot in the GUI are represented by the outcome of your genetic algorithm? I think that determining the representation that you want to use should be your starting point. So you need to create a mapping for each (group of) 'genes' in your individual to a certain action or a certain change in the movement algorithm of your bot.

As soon as you've chosen a viable representation, the implementation will be somewhat more logical.

A very simple representation of movement would be to let the genes hard-code a certain route. You can use blocks of four genes to represent the different directions, and then a 0 represents 'don't move in this direction' and a 1 represents moving.

Then the representation 01001101 could be translated as the following movement pattern:

stand still
go one step east
stand still
stand still
go one step north
go one step east
stand still
go one step west

Josien 2010-02-18 11:56:09

Answer 2

+3 A:

BadHorse's answer is good if you want to solve a specific maze; you simply intepret your bit string as a sequence of precise instructions to guide the agent through the maze. In this case your fitness is not the sum of the bit string (as you state in your question) but rather some metric measuring how successfull the agent was in solving the problem. For example, your fitness might be defined as "distance in a straight line from end of maze after processing 20 instructions".

Hence, when evaluating each individual you allow it to process the first 20 instructions from your bit string and then compute its fitness, perform any crossovers / mutations and then create the next generation of individuals.

If you wish to develop your agent to solve any maze you need to encode rules within your bit string rather than a sequence of instructions. You could define rules based on whether a wall was immediately behind, in front, left or right of the robot; e.g.

FBLR Action
0000 Move Forward
0001 Move Forward
0010 Turn Right
etc

This gives you a bit string consisting of 16 actions, each action encoded as 2 bits (00 = Move Forward, 01 = Turn Right, 10 = Turn Left, 11 = Move Backwards). When evaluating your agent it simply determines its current state and uses the bit string as a lookup table to determine how it should respond. It then repeats this a certain number of times after which point you evaluate its fitness.

Given this encoding the agent could evaluate the rule humans typically use which is "Follow the left hand wall continuously". Obviously this approach will fail if the maze is not fully connected and in this case you need to encode more state into your rules based approach (e.g. the agent could respond differently if going over "old ground").

Hope that helps.

EDIT

In response to your latest edit:

The fact that I've encoded the agent "sensors" detecting whether it is next to a wall or not isn't relevant to the bit string (your gene), and perhaps I've slightly confused things with my example. The gene only encodes the actions (move forward, etc.) not the sensor states.

You should therefore write code to look-up the relevant part of the bit string given a particular combination of sensor readings; e.g.

/**
 * Enumeration describing the four available actions to the agent
 * and methods for decoding a given action from the "bit" string
 * (actually represented using booleans).
 */
public enum Action {
  MOVE_FORWARD, REVERSE, TURN_LEFT, TURN_RIGHT

  Action decodeAction(boolean b1, boolean b2) {
    Action ret;

    if (b1) {
      ret = b2 ? Action.MOVE_FORWARD : Action.TURN_LEFT;
    } else {
      ret = b2 ? Action.TURN_RIGHT : Action.REVERSE;
    }

    return ret;
  }
}

/**
 * Class encapsulating the 32-bit "bit string" represented using booleans.
 * Given the state of the four agent inputs the gene will provide a specific
 * action for the agent to perform.
 */
public class Gene {
  private final boolean[] values = new boolean[32];

  public Action getActionForSensorInputs(boolean wallInFront,
    boolean wallBehind, boolean wallToLeft, boolean wallToRight) {

    int i=0;

    // Encode the four sensor inputs as a single integer value by
    // bitwise-ORing each sensor value with a power of 2.
    // The encoded value will be in the range [0, 15].
    if (wallToRight) {
      i |= 0x01;
    }

    if (wallToLeft) {
      i |= 0x02;
    }

    if (wallBehind) {
      i |= 0x04;
    }

    if (wallInFront) {
      i |= 0x08;
    }

    // The look-up index is i * 2 because each action is encoded as 2
    // booleans.
    int index = i * 2;

    // Retrieve the two action bits from the bit string.
    boolean b1 = this.values[index];
    boolean b2 = this.values[index + 1];

    // Finally decode the action to perform.
    return Action.decodeAction(b1, b2);
  }

  // TODO: Add method to support crossover and mutation with other Genes.
}

Given this simple definition of a Gene you could embed this class within an Agent implementation and record how the agent performs with the current gene "installed"; e.g.

private enum Direction { NORTH, SOUTH, EAST, WEST };

public class Agent {
  private final Geneva gene;
  private final int x; // x position in maze;
  private final int y; // y position in maze;
  private Direction currentDirection;

  public double evaluate() {
    double fitness;

    // Perform up to 20 actions and then evaluate fitness.
    for (int i=0; i<20; ++i) {
      // TODO Determine sensor inputs.

      Action action = gene.getActionForSensorInputs(...);

      // TODO: Now apply action to update agent's state.
      // If agent has reached goal exit loop and return fitness 1.0 (max fitness).
      // If agent has exited the maze then exit loop and return 0.0 (min fitness).
    }

    // Calculate fitness after 100 steps taken.  For example could be
    // calculated as sqrt((goal.x - x) ^ 2 + (goal.y - y) ^ 2).

    return fitness;
  }
}

Adamski 2010-02-18 12:25:02

The latter part is how I want to do it, so it will solve any given maze using rules. The part I have trouble with is assigning the rules. As I understand it the agent will have its own "lookup" with the values you've given at the top, and then a string of 32 bits, split into 16 sets of 2 bits (actions) is gained from the genes, with four actions available. I've edited the question to show where I am having troubles so that I won't run out of space here..

AlexT 2010-02-19 03:08:58

@AlexT: Sorry for the delay; I've edited my answer and added some sample code to get you started.

Adamski 2010-02-22 22:44:24

This looks great so far! I'm going to toy with the code for a bit and try and build a Maze class then if everything goes as planned I'll come back and accept this answer.

AlexT 2010-03-01 12:02:05

Ok no worries. Be aware that I put the code together very quickly and haven't tested it. Also, remember that because there's no "memory" encoded into each state (e.g. previous 3 squares) the solving power of the solution will be fairly limited, but I think it's a good place to start.

Adamski 2010-03-01 16:16:33

ansaurus

tags:

views:

answers:

Using GA in GUI

related questions