ansaurus

Question

C: Using a select call, when I am reading, how do I keep track of the data?

Answer 1

+2 A:

Yes, the usual approach is to have a buffer of "data I've received but not processed" for each client, large enough to hold the biggest protocol message.

You read into that buffer (always keeping track of how much data is currently in the buffer), and after each read, check to see if you have a complete message (or message(s), since you might get two at once!). If you do, you process the message, remove it from the buffer and shift any remaining data up to the start of the buffer.

Something roughly along the lines of:

for (i = 0; i < nclients; i++)
{
    if (!FD_ISSET(client[i].fd, &read_fds))
        continue;

    nbytes = recv(client[i].fd, client[i].buf + client[i].bytes, sizeof(client[i].buf) - client[i].bytes, 0);

    if (nbytes > 0)
    {
        client[i].bytes += nbytes;

        while (check_for_message(client[i]))
        {
            size_t message_len;

            message_len = process_message(client[i]);
            client[i].bytes -= message_len;
            memmove(client[i].buf, client[i].buf + message_len, client[i].bytes);
        }
    }
    else
        /* Handle client close or error */
}

By the way, you should check for errno == EINTR if select() returns -1, and just loop around again - that's not a fatal error.

caf 2010-01-29 03:13:52

Answer 2

+2 A:

I would keep a structure around for each client. Each structure contains a pointer to a buffer where the command is read in. Maybe you free the buffers when they're not used, or maybe you keep them around. The structure could also contain the client's fd in it as well. Then you just need one array (or list) of clients which you loop over.

The other reason you'd want to do this, besides the fact that 256 bytes might not be enough, is that recv doesn't always fill the buffer. Some of the data might still in transit over the network.

If you keep around buffers for each client, however, you can run into the "slowloris" attack, where a single client keeps sending little bits of data and takes up all your memory.

Dietrich Epp 2010-01-29 03:15:28

Great! Never knew about the slowloris attack... Thanks a lot to others as well...

Legend 2010-01-29 03:20:04

Answer 3

+1 A:

It can be a serious pain when you get tons of data like that over a network. There is a constant trade between allocating a huge array or multiple reads with data moves. You should consider getting a ready made linked list of buffers, then traverse the linked list as you read the buffers in each node of the linked list. That way it scales gracefully and you can quickly delete what you've processed. I think that's the best approach and it's also how boost asio implements buffered reads.

Chris H 2010-01-29 03:18:56

Answer 4

+1 A:

If you're dealing with multiple clients a common approach to to fork/exec for each connection. Your server would listen for incoming connections, and when one is made it would fork and and exec a child version of itself that would then handle the "command interpreter" portion of the problem.

This way you're letting the OS manage the client processes--that is, you don't have to have a data structure in your program to manage them. You will still need to clean up child processes in your server as they terminate.

As for managing the buffer...How much data do you expect before you post a response? You may need to be prepared to dynamically adjust the size of your buffer.

Rob Jones 2010-01-29 03:19:17

I was actually thinking of checking for a EOF character which my protocol has... So I guess what I would do is keep appending this buffer to the main data line until I actually see this character.. What do you think?

Legend 2010-01-29 03:26:35

I would be afraid of not having some kind of guard. Do you trust the sender of the data to behave--to always send well-formed data? What about the network? What about the OS? I wouldn't blindly append data of unknown origin to a buffer without a bit of protection in place. Even if you wrote the client, the data is still of unknown origin, i.e. don't trust yourself.

Rob Jones 2010-01-29 03:39:51

I see... Got your logic... Thanks for the pointers...

Legend 2010-01-29 03:47:32

ansaurus

tags:

views:

answers:

C: Using a select call, when I am reading, how do I keep track of the data?

related questions