ansaurus

Question

TCP Connection Seems to Receive Incomplete Data

Answer 1

+8 A:

The recv function can receive as little as 1 byte, you may have to call it multiple times to get your entire payload. Because of this, you need to know how much data you're expecting. Although you can signal completion by closing the connection, that's not really a good idea.

Update:

I should also mention that the send function has the same conventions as recv: you have to call it in a loop because you cannot assume that it will send all your data. While it might always work in your development environment, that's the kind of assumption that will bite you later.

Tim Sylvester 2009-12-03 03:31:44

Answer 2

A:

recv returns right away with whatever is in the buffer (upto MAXBUF). If the buffer is being written to at the same time you might not get all the data

gnibbler 2009-12-03 03:35:07

Answer 3

+1 A:

You should probably have some kind of sequence of characters to signal termination of the file transfer, and only when you read those at the end of a block do you break out of your recv loop.

Of course, you will have to find a sequence that won't occur in your files, or that can be easily escaped. If you're working with text files this is pretty easy, but if not you'll have to be clever.

Alternatively, the client could first send the file size (in a separate send call), so the server knows how many bytes to expect in the file transfer.

Platinum Azure 2009-12-03 03:56:32

Is this really necessary? It seems like the while loop get the job done a lot more cleanly.

SooDesuNe 2009-12-03 05:30:14

Well, if the client machine goes down in the middle of a transfer, the server would have no way of knowing the file transfer is incomplete. This is a way to insure that the server knows exactly when a successful transfer has happened. Otherwise, either the file size is smaller than it should be or the sentinel control sequence hasn't been seen yet to signal the end of the transmission-- but with just a naive while loop, the server wouldn't recognize an interruption if the client's end of the socket is shut down "against its wishes"... no?

Platinum Azure 2009-12-03 05:38:27

@Platinum Azure A leading size value is simpler and more efficient than special control sequences. It lets you allocate buffers appropriately, pre-allocate files to reduce fragmentation, etc., as well as eliminating any complexity having to do with escaping and searching/matching sequences.

Tim Sylvester 2009-12-03 17:30:46

I'm aware. I just presented both options.

Platinum Azure 2009-12-03 19:55:16

Answer 4

A:

What TCP ensures is that your message will get to the remote peer correctly. As long as it fits in the send buffer, it will be automatically split into smaller chunks and sent by the local peer, and reordered and reassembled by the remote peer. It is not uncommon for a route to dynamically change while you are sending a message, which you would have to reorder manually (the smaller chunks) before delivering to your application.

As for your actual data transfer, your application needs to agree on a custom protocol. For instance, if you are only sending one message (the file), the sender could signal the receiver that it does not intend to write anymore to the socket (with shutdown(sock, SHUT_WR)), this way recv() returns with 0 and you know the transfer is complete (this is how a HTTP/1.0 server signals the client the transfer is complete). If you intend to send more data, then this alternative is not appropriate.

Another way would be to let the receiver know how much data the sender is going to transmit by including a header, for instance. It does not need to be overly elaborate, you could simply reserve the first 8 bytes to send the length as a 64-bit unsigned integer. In this case, you still need to be careful about byte ordering (big-endian / little-endian).

There is a very useful tutorial on network programming for UNIX environments:

Beej's Guide to Network Programming

You could refer to it to get a quick start, then refer back to the book for completeness, if you need. Even though you did not ask for additional references, TCP/IP Illustrated Vol. 1 and UNIX Network Programming Vol. 1 (both by W. Richard Stevens, the latter with a recent third edition) are excellent references.

rnsanchez 2009-12-03 04:10:33

I have read Beej's guide several times (some times it takes me a few times), and I agree, it is a great reference.It should be noted, that some of the functions Beej mentions, such as gethostbyname() are not available on iPhone (or rather, they're not included in the headers)

SooDesuNe 2009-12-03 04:29:57

Right, I'd assume you also need something iPhone-specific in this case, so that you can use equivalent calls (if you need them, of course).

rnsanchez 2009-12-03 05:45:46

Answer 5

+3 A:

Tim Sylvester and gnibbler both have very good answers, but I think the most clear and complete is a combination of the two.

The recv() fuction return immediatly with whatever is the in the buffer. This will be somewhere between 1 byte and MAXBUF. If the buffer is being written to while recv returns, you wont have the entire data that was sent in the buffer yet.

So you need to call recv() multiple times, and concatenate the data, to get everything that was sent.

A convenient way to do this (since we are working in cocoa) is to use NSMutableData like:

NSMutableData *fileData = [[NSMutableData alloc] init];  //Don't forget to release
while ((readCounter = recv(socket2, file, MAXBUF, 0)) > 0){  
    if (readCounter == -1){
     NSLog(@"server could not read filename from socket");
     close(socket2);
     continue;
    }
    else{
     NSLog(@"server reading file of size: %i", readCounter);
     [fileData appendData:[NSData dataWithBytes:file length:readCounter]];
    }
    bzero(file, MAXBUF);
    readCounter = 0;
}

SooDesuNe 2009-12-03 04:24:19

ansaurus

tags:

views:

answers:

TCP Connection Seems to Receive Incomplete Data

related questions