ansaurus

Question

How to correctly relay TCP traffic between sockets?

Answer 1

+1 A:

I don't think that's likely to be your problem.

In general, the sending application can't tell when the receiving application actually calls recv() to read the data: the sender's send() may have completed, but the TCP implementations in the source & destination OS will be doing buffering, flow control, retransmission, etc.

Even without your relay in the middle, the only way for A to "consider its data to have already arrived at B" is to receive a response from B saying "yep, I got it".

David Gelhar 2010-04-09 02:40:39

Answer 2

+4 A:

Is it possible to only read from socket A when we know that B is ready to receive data?

Sure: use select.select on both sockets A and B (if it returns saying only one of them is ready, use it on the other one), and only read from A and write to B when you know they're both ready. E.g.:

import select

def fromAtoB(A, B):
    r, w = select.select([A], [B], [])
    if not r: select.select([A], [], [])
    elif not w: select.select([], [B], [])
    B.sendall(A.recv(4096))

Alex Martelli 2010-04-09 02:43:18

I've changed my code to use this, but the original problem is still there - the app I'm testing with still behaves differently with a relay in place. Also, putting this in a while loop causes Python to use a lot of CPU cycles. I should note that the data is being sent with fairly high throughput. If you have any other suggestions I'd love to hear them.

flukes1 2010-04-09 03:41:50

I've fixed the CPU issue but still no dice on my original problem. Here's my code: http://pastie.org/910900

flukes1 2010-04-09 04:13:24

(1) I'm surprised to hear about the high CPU consumption, since my code doesn't spend any until the data's ready -- maybe data's being sent in tiny packets, but even then, if you want to relay it very promptly, it's hard to think how to improve this (unless you're willing to add, potentially, very long latencies). (2) I'm not surprised that this has little to do with your actual problem -- I just answered the question of yours that I quoted, about reading from A only when we know B is ready for writing; behavior differences may be due to A checking its peer, which you can't fake.

Alex Martelli 2010-04-09 04:16:23

Indeed; but I'm actually working with UNIX sockets and there's no way to check their legitimacy, as far as I know. I move the original socket to a safe location before creating my own fake one in its place. Truly baffled by this!

flukes1 2010-04-09 04:29:08

Answer 3

A:

If you're using blocking operations (not async), you'll probably stumble on many problems. My advice is to use asynchronous IO, and for that I suggest using pyevent (python bindings for libevent).

IMO, your concern makes sense if there's a very high throughput, which may cause a general slowdown. In this extraordinary situation, your application could miss packets due to full cache, and retransmission (protocol dependent) is very likely to occur. Another possibility for the same scenario, is that endpoint applications with strict timeout handling might behaviour differently.

Additionally, a common problem I see in this kind of application is a missing flush upon disconnect. For example, if endpoint A sends data and disconnects right after that, you must send this data to endpoint B prior to closing the connection with B.

jweyrich 2010-04-09 03:06:18

Answer 4

+1 A:

Perhaps the application you're proxying is poorly written.

For instance, if I call recv(fd, buf, 4096, 0); I'm not promised 4096 bytes. The system makes a best-effort to provide it.

If 1k isn't a multiple of your application's recv or send sizes, and the application is broken, then grouping the data sent into 1k blocks will break the app.

rescrv 2010-04-09 17:12:17

I see. Is there a way around that?

flukes1 2010-04-10 09:01:41

Encourage the application writer to write better software.If you're trying to make a generic proxy, then no, there isn't. If it is for a particular application, try emulating the size of send and recv used by the client and server.

rescrv 2010-04-22 16:47:00

ansaurus

tags:

views:

answers:

How to correctly relay TCP traffic between sockets?

related questions