ansaurus

Question

Simple TCP server using select(), why is "longest request" so high?

Answer 1

A:

Since 99% of your requests are completing in only 4ms, that would tend to implicate a once-off cost, like a DNS lookup or swapping a large amount of your code in from disk.

caf 2009-09-27 07:09:56

Thank you, that makes sense - however: this happens on local host/network too, code is a single php file already running, and benchmarks are repeated several times...

jcinacio 2009-09-27 13:10:28

Answer 2

+1 A:

you could use wireshark or another sniffer to track the tcp-ip traffic. This way you can see if the problem has to do with low-level issues (retransmissions, packetloss, etc)

Toad 2009-09-27 11:13:39

I have considered it, but i have NO clue how to "debug" hundreds of requests... any ideas?

jcinacio 2009-09-27 13:06:21

only 1 request is the one which takes forever. Make sure every request has a unique ID which is logged client side. Make sure you transmit the ID along somewhere in the tcpip stream. The moment you know which ID is the highest, look through the recorder wireshark tcp-ip logs and search for the same ID. Only look through this request and compare it to ones which were fast

Toad 2009-09-28 17:27:14

Answer 3

+1 A:

200ms sounds like a scheduler time quantum.

Just to be sure, you're using a NULL or nonzero timeout for select? Are you writing to sockets that are only ready for reads, or vice versa? Are you processing every fd that select returns before calling select again? Would be really nice to see some code...

I don't think it would be network if you're testing against localhost. But reinier is right, it looks a lot like what you'd see if there were some TCP retransmit (200ms is the minimum TCP retransmit timeout in a reasonably modern linux).

Keith Randall 2009-09-28 05:44:50

OK, i have managed to debug tcp traffic AND i am seeing a retransmission of the initial GET request from the client to the server - debugging client connections does indeed show a very short live time, so the problem SHOULD be caused by the retransmit.so... any clues on what i could try next?

jcinacio 2009-09-28 15:27:01

retransmission times are typically set globally. On windows it's done with some obscure registry setting. On linux I don't know.

Toad 2009-09-28 17:29:20

ansaurus

tags:

views:

answers:

Simple TCP server using select(), why is "longest request" so high?

related questions