ansaurus

Question

Answer 1

+1 A:

You are starting more threads than can be handled by your system. There is a limit to the number of threads that can be active for one process.

Your application is starting threads faster than the threads are running to completion. If you need to start many threads you need to do it in a more controlled manner I would suggest using a thread pool.

Tendayi Mawushe 2009-12-02 18:56:01

Answer 2

+2 A:

The "can't start new thread" error almost certainly due to the fact that you have already have too many threads running within your python process, and due to a resource limit of some kind the request to create a new thread is refused.

You should probably look at the number of threads you're creating; the maximum number you will be able to create will be determined by your environment, but it should be in the order of hundreds at least.

It would probably be a good idea to re-think your architecture here; seeing as this is running asynchronously anyhow, perhaps you could use a pool of threads to fetch resources from another site instead of always starting up a thread for every request.

Another improvement to consider is your use of Thread.join and Thread.stop; this would probably be better accomplished by providing a timeout value to the constructor of HTTPSConnection.

gab 2009-12-02 18:56:42

Answer 3

+2 A:

I think the best way in your case is to set socket timeout instead of spawning thread:

h = httplib.HTTPSConnection(self.config['server'], 
                            timeout=self.config['timeout'])

Also you can set global default timeout with socket.setdefaulttimeout() function.

Update: See answers to Is there any way to kill a Thread in Python? question (there are several quite informative) to understand why. Thread.__stop() doesn't terminate thread, but rather set internal flag so that it's considered already stopped.

Denis Otkidach 2009-12-03 11:03:21

It's can be useful for me. Thank you.

Oduvan 2009-12-03 18:41:38

Answer 4

+1 A:

If you are tying to set timeout why don't you use urllib2.

Prashanth 2009-12-03 15:57:48

urllib2 doesn't has connection time out.

Oduvan 2009-12-03 18:39:35

urllib2 does have timeout. <snip> urllib2.urlopen(url[, data][, timeout])</snip>

Prashanth 2009-12-04 04:02:00

`timeout` argument is new in Python 2.6

Denis Otkidach 2009-12-04 07:43:14

Answer 5

+1 A:

I completely rewrite code from httplib to pycurl.

c = pycurl.Curl()
c.setopt(pycurl.FOLLOWLOCATION, 1)
c.setopt(pycurl.MAXREDIRS, 5)
c.setopt(pycurl.CONNECTTIMEOUT, CONNECTION_TIMEOUT)
c.setopt(pycurl.TIMEOUT, COOPERATION_TIMEOUT)
c.setopt(pycurl.NOSIGNAL, 1)
c.setopt(pycurl.POST, 1)
c.setopt(pycurl.SSL_VERIFYHOST, 0)
c.setopt(pycurl.SSL_VERIFYPEER, 0)
c.setopt(pycurl.URL, "https://"+server+path)
c.setopt(pycurl.POSTFIELDS,sended_data)

b = StringIO.StringIO()
c.setopt(pycurl.WRITEFUNCTION, b.write)

c.perform()

something like that.

And I testing it now. Thanks all of you for help.

Oduvan 2009-12-03 18:46:52

ansaurus

tags:

views:

answers:

error: can't start new thread

related questions