ansaurus

Question

Python Selenium handling Timeout Exceptions with a long list of URLs

Answer 1

+2 A:

Try the following:

from selenium import selenium
import unittest, time, re, csv, logging

class Untitled(unittest.TestCase):
    def setUp(self):
        self.verificationErrors = []
        self.selenium = selenium("localhost", 4444, "*firefox", "http://example.com")
        self.selenium.start()
        self.selenium.set_timeout("60000")

    def test_untitled(self):
        sel = self.selenium
        spamReader = csv.reader(open('SubDomainList.csv', 'rb'))
        for row in spamReader:
            try:
                sel.open(row[0])
            except Exception, e:
                ofile = open('outputTest.csv', 'ab')
                ofile.write("error on %s: %s" % (row[0],e))
            else:
                time.sleep(5)
                html = sel.get_html_source()
                ofile = open('output4001-5000.csv', 'ab')
                ofile.write(html.encode('utf-8') + '\n')
            ofile.close()

    def tearDown(self):
        self.selenium.stop()
        self.assertEqual([], self.verificationErrors)

if __name__ == "__main__":
     unittest.main()

Some comments:

You don't need a wait_for_page_to_load after an open, that will cause you timeouts because once the page is loaded after the opeen, it will start waiting again and the page will not be loading.
Most of the failures you get from selenium (timeouts, object not found) can be caught with try-except statements
You should set the timeout in your tests withing the test itself (using set_timeout), that way it doesn't depend on the way you start the server, it will always wait the time you wanted

Santi 2009-10-25 21:48:28

you da man - testing now.

KenBurnsFan1 2009-10-25 22:01:26

I've got issues somewhere, but I'm sure this is the answer. I will continue tweaking tonight. Thanks!

KenBurnsFan1 2009-10-25 22:06:42

Ups, the failure was caused for some identation issues, fixed now.I also added the encoding before writing the file, just in case your website has some unicode chars on it.

Santi 2009-10-26 03:46:06

You hit the nail on the head. I spent the better half of the day trying to figure out how to escape non-ascii chars -- when I could have just looked back at your answer to this question. You da man!

KenBurnsFan1 2009-10-31 06:34:00

Hah! Unicode and all it's madness.. Let's hope Python3 comes soon so we can forget about this kind of issues.

Santi 2009-10-31 15:03:03

ansaurus

tags:

views:

answers:

Python Selenium handling Timeout Exceptions with a long list of URLs

related questions