ansaurus

Question

Python performance: search large list vs sqlite

Answer 1

A:

I imagine using a python dictionary would allow for much faster searching than using a python list. (Just set the values to 0, you won't need them, and hopefully a '0' stores compactly.)

As for the larger question, I'm curious too. :)

sarnold 2010-08-04 10:25:22

Or just store them in a `set` instead of a `dict`...

tzaman 2010-08-04 10:29:09

Thanks tzaman, didn't realize sets were fast :)

sarnold 2010-08-04 10:42:46

Answer 2

+8 A:

If I understand your question correctly, it seems like you could simply use SQLite's built in conflict handling mechanism.

Assuming you have a UNIQUE constraint on field1, you could simple use:

INSERT OR REPLACE INTO table VALUES (...)

The following syntax is also supported (identical semantics):

REPLACE INTO table VALUES (...)

EDIT: I realise that I am not really answering your question, just providing an alternative solution which should be faster.

Nick 2010-08-04 10:32:33

+1, I was just about to post this myself.

tzaman 2010-08-04 10:35:20

Cool, thanks! Virtual +1, I'm out of votes today. Sigh :)

sarnold 2010-08-04 10:43:21

Great answer! Great functionality. A shame it's not in the SQL-92 specification, so that *all* RLDB's implement this :)

Wez 2010-08-04 11:29:49

Thanks Nick, this is a good solution. As with all good solutions, it has made me realise that I wasn't quite asking the right question.

Andrew Ho 2010-08-04 13:52:09

Answer 3

+1 A:

I'm not familiar with sqlite but a general approach like this should work:

If there's a unique index on field1 and you're trying to insert a value that's already there you should get an error. If insert fails, you go with the update.

Pseudocode:

try
{
    insert into table (value1, value2)
}
catch(insert fails)
{
    update table set field2=value2 where field1=value1
}

Dan Stocker 2010-08-04 10:37:25

Answer 4

A:

You appear to be comparing apples with oranges.

A python list is only useful if your data fit into the address-space of the process. Once the data get big, this won't work any more.

Moreover, a python list is not indexed - for that you should use a dictionary.

Finally, a python list is non-persistent - it is forgotten when the process quits.

How can you possibly compare these?

MarkR 2010-08-21 07:50:27

ansaurus

tags:

views:

answers:

Python performance: search large list vs sqlite

related questions