ansaurus

Question

Why does SQLite take such a long time to fetch the data?

Answer 1

+4 A:

I'm not sure about sqlite, but most databases do not calculate the entire result set right away when you execute a query. They compile and optimize the query, then begin running it by fetching the first few rows.

Sort of like how it is quick to open a large file, but then it takes a long time to read all of the contents.

Harold L 2010-04-17 19:10:59

Answer 2

+2 A:

Why not take a look at the query plan and determine the answer yourself? Here's a reference: http://stackoverflow.com/questions/1454188/how-can-i-analyse-a-sqlite-query-execution

dcp 2010-04-17 19:12:49

Have tried this and updated my post.

Derk 2010-04-17 19:33:40

Answer 3

A:

Use Begin Transaction and End Transaction anytime you write an sql query. This is explained here plus lots of other useful info:

SQLite Optimization FAQ

You can check the indexing details on where clause here:

SQLite Query Optimizer Overview

Aseem Gautam 2010-04-17 19:22:51

If you don't use BEGIN/END TRANSACTION, each statement gets its own transaction. Using explicit transactions improves efficiency with multiple statements, but the OP is only executing one.

dan04 2010-04-17 19:32:18

I am not writing.

Derk 2010-04-17 19:33:16

Answer 4

+3 A:

You probably should be using JOINs here and your use of correlated subqueries might well be slowing things down. Whilst it is true that often the optimizer can execute a cross join with a where clause as if it were a join, and can also execute correlated subqueries as joins, I wouldn't count on SQLite being able to do this in all cases. Rewriting your query with JOINs I think gives this:

SELECT DISTINCT prod2.value AS id
FROM product_to_value
JOIN features ON product_to_value.feature = features.int
JOIN product_to_value as prod2 ON product_to_value.product = prod2.product
JOIN featurevalues as featval3 ON prod2.value = featval3.id
JOIN featurevalues ON product_to_value.value = featurevalues.id
WHERE features.id = ?
AND featurevalues.id IN (?,?)
AND featval3.feature IN (?,?,?,?)

Try this and see it is faster (and still gives the correct result).

Mark Byers 2010-04-17 19:29:17

SQLite translates JOINs to the WHERE clause. I am pretty sure the query does what has to do.

Derk 2010-04-17 19:33:04

The problem I get with the query you posted is that it returns duplicate values. I could solve this with a DISTINCT or a temporary table, but this would be very slow, especially when it has to deal with a huge result set.I therefore figured I might go around that using a sub-query on every row of a smaller result set from featval3.IN(?,?,?,?).

Derk 2010-04-17 20:12:48

@Derk: OK, I missed that. It was difficult to understand what your query was doing. I have added the DISTINCT. Correlated subqueries can be slow because you might need multiple scans over the tables/indexes instead of just one scan.

Mark Byers 2010-04-17 20:17:15

Because the 4 joins create a huge result set. SQLite compares each row with the previous in case of a DISTINCT. I could speed it up with a temporary table, but that would still be slow :(

Derk 2010-04-17 20:21:39

The sub-query takes about 0.3ms to perform. The sub-query should run about 150 times (amount of featval3 records left after IN (?,?,?,?)), so that would be 50ms for the query to run. That's why I don't get whats going on..

Derk 2010-04-17 20:28:36

@Derk: About 0.3ms? For every single input? Are you sure that when you were timing it, that the results weren't already cached for the inputs you used?

Mark Byers 2010-04-17 20:45:43

I timed it wrong, I see now. Yes, 0.3ms for the query to run, but also 1.2ms for fetching that single result. So 1.2 times 150 featval3 rows makes 180ms :P. I now see why it's so "slow". Is there a way of going around this so I don't have to read the value from the database. Instead use an index or something? And why is the fetching so slow?

Derk 2010-04-17 20:49:02

@Derk: The optimizer should automatically select the appropriate index. So make sure that you have all the appropriate indexes (primary keys are also indexes) - post your table definitions if you are unsure. I am not familiar with the SQLite explain syntax unfortunately, so I can't see if you are missing an index from reading that output.

Mark Byers 2010-04-17 20:54:34

What I just said isn't right either. Let me try again: To fetch one row from the sub-query takes less than 0.01ms, the query itself takes 0.3ms. Therefore 0.31 times 150 is <30ms. So it still doesn't make any sense to me. I figure it's still an index problem.

Derk 2010-04-17 21:05:06

It looks like `prod2.value = featval3.id` doesn't seem to be making use of an index. When I recreate it in this way `prod2.value IN ('string')` it is extremely fast.

Derk 2010-04-17 21:35:54

@Derk: Strange.... I can't explain why that works. But it's good you found a solution.

Mark Byers 2010-04-17 21:48:40

ansaurus

tags:

views:

answers:

Why does SQLite take such a long time to fetch the data?

related questions