ansaurus

Question

Postgresql Select * from table performance vs MySql

Answer 1

+2 A:

Here is a useful article about tuning Postgres- It has definitions and a bit of tips.

This performance tuning article offers a pretty decent overview with some specifics methods of optimization.

Diakonia7 2009-12-18 01:37:32

Hey - Thanks, I initially read through both of those pages before actually posting here. I've tweaked a few things, but I still get the same poor performance

Robert 2009-12-18 23:12:18

Is there anything in particular that I should check?

Robert 2009-12-18 23:20:21

@Robert - what did you tweak?

Diakonia7 2009-12-18 23:22:45

shared_buffers = 128MB effective_cache_size = 512MB. There didn't seem to be anything else that would cause the symptoms I'm seeing, but I could be overlooking something :)

Robert 2009-12-18 23:25:38

@Robert - It looks like you have already played with some of the config options- But in case you haven't considered these, here are some more: http://www.postgresql.org/docs/8.3/static/runtime-config-resource.html

Diakonia7 2009-12-19 00:19:07

@Robert - maybe also take a look at shared memory, particularly SHMMAX: http://www.postgresql.org/docs/8.3/static/kernel-resources.html#SYSVIPC

Diakonia7 2009-12-19 00:23:27

Hey - thanks for that. It looks like windows has it's own way of dealing with those parameters ("On Windows, PostgreSQL provides its own replacement implementation of these facilities, and so most of this section can be disregarded.")

Robert 2009-12-19 03:10:48

Answer 2

A:

PostgreSQL uses MVCC architecture, what means so it uses more complicated format for store data on disc than MySQL. It is slower in single access and faster in multi user access.

a) check if your tables are vacuumized - look on VACUUM statement b) use indexes - PostgreSQL has bigger repertoar of indexes then MySQL, so use it - there are GiST, GIN indexes.

Pavel Stehule 2009-12-18 07:40:55

MySQL uses MVCC for InnoDB, Falcon, PBXT, and solidDB storage engines.

Alex 2009-12-18 08:48:31

I've tried running vacuum, and vacuum analyse, but it doesn't seem to make a difference. I'm just doing a select * - surely indices are irrelevant for such a query?

Robert 2009-12-18 23:21:11

Answer 3

A:

Sounds like you suffer from fragmentation. Do you have lots of updates without running vacuum? Do you update indexed columns so HOT-updates are not used?

What's the output of select relpages, reltuples from pg_class where relname='nodelink'. That'll show you how many disk pages your tuples are stored on.

@Pavel: PostgreSQL certainly is more flexible wrt. indexes, but an index will not help in this case, since he's selecting everything in the table.

Many of the tables have hundreds of thousands of rows, so I need to keep performance in mind.

These are not particularly large tables ...

Is this what I can expect from PostgreSQL, or is there something I can do to make it better?

... so there's probably something else you're doing wrong.

Alex Brasetvik 2009-12-18 07:55:52

I've tried running vacuum, and vacuum analyse, but it doesn't seem to make a difference. The output of the query is: relpages: 4345, reltuples: 84936.

Robert 2009-12-18 23:18:36

"Do you have lots of updates..." : well, no - not really. Perhaps it's worth mentioning that this database is being created from a migration tool I've written. It looks at the MySQL database, and migrates the data to my new schema. I'm running the slow queries immediately after the migration, and there is no other system using the database, so there are no updates occurring.

Robert 2009-12-18 23:23:50

Answer 4

A:

Did you have the GIS features in MySQL as well? IIRC, that means that you were using MyISAM and not a transaction-capable storage manager, which means you're really not comparing apples to apples.

Also, is your application actually ever going to do this? A completely unqualified SELECT of all the rows? If not, you are better looking at the performance of things that you are actually going to be doing, which probably would involve at least WHERE clauses. (though this of course also cannot be fairly compared to non-crashsafe non-transactional system)

Magnus Hagander 2009-12-19 12:58:45

We do have the GIS features in MySQL - but we are moving to pgSQL because of the faster spatial queries. I've tried running these queries before I've PostGIS-ified the database, and the performance is similar. No - the app will never get everything from this table, but take a look at the amendments I've made to the original post - doing a select that yields around 10000, even on an indexed column is still a lot slower than mysql. Therein lies the problem :)

Robert 2009-12-19 21:14:08

Well, that means you are comparing a safe system with an unsafe one, since you are on MyISAM. It's obviously going to be slower when you need to deal with actual data safety. Not necessarily *that* much slower, of course :-)

Magnus Hagander 2009-12-20 15:06:32

Answer 5

A:

If you have a table which has hundreds (let alone hundreds of thousands) of records, what possible reason do you have for running the query SELECT * FROM? Perhaps you should think about what data you're actually querying for, and how you could get just the relevant rows from the database.

nickf 2009-12-19 13:08:25

No reason. But the "select *" highlights a performance issue that I get.If I do a SELECT * FROM NodeLink LIMIT 10000, then the time is roughly 1/8th of the time to get everything. Similarly, if I limit to 1000, it's roughly 1/80th of the time. If I employ and index based search, it's still slow to get the data. Certainly 30-50 times slower than MySQL. So accessing data in this table is slow - that's my concern.

Robert 2009-12-19 21:03:59

Answer 6

A:

This is much to long for a normal 100000 rows table, so I think there is a problem in PostGIS, not PostgreSQL. Try to get all rows without shapenative and shapewgs84 columns - if it is much faster, then it looks like PostGIS is responsible for slowdowns.

Tometzky 2009-12-19 13:55:17

Yes - I actually found this and amended my original post. However, even if I do a SELECT [everythingByTheyPOSTGISColumns], it's still really slow.

Robert 2009-12-19 20:57:59

ansaurus

tags:

views:

answers:

Postgresql Select * from table performance vs MySql

Question resolution:

related questions