ansaurus

Question

mysql 7columns pk vs. 1 column md5 unique constraint

Answer 1

A:

until mysql gets partition pruning, i suggest (gulp) denormalizing your tables to fake partitioning. do something like take the modulo 32 of your first value and make 32 tables.

update: apparently mysql 5.1.6 and later do support pruning (http://dev.mysql.com/doc/refman/5.1/en/partitioning-pruning.html) so my stronger advice is to upgrade, then allow mysql to handle the partitioning for you, possibly using a hash value of one of your 7 columns.

longneck 2009-10-14 19:42:33

Answer 2

A:

If you can find a good hash that matches your record lookup, then applying your unique constraint on each partition shouldn't be that big of a deal. Smaller partition sizes will make your unique constraint less expensive. (If I'm wrong, someone here will school me I'm sure).

I'm stuck on MySQL 5.0. I'm facing manual partitioning a few tables over 40M rows. I have a document id that I can hash in my application: floor(docID/10)%100. This can give me 100 partitions and that should keep my index size down significantly. I did a query on the table and counted up the number of rows by hash:

select count(docID), floor(docID/10)%100 as partno
from documents 
group by partno

Luckily, I found a very even distribution on my first try. Your own formula will be different, I have no idea what your distribution would be like. Are you concerned that your unique constraint will not hold up in the face of partitioning?

If you can take advantage of MySQL partitioning, it will be more powerful and less of an impact on your application.

memnoch_proxy 2009-11-05 05:29:52

ansaurus

tags:

views:

answers:

mysql 7columns pk vs. 1 column md5 unique constraint

related questions