ansaurus

Question

Inner join on regexes

Answer 1

+3 A:

Is there any easy way to speed this up?

The reason performance suffers is all the operations, let alone the regex, that have to be performed just to make a match. You need to simplify the relationship so these don't need to be performed.

OMG Ponies 2010-06-05 00:34:00

It sure does look like your schema needs re-thinking. Joining on expressions like this is always going to be slow. You could (possibly, I don't know your full schema) side-step the problem with a derived table to limit the work done, but it's not the solution.

d11wtq 2010-06-05 00:43:23

A functional index *could* help, but you really need to rethink that design that requires that.

rfusca 2010-06-05 05:40:18

Answer 2

+1 A:

I would start by placing the results of :

 trim(lower(replace(replace(replace(B.enginequery,',',' '),'"',' '),'+',' ')))

into a column in your table. At least then one wouldn't have to repeatedly calculate it. How you implement that in postgres I don't know. In Ms sql server, I would try a calculated column so that my apps wouldn't have to know about updating B.enginequery and its cleaned version.

And then, I would probably end up attempting an index on that cleaned up column.

MaasSql 2010-06-25 02:35:01

ansaurus

tags:

views:

answers:

Inner join on regexes

related questions