ansaurus

Question

Poorly performing Mysql subquery -- can I turn it into a Join?

Answer 1

+4 A:

Use:

   SELECT a.id
     FROM ORDERHISTORYTABLE AS a
LEFT JOIN (SELECT e.EmailAddress,
                  e.product,
                  MAX(OrderDate) AS max_date
             FROM OrderHistoryTable AS e
            WHERE e.Product IN ('ProductA','ProductB','ProductC','ProductD')
         GROUP BY e.EmailAddress) b ON b.emailaddress = a.emailaddress
                                   AND b.max_date = a.orderdate
                                   AND b.product = a.product
    WHERE x.emailaddress IS NULL
      AND a.Product IN ('ProductA','ProductB','ProductC','ProductD')

OMG Ponies 2010-10-28 15:54:07

This looks good, but I'm getting "#1054 - Unknown column 'emp1.Product' in 'where clause'"

BrianAdkins 2010-10-28 17:51:00

@ OMG Ponies:So my original query takes about 90 seconds to run (but I have to run it many times over again with different product sets)... I just tried your revised query and I killed the process at the 3 minute mark as it was marked as **DEAD** ... any ideas?

BrianAdkins 2010-10-28 18:29:22

@BrianAdkins: I re-added the filtration in the derived table for the LEFT JOIN -- that should minimize the amount of processing, but I'd hoped to consolidate it. You've got separate indexes -- what about a covering index using emailaddress, product and orderdate columns?

OMG Ponies 2010-10-28 18:51:22

Answer 2

A:

Make view of this sub query and do join with view. It will multiply by 5 to ur query performance

seed_of_tree 2010-10-28 16:08:00

Answer 3

+1 A:

My MySQL is a bit rusty (I'm used to MSSQL), but here's my best guess. It might need a bit of tweaking in the GROUP BY and HAVING clauses. Also, I assumed from your duplicate IN statements that you want the Products to match in both tables. If this isn't the case, I'll adjust the query.

SELECT a.id
FROM OrderHistoryTable a
INNER JOIN OrderHistoryTable b
    ON a.Product = b.Product AND
       a.Employee = b.Employee
WHERE a.Product IN ('ProductA','ProductB','ProductC','ProductD')
GROUP BY a.id, a.OrderDate, b.OrderDate, 
HAVING b.OrderDate < MAX(a.OrderDate)

Edit: removed extraneous AND.

jwiscarson 2010-10-28 19:18:47

ansaurus

tags:

views:

answers:

Poorly performing Mysql subquery -- can I turn it into a Join?

related questions