ansaurus

Question

Need help on nested loop of queries in php and mysql?

Answer 1

+2 A:

Sounds like you have a problem with your model. Why do you have 20 data-tables instead of one with a week-column?

Then you could do a

Select user, Sum( money_spent ) As total_money_spent
From data
Group By user

or even

Select Count(*) As good_customer_count
From data
Group By user
Having Sum( money_spent ) > 1000000

With your current structure you can only do something like this:

Select u.user, d1.money_spent + d2.money_spent + ...
From users u
Join data1 d1 On ( d1.user = u.user )
Join data2 d2 On ( d2.user = u.user )
...

or

Select Count(*) As good_customer_count
From
  ( Select d1.money_spent + d2.money_spent + ... As total_money_spent
    From data1 d1
    Join data1 d1 On ( d1.user = u.user )
    Join data2 d2 On ( d2.user = u.user )
    ...
  )
Where total_money_spent > 1000000

This will certainly be faster than your current solution.

And the time spent on a page should be stored in a numeric field.

Peter Lang 2010-04-21 05:31:15

if the money_spent is "time" type. How do I do the sum? E.g 00:10:23, 00:12:01 etc

mysqllearner 2010-04-21 05:39:40

I think we need information about your table structures and about your data to answer that.

Peter Lang 2010-04-21 05:41:59

`money_spent` did sound like some sort of number column to me...

Peter Lang 2010-04-21 05:45:46

@peter: I will edit my question, about the structures

mysqllearner 2010-04-21 05:47:26

+1, but I updated my answer with UNION ALL option :)

Unreason 2010-04-21 06:50:02

Answer 2

A:

You should store the time spent on your site as number (in minutes or seconds), not as time. Then you can calculate the averages and sums on this value. And keep your logs in one table.

Riho 2010-04-21 05:56:37

Answer 3

+1 A:

As Peter already gave a good answer I will only post how the query would look with proper design (all log data in one table)

SELECT user, AVG(TIMEDIFF(start_time, end_time)) AS average_time
FROM logs
GROUP BY user

You can apply further where conditions to the above to get stats for only a certain period (week, month, etc) or you can also group by another level.

You can also get MAX and COUNT in the same query (as well as standard deviation and other aggregate function) in an effective manner.

Of course, take care of your indexes for best performance with larger data sets.

EDIT:

Just as I was giving peter +1 I noticed that he did not mention UNION ALL option

So, you could (this is not optimal and does not contradict design issue warnings given by others)

SELECT user, AVG(TIMEDIFF(start_time, end_time)) AS average_time
FROM (
    SELECT * FROM log_week_1
    UNION ALL
    SELECT * FROM log_week_2
    UNION ALL
    SELECT * FROM log_week_3
    ...
) U
GROUP BY user

And also you can create a VIEW for this union.

Unreason 2010-04-21 06:17:57

@Unreason: Whats the different between using Union and INNER Join? Currently I am using UNION, the result seems abit weird to me. I am trying to use INNER Join now

mysqllearner 2010-04-21 08:10:26

@mysqllearner: if your inner join is 1-1 then joins your tables next to each other, you can select all columns from all tables in a single row. if you use join the number of columns remains the same (must be the same) and results are appended one after other (with the big difference in performance between UNION and UNION ALL: UNION will return unique rows and that will require building an index, UNION ALL can return duplicate records, but I assumed that your logs don't overlap).

Unreason 2010-04-21 08:41:26

@mysqllearner: I posted the UNION ALL solution because it is conceptually equivalent (for selects) to fixing your design - keeping all logs in one table (but still it is not the same performance wise).

Unreason 2010-04-21 08:43:42

ansaurus

tags:

views:

answers:

Need help on nested loop of queries in php and mysql?

related questions