ansaurus

Question

Getting DISTINCT users on Google App Engine

Answer 1

+1 A:

Google App Engine and more particular GQL does not support a DISTINCT function.

But you can use Python's set function as described in this blog and in this SO question.

tomlog 2010-01-29 14:33:34

Thanks. I was aware of that SO question and the blogpost, but they do not apply to this situation due to the size of the task.

Bemmu 2010-01-29 14:40:40

Answer 2

+1 A:

Here is a possibly-workable solution. It relies to an extent on using memcache, so there is always the possibility that your data would get evicted in an unpredictable fashion. Caveat emptor.

You would have a memcache variable called unique_visits_today or something similar. Every time that a user had their first pageview of the day, you would use the .incr() function to increment that counter.

Determining that this is the user's first visit is accomplished by looking at a last_activity_day field attached to the user. When the user visits, you look at that field, and if it is yesterday, you update it to today and increment your memcache counter.

At midnight each day, a cron job would take the current value in the memcache counter and write it to the datastore while setting the counter to zero. You would have a model like this:

class UniqueVisitsRecord(db.Model):
    # be careful setting date correctly if processing at midnight
    activity_date = db.DateProperty()
    event_count = IntegerProperty()

You could then simply, easily, quickly get all of the UnqiueVisitsRecords that match any date range and add up the numbers in their event_count fields.

Adam Crossland 2010-01-29 15:01:02

This relies on your value staying in memcache for an entire day. memcache is a cache, not reliable storage; this is only a good answer if you're happy to lose your count all the time.

Wooble 2010-03-30 12:45:09

ansaurus

tags:

views:

answers:

Getting DISTINCT users on Google App Engine

related questions