ansaurus

Question

Efficiently retrieving entities that match any element of a set of ids

Answer 1

+1 A:

You might be able to do this using a RelationIndex. Depending on how exactly you will want to allow user to view and query the data, it should work.

The idea is pretty straight forward, basically you will store a list of "standards" for each employee. And possibly a list of employee's for each standard. Then you'll be able to ask questions such as all employee's who 'smell good'.

Because you have scores for each standard, you might want to do something like store the "score" and "standard number" as a pair in the list ("3:12") so that you can find everyone who has a score of 3 on standard 12.

edit: Updated based on comment.

It sounds like you need to deal with a few different issues. First, you need to deal with editing and maintaining the data. Second, you need to deal with querying the data. Third, you are going to need to handle displaying the data.

For querying the data efficiently you will probably need some approach similar to what I initially suggested. What is more common, editing or viewing the data? That will impact how you setup your models.

If you are only dealing with 30 or 40 employees and 30 or 40 standards, maybe you could use something like the following:

class Evaluations(db.Model):
    period = db.StringProperty()
    standards = db.TextProperty()
    scores = db.TextProperty()

class EvaluationsIndex(db.Model):
    index = db.StringListProperty()

Use the standards property on Evaluations to store a list of standards evaluated. Then store your employee-standard-score grid in the scores property. Obviously you'll need to serialize both the standards list and the evaluation grid, perhaps using something like JSON. Use the EvaluationsIndex model as I mentioned above.

With this (or something really similar) you will have pretty easy edits, very easy display, and support for queries.

You could add an additional model to track which supervisor entered the evaluation and her notes.

Robert Kluin 2010-10-06 18:32:37

Thanks for the link to that talk; it was interesting. I don't think a relation index would help here, though, because each score only has a single "receiver" (in users x standards). Since there are just two ids there, the deserialization cost would be negligible. The real problem I'm having is that I need to find the scores connected to any of 1200 receivers - it's like 1200 people logging on to his microblogging service at once... _per user!_

Riley 2010-10-06 21:48:26

Perhaps I miss-understood your issue. I thought you were only looking for a method to query and identify employees with a particular value for some standard. You are going to need a combination of approaches to make this work on App Engine.

Robert Kluin 2010-10-06 22:20:19

Thanks for your further thought. This scheme of storing all of the scores in a big text column is what I was trying to get at in option #1 of my original post, where I used "grid" instead of "TextProperty." For the serialization, I was thinking of just using a List<Score> kind of mechanism, and have appengine do the serialization for me. I assume that deserializing 1200 objects will be much faster than performing 1200 queries, and will certainly _cost_ less. I'll just go ahead and try this method and time it. Thanks again!

Riley 2010-10-07 15:55:26

Just do some testing first, you may encounter an issue storing 1,200 items in one db.ListProperty. For one of my apps, after testing, I went with a design more similar to what I detailed above. I *never* deserialize the large RelationIndex list, I use key names and rewrite the entire thing when it changes. And, I render the grid client-side -- so storing my data as a JSON string save a bunch of work server-side.

Robert Kluin 2010-10-07 16:08:47

Great point about sending to the client. I'll try JSON first.

Riley 2010-10-07 16:17:51

ansaurus

tags:

views:

answers:

Efficiently retrieving entities that match any element of a set of ids

related questions