ansaurus

Question

Best practices for manipulating database result sets in Python?

Answer 1

+1 A:

Have you considered using an ORM? SQLAlchemy is pretty good, and Elixir makes it beautiful. It can really reduce the ammount of boilerplate code needed to deal with databases. Also, a lot of the gotchas mentioned have already shown up and the SQLAlchemy developers dealt with them.

nosklo 2008-09-22 19:30:31

I don't think an ORM is appropriate. I don't expect to write to the database at all -- this is strictly a data display project. And writing the queries is not a problem (in fact, I plan to simplify the first-draft code I posted above to remove the queries-in-a-loop issue).

Will Rogers 2008-09-22 19:33:50

Answer 2

A:

Using a ORM for an iPhone app might be a bad idea because of performance issues, you want your code to be as fast as possible. So you can't avoid boilerplate code. If you are considering a ORM, besides SQLAlchemy I'd recommend Storm.

Vasil 2008-09-22 19:35:55

He says "formatted for the iPhone", a web page I would guess. The ORM would be on the server side. I know ORMs add a little overhead, but the cell phone network will be the real latency inducer.

alif 2008-09-22 20:01:02

ORM is in Python. Can be on the client. ORM ships pure SQL to the server.

S.Lott 2008-09-22 22:11:32

Like I said in the first sentence of my question, this is a web app. The Python code lives on the server. Also, I'm not really interested in ORM because I have only a handful of set read-only queries to implement.

Will Rogers 2008-09-22 22:16:05

SQLAlchemy is not a ORM in a ActiveRecord style but a DataMapper which can be very efficient. SQLAlchemy also generates the ideal query for your desired result-set and does not do excessive multi querying...

pi 2008-09-23 12:13:20

Answer 3

A:

Depending on how much you want to do with the data you may not need to populate an intermediate object. The cursor's header data structure will let you get the column names - a bit of introspection will let you make a dictionary with col-name:value pairs for the row. You can pass the dictionary to the % operator. The docs for the odbc module will explain how to get at the column metadata.

This snippet of code to shows the application of the % operator in this manner.

>>> a={'col1': 'foo', 'col2': 'bar', 'col3': 'wibble'}
>>> 'Col1=%(col1)s, Col2=%(col2)s, Col3=%(col3)s' % a
'Col1=foo, Col2=bar, Col3=wibble'
>>>

ConcernedOfTunbridgeWells 2008-09-22 21:30:03

Are there any benefits or drawbacks to using a dict vs. using an object for my intermediate rows? I do need some kind of intermediate step because I need to calculate and add additional columns in each row.

Will Rogers 2008-09-22 22:22:36

If you are just rendering something from the contents of the query results then the '%' operator can substitute the dictionary values into placeholders marked with the %(xxx)s operator. For simple tasks this is simpler than instantiating a model or DB row object.

ConcernedOfTunbridgeWells 2008-09-23 12:44:55

Answer 4

+1 A:

The empty Record class and the free-floating function that (generally) applies to an individual Record is a hint that you haven't designed your class properly.

class Record( object ):
    """Assuming rtda and pnl must exist."""
    def __init__( self ):
        self.da= 0
        self.rt= 0
        self.rtda= 0 # or whatever
        self.pnl= None # 
        self.sink = None # Not clear what this is
    def setPnl( self, node_prices ):
        # fill RT and DA prices from the hash retrieved above
        # calculate dependent values: RT-DA and PNL

Now, your calculate_pnl( records, node_prices ) is simpler and uses the object properly.

def calculate_pnl( records, node_prices ):
    for record in records:
        record.setPnl( node_prices )

The point isn't to trivially refactor the code in small ways.

The point is this: A Class Encapsulates Responsibility.

Yes, an empty-looking class is usually a problem. It means the responsibilities are scattered somewhere else.

A similar analysis holds for the collection of records. This is more than a simple list, since the collection -- as a whole -- has operations it performs.

The "Request-Transform-Render" isn't quite right. You have a Model (the Record class). Instances of the Model get built (possibly because of a Request.) The Model objects are responsible for their own state transformations and updates. Perhaps they get displayed (or rendered) by some object that examines their state.

It's that "Transform" step that often violates good design by scattering responsibility all over the place. "Transform" is a hold-over from non-object design, where responsibility was a nebulous concept.

S.Lott 2008-09-22 22:23:44

Thanks for actually taking the time to read what I posted and offer a thoughtful response. I'll have to think about your suggestion, it does make sense.

Will Rogers 2008-09-23 13:54:57

Wrote a book on OO design. Not very good, but pretty complete. Hits this point pretty hard. See http://homepage.mac.com/s_lott/books/oodesign.html

S.Lott 2008-09-23 15:48:09

ansaurus

tags:

views:

answers:

Best practices for manipulating database result sets in Python?

related questions