ansaurus

Question

GAE load data into datastore without using CSV

Answer 1

+2 A:

Here's an extremely simplified example of what we're doing to use the bulkloader to load JSON data instead of CSV data:

class JSONLoader(bulkloader.Loader):
    def generate_records(self, filename):
        for item in json.load(open(filename)):
            yield item['fields']

In this example, I'm assuming a JSON format that looks something like

[
    {
        "fields": [
            "a", 
            "b", 
            "c", 
            "d"
        ]
    }, 
    {
        "fields": [
            "e", 
            "f", 
            "g", 
            "h"
        ]
    }
]

which is oversimplified.

Basically, all you have to do is create a subclass of bulkloader.Loader and implement (at a minimum) the generate_records method, which should yield lists of strings. This same strategy would work for loading data from XML files or ROT13-encrypted files or whatever.

Note that the list of strings yielded by the generate_records method must match up (in length and order) with the "properties" list you provide when you initialize the loader (ie, the second argument to the AlbumLoader.__init__ method in this example).

This approach actually provides a lot of flexibility: We're overriding the __init__ method on our JSONLoader implementation and automatically determining the kind of model we're loading and its list of properties to provide to the bulkloader.Loader parent class.

Will McCutchen 2009-09-14 16:52:21

Answer 2

A:

You may find this post useful - it details how to load data direct from an RDBMS, but applies equally to loading from any other source.

Nick Johnson 2009-10-15 22:52:09

ansaurus

tags:

views:

answers:

GAE load data into datastore without using CSV

related questions