views:

778

answers:

4

Hi,

Problem Statement: I would like to create an offline database to lookup prices/info on the n most useful books to sell in the United States (where n is probably 3 million or so).

Question: So, my question is (and I am open to other approaches here as well), I am trying to figure out how to use Amazon AWS to download a list of the n higest salesrank books being sold as well as some information about the book (i.e. title, prices, etc...).

What I have done so far: First, something like this exists already (asellertool.com), however, I thought this would be an interesting project to work on and quite frankly, we aren't serious enough to need to pay the $30/month subscription.

Now, AWS is great (and easy) if you have a few items you want to look up, but I can't seem to figure out how enumerate on sales rank. Originally, I was hoping to enumerate all of the book items Amazon had by ISBN. But that wasn't available either. Then I thought I could find a list of all ISBN numbers out there, but that was a dead end too. Finally I thought I could create my own list of ISBN numbers, but as I did some back of the envelope calculations, I thought better of it as my solutions would take roughly a year to go through a third of the 10 digit space at 100/second (and it was overkill anyway).

So, I am back on Sales Ranking, which is currently seems like a dead end as well. So, if you have any thoughts, I would appreciate it.

A: 

I was wondering which one of Amazon AWS services you want to use? Or you don't know?

And why you need offline database? Why not make requests on everything you need and make some temporary cache?

ju
The reason is because the PDA would not be online when it was time to look up the ISBN. This would be used in a bookstore, for instance.
So what sort of database were you thinking of using?
George Stocker
SQL Compact for PDA on Windows Mobile or SQLite will be the best for a Database.
balexandre
+1  A: 

Take a look at AWS Zone, in the Amazon E-Commerce Service section.

alexandrul
This API has nothing to do with getting the entire inventory. It is for small scale requests.
A: 

I'm not a programmer either, but curious and cheap. Asellerstool is probably great, but they wouldn't be around if not for Amazons free data base. Well, I guess nobody cares about used book prices?

+1  A: 

Amazon has a data feed service you can use which contains GZipped xml files of all their products based on top level categories. It's updated once a day and totals about 20GB/110GB of compressed/uncompressed data. Since you only need books it's more in the area of 4GB/31GB. The only thing is I'm not sure who's able to use this and what's involved with getting an account. They don't list anything about this on their website as far as I know so you will most likely have to contact someone there to find out more about it. We use this at work for stuff we do with them and it's some of the craziest xml processing I've had to do.

Brian Surowiec
This makes sense. Thanks for the answer. I will look around, but if you come across any contact information, please update this!
I'll see if I can find out any more info next time I talk to someone over there.
Brian Surowiec