tags:

views:

1697

answers:

18

I'm looking into developing a used cars web app, but I've searched and I cannot find any avaliable database for download (paid or free) that contains all the makers and models of automobiles.

How should I go about this? Should I scrap someone else's website? What guarantees do I have that the said website is updated frequently? Where do they get it from?

Does anyone knows where can I find such data?

+4  A: 

While looking for a solution to this, keep in mind that auto manufacturers often sell the same car branded under a different model name in different markets. Also, different markets will get different car models entirely. Depending on the scope of your application, this may or may not be a problem for you.

Greg Hewgill
Now that you've mention it that would be a problem, do you have any ideas on how I can circumvent this issue? My app will serve a local market only.
Alix Axel
@eyze Actually it's called Punto accross whole Europe, Punto means 'dot' in Italian.
Niteriter
A: 

If you can't find someone willing to vender you their database of this info, then you may be forced to just data entry it. Maybe pick up one of the blue books and just pound away at it. Better yet, hire an intern to do it for you!

Jagd
+7  A: 

You might like to checkout the free databases from the EPA.

"This website provides links to EPA's test vehicle fuel economy data. The data has been collected by Model Year since 1984 and been formatted to downloadable and comma-delimited files for import into databases or spreadsheets."

JonnyBoats
+12  A: 

You may want to key off of the VIN. You can get the database from SAE for $500 a year.

Bramha Ghosh
And there you have it ... just what I was talking about ... the man knows!
ldigas
I though about using VIN as well but all the services I've seen are a little expensive (25 > x < 100 cents per each query) and would make the business model unreliable. I downloaded the sample database from SAE (http://www.sae.org/standardsdev/groundvehicle/wmisample.xls) but it only has manufacturers information, VINs and model names are absent.
Alix Axel
A: 

There are such commercial databases, I've seen a few. Unfortunatelly, its names escape me. Try looking for a database of spare parts and engine numbers ... I gotta warn you, they're not free, not even cheap.

ldigas
Thanks but if they are so expensive where do all those small websites get it from? I bet they don't pay that much money to get it.
Alix Axel
Can't say, sorry. Automobile business is not my branch, I've just picked this up along the way.
ldigas
A: 

I worked in the used cars sales world and we have evaluated a number of commercial car databases. I can't remember the names of them but they were not cheap and not so easy as you would think to integrate. We even didn't buy one of them. After some research, we found out that 80% of the cars we offered on our site, were a list of 200 types of cars (and with type I mean, not only the model and brand of the car, but also a list of options in common). We decided then to create a templating system (based on chassis number), that we could use to enter a car of common brand/model/type into our system. This saved a lot of work with relatively little effort.

Michael
+4  A: 

Another option would be to load the data from an RDF data source like dbpedia.org or freebase.com (e.g. have a look at DBpedia Compact Cars, DBpedia Mid Size Cars or Freebase Automobile Make). Simply parsing out the information from the RDF resources will be much more reliable than html scrapping and easier to update. If you have the time and don't fear a steep learning curve you could also look into using some more powerful semantic web techniques like SPARQL querying to implement this :)

VolkA
+1 those are great resources, thank you for poiting me in that direction
Alix Axel
+3  A: 

Mitchell International is considered to be the gold standard for collision repair databases. As their database contains information about all the parts on a vehicle, they also maintain a list of Year, Manufacturer, Make, Model and Style.

Here is the page about their database.

They make their data available for private label use as a subscription service.

(I became aware of Mitchell when I worked in the Auto Glass industry and we used their "NAGS" database for auto glass numbering.)

Should you go with the DIY route, there are some design considerations you should make. The main table is going to be the Make table as sometimes a vehicle make is shifted between manufacturers (the Geo Metro and Dodge Neon in the US for example). One might assume it is a hierarchy but it is not. The table structure might look like this:

       Manufacturer
            |
 Year --- Make --- Model --- Style
y0mbo
I guess the price of their DB is so high they don't even display it on the website, anyway thank you for the heads up regarding the DB structure - I was currently kinda struggling with that.
Alix Axel
A: 

Read up on how a VIN is constructed. There is a list of VINs as well as a decoder at decodethis.com or you can scrape from manufacturer web sites like this one at Ford.

jiggy
I've given a lot of research to this subject however, there are two main reasons why VINs don't work in my case: 1) with the VIN alone I have no way of knowing the model name for instance and 2) every manufacturer (WMI) has its own way of VIN coding (and it even changes over the years!) - decodethis.com is too expensive and it doesn't even find the VIN for my car.
Alix Axel
+1  A: 

My biz partner and I developed a website that aggregated used and new cars for sale.

You'll most likely want to determine this information from the VIN. However, note that there is no one ultimate authority for VINs. (See wikipedia for more info on VINs)

We researched free databases and for pay services that would handle VIN decoding. Using a free database was eventually discarded as you aren't guaranteed the information is accurate and up to date.

We considered DataOneSoftware and Vinquery. DataOne didn't post their pricing tiers like Vinquery but Josh Pereira was very helpful and responsive.

Furthermore, I found this GA post which is contains a list of VIN service providers we weren't aware of at the time. http://answers.google.com/answers/threadview/id/592533.html

Jonathan R. Wallace
Thank you, Vinquery is more attractive than decode-this.com but still a little expensive.I'll take a more in depth look at the Google question later on.
Alix Axel
+1  A: 

Like I said in my comment earlier. Free data is hard to find. Not to mention the maintenance up keeping it updated. You'll have to keep hunting information of new models. It's better off getting a subscription and spend the saved time on another project. Here is a link

Saif Khan
That DB seems very complete and at an affordable price, I'll consider this as one of my main options. Have you buyed any of their databases? Are they trustworthy?
Alix Axel
I haven't used their database (as yet) but heard some good things about them.
Saif Khan
I bought the pluup one. The data was incomplete.
Josh Ribakoff
+2  A: 

In the UK I would speak to the DVLA and see what they have to offer.

Greg B
That is a good idea!
Alix Axel
+1  A: 

I work in the automotive sector, and I can tell you that this is a nut that has yet to be cracked!

I've made a couple of integrations with such databases.

One of them is JATO. They provide you with a database with such information. I know they have a team of people working full time on updating that database... We receive daily updates and they are sometimes big! And I know they aren't cheap either. Here's a demo. I know they also have pictures of every make and model.

Another is Eurotax which is used to do used vehicle valuation, so they have a big history of such data. They have a webservice (which isn't free) that can be used to query the data you need (and also valuate used cars). European based though...

fretje
I already saw both JATO and Eurotax, for JATO I couldn't find any prices (I guess they are too high for public eyes), regarding Eurotax their database is pretty complete but they charge about 5 USD for web service each request, that's WAY more than what I was looking to spend.
Alix Axel
5 USD for each web service request - typo =)
Alix Axel
+2  A: 

You won't get it for free. You should use chrome They store historical data for vins, and also have a lot of services for new cars.

salietata
Thank you for the link, do you have any idea how much they charge for the web service?
Alix Axel
We bought the chrome data and have been fairly happy with it. It's in a somewhat funky (1990's?) format with occasional inconsistencies or omissions but works for the most part.
Brian Armstrong
+1  A: 

You can buy Vehicle data and pricing from Kelly Blue book, BlackBook and Nada.

You can try to get Year, Make, Model, Trim from the VIN number since they are unique but you will run into problems doing this because sometimes the manufacture will reuse models.

Robert Kozak
Thanks for the links.
Alix Axel
+3  A: 

Free data - www.vehicle-data.com Disclaimer: My site

Data is FREE, I just ask for a backlink.

1/4 million records from 3 countries dating back to early '90s. Twice that much data in staging.

Josh Ribakoff
A: 

Alix,

I was wondering what solution you finally went with. I am building a car dealer's web site and I need the year, make, model info. Any insight you have would be very much appreciated.

Thank you!

dsapala
+1  A: 

There is a great Vehicle Make Model database that comes with dependent drop downs ready to use in forms. They have an upgrade program too. Great price and validated data.

Its called TheVehicleList.com

BC