suggest database for storing metadata regarding 200 million images (1 million books) (NoSQL? SQL?) | ansaurus

tags:

views:

59

answers:

1

+2 Q:

suggest database for storing metadata regarding 200 million images (1 million books) (NoSQL? SQL?)

Friends,

We will be undertaking a knowledge preservation project for scanning more than 1 million books. We need some suggestions on implementing database for storing and retrieving metadata as well as use it for tracking the scanning status of each object (book)

Can you guys suggest should we go for SQL or NoSQL (The metadata could vary from project to project say this project could have 15 fields)
We are thinking something based on Lucene/Solr or some Scalable RDF database
Any open source solution where we have the ability to define custom metadata fields and store information with a search feature?

A:

Disclaimer: Never attempted this type of project

I have seen very good performance from MSSQL server's "Filestream" type. It uses the NTFS file APIs for storing binary data, and keeps a pointer in the rows of your table.

If you have no structure on the metadata you could use XML, but if you do have a repeating structure shove it into relation data and then you can use indexing etc. to help you get your performance.

Filestream Type

Spence 2010-06-24 11:18:56

related questions

When and why is ns0pred used?

Handling historical calendar dates

Versioned RDF store

Is there an RDF ontology for blogs?

SPARQL query - Class and subclass give a class name and namespace

Which Triplestore for rapid semantic web development?

Is there a free (LGPL< BSD, etc. ) RDF editor component for swing ?

what is an rdf triple?

Storage transactions in Redland's Python bindings?

Can anyone help me convert this ANTLR 2.0 grammar file to ANTLR 3.0 syntax?

What is a good RDF library for .net?

Blending RDF and ORM approaches

Source for Flash based RSS1/RSS2/ATOM/RDF reader?

What is the best PHP lib/class to generate RSS/Atom

Object-to-Triple mapping using AllegroGraph RDF store?

What is the best way to learn about RDF / OWL?

Do you use Microformats, RDF, Dublin Core or another type of sematic markup?

For your semantic web type application, do you use RDF or a proprietary model for the internal representation?

Do you leverage Semantic Web technologies? Why or why not?

Tools to enable translation of A-Box ontology data

Visualize Friend of a Friend (foaf) graph

What are some good Java RDF libraries.

Are there any tools to visualize a RDF graph? (please include a screenshot)

Examples of using semantic web technologies in real world applications

RDF storage