nosql

Getting started with massive data

I'm a math guy and occasionally do some statistics/machine learning analysis consulting projects on the side. The data I have access to are usually on the smaller side, at most a couple hundred of megabytes (and almost always far less), but I want to learn more about handling and analyzing data on the gigabyte/terabyte scale. What do I n...

relating data stored in NoSQL DB to data stored in SQL DB

Whats the best way to use a SQL DB along side a NoSQL DB? I want to keep my users and other data in postgres but have some data that would be better suited for a NoSQL DB like redis. I see a lot of talk about switching to NoSQL but little talk on integrating it with existing systems. I think it would be foolish to throw the baby ou...

How to modernize an enormous legacy database?

I have a question, just looking for suggestions here. So, my application is 'modernizing' a desktop application by converting it to the web, with an ICEFaces UI and server side written in Java. However, they are keeping around the same Oracle database, which at current count has about 700-900 tables and probably a billion total records...

Simple / Smart, Pythonic database solution, can use Python types + syntax? (Key / Value Dict, Array, maybe Ordered Dict)

Looking for solutions that push the envelope and: Avoid Manually writing SQL queries(Python can be more OO not passing DSL strings) Using non-Python datatypes for a supposedly required model definition Using a new class of types rather than perfectly good native Python types Boast Using Python objects Using Object Oriented and key...

Neo4j spatial within distance query

I would like to store coordinates as nodes in neo4j to do simple querys For example a list of 10,000 nodes: Node[1] - lon=-74.007124, lat=40.714550 Node[2] - lon=-94.626824, lat=39.113380 Node[3] - lon=-105.937406, lat=35.691543 Node[4] - lon=-122.329439, lat=47.603560 Node[5] - lon=-122.420139, lat=37.779600 .. .. ...

Cassandra - transaction support

I am going through apache cassandra and working on sample data insertion, retrieving etc. The documentation is very limited. I am interested in knowing can we completely replace relation db like mysql/ oracle with cassandra? does cassandra support rollback/ commit? does cassandra clients (thrift/ hector) support fetching associated o...

Django Models / SQLAlchemy are bloated! Any truly Pythonic DB models out there?

"Make things as simple as possible, but no simpler." Can we find the solution/s that fix the Python database world? Update: A 'lustdb' prototype has been written by Alex Martelli - if you know any somewhat lightweight, high-level database libraries with multiple backends we could wrap in syntax sugar honey, please weigh in! from some...

Retrieve every key of a column family in Cassandra

Hi all, I have found no way to translate a simple select like SELECT * FROM USER into Cassandra. Is it possible to simply retrieve all the keys in a ColumnFamily? The only one I have found is a select with a key range (get_range_slices). Is there a way to get the entire key list without the need to define a key range? Thanks Tobia Losc...

When I remove rows in Cassandra I delete only columns not row keys

Hi, If I delete every keys in a ColumnFamily in a Cassandra db using remove(key), then if I use get_range_slices, rows are still there but without columns. How could I remove entire rows? Thanks Tobia Loschiavo ...

Data store appropriate for sensor data

Hi, What do you think is a data store appropriate for sensor data, such as temperature, humidity, velocity, etc. Users will have different sets of fields and they need to be able to run basic queries against data. Relational databases like MySQL is not flexible in terms of schema, so I was looking into NoSql approaches, but not sure ...

Drive space hungry NoSQL's databases

I've tested NoSQL databases like CouchDB, MongoDB and Cassandra and observed tendence to absorbing very large amount of drive space relative to inserted key-value pairs. When comparing CouchDB and MySQL schemaless databases CouchDB is consuming much more drive space than MySQL. I know about that key-value DBs by default are versioning ...

MongoDB vs. Redis vs. Cassandra for a fast-write, temporary row storage solution

Hi there, I'm building a system that tracks and verifies ad impressions and clicks. This means that there are a lot of insert commands (about 90/second average, peaking at 250) and some read operations, but the focus is on performance and making it blazing-fast. The system is currently on MongoDB, but I've been introduced to Cassandra ...

Which of CouchDB or MongoDB suits my needs?

Where I work, we use Ruby on Rails to create both backend and frontend applications. Usually, these applications interact with the same MySQL database. It works great for a majority of our data, but we have one situation which I would like to move to a NoSQL environment. We have clients, and our clients have what we call "inventories"--...

Simulating relations in MongoDB

Being one of the most popular NoSQL solutions MongoDB has most of the advantages of this approach. But one issue I'm still struggling with is how reflect object relations in NoSQL data store, specifically - MongoDB. For example, let's consider a simple data model: User, Post and Comment. It is clear to me that comments have no value on ...

Berkeley DB java edition, any LGPL or BSD alternatives in Java?

Hi All, I am dealing with a huge dataset consisting of key-value pairs. The queries are always in the form of range queries on the key space (keys are numbers) hence any persistent B-Tree like structure will handle the situation. I would like to use BDB-Java Edition but the product is closed source and my company doesn't want to buy BDB...

Any Open Source software using Orient DB database? Have you any experiences with that database?

Do you know any open source software that uses Orient DB? Or have you used that product yourself? Any experiences to share? I have recently looked into Orient DB, and it has nice and interesting feature set (fast, embeddable in Java, simple API) but it seems that it is not widely used. Is it just because the Orient DB is a new player o...

MongoDB equivalent of SQL "OR"

So, MongoDB defaults to "AND" when finding records. For example: db.users.find({age: {'$gte': 30}, {'$lte': 40}}); The above query finds users >= 30 AND <= 40 years old. How would I find users <= 30 OR >= 40 years old? ...

Storing one column in a NoSQL DB?

In an app I am working on, we use a MySQL database and want to store articles in a table. Rather than store them in a SQL DB, we were looking at just storing the key to the article in a NoSQL db. Is this a good problem to solve using NoSQL, or should we just create another table in MySQL and store the large amounts of text there? We a...

inheritance in document database?

i am wondering because i searched the pdf "xxx the definitive guide" and "beginning xxx" for the word "inheritance" but i didn't find anything? am i missing something? because i am doing a tablePerHierarchy inheritance with hibernate and mysql, does that become deprecated for some reason in xxx? (replace xxx with the "not only sql" data...

What is an example of relational data and what is an example of a document?

I am looking for examples of when to use a relational data base vs when to use a document database. ...