Performance lost when open a db multiple times in BerkeleyDB | ansaurus

tags:

views:

92

answers:

1

Q:

Performance lost when open a db multiple times in BerkeleyDB

I'm using BerkeleyDB to develop a small app. And I have a question about opening a database multiple time in BDB.

I have a large set of text ( corpus ), and I want to load a part of it to do the calculation. I have two pseudo-code (mix with python) here

@1

def getCorpus(token):
    DB.open()
    DB.get(token)
    DB.close()

@2

#open and wait
def openCorpus():
    DB.open()

#close database
def closeCorpus():
    DB.close()

def getCorpus(token):
    DB.get(token)

In the second example, I'll open the db before the calculation, load token for each loop and then close the db.

In the first example, each time the loop ask for the token, I'll open, get and then close the db.

Is there any performance lost ?

I also note that I'm using a DBEnv to manage the database

+3 A:

If you aren't caching the opened file you will always get performance lost because:

you call open() and close() multiple times which are quite expensive,
you lose all potential buffers (both system buffers and bdb internal buffers).

But I wouldn't care too much about the performance before the code is written.

Piotr Czapla 2009-09-09 19:19:15

related questions

What language do you use for Postgresql triggers and stored procedures?

Are Multiple DataContext classes ever appropriate?

Which tools do people use to create Data Dictionaries?

Any experiences with Protocol Buffers?

Mechanisms for tracking DB schema changes

How big can a MySQL database get before performance starts to degrade.

How do I index a database field

How does database indexing work?

How do I connect to a database and loop over a recordset in C#?

Editing database records by multiple users

Object Oriented vs Relational Databases

VFP .NET OLEdb provider does not work in Win 64-Bits. Help

Embedded Database for .net that can run off a network

Connect PHP to an AS/400

Swap unique indexed column values in database.

cx_Oracle - what is the best way to iterate over a result set?

cx_Oracle - How do I access Oracle from Python?

.NET Migrations Engine

Is there a version control system for database structure changes?

SQLite and XSD

How do I version my MS SQL database in SVN?

XSD DataSets and ignoring foreign keys

Flat File Databases in PHP

Throw Error In MySQL Trigger

Binary Data in MySQL