What are the best ways to understand an unfamiliar database?

views:

701

answers:

+9 Q:

What are the best ways to understand an unfamiliar database?

What kind of approaches and techniques can you employ to become familiar with an existing database if you are tasked with supporting and/or modifying it? How can you easily and effectively ramp up your knowledge of a database you have never seen before?

+5 A:

The data dictionary is your friend. Also, try reverse-engineering the database with the reverse-engineering tool on Visio and building your own set of diagrams. Because reverse engineering is interactive - you build the diagrams - it's much more engaging than reading through a data dictionary. The activeness of the process is its advantage and I find it quite relaxing to do this.

Most of the work I do is in data warehousing, where poking around source system database schemas is something of a core activity. I've done this sort of thing on quite a number of occasions and find it works really well.

Visio pro is not that expensive and the Visio modelling engine lets you share a model amongst multiple diagrams. As a bonus, you can add in missing foreign keys in the diagrams and you get a useful set of documentation for the system out the end.

ConcernedOfTunbridgeWells 2008-11-10 16:50:17

This technique, combined with Bill Karwin's, saved my bacon when we inherited a very large, disorganized product from one of our clients' former developers ...

John Rudy 2008-11-10 16:57:15

+1 A:

See if the option of a Knowledge Transfer session is available to you, and if so, take full advantage of it.

Also, many DBMS's come with tools that allow you to draw/print the database schema with some helpful information (i.e. foreign keys).

Additionally, (stolen from NXC) you can reverse engineer the database via tools like Visio.

Kon 2008-11-10 16:51:17

+16 A:

The first thing I do is create an Entity-Relationship Diagram (ERD). Sometimes you can simply describe the metadata with command-line tools but to save time there are some tools that can generate a diagram automatically.
Second, examine each table and column make sure I learn the meaning of what it stores.
Third, examine each relationship and make sure I understand how the tables relate to one another.
Fourth, read any views or triggers to understand custom data integrity enforcement or cascading operations.
Fifth, read any stored procedures. Also read SQL access privileges if there are such.
Sixth, read through parts of the application code that use the database. That's where some additional business rules and data integrity rules are enforced.

update: I just read an interesting article "9 Things to Do When You Inherit a Database" with a good checklist.

Summary:

Backups
Research (the schema documentation steps I mention above)
Talk to the former developers
A bug database
Source code control
Talk to the users and/or business owners
Establish credibility with the users by fixing a few things or making some enhancements
Create a development environment
Drop obsolete objects

Bill Karwin 2008-11-10 16:55:58

Put a backup copy on a sandbox server and then start writing and running test queries. I always find a complex system easier to understand if I can get my hands on it and not worry about breaking it.

Also, I like to have The Daily WTF open in a browser window. Taking over someone else's design usually involves a lot of "I can't believe they did {WTF}" moments, and it helps to have somewhere to go where people understand your pain.

MusiGenesis 2008-11-10 16:57:07

+4 A:

In addition to Bill Karwin's ideas, I suggest talking to the users - occasionally users know quite a bit about what their database is used for, especially if they do any reporting from it.

Kramii 2008-11-10 16:58:59

Good point! A comment from fallen888 also advocates knowledge transfer.

Bill Karwin 2008-11-10 17:25:02

+2 A:

Schema Spy is a really nice tool for generating an ERD.

Don 2008-11-10 18:17:15

+1 A:

Bill gave an excellent answer. I would add that I would login to the user interface as a test user and try to understand exactly what the users do with the data. It will help you understand the why behind some of the stored procs or design. Understanding what the data means and is used for is critical to understanding a a database.

If the database is on a business function or subject you are in general unfamiliar with (say it does flight planning and you have previously only worked on financial applications), then ask the users for some reading material on the subject matter or go to the library yourself or search the Internet about the subject matter. Ask the users if there are legal or regulatory issues you need to be aware of. Again some of this subject matter background may explain what seem to be odd design choices.

HLGEM 2008-11-10 18:28:09

+1 A:

In addition to discovering the structure of the database, I've found that it's also important to look at the data quality. Once you understand the meaning of each column you can look for any places where there are a lot of missing values. As you become more familiar with the data you can also examine where there are inconsistencies between the values in different columns.

Eric Ness 2008-11-11 17:19:59

http://code.google.com/p/database-diagram/

reverse engineer the DB into a diagram

2009-07-24 13:41:11

I like to fire up a query profiler and watch what goes by on a production system. Gives me some idea of what tables are 'hot' and the kind of queries there are against them.

n8wrl 2009-07-24 13:43:16

ansaurus

tags:

views:

answers:

What are the best ways to understand an unfamiliar database?

related questions