ansaurus

Question

Need help deciding on a database scheme for a reporting project (PHP)

Answer 1

+1 A:

Without a sample of the data you hope to parse and then store it's quite hard to say where you need to go from where you already are. It's possible that someone could suggest a reasonable database schema, or at the very least say that whether a 3NF schema is viable.

Based on what you've provided, the question of whether to learn more or just proceed with a denormalised database requires a consideration of time, effort and the extent to which the database is to be used.

A normalised 3NF schema will:

take the greatest design time
enable a minimal-complexity application
provide an excellent position from which to denormalised with respect to performance

A denormalised schema will:

take the least design time (quickest option: one DB row per CSV line)
shift complexity from the database layer to the application layer
present future maintenance nightmares

The application layer can always compensate for shortcomings of the database layer at the cost of increased application complexity. Complexity can be managed through intelligent software design, and a good OO design can make the complex look simple. Consider where your strongest skills lie.

If your skills aren't in DB schema design and you can handle the increased application complexity, go for a quick schema design and crack on with getting the application working. Results trump perfection in a business environment.

If you have plenty of time, learn more about DB schemas and find a 3NF form that works for your data.

Remember that performance is relative to the frequency of use. Performance can be a pain for users if you need to generate reports from the application one per minute, less of a pain if the reports are run daily, no pain at all if the report generation is automated and happens once a day over night.

An ideal approach would be to:

determine the relationships between data (someone knows how those scripts you don't control work)
create a good 3NF schema
create the least-complex application needed (aided by a good 3NF schema)
iterate: denormalise as needed based on performance and user feedback

Keep in mind the business considerations. Getting out something that works in two weeks instead of something perfect in two months may be a better option. You may have difficulty convincing management of the time and cost sink of the two month solution (which may require extensive learning on your part).

If you're not sure which direction to take:

go for the quickest option that is most management friendly that you know you can handle
make sure you are aware of the shortcomings in your design
make management aware of the shortcomings of your quick option and what time/cost investment is required to overcome the shortcomings
you never know until you start whether any perceived shortcomings will actually be a problem
based on usage levels, your quick-and-it-works option might well be good enough

Jon Cram 2010-07-08 16:30:41

Most of this are all things I've considered. I've managed to scare myself into inaction, and need a kick in the pants from someone who's been there before. Since I'm a one man team, I need that kick from SO :)

John Bruckler 2010-07-08 17:34:32

I slept on it, and considering the requirements I was given, I think I can just go forward with what I already know, and get as close to 3NF as I can, then go from there. Analysis paralysis is a bear.

John Bruckler 2010-07-09 11:25:26

Better to do something that nothing!

Jon Cram 2010-07-09 19:11:38

ansaurus

tags:

views:

answers:

Need help deciding on a database scheme for a reporting project (PHP)

related questions