normalizing

Looking for algorithm that reverses the sprintf() function output

I am working on a project that requires the parsing of log files. I am looking for an fast algorithm that would take groups messages like this: Input: The temperature at P1 is 35F. The temperature at P1 is 40F. The temperature at P3 is 35F. Logger stopped. Logger started. The temperature at P1 is 40F. and puts out something in th...

Address book DB schema

I need to store contact information for users. I want to present this data on the page as an hCard and downloadable as a vCard. I'd also like to be able to search the database by phone number, email, etc. What do you think is the best way to store this data? Since users could have multiple addresses, etc complete normalization would be...

How do you determine how far to normalize a database?

When creating a database structure, what are good guidelines to follow or good ways to determine how far a database should be normalized? Should you create an un-normalized database and split it apart as the project progresses? Should you create it fully normalized and combine tables as needed for performance? ...

What's the better database design: more tables or more columns?

A former coworker insisted that a database with more tables with fewer columns each is better then one with fewer tables with more columns each. For example rather than a customer table with name, address, city, state, zip, etc. columns, you would have a name table, an address table, a city table, etc. He argued this design was more ef...

Conflicting desires in Database Design, with fields of two similar functions

Okay, so I'm making a table right now for "Box Items". Now, a Box Item, depending on what it's being used for/the status of the item, may end up being related to a "Shipping" box or a "Returns" box. A Box Item may be defective:if it is, a flag will be set in the Box Item's row (IsDefective), and the Box Item will be put in a "Returns" ...

Data structure for non-overlapping ranges within a single dimension

I need a data structure that can store non-overlapping ranges within a single dimension. The entire range of the dimension need not be completely covered. An example would be a conference room scheduler. The dimension is time. No two schedules may overlap. The conference room isn't always scheduled. In other words, for a given time ther...

Database denormalization opportunity

I'm looking for a strategy in as far as stopping the repetitive problem of branching out tables. For example as a fictitious use case, say I have a table with users that contains their name, login, password and other meta data. In this particular scenario, say the user is restricted to login per a specific subset of IP(s). Thus, we have ...

Hierarchical tagging in SQL

I have a PHP web application which uses a MySQL database for object tagging, in which I've used the tag structure accepted as the answer to this SO question. I'd like to implement a tag hierarchy, where each tag can have a unique parent tag. Searches for a parent tag T would then match all descendants of T (i.e. T, tags whos parent is T...

Normalizing an existing MS Access Database

I have one large access database that I need to normalize into five tables and a lookup table. I understand the theory behind normalization and have already sketched out the look of the tables but I am lost on how to transform my table to get the database normalized. The table analyzers doesn't offer the the breakdown that I want. ...

How to prevent orphaned records in detail tables of normalized database?

I have to maintain an old database which is not properly normalized. For instance there is a project table that has grown (or maybe mushroomed) to have 5 or more different date columns, for different milestones of the project from being ordered to the delivery date. There are also several tables each with columns for street addresses, ma...

Dealing with "hypernormalized" data

My employer, a small office supply company, is switching suppliers and I am looking through their electronic content to come up with a robust database schema; our previous schema was pretty much just thrown together without any thought at all, and it's pretty much led to an unbearable data model with corrupt, inconsistent information. T...

Database Design/Modeling Question - Constraints or No Constraints?

Given the following structure: City Area User Every Area has 1 and only 1 City. Every User has at least one but possibly multiple Areas. Every User has 1 and only 1 City. What is the most elegant way to model this? Currently, I have: User, UserArea, Area, City Where UserArea is a 1:M relationship w/ User, and Area is 1:1 with Cit...

MS Access 03 - Normalizing Data from an excel spreadsheet

Hey guys I have a question about a method to break something apart. I get this excel spreadsheet that provides me with data that I need to do one report. Its pretty simple and straight forward, however there is one particular part of it that is giving me some grief. In the excel spreadsheet there is a column that lists "parties envolved...

Normalizing dataset with ruby

Hi folks, i have a data set that ranges from 1 to 30,000 I want to normalize it, so that it becomes 0.1 to 10 What is the best method/function to do that? Would greatly appreciate it if you could give some sample code! ...

Using JavaScript to normalize Url Rewriting strings entered by users

String.prototype.NormaliseUrl = function () { var url = this; var preserveNormalForm = /[,_`;\':-]+/gi url = url.replace(preserveNormalForm, ' '); // strip accents url = stripVowelAccent(url); //remove all special chars url = url.replace(/[^a-z|^0-9|^-|\s]/gi, '').trim(); //replace spaces with a - u...

SQL database design problem.

I'm creating the database for monitoring status of applications' functionalities. The logic is following: Each application has its own, specific list of functionalities that I'm monitoring. Each functionality belongs to only one application. There is a Functionality table that has foreign key to Application Each application runs on one...