ansaurus

Question

Answer 1

A:

If you have a unique index on ResourseName, the lookup should be very fast even on a big table. However, it has disadvantages. For instance, if you log a lot of data and have to archive it off periodically and want to archive the previous month or year of logdata, you are forced to keep all of resoursepaths. You can come up with solutions for all of that.

Matt Wrock 2009-09-20 16:37:12

I recognize that the lookups can be fast, the real problem with the simple table design is that the size of the tables grows very rapidly. This affects both the efficiency of queries against the table (due to increased IO activity) and more importantly, the amount of data I have to transfer in when I am inserting.

FlipThePig 2009-09-20 16:54:55

Personally, I would go with the two table design. If this is going to grow fast like you say, I would add a partition key to both tables. Lets say you partition by month. You would have a month key in both tables. The resoursenames would be unique only within the month key. This will allow you to archive partitions of both tables when you need to purge for space and the partition scheme should provide performance gains as well.

Matt Wrock 2009-09-20 17:25:24

Answer 2

A:

yes inserting from existing data doing the lookup as part of the insert

Given @resource, @time and @data as inputs

insert( ResourcePathId, EventTime, ExtraData)
    select ResourcePathId, @time, @data
        from ResourcePaths
        where ResourceName = @resource

Mark 2009-09-20 16:37:47

This only works if the resource path is already defined. If the path doesn't already exist, I have to INSERT it...

FlipThePig 2009-09-20 16:52:44

True but it is quicker to do this and then if this fails to insert then do the insert into ResourcePaths. especially if adding a new resource is much less frequent then adding a log message

Mark 2009-09-20 17:03:14

ansaurus

tags:

views:

answers:

Enumerated text columns in SQL

related questions