ansaurus

Question

Answer 1

A:

In the past, I've created physical tables for read purposes only. The structure you have above is GREAT for storage, but terrible for reporting.

So you could do the following: Write a script (that is scheduled nightly) or a trigger (on data change) that does the following tasks:

First, you would dynamically go through each Product and build a static table "Product_[ProductName]"

Then go through each ProductAttributeTypes for each product and create/update/delete a physical column on the corresponding Product table.

Then, fill that table with the proper values based on OrderItemsToProductAttributes and ProductAttributes

This is just a rough idea. Make sure you are storing OrderID in the "Static"/"Flattened" tables. And make sure you do everything else you need to do. But after that, you should be able to start pulling from those flattened tables to get the data you need.

mschmidt42 2009-08-30 23:01:18

I get your point. I agree that database design for storage vs. reporting has its conflicts and limitations.But I don't think this query is heavy enough to make use of aggregated tables, as it gives me other problems like redundancy, delays in updates and so on.

MartinHN 2009-08-30 23:08:44

Answer 2

+1 A:

If you are using sql2005 or 2008 you can use the pivot command. See here.

In the example below the OrderAttributes set will look like:

 OrderItemId AttName AttValue
       -----  ------  -----
        100 Color Red
        100 Size Small
        101 Color Blue
        101 Size Small
        102 Color Red
        102 Size Small
        103 Color Blue
        103 Size Large

The final results after the PIVOT will be:

OrderItemId Size  Color
  -----    ------  -----
    100 Small Red
    101 Small Blue
    102 Small Red
    103 Large Blue

WITH OrderAttributes(OrderItemId, AttName, AttValue)
   AS (
      SELECT 
         OrderItemId, 
         pat.Name AS AttName,
         pa.Name AS AttValue
      FROM OrderItemsToProductAttributes x
      INNER JOIN ProductAttributes pa 
      ON x.ProductAttributeId = pa.id
      INNER JOIN ProductAttributeTypes pat
      ON pa.ProductAttributeTypeId =  pat.Id
   )

SELECT AttrPivot.OrderItemId,
[Size] AS [Size],
[Color] AS Color
FROM OrderAttributes
PIVOT ( 
      MAX([AttValue])
      FOR [AttName] IN ([Color],[Size])
   ) AS AttrPivot
ORDER BY AttrPivot.OrderItemId

There is a way to dynamically build the columns (i.e. the Color and Size columns), as can be seen here. Make sure your database compatibility level on your database is set to something greater than 2000 or you will get strange errors.

JBrooks 2009-08-30 23:07:12

How? When I'm not using any aggregate functions? I don't want to sum up Ids or anything like that.

MartinHN 2009-08-30 23:11:23

I think this one is very close now - but the value of both Color and Size columns are null. Maybe just a small detail missing?

MartinHN 2009-08-31 10:45:44

One thing might cause a diff, you had ProductAttributeTypeId in the table OrderItemsToProductAttributes, I didn't put this in my sample b/c this should be referenced thru the joins (third normal form and all). If you are saying that they could have nulls, then something should be changed to something like:SELECT AttrPivot.OrderItemId,isnull([Size],'NA') AS [Size],isnull([Color],'NA') AS ColorFROM OrderAttributes

JBrooks 2009-08-31 14:20:47

It doesn't have null values.I don't know why it returns null, though.If I run the query inside the WITH statement, I get all the correct data returned - no nulls. But running the entire query, gives me null values for the Size and Color columns.

MartinHN 2009-08-31 15:48:40

Post or send me your code...

JBrooks 2009-08-31 16:26:06

I got it working. The reason was that the name of each attribute is not stored as Color, Size and so. It is stored as an XML string with a number of culture specific values to accommodate internationalization.

MartinHN 2009-09-03 20:38:38

Answer 3

A:

Pivot is your best bet, but what I did for reporting purposes, and to make it work well with SSIS is to create a view, which then has this query:

SELECT     [InputSetID], [InputSetName], CAST([470] AS int) AS [Created By], CAST([480] AS datetime) AS [Created], CAST([479] AS int) AS [Updated By], CAST([460] AS datetime) 
                      AS [Updated]
FROM         (SELECT     st.InputSetID, st.InputSetName, avt.InputSetID AS avtID, avt.AttributeID, avt.Value
                       FROM          app.InputSetAttributeValue avt JOIN
                                              app.InputSets st ON avt.InputSetID = st.InputSetID) AS p PIVOT (MAX(Value) FOR AttributeID IN ([470], [480], [479], [460])) AS pvt

Then I can just interact with the view, but, I have a trigger on the table that any new dynamic attributes must be added to, which recreates this view, so I can assume the view is always correct.

James Black 2009-08-31 00:31:30

ansaurus

tags:

views:

answers:

SQL make rows into columns, PIVOT maybe

related questions