ansaurus

Question

Generating a two level hiearchy from a set of data with more then one common grouping field in sql

Answer 1

A:

I don't think that a recursive CTE is going to work here. The query is based entirely upon sequential logic and there's no logical "next set" for any given state because you can't know where the stop point is without first scanning each row one-by-one; in other words, the results essentially need to be evaluated row-by-row. The purpose of using a recursive CTE is to be able to append sets; if you're just appending rows then what you end up with is no better than a cursor.

I would actually use a CLR User-Defined Aggregate for something like this, because there is no good-performance pure SQL solution that I can think of, but if you need a pure SQL solution, here is one using regular (not recursive) CTEs and the windowing functions:

;WITH Rows_CTE AS
(
    SELECT
        ID, Name, Code,
        ROW_NUMBER() OVER (ORDER BY ID) AS RowNum
    FROM @Tbl
),
Changes_CTE AS
(
    SELECT
        r1.ID, r1.Name, r1.Code,
        CASE
            WHEN r1.Name = r2.Name OR r1.Code = r2.Code THEN NULL
            ELSE r1.ID
        END AS BeginGroupID
    FROM Rows_CTE r1
    LEFT JOIN Rows_CTE r2
        ON r2.RowNum = r1.RowNum - 1
),
Groups_CTE AS
(
    SELECT ID, Name, Code, BeginGroupID, m.EndGroupID
    FROM Changes_CTE c1
    CROSS APPLY
    (
        SELECT MIN(ID) AS EndGroupID
        FROM Changes_CTE c2
        WHERE c2.ID > c1.BeginGroupID
        AND c2.BeginGroupID IS NOT NULL
    ) m
)
SELECT
    t.*,
    CASE
        WHEN t.ID = g.BeginGroupID THEN NULL
        ELSE g.BeginGroupID
    END AS ParentID
FROM Groups_CTE g
INNER JOIN @Tbl t
    ON t.ID >= g.BeginGroupID
    AND t.ID < g.EndGroupID

That gets the results you're asking for. It could be written in a more compact form but I've tried to aim for readability.

(Addendum: We could use a recursive CTE and improve this significantly if it were known at the beginning that each and every name/code can only go under one "parent" - but that assumption is not documented anywhere, so we really have to assume worst-case.)

Aaronaught 2010-02-10 05:28:28

Works well. However it did leave the "Rogue" one in the result when it should be left out since it has no children (solvable by doing a count on the "group").I didn't realize you could use row_number() without a PARTITION to create a row number. I also never seen CROSS APPLY used on a sub-query, Ive only used it on a function only.The downside is I realized from your answer I was missing items from my requirements and I apologize. I tried to simplify for clarity and left out some key parts. I will create a new question since this is a Problem with a correct answer.

Ben Dempsey 2010-02-11 15:25:52

ansaurus

tags:

views:

answers:

Generating a two level hiearchy from a set of data with more then one common grouping field in sql

related questions