ansaurus

Question

Answer 1

+3 A:

Commonly solutions are presented with the use of generate-id() but personally I prefer a slightly different variation that doesn't use generate-id:-

<xsl:key name="items" match="item" use="@id" />

<xsl:template match="root">
    <root>
        <xsl:copy-of select="item[count(key('items',@id)[1]|.)=1]" />
    </root>
</xsl:template>

First you create a key which holds the all item elements using the id attribute as the lookup key. key generates an efficient index which can be used to look up items.

The technique relies on the fact that when create a node-set using the | operator you get a unique set of nodes. In other words if the same node is found on both sides of the | operator it only appears in the resulting set once.

The expression:-

 key('items',@id)

Will return the set of item nodes that have a specific ID. So:-

 key('items',@id)[1]

will return only one of the nodes that were found have that specific ID and is repeatable (that is using this expression repeatedly always returns the same node).

Hence the expression:-

 count(key('items',@id)[1]|.)=1

is can only be true for one item node with a specific id value.

The copy-of therefore makes a deep copy of only one item node having a distinct id.

AnthonyWJones 2010-02-12 10:52:44

@Anthony: My 0.02$ - while the `count()` approach takes less space, it is also harder to understand. Proof: The long explanation. :) The `generate-id()` approach is less opaque, that's why I would always recommend towards the latter. There *are* cases where the `count()` way is the only option, but they are rare and far apart. (edit: still, +1)

Tomalak 2010-02-12 13:04:52

Answer 2

+3 A:

Here is the generate-id() way @AnthonyWJones mentioned. I find this one much easier on the human mind. It makes no difference in the result, choose what you like best.

<xsl:stylesheet 
  version="1.0" 
  xmlns:xsl="http://www.w3.org/1999/XSL/Transform"
>
  <xsl:key name="kItemById" match="item" use="@id" />

  <xsl:template match="root">
    <copy>
      <xsl:copy-of select="
        item[generate-id() = generate-id(key('kItemById', @id)[1])]
      " />
    </copy>
  </xsl:template>
</xsl:stylesheet>

In short:

item[generate-id() = generate-id(key('kItemById', @id)[1])]

means: "All <item>s, whose unique ID is equal to the unique ID of first item with the same @id value".

Tomalak 2010-02-12 13:10:51

+1 I agree the use of generate-id is easier to understand.

AnthonyWJones 2010-02-12 14:56:29

ansaurus

tags:

views:

answers:

Unique xml nodes based on attribute

related questions