tags:

views:

146

answers:

1

I have the following XML structure:

<charsets>
  <charset>
    <name>ANSI_X3.4-1968</name>
    <aliases>
      <alias>iso-ir-6</alias>
      <alias>ANSI_X3.4-1986</alias>
      <alias>ISO_646.irv:1991</alias>
      <alias>ASCII</alias>
      <alias>ISO646-US</alias>
      <alias>US-ASCII</alias>
      <alias>us</alias>
      <alias>IBM367</alias>
      <alias>cp367</alias>
      <alias>csASCII</alias>
    </aliases>
  </charset>
  <charset>
    <name>ISO-10646-UTF-1</name>
    <aliases>
      <alias>csISO10646UTF1</alias>
    </aliases>
  </charset>
</charsets>

I can grab the text contents of the the "name" nodes using Ruby & Nokogiri as follows:

require 'nokogiri'
require 'open-uri'

doc = Nokogiri::XML(File.open("StandardCharsets.xml"))
@charsets = doc.css("charsets name").map {|node| node.children.text }

But, what I want is the text contents of all "name" and "alias" nodes in the order as they are shown in the source document. Everything I try fails.

Does anyone have a good example of how to do this?

Thanks,
Chip Castle
http://invoicethat.com
+3  A: 

Change the line

@charsets = doc.css("charsets name").map {|node| node.children.text }

to

@charsets = doc.css("charsets name, charsets alias").map {|node| node.children.text }
Adrian