ansaurus

Question

How to parse XML in Bash?

Answer 1

+1 A:

Well, you can use xpath utility. I guess perl's XML::Xpath contains it.

alamar 2009-05-21 15:39:23

Answer 2

+2 A:

I am not aware of any pure shell XML parsing tool. So you will most likely need a tool written in an other language.

My XML::Twig Perl module comes with such a tool: xml_grep, where you would probably write what you want as xml_grep -t '/html/head/title' xhtmlfile.xhtml > titleOfXHTMLPage.txt (the -t option gives you the result as text instead of xml)

mirod 2009-05-21 15:43:58

The command I gave you would be from the command line. xml_grep itself is written in Perl, but you can call it from whatever language you want, like any other tool.

mirod 2009-05-21 16:53:35

Answer 3

+6 A:

Command-line tools that can be called from shell scripts include:

4xpath - command-line wrapper around Python's 4Suite package
XMLStarlet
xpath - command-line wrapper around Perl's XPath library

I also use xmllint and xsltproc with little XSL transform scripts to do XML processing from the command line or in shell scripts.

Nat 2009-05-21 18:18:38

Answer 4

A:

Check out XML2 from http://www.ofb.net/~egnor/xml2/ which converts XML to a line-oriented format.

simon04 2009-11-07 15:31:22

Answer 5

+4 A:

You can do that very easily using only bash. You only have to add this function:

rdom () { local IFS=\> ; read -d \< E C ;}

Now you can use rdom like read but for html documents. When called rdom will assign the element to variable E and the content to var C.

For example, to do what you wanted to do:

while rdom; do
    if [[ $E = title ]]; then
        echo $C
        exit
    fi
done < xhtmlfile.xhtml > titleOfXHTMLPage.txt

Yuzem 2010-04-09 14:13:13

Answer 6

A:

Here's a function which will convert XML name-value pairs and attributes into bash variables.

http://www.humbug.in/2010/parse-simple-xml-files-using-bash-extract-name-value-pairs-and-attributes/

freethinker 2010-08-01 06:46:44

Answer 7

A:

After some research for translation between Linux and Windows formats of the file paths in XML files I found interesting tutorials and solutions on:

General informations about XPaths
Amara - collection of Pythonic tools for XML
Develop Python/XML with 4Suite (2 parts)

2010-10-24 01:00:37

ansaurus

tags:

views:

answers:

How to parse XML in Bash?

related questions