ansaurus

Question

path to element with conditions on parent(s) attributes using xpath,lxml,python

Answer 1

+2 A:

This XPath expression:

/PatientsTree 
  /Patient[@PatientID='SKU55527']     
    /Study[@StudyInstanceUID =
           '25.2.9.2.1107.5.1.4.49339.30000007010207164403100000013'] 
      /Series

Results in this node selected:

<Series SeriesInstanceUID="2.16.840.1.113669.1919.1198835358"/>

Alejandro 2010-10-19 14:10:16

@Alejandro: +1 for a pure XPath solution.

Dimitre Novatchev 2010-10-19 16:23:51

thanks , for some reason when I tried this syntax I was getting only empty list, I thought there was something mystic about lxml that I am still to learn. I found this page http://msdn.microsoft.com/en-us/library/ms256086.aspx with some great xpath exaples

2010-10-19 17:13:30

@user480369: You are wellcome.

Alejandro 2010-10-19 18:28:17

Answer 2

+2 A:

import lxml.etree as le
with open('data.xml') as f:
    doc=le.parse( f )
patientID="SKU55527"
studyInstanceUID="25.2.9.2.1107.5.1.4.49339.30000007010207164403100000013"
xpath='''\
    /PatientsTree
        /Patient[@PatientID="{p}"]
            /Study[@StudyInstanceUID="{s}"]
                /Series'''.format(p=patientID,s=studyInstanceUID)
seriesInstanceUID=doc.xpath(xpath)
for node in seriesInstanceUID:
    print(node.attrib)
    # {'SeriesInstanceUID': '2.16.840.1.113669.1919.1198835358'}

unutbu 2010-10-19 14:16:53

Beautifully written - I love the improved readability of the format method. I also can never help but appreciate how much lxml seems like a native library.

nearlymonolith 2010-10-19 14:42:26

@unutbu: Please, when the schema is well known, don't start XPath expressions with `//` operator because this trasverse all the tree.

Alejandro 2010-10-19 15:13:33

@Alejandro: Thanks; that's a good point. I hope you don't mind my taking your nicely formatted XPath, too.

unutbu 2010-10-19 15:24:20

@unutbu: Not problem. White space handling in XPath expressions is a good feature for readability.

Alejandro 2010-10-19 15:29:52

I learnt some new things, hope this helped some others, thanks to you all

2010-10-19 17:15:53

Answer 3

+1 A:

If you want to use lxml natively instead of xpath: (otherwise, unutbu's solution is perfect)

from lxml import etree as ET
tree = ET.parse('some_file.xml')
patientID="SKU55527"
studyInstanceUID="25.2.9.2.1107.5.1.4.49339.30000007010207164403100000013"
patient_node = tree.find(patientID)
if not patient_node is None:
    study_node = patient_node.find(studyInstanceUID)
    if not study_node is None:
        for child in study_node.getchildren():
            print child.attrib
            #or do whatever useful thing you want
    else:
        #didn't find the study
else:
    #didn't find the node

nearlymonolith 2010-10-19 14:39:59

ansaurus

tags:

views:

answers:

path to element with conditions on parent(s) attributes using xpath,lxml,python

related questions