ansaurus

Question

email body from a parsed email object in jython

Answer 1

+3 A:

This will get you the contents of the message

self.currentEmailParsedInstance.get_payload()

As for the text only part you will have to strip HTML on your own, for example using BeautifulSoup.

Check this link for more information about the Message class the Parser returns. If you mean getting the text part of messages containing both HTML and plain text version of themselves, you can specify an index to get_payload() to get the part you want.

I tried with a different MIME email because what you pasted seems malformed, hopefully it got malformed when you edited it.

>>> parser = email.parser.Parser()
>>> message = parser.parse(open('/home/vinko/jlm.txt','r'))
>>> message.is_multipart()
True
>>> parts = message.get_payload()
>>> len(parts)
2
>>> parts[0].get_content_type()
'text/plain'
>>> parts[1].get_content_type()
'message/rfc822'
>>> parts[0].get_payload()
'Message Text'

parts will contain all parts of the multipart message, you can check their content types as shown and get only the text/plain ones, for instance.

Good luck.

Vinko Vrsalovic 2008-11-11 07:27:54

hi, just want the text, I want to throw away the html.pls look at the question again

Setori 2008-11-11 08:16:06

Please rewrite your question. How do those emails look like? What is the problem with what you get with get_payload()? Why do you use in one example self.currentEmailParsedInstance and in the other self._email?

Vinko Vrsalovic 2008-11-11 09:14:01

Answer 2

A:

ended up with this

        parser = email.parser.Parser()
        self._email = parser.parse(open('/home/vinko/jlm.txt','r'))
        parts=self._email.get_payload()
        check=parts[0].get_content_type()
        if check == "text/plain":
            return parts[0].get_payload()
        elif check == "multipart/alternative":
            part=parts[0].get_payload()
            if part[0].get_content_type() == "text/plain":
                return part[0].get_payload()
            else:
                return "cannot obtain the body of the email"
        else:
            return "cannot obtain the body of the email"

Setori 2008-11-11 10:42:45

ansaurus

tags:

views:

answers:

email body from a parsed email object in jython

related questions