You can use this regular expression that will strip out html tags and leave just the data you require:
String noHTMLString = htmlString.replaceAll("\\<.*?>","");
You will get
inputString.split('\n') # --> ['Line 1', 'Line 2', 'Line 3']
This is identical to the above, but the string module's functions are deprecated and should be avoided:
import string
string.split(inputString, '\n') # --> ['Line 1', 'Line 2', 'Line 3']
Alternatively, if you want each line to include the break sequence (CR,LF,CRLF), use the splitlines method with a True argument:
inputString.splitlines(True) # --> ['Line 1\n', 'Line 2\n', 'Line 3']
Hope this helps if you need any more help do let me know.
PK