ansaurus

Question

Regex to split on successions of newline characters

Answer 1

+2 A:

re.split(r'[\n\r]+', line)

Ignacio Vazquez-Abrams 2010-04-08 00:22:43

Answer 2

+4 A:

The simplest pattern for this purpose is r'[\r\n]+' which you can pronounce as "one or more carriage-return or newline characters".

Alex Martelli 2010-04-08 00:23:27

Yup. That works.

drewk 2010-04-08 01:05:01

Answer 3

+1 A:

>>> s="Foo\r\n\r\nDouble Windows\r\rDouble OS X\n\nDouble Unix\r\nWindows\rOS X\nUnix"
>>> import re
>>> re.split("[\r\n]+",s)
['Foo', 'Double Windows', 'Double OS X', 'Double Unix', 'Windows', 'OS X', 'Unix']

ghostdog74 2010-04-08 00:34:50

Answer 4

A:

If there are no spaces at the starts or ends of the lines, you can use line.split() with no arguments. It will remove doubles. . If not, you can use [a for a a.split("\r\n") if a].

EDIT: the str type also has a method called "splitlines".

"Foo\r\n\r\nDouble Windows\r\rDouble OS X\n\nDouble Unix\r\nWindows\rOS X\nUnix".splitlines()

magcius 2010-04-08 03:19:33

ansaurus

tags:

views:

answers:

Regex to split on successions of newline characters

related questions