ansaurus

Question

The SHORTEST way to remove multiple spaces in a string in Python

Answer 1

+5 A:

>>> import re
>>> re.sub(' +',' ','The     quick brown    fox')
'The quick brown fox'

jleedev 2009-10-09 21:52:29

This solution only handles single space characters. It wouldn't replace a tab or other whitespace characters handled by \s like in nsr81's solution.

Taylor Leese 2009-10-09 22:21:52

That's true, `string.split` also handles all kinds of whitespaces.

jleedev 2009-10-10 07:55:35

Answer 2

+9 A:

import re
s = "The   fox jumped   over    the log."
re.sub("\s\s+" , " ", s)

nsr81 2009-10-09 21:52:30

I'd tend to change that regex to `r"\s\s+"` so that it doesn't try to replace already-single spaces.

Ben Blank 2009-10-09 21:55:57

updated. thanks for pointing that out.

nsr81 2009-10-09 21:56:46

If you wanted that behavior, why not just `"\s{2,}"` instead of a workaround for not knowing moderately-advanced regex behavior?

Chris Lutz 2009-10-09 22:06:26

Answer 3

+8 A:

foo is your string:

" ".join(foo.split())

Taylor Leese 2009-10-09 21:52:56

nsr81 2009-10-09 21:53:40

You mean `split` not `strip` right?

Chris Lutz 2009-10-09 21:54:09

sorry, meant split. it's a typo.

Taylor Leese 2009-10-09 21:54:51

“Without splitting and going into lists...”

Gumbo 2009-10-09 21:57:17

I ignored "Without splitting and going into lists..." because I still think it's the best answer.

Taylor Leese 2009-10-10 03:44:12

Answer 4

+4 A:

Similar to the previous solutions, but more specific: replace two or more spaces with one:

>>> import re
>>> s = "The   fox jumped   over    the log."
>>> re.sub('\s{2,}', ' ', s)
'The fox jumped over the log.'

Peter 2009-10-09 21:58:27

Answer 5

+4 A:

Have to agree with Paul McGuire's comment above. To me,

         ' '.join(the_string.split())

is vastly preferable to whipping out a regex. My measurements (Linux, Python 2.5) show the split-then-join to be almost 5 times faster than doing the "re.sub(...)", and still 3 times faster if you precompile the regex once and do the operation multiple times. And it is by any measure easier to understand -- much more pythonic.

Kevin Little 2009-10-10 02:39:51

ansaurus

tags:

views:

answers:

The SHORTEST way to remove multiple spaces in a string in Python

related questions