ansaurus

Question

vim regex replace multiple consecutive spaces with only one space

Answer 1

+1 A:

this will replace 2 or more spaces

s/ \{2,}/ /g

or you could add an extra space before the \+ to your version

s/  \+/ /g

mikerobi 2010-10-05 02:51:39

Answer 2

+2 A:

In the interests of pragmatism, I tend to just do it as a three-stage process:

:g/^    /s//XYZZYPARA/g
:g/ \+/s// /g
:g/^XYZZYPARA/s//    /g

I don't doubt that there may be a better way (perhaps using macros or even a pure regex way) but I usually find this works when I'm in a hurry. Of course, if you have lines starting with XYZZYPARA, you may want to adjust the string :-)

It's good enough to turn:

    This is a new paragraph
spanning       two lines.
    And    so    is   this but on one line.

into:

    This is a new paragraph
spanning two lines. 
    And so is this but on one line.

Aside: If you're wondering why I use :g instead of :s, that's just habit mostly. :g can do everything :s can and so much more. It's actually a way to execute an arbitrary command on selected lines. The command to execute happens to be s in this case so there's no real difference but, if you want to become a vi power user, you should look into :g at some point.

paxdiablo 2010-10-05 02:56:01

I liked your idea of dealing with the particular case (whitespaces at the beginning of the line which must be treated differently).

jedi_coder 2010-10-05 03:07:22

Yeah, the purist/idealist in me started taking a back seat a long time ago. Now I just like to get the job done, especially if the alternative is a 600-character regex with back-tracking and look-ahead, that I won't understand when I have to come back and debug it in three months :-)

paxdiablo 2010-10-05 03:12:25

+1 xyzzy plover

TokenMacGuy 2010-10-05 03:56:04

Answer 3

+1 A:

Does this work?

%s/\([^ ]\)  */\1 /g

frogstarr78 2010-10-05 03:34:44

better use `%s/[^ ]\zs \+/ /g` in this case (`:help /\zs`)

Benoit 2010-10-05 16:32:36

Ah! Nice. I agree much better. Thank you.

frogstarr78 2010-10-06 23:23:52

Answer 4

+10 A:

This will do the trick:

%s![^ ]\zs  \+! !g

Many substitutions can be done in Vim easier than with other regex dialects by using the \zs and \ze meta-sequences. What they do is to exclude part of the match from the final result, either the part before the sequence (\zs, “s” for “start here”) or the part after (\ze, “e” for “end here”). In this case, the pattern must match one non-space character first ([^ ]) but the following \zs says that the final match result (which is what will be replaced) starts after that character.

Since there is no way to have a non-space character in front of line-leading whitespace, it will be not be matched by the pattern, so the substitution will not replace it. Simple.

Aristotle Pagaltzis 2010-10-05 03:48:35

I would like to propose this alternative: `%s!\S\@<= \+! !g`. The `\@<=` is such a beautiful duck that I like using it. See also `:help /\@<=`

Benoit 2010-10-05 16:36:40

I just prefer the reduced finger acrobatics of `zs` over typing `@<=`… in much the same way (if to a lesser extent) that I enjoy Vim better than E(scape)M(eta)A(lt)C(ontrol)S(hift). :) OTOH, one’s sense of flair is always worth some sacrifice, so feel free.

Aristotle Pagaltzis 2010-10-05 16:48:52

depends on what keyboard layout you're using of course…

Benoit 2010-10-05 18:22:35

Answer 5

A:

I like this version - it is similar to the look ahead version of Aristotle Pagaltzis, but I find it easier to understand. (Probably just my unfamiliarity with \zs)

s/\([^ ]\) \+/\1 /g

or for all whitespace

s/\(\S\)\s\+/\1 /g

I read it as "replace all occurences of something other than a space followed by multiple spaces with the something and a single space".

Michael Anderson 2010-10-05 04:13:10

Of course this version is an order of magnitude more finicky to type, and to formulate on the fly – and that’s for almost as trivial a pattern as it gets. You’ll be well served to familiarise yourself with `\zs` and `\ze`, they can do wonders for the writability and readability of more complex patterns (particularly when you have reason to use both at once!).

Aristotle Pagaltzis 2010-10-05 04:43:43

I surely will look at `\zs` and `\ze`, but I also often use my regexs in python and sed. So it can be nice to have a solution that will work across multiple applications.

Michael Anderson 2010-10-05 05:11:50

Answer 6

+3 A:

There are lots of good answers here (especially Aristotle's: \zs and \ze are well worth learning). Just for completeness, you can also do this with a negative look-behind assertion:

:%s/\(^ *\)\@<! \{2,}/ /g

This says "find 2 or more spaces (' \{2,}') that are NOT preceded by 'the start of the line followed by zero or more spaces'". If you prefer to reduce the number of backslashes, you can also do this:

:%s/\v(^ *)\@<! {2,}/ /g

but it only saves you one character! You could also use ' +' instead of ' {2,}' if you don't mind it doing a load of redundant changes (i.e. changing a single space to a single space).

You could also use the negative look-behind to just check for a single non-space character:

:%s/\S\@<!\s\+/ /g

which is much the same as (a slightly modified version of Aristotle's to treat spaces and tabs as the same in order to save a bit of typing):

:%s/\S\zs \+/ /g

See:

:help \zs
:help \ze
:help \@<!
:help zero-width
:help \v

and (read it all!):

:help pattern.txt

Al 2010-10-05 10:42:41

Answer 7

A:

Answered; but though i'd toss my work flow in anyway.

%s/  / /g
@:@:@:@:@:@:@:@:@:@:@:@:(repeat till clean)

Fast and simple to remember. There are a far more elegant solutions above; but just my .02.

wom 2010-10-05 12:50:18

this is not a good solution: first it will remove leading whitespace, which the author of the question wishes to avoid. Second, you can do 100@: to run 100 times contents of register : (which is the last ex command)

Benoit 2010-10-05 16:34:41

hence I said it's not the best answer in my reply :)

wom 2010-10-06 18:18:46

ansaurus

tags:

views:

answers:

vim regex replace multiple consecutive spaces with only one space

related questions