ansaurus

Question

jQuerys $.trim(), bug or poorly written ?

Answer 1

+5 A:

As it turned out, this behavior was posted on jQuerys bugtracker one month ago:

http://dev.jquery.com/ticket/6605

Thanks to Andrew for pointing me to that.

jAndy 2010-06-28 13:37:54

That bug report is funny because Trac seems to eat uparrow characters. Thus both the broken "rtrim" and the suggested fix look incorrect.

Pointy 2010-06-28 13:44:38

jAndy - +1 I see that I had been looking at the most recent commit. (Thanks to @Pointy for giving me heads up.) But still, I can't get either the regex or `$.trim()` (which are the same) to fail in Firefox. I wonder why that is.

patrick dw 2010-06-28 13:46:53

Answer 2

+7 A:

This is a known bug, as said in comments, and Crescent is right that it's this way in 1.4.2, but it's already fixed for the next release.

You can test the speed of String.prototype.trim on your string here: http://jsfiddle.net/dLLVN/
I get around 79ms in Chrome 117ms in Firefox for a million runs...so this will fix the hanging issue :)

As for the fix, take a look at the current source that'll be in 1.4.3, the native trimming is now used.

There were 2 commits in march for this:

1.4.2 $.trim() function:

trim: function( text ) {
    return (text || "").replace( rtrim, "" );
},

1.4.3 $.trim() function:

//earlier: 
trim = String.prototype.trim

//new trim here
trim: trim ?
  function( text ) {
    return text == null ?
      "" : 
      trim.call( text ); 
  } :

  // Otherwise use our own trimming functionality
  function( text ) { 
    return text == null ? 
      "" :
      text.toString().replace( trimLeft, "" ).replace( trimRight, "" );
  }

The trimLeft and trimRight vary, depending on whether you're in IE or not, like this:

trimLeft = /^\s+/,
trimRight = /\s+$/,

// Verify that \s matches non-breaking spaces
// (IE fails on this test)
if ( !/\s/.test( "\xA0" ) ) {
  trimLeft = /^[\s\xA0]+/;
  trimRight = /[\s\xA0]+$/;
}

Nick Craver 2010-06-28 13:43:02

@Nick: it would be nice to see the `trimLeft` and `trimRight` expressions as well.

jAndy 2010-06-28 13:49:10

@jAndy - He posted them as a link. Click the first commit link.

patrick dw 2010-06-28 13:52:55

@jAndy - Added the relevant bits and links to the branch tags so you can see the changes in git :)

Nick Craver 2010-06-28 13:55:38

Answer 3

+5 A:

Normally an expression like ^\s+|\s+$ should be enough for trimming, since \s is supposed to match all space characters, even \0xa0 non-breaking spaces¹. This expression should run without causing any problems.

Now probably some browser that jQuery wants to support doesn't match \0xa0 with \s and to work around this problem jQuery added the alternative (\s|\0xa0), to trim away non-breaking spaces on that browser too.

With this change, the second part of the regex looks like (\s|\0xa0)+$, which leads to problems in browsers where \0xa0 is also matched by \s. In a string containing a long sequence of \0xa0 characters, each character can be matched by \s or \0xa0, leading to lots of alternative matches and exponentially many combinations how different matches can be combined. If this sequence of \0xa0 characters is not at the end of the string, the trailing $ condition can never be fulfilled, no matter which spaces are matched by \s and which are matched by \0xax, but the browser doesn't know this and tries all combinations, potentially searching for a very long time.

The simplified expression you suggest will not be sufficient since \s is supposed to match all unicode space characters, not just the well-known ASCII ones.

¹ According to MDC, \s is equivalent to [\t\n\v\f\r \u00a0\u2000\u2001\u2002\u2003\u2004\u2005\u2006\u2007\u2008\u2009\u200a\u200b\u2028\u2029\u3000]

sth 2010-06-28 13:53:00

Nice answer, good to know.

jAndy 2010-06-28 13:57:51

ansaurus

tags:

views:

answers:

jQuerys $.trim(), bug or poorly written ?

related questions