ansaurus

Question

How to find a Word that is enclosed from Html Tags?

Answer 1

A:

sorry the word programing

yas 2010-02-22 10:46:30

I've edited the post. This space is meant for answers. Please delete this answer. You can make use of comments. Welcome to SO :)

codaddict 2010-02-22 10:49:08

There's a edit link under your question, use that instead if you need to clarify your question.

Tatu Ulmanen 2010-02-22 10:50:12

I have done it ;-) thanks

yas 2010-02-22 10:52:29

Answer 2

A:

I would use something to pull out any HTML so that you are dealing with plaintext. I cannot speak for any tools like this in javascript but I'm sure they exists. If you can find something to 'scrub' the html out of your .text() you can run a search this way.

Try something like this: http://search.cpan.org/~podmaster/HTML-Scrubber-0.08/Scrubber.pm

Rabbott 2010-03-22 21:13:13

Answer 3

+1 A:

/([\s>"'])prog(<[^>]+>)ram(<[^>]+>)ing([\s\.,:;"'<])/g

will match your example

So roughly the following regex will find all instances of the word, even those broken with html

 var regExp = new RegExp('([\s>"\'])' + word.split('').join('(<[^>]+>)') + '([\s\.,:;"\'<])',g);

God knows how that'll help you build a spellchecker though. I suspect the approach used in spellcheckers would be more like 'do a spellcheck assuming no html, and if there is html in a word then strip it out using something like the method below, and do a spellcheck as normal for the string you get:

String.prototype.stripHtml = function() {
  return this.replace(/(<[^>]+>)/, '');
}

wheresrhys 2010-03-22 22:23:45

ansaurus

tags:

views:

answers:

How to find a Word that is enclosed from Html Tags?

related questions