Help me rewrite this regex to not match tags with attributes? | ansaurus

tags:

views:

92

answers:

1

+1 Q:

Help me rewrite this regex to not match tags with attributes?

=========================================================================

EDIT: I'm using node.js, so I don't have access to the DOM, and parsing with an HTML parser is not an option (it's not efficient enough to justify parsing through such a small amount of text)

=========================================================================

First off, I know. HTML + Regex = fail. However, I just need it to remove all tags with attributes.

Here's what I have so far:

    exports.strip_tags = function(input, allowed) {
      // Strips HTML and PHP tags from a string
   allowed = (((allowed || "") + "")
     .toLowerCase()
     .match(/<[a-z][a-z0-9]*>/g) || [])
     .join('');
      var tags = /<\/?([a-z][a-z0-9]*)\b[^>]>/gi,
      commentsAndPhpTags = /<!--[\s\S]*?-->|<\?(?:php)?[\s\S]*?\?>/gi;
      return input.replace(commentsAndPhpTags, '').replace(tags, function($0, $1){
        return allowed.indexOf('<' + $1.toLowerCase() + '>') > -1 ? $0 : '';
      });
    }

Any chance someone know's how to change up one of these regex's to make this remove what I need it to?

To clarify: This function should remove all tags with attributes, keep only the tags that are allowed (without attributes), and output the result.

A:

Convert it to XHTML and then use xpath.

HTML->XHTML tools:

As you said.... HTML + Regex = fail

Abe Miessler 2010-09-14 22:47:11

related questions

What JavaScript patterns do you use most?

Javascript events

What style do you use for creating an "class" in JavaScript?

Graphing JavaScript Library

What's the difference in closure style

Getting the text from a drop-down box

How to specify javascript to run when ModalPopupExtender is shown

Length of Javascript Associative Array

How Do I Post and then redirect to an external URL from ASP.Net?

How to set up a CSS switcher in ASP.NET

Wrapping lists into columns

Javascript keyboard events primer? (or rather: help me with my custom dropdown)

Http Auth in a Firefox 3 bookmarklet

Best Debugging Tools for JavaScript/xulrunner Development

How can I turn a string of HTML into a DOM object in a Firefox extension?

Call ASP.NET Function From Javascript?

Javascript troubleshooting tools in IE

MAC addresses in JavaScript

Capturing TAB key in text box

CSS Background Color in Javascript

How can I make the browser see CSS and Javascript changes?

Triple Quotes? How do I delimit a databound Javascript string parameter in ASP.NET?

ASP .Net Custom Client-Side Validation

What JavaScript library would you choose for a new project and why?

Detecting font in JavaScript