ansaurus

Question

Replace only text on a webpage to create a link

Answer 1

A:

Walk the DOM recursively, skip over links (and their children) and process only nodes which nodeType is "text". You should have a Text node inside everything that has text, including <p>.

Edgar Bonet 2010-10-16 10:10:29

Answer 2

A:

Instead of doing a RegExp on the whole document I would try working with the DOM.

Using jQuery, I first select only nodes within I want to replace text nodes. That's important because otherwise you would also replace content in tags like <code> or event <script> which you really want to avoid. Within those get the text nodes and extract the matches. My example is very conservative about selecting which elements to consider to scan, YMMV.

// I used this code to execute on your stackoverflow question,
// thus I choose "this" . Try it in Firebug.
var re = /(this)/gi, split;
$('div,p').contents().each( function() {
    if (this.nodeType == Node.TEXT_NODE) {
        if (this.nodeValue.match( re ) ) {
            split = this.nodeValue.split( re );
            for (var i = 0, l = split.length; i < l; i++) {
                if (i % 2) {
                    // the re catches the match thus every
                    // odd index is the match
                    $('<a href="#destination">' + split[i] +
                        '</a>').insertBefore( this );
                } else {
                    $( document.createTextNode(split[i]) ).
                        insertBefore( this );
                }
            }
            $(this).remove();
        }
    }
});

Afaik using Node.TEXT_NODE is not that cross browser compatible, using the native value 3 if it's not available might be necessary. I've also read that String#split may not applicable everywhere too. In other words: careful testing is required.

mark 2010-10-16 10:46:15

Thank you. I used part of your solution and Edgars solution and came up with a (not 100% finished, I need to match the text in the text node and then create a link element and append it as the text nodes child, thus getting rid of the incorrect data.replace) probably less efficient solution. anyhow, here is what I have (and I pass in the body element) so far, I'll post again once I finish: (would not all fit in this comment) pastebin.com/RmHk3sKk I'd vote you and Edgar up, but I don't have enough reputation yet, and I'm not really sure who to select as the accepted solution since both are ri

Thomas Wolfe 2010-10-18 10:16:23

@Thomas: with your pastie-code there's the possibility you will work through non-web-text nodes because you use `hasChildNodes` which *could* find a `<script>` tag etc. If you just specify, from my example, `div, p, <other tags?>` you should be fine without going through the children (that's the idea of my sample). Or am I missing something? Is the set of elements you work on too small in my example?

mark 2010-10-18 13:04:53

@Mark: Thanks for catching that. I'll need to test a bit more (just now I tested on a page with a <script> tag and it returned a nodetype of 1. I'm not sure what the logic is for determining the value of nodetype (I googled nodetype and I am still not sure). I'm just a bit compulsive and really don't like to copy other peoples work. Nothing wrong with your solution, in fact I'd say it's better than what I came up with (I really like your sample, I might use your idea to only work on div, p, other? since my recursive function is not very efficient).

Thomas Wolfe 2010-10-20 07:40:24

Answer 3

A:

Just another idea: currently you first parse the whole document(which can take it's time). Then you transform the document, what may be critical(me as a user would'nt be glad to see some links removed)

so my idea: if you click somewhere inside a document, you can create a range, which may give you the TextNode you clicked on. So you should be able to parse the contents of the TextNode, see if it matches your pattern and forward the match to a function, which opens the URL. The bad part of this idea: you will not be able to label the Date somehow before the user has clicked on it.

But it's just an idea.

Dr.Molle 2010-10-17 03:05:58

ansaurus

tags:

views:

answers:

Replace only text on a webpage to create a link

related questions