ansaurus

Question

Replacing td tags with td and attributes

Answer 1

A:

just call this function from main: Note:this code will work for valid html i.e xhtml

 public static string TableFormat(string xhtml)
    {
        int start = 0, end = 0, trstart = 0, trend = 0;

        while (trstart != -1)
        {
            //start=end;
            trstart = xhtml.IndexOf("<tr>", end);
            if (trstart == -1)
                break;
            trend = xhtml.IndexOf("</tr>", trstart);
            start = xhtml.IndexOf("<td>", trstart);
            end = xhtml.IndexOf("</td>", start);
            while (end < trend)
            {
                //int trackTr = 0;
                start = xhtml.IndexOf("<td>", end);
                if (start > trend)
                    break;
                xhtml = xhtml.Insert(start + 3, " class=\"right\"");

                end = xhtml.IndexOf("</td>", start);

            }
        }
        return (xhtml);
    }

Smack 2010-09-23 10:49:21

Answer 2

A:

Have you stepped through this code and verified that it works as intended? HTML is very forgiving about things like tag case and whitespace, but your method is not; if the HTML isn't formatted very specifically, your method will likely fail. I'd take a look at that.

Also, you might want to build some more flexibility into it. It might work now (once you get the issue resolved), but if the source HTML ever changes, it may not in the future.

Mike Hofer 2010-09-23 10:53:08

seems like he wants it to be hardcore !! may be for some particular purpose as he said.

Sangram 2010-09-23 10:56:52

Okay. But what if, at some point down the road, the TD tag already contains a class attribute? Or what if the tag is written as "<TD> or "<td >" or "<Td >" or some other variant? He can control it now, but once it goes live and others get their hands on the code, all bets are off.

Mike Hofer 2010-09-23 11:00:20

@Mike: it will work only for valid xhtml as He specified earlier.

Sangram 2010-09-23 11:07:43

Not being argumentative, but the OP didn't specify "valid xhtml." And with that I'll let it drop.

Mike Hofer 2010-09-23 11:18:48

Answer 3

A:

if there is inside a tag then that also needs to be handled

Handling nested structures like that is not possible with regex.

Regex is an extraordinarily poor tool for manipulating HTML. Do yourself a favour and grab yourself a proper parser instead and your code will be simpler and more reliable. eg. with HTML Agility Pack:

HtmlDocument doc= new HtmlDocument();
doc.LoadHtml(html);
foreach (HtmlNode td in doc.DocumentElement.SelectNodes("//tr/td[position()>1]"]) {
    td.SetAttributeValue("class", "right");
}

bobince 2010-09-23 11:32:37

Answer 4

A:

Consider using a regular expression...

        string pattern = @"(?<!(<tr>\s*))<td>";
        string test = @"<tr> 
                          <td>1</td> 
                          <td>2</td> 
                          <td>3</td> 
                        </tr> ";
        string result = Regex.Replace(test, pattern, "<td class=\"right\">", RegexOptions.IgnoreCase | RegexOptions.Multiline);
        Console.WriteLine("{0}", result);

This works with upper or lower case and any amount of whitespace betweent the <tr> and the <td>. Anything other than whitespace would cause this to fail.

Les 2010-09-23 11:37:59

what about <tr> inside another <tr> tag ? iguess not possible !! ?

TERNA_staff 2010-09-27 04:12:50

it's possible, but would not be valid html. the example finds the first <td> in the <tr> ignoring only whitespace

Les 2010-09-27 16:30:05

ansaurus

tags:

views:

answers:

Replacing td tags with td and attributes

related questions