I am trying to get all the text between the following tags and it is just not workind
If Not String.IsNullOrEmpty(_html) Then
Dim regex As Regex = New Regex( _
".*<entry(?<link>.+)</entry>", _
RegexOptions.IgnoreCase _
Or RegexOptions.CultureInvariant _
Or RegexOptions.Multiline _
)
Dim ms As MatchCollection = regex.Matches(_html)
Dim url As String = String.Empty
For Each m As Match In ms
url = m.Groups("link").Value
urls.Add(url)
Next
Return urls
I have already wrote my fetch functions to get the html as string. I was looking at an example of the html agility pack and I dont have files saved as html docs
HtmlDocument doc = new HtmlDocument();
doc.Load("file.htm");
foreach(HtmlNode link in doc.DocumentElement.SelectNodes("//a[@href"])
{
HtmlAttribute att = link["href"];
att.Value = FixLink(att);
}
doc.Save("file.htm");