Getting Links from an internal web page

Name: development - Getting Links from an internal web page - SharePoint Stack Exchange
Rating: 4.6 (4724 reviews)

Question

I have an internal company's web page and I need to extract a link from that page what I tried is

using (var client = new System.Net.WebClient()) { string pattern = @"(<a.*?>.*?</a>)"; MatchCollection hreflist; string Url = client.DownloadString("https://collaborate.citi.net/docs/DOC-908807"); hreflist = Regex.Matches(Url, pattern); Console.WriteLine("Total number of links in Url: " + hreflist.Count + "\n\n");

But this code doesn't seems to work here.

mzonerz · Accepted Answer · 2021-07-26 04:44:40Z

You need to download HtmlAgilityPack

WebClient wc = new WebClient(); var sourceCode = wc.DownloadString("http://dota-trade.com/equipment?order=name"); HtmlDocument doc = new HtmlDocument(); doc.LoadHtml(sourceCode); var node = doc.DocumentNode; var nodes = node.SelectNodes("//a"); List<string> links = new List<string>(); foreach (var item in nodes) { var link = item.Attributes["href"].Value; links.Add(link.Contains("http") ? link : "http://dota-trade.com" +link); } System.IO.File.WriteAllLines(@"C:\Users\Public\WriteLines.txt", links);

Can you please Explain me all the links that you have used in your code, as i have only one url and that i have mentioned in my question. i am bit confused with all the links that you have used. — Tausif Khan, CommentedJul 15, 2021 at 13:01

Stack Exchange Network

Getting Links from an internal web page

1 Answer 1

Hot Network Questions

Getting Links from an internal web page

1 Answer 1

Related

Hot Network Questions