Skip to content Skip to sidebar Skip to footer

Trouble Scraping Web Page With Malformed Content

I have written c# code which utilizes the HtmlAgilityPack library in order to scrape a page located at: World's Largest Urban Areas (Page 2). Unfortunately the page consists of ma

Solution 1:

Run the content through HTML Tidy before parsing it.

http://tidy.sourceforge.net/

Post a Comment for "Trouble Scraping Web Page With Malformed Content"