NuGet Package Html2Xhtml

Html2Xhtml is a .NET 4.0 library for converting HTML to XHTML licensed under GPLv2 or above.

I tested Html2Xhtml in the local reconstruction of a large online database of the European Union.

Tidy/Tidy.NET would not even produce valid output most of the time, Chilkat's HTML-to-XML was a bit slow and produced strange results (misplaced, missing, unexplainable elements). In attempt to find a free, fast and reliable conversion tool I created this library.

It converts 2 - 4x faster than all other libraries I tested.

Html2Xhtml, combined with the power of LINQ to XML, is an excellent tool for all large-scale data extraction and web crawling scenarios.

Info

Version: 1.1.2.4
Author(s): deemc
Last Update: Saturday, June 4, 2011
.NET Fiddle: Create the first Fiddle
Project Url: http://corsis.sourceforge.net/index.php/Html2Xhtml
NuGet Url: https://www.nuget.org/packages/Html2Xhtml


Install
Install-Package Html2Xhtml
dotnet add package Html2Xhtml
paket add Html2Xhtml
Html2Xhtml Download (Unzip the "nupkg" after downloading)



Tags



STATS

must-have-score

1.1

avg-downloads-per-day

2

days-since-last-release

2970