Html2Xhtml 1.1.2.4
Html2Xhtml is a .NET 4.0 library for converting HTML to XHTML licensed under GPLv2 or above.
I tested Html2Xhtml in the local reconstruction of a large online database of the European Union. Tidy/Tidy.NET would not even produce valid output most of the time, Chilkat's HTML-to-XML was a bit slow and produced strange results (misplaced, missing, unexplainable elements). In attempt to find a free, fast and reliable conversion tool I created this library. It converts 2 - 4x faster than all other libraries I tested.
Html2Xhtml, combined with the power of LINQ to XML, is an excellent tool for all large-scale data extraction and web crawling scenarios.
Install-Package Html2Xhtml -Version 1.1.2.4
dotnet add package Html2Xhtml --version 1.1.2.4
<PackageReference Include="Html2Xhtml" Version="1.1.2.4" />
paket add Html2Xhtml --version 1.1.2.4
#r "nuget: Html2Xhtml, 1.1.2.4"
// Install Html2Xhtml as a Cake Addin #addin nuget:?package=Html2Xhtml&version=1.1.2.4 // Install Html2Xhtml as a Cake Tool #tool nuget:?package=Html2Xhtml&version=1.1.2.4
Dependencies
This package has no dependencies.
Used By
NuGet packages
This package is not used by any NuGet packages.
GitHub repositories
This package is not used by any popular GitHub repositories.
Version History
Version | Downloads | Last updated |
---|---|---|
1.1.2.4 | 18,250 | 6/4/2011 |