Top 20 NuGet spider Packages
Sqlite-based storage engine to the SimpleSpider
See examples and documentation on the GitHub page
It helps you to use HAP in easier and meaningful way via Reflection.
It works somehow like Entity-Framework. Go to wiki in github page for tutorial :
https://github.com/parsalotfy/HtmlAgilityPack_Helper/wiki
Stateful programmatic web browsing, based on Python-Mechanize, which is based on Andy Lester’s Perl module WWW::Mechanize.
FSharp Web Crawler
This is an agile HTML parser that builds a read/write DOM and supports plain XPATH or XSLT (you actually don't HAVE to understand XPATH nor XSLT to use it, don't worry...). It is a .NET code library that allows you to parse "out of the web" HTML files. The parser is very tolerant with "real world" m...
插件式爬虫核心代码
插件式爬虫的 cef 实现的一个下载器
An SDK client to make calls to a scrapyrt http endpoint.
Security Spider