NuGet Package TextDiscovery

Linear-progressive text discovery engine exposing functionality through simple service APIs.

Break plain text into a sequence of slices which can be reconstituted as annotated text. Generate meta-rich tokens from a search expression to then be used to annotate source text matches; noise-word detection, tokenization, and matching options are configurable. Use a common adapter interface with interchangeable DOM libraries (HtmlAgility, AngleSharp, etc.) to do the following: mark search hits in the DOM, create HTML excerpts at a given word count with configurable element-breaking rules, and extract text content with selectively preserved formatting indicators. High degree of extensibility leveraging dependency injection.

While regex can be used in advanced configurations, it is not required. See project site for demos.

Info

Version: 1.0.3
Author(s): David West
Last Update: Monday, December 5, 2016
.NET Fiddle: Create the first Fiddle
Project Url: https://github.com/davidwest/textdiscovery
NuGet Url: https://www.nuget.org/packages/TextDiscovery


Install
Install-Package TextDiscovery
dotnet add package TextDiscovery
paket add TextDiscovery
TextDiscovery Download (Unzip the "nupkg" after downloading)

.NETFramework 4.5
  • No dependencies.
.NETStandard 1.0 2 packages depend on this package.


Tags



STATS

must-have-score

1

avg-downloads-per-day

2

days-since-last-release

931