NuGet Package Stanford.NLP.Segmenter

Tokenization of raw text is a standard pre-processing step for many NLP tasks.

For English, tokenization usually involves punctuation splitting and separation of some affixes like possessives. Other languages require more extensive token pre-processing, which is usually called segmentation.

Info

Version: 3.9.2.0
Author(s): The Stanford Natural Language Processing Group
Last Update: Wednesday, May 1, 2019
.NET Fiddle: Create the first Fiddle
Project Url: http://sergey-tihon.github.io/Stanford.NLP.NET/
NuGet Url: https://www.nuget.org/packages/Stanford.NLP.Segmenter


Install
Install-Package Stanford.NLP.Segmenter
dotnet add package Stanford.NLP.Segmenter
paket add Stanford.NLP.Segmenter
Stanford.NLP.Segmenter Download (Unzip the "nupkg" after downloading)

  • IKVM(>= 8.1.5717 && <= 8.1.5717)


Tags



STATS

must-have-score

3.8

avg-downloads-per-day

4

days-since-last-release

222