Stanford.NLP.Segmenter

Tokenization of raw text is a standard pre-processing step for many NLP tasks.

For English, tokenization usually involves punctuation splitting and separation of some affixes like possessives. Other languages require more extensive token pre-processing, which is usually called segmentation.

Info

Version:	4.2.0.2
Author(s):	The Stanford Natural Language Processing Group
Last Update:	Thursday, August 25, 2022
.NET Fiddle:	Create the first Fiddle
Project Url:	http://sergey-tihon.github.io/Stanford.NLP.NET/
NuGet Url:	https://www.nuget.org/packages/Stanford.NLP.Segmenter

Install

NuGet .NET CLI Paket CLIR Direct Download

Install-Package Stanford.NLP.Segmenter

dotnet add package Stanford.NLP.Segmenter

paket add Stanford.NLP.Segmenter


                                Stanford.NLP.Segmenter Download (Unzip the "nupkg" after downloading)

Dependencies

IKVM(>= 8.1.5717 && <= 8.1.5717)

Stanford.NLP.Segmenter NuGet Package

Got any Stanford.NLP.Segmenter Question?

Info

Install

Dependencies

Tags

STATS

must-have-score

avg-downloads-per-day

days-since-last-release