Top 20 NuGet tokenizer Packages
Tokenizer extracts structured information from blocks of text and reflects them onto .NET objects
NLTK python library wrapper for .NET
Extension methods to integrate GParse with Tsu.StateMachines
Parsing and lexing utilities to create your own parser and lexer
The Apache OpenNLP library is a machine learning based toolkit for the processing of natural language text. It supports the most common NLP tasks, such as tokenization, sentence segmentation, part-of-speech tagging, named entity extraction, chunking, parsing, and coreference resolution. These tasks ...
C# Expression parser and evaluator, inspired from jokenizer project.
Powerful Brainfuck language interpreter with ability to tokenize & validate the Brainfuck code
A .NET class library that makes it easier to parse text. The library tracks the current position within the text, ensures your code never attempts to access a character at an invalid index, and includes many methods that make parsing easier. The library makes your text-parsing code more concise and ...
Trl.PegParser contains a tokenizer and a parser. The tokenizer uses regular expressions to define tokens, and exposes both matched and unmatched character ranges. The PEG Parser uses parsing expression grammers with tokens produced by the tokenizer. Trl.PegParser is build on .NET Standard 2.1 for cr...
Tokenizer for social media posts and comments
A lightweight CSS tokenizer/parser with no dependencies.
NLQuery: natural language query parser recognizes entities in context of structured sources (like tabular dataset). Can be used for building natural language interface to SQL database or OLAP cube, implementing custom app-specific search.
The Apache OpenNLP library is a machine learning based toolkit for the processing of natural language text. It supports the most common NLP tasks, such as tokenization, sentence segmentation, part-of-speech tagging, named entity extraction, chunking, parsing, and coreference resolution. These tasks ...
A Farsi (Persian) language checking and NLP library. This package includes both library and it's required data files.
Required data files for Virastyar library.
A Farsi (Persian) language checking and NLP library.
The Stringe is a wrapper for the .NET String object that tracks line, column, offset, and other metadata for substrings.
Ccr.Parsing package for writing parsers, lexers, tokenizers, etc.
VBF.Compilers.Scanners is a scanner builder. It contains a regular expression to DFA engine, can generate high performance scanners for unicode source text.
WQuery enables parsing and then editing a HTML code with the assistance of a fluent interface just like in the case of a jQuery library. WQuery is a part of a Wojdav Bootstrap Mvc package. The parsing of the HTML code is based on a WHtmlParser library. For now, a WHtmlParser contains some limitation...