28 packages returned for Tags:"tokenizer"
- 43,092 total downloads
- last updated 2/10/2020
- Latest version: 1.1.1
C# Expression parser and evaluator, inspired from jokenizer project.
- 38,161 total downloads
- last updated 11/22/2021
- Latest version: 1.9.4
The Apache OpenNLP library is a machine learning based toolkit for the processing of natural language text. It supports the most common NLP tasks, such as tokenization, sentence segmentation, part-of-speech tagging, named entity extraction, chunking, parsing, and coreference resolution. These tasks...
More information
- 7,449 total downloads
- last updated 3/1/2015
- Latest version: 1.0.6
Tokenizador (Xamarin) para Conekta. Necesitas tener alguna libreria de servidor para usar el token.
- 4,734 total downloads
- last updated 10/24/2015
- Latest version: 1.0.5
VBF.Compilers.Scanners is a scanner builder. It contains a regular expression to DFA engine, can generate high performance scanners for unicode source text.
- 10,986 total downloads
- last updated 12/12/2021
- Latest version: 4.1.0
A .NET class library that makes it easier to parse text. The library tracks the current position within the text, ensures your code never attempts to access a character at an invalid index, and includes many methods that make parsing easier. The library makes your text-parsing code more concise and...
More information
owl
by:
SplittyDev
- 4,616 total downloads
- last updated 10/21/2014
- Latest version: 1.0.5407.37498
A html preprocessor which allows you to write html code using a beautiful syntax
- 2,380 total downloads
- last updated 11/26/2020
- Latest version: 1.3.0
Trl.PegParser contains a tokenizer and a parser. The tokenizer uses regular expressions to define tokens, and exposes both matched and unmatched character ranges. The PEG Parser uses parsing expression grammers with tokens produced by the tokenizer. Trl.PegParser is build on .NET Standard 2.1 for...
More information
- 3,388 total downloads
- last updated 2/26/2020
- Latest version: 1.9.1
The Apache OpenNLP library is a machine learning based toolkit for the processing of natural language text. It supports the most common NLP tasks, such as tokenization, sentence segmentation, part-of-speech tagging, named entity extraction, chunking, parsing, and coreference resolution. These tasks...
More information
- 1,197 total downloads
- last updated 3/19/2022
- Latest version: 0.2.9
Experimental code that might become part of Loretta.CodeAnalysis.Lua.
- 648 total downloads
- last updated 4/13/2022
- Latest version: 1.1.0
This package contains tokenizers for following models:
· BERT Base
· BERT Large
· BERT German
· BERT Multilingual
· BERT Base Uncased
· BERT Large Uncased
- 1,310 total downloads
- last updated 3/19/2022
- Latest version: 0.2.9
A shared package used by Loretta.
Do not install this package manually, it will be added as a prerequisite by other packages that require it.
XLemmatizer
by:
rchristen
- 374 total downloads
- last updated 5/29/2020
- Latest version: 0.1.0
Pre-release version. API might change later.
A lemma is the canonical form of the word. For example, the words "run", "runs", "ran" and "running" can be lemmatized to "run"
XLemmatizer tokenizes and lemmatizes English sentences. How to use:
1) Creates a new instance of Lemmatizer
2) Calls...
More information
- 1,304 total downloads
- last updated 3/19/2022
- Latest version: 0.2.9
A GLua/Lua lexer, parser, code analysis, transformation and generation library.
- 4,385 total downloads
- last updated 10/23/2019
- Latest version: 1.1.0
NLQuery: natural language query parser recognizes entities in context of structured sources (like tabular dataset). Can be used for building natural language interface to SQL database or OLAP cube, implementing custom app-specific search.
- 666 total downloads
- last updated 6/7/2021
- Latest version: 5.0.0-alpha09
Extension methods to integrate GParse with Tsu.StateMachines
- Previous
- Next