simplemma

Lemmatizer

Lemmatization tool for natural language processing

Simple multilingual lemmatizer for Python, especially useful for speed and efficiency

GitHub

146 stars
7 watching
12 forks
Language: Python
last commit: 28 days ago
Linked from 1 awesome list

corpus-toolslanguage-detectionlanguage-identificationlemmatiserlemmatizationlemmatizerlow-resource-nlpmorphological-analysisnlptokenizationtokenizerwordlist

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
yohasebe/lemmatizer A Ruby library that provides a lemmatizer for text in English. 108
kuhumcst/cstlemma A lemmatiser tool for multiple languages using affix rules and supervised learning from full-form dictionaries. 36
sorenlind/lemmy Lemmatizer for Danish and Swedish languages 76
adbar/german-nlp A curated collection of German language resources and tools for natural language processing 453
jpbarrette/moman A suite of tools for linguistic analysis and correction, including finite state automata manipulation and string correction algorithms. 28
leks-forever/nllb-tuning This is an experimental project for fine-tuning the NLB language model with a specific dataset and evaluating its performance on translation tasks. 7
c4n/pythonlexto A Python wrapper around the Thai word segmentator LexTo, allowing developers to easily integrate it into their applications. 1
mohamedadaly/labr A dataset of Arabic book reviews for natural language processing tasks 44
proycon/python-frog A Python binding to a C++ NLP tool for Dutch language processing tasks 47
sergioburdisso/pyss3 A Python package implementing an interpretable machine learning model for text classification with visualization tools 336
artificiai/multilingual-latent-dirichlet-allocation-lda An LDA-based text clustering pipeline for multiple languages 82
ixa-ehu/ixa-pipe-pos Provides tools for part of speech tagging and lemmatization across multiple languages using machine learning models. 18
neulab/compare-mt A tool for comparing the performance of different language generation systems. 467
alexrutherford/arabic_nlp Tools for normalizing and deriving sentiment from Arabic text 26
minibikini/paasaa Tools for detecting the language of unstructured text in Elixir applications 116