simplemma
Lemmatizer
Lemmatization tool for natural language processing
Simple multilingual lemmatizer for Python, especially useful for speed and efficiency
146 stars
7 watching
12 forks
Language: Python
last commit: 28 days ago
Linked from 1 awesome list
corpus-toolslanguage-detectionlanguage-identificationlemmatiserlemmatizationlemmatizerlow-resource-nlpmorphological-analysisnlptokenizationtokenizerwordlist
Related projects:
Repository | Description | Stars |
---|---|---|
yohasebe/lemmatizer | A Ruby library that provides a lemmatizer for text in English. | 108 |
kuhumcst/cstlemma | A lemmatiser tool for multiple languages using affix rules and supervised learning from full-form dictionaries. | 36 |
sorenlind/lemmy | Lemmatizer for Danish and Swedish languages | 76 |
adbar/german-nlp | A curated collection of German language resources and tools for natural language processing | 453 |
jpbarrette/moman | A suite of tools for linguistic analysis and correction, including finite state automata manipulation and string correction algorithms. | 28 |
leks-forever/nllb-tuning | This is an experimental project for fine-tuning the NLB language model with a specific dataset and evaluating its performance on translation tasks. | 7 |
c4n/pythonlexto | A Python wrapper around the Thai word segmentator LexTo, allowing developers to easily integrate it into their applications. | 1 |
mohamedadaly/labr | A dataset of Arabic book reviews for natural language processing tasks | 44 |
proycon/python-frog | A Python binding to a C++ NLP tool for Dutch language processing tasks | 47 |
sergioburdisso/pyss3 | A Python package implementing an interpretable machine learning model for text classification with visualization tools | 336 |
artificiai/multilingual-latent-dirichlet-allocation-lda | An LDA-based text clustering pipeline for multiple languages | 82 |
ixa-ehu/ixa-pipe-pos | Provides tools for part of speech tagging and lemmatization across multiple languages using machine learning models. | 18 |
neulab/compare-mt | A tool for comparing the performance of different language generation systems. | 467 |
alexrutherford/arabic_nlp | Tools for normalizing and deriving sentiment from Arabic text | 26 |
minibikini/paasaa | Tools for detecting the language of unstructured text in Elixir applications | 116 |