ixa-pipe-pos

Language tagger and lemmatizer

Provides tools for part of speech tagging and lemmatization across multiple languages using machine learning models.

IXA pipes Part of Speech tagger and Lemmatizer (http://ixa2.si.ehu.es/ixa-pipes)

GitHub

18 stars
9 watching
15 forks
Language: Java
last commit: about 2 years ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
arbox/treetagger-ruby A Ruby wrapper for a statistical language modeling tool for part-of-speech tagging and chunking 16
openai/finetune-transformer-lm This project provides code and model for improving language understanding through generative pre-training using a transformer-based architecture. 2,167
yohasebe/engtagger A Ruby library for assigning part-of-speech tags to English text based on a lookup dictionary and probability values. 263
languagemachines/frog An integration of memory-based natural language processing modules for Dutch 75
adbar/simplemma Lemmatization tool for natural language processing 146
vhellendoorn/code-lms A guide to using pre-trained large language models in source code analysis and generation 1,786
lyogavin/anima An optimization technique for large language models allowing them to run on limited hardware resources without significant performance loss. 9
c4n/pythonlexto A Python wrapper around the Thai word segmentator LexTo, allowing developers to easily integrate it into their applications. 1
antoniogarrote/clj-ml A Clojure library for machine learning built on top of Weka and friends 148
pemistahl/lingua An accurate language detection library for Java and the JVM suitable for both short and long text inputs. 716
olivomarco/lc4j An open-source Java library implementing text categorization and language detection using N-grams. 5
lowresourcelanguages/hltdi-morphology Provides morphological analysis tools for various languages, including verb and noun generation, based on archived web pages. 5
hemangsk/capacitor-mlkit-language An Android and iOS plugin using ML Kit for language identification on device 3
kuhumcst/cstlemma A lemmatiser tool for multiple languages using affix rules and supervised learning from full-form dictionaries. 36
datquocnguyen/rdrpostagger A toolkit for part-of-speech tagging and morphological analysis 141