underthesea

Vietnamese NLP Toolkit

A comprehensive toolkit for processing and analyzing Vietnamese language texts

Underthesea - Vietnamese NLP Toolkit

GitHub

1k stars
81 watching
273 forks
Language: Python
last commit: 25 days ago
Linked from 1 awesome list

dependency-parserdependency-parsingnamed-entity-recognitionnatural-language-processingnernlpnlp-librarypos-taggingsentence-segmentationvietnamesevietnamese-nlpvietnamese-tokenizerword-segmenter

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
vncorenlp/vncorenlp A Vietnamese natural language processing toolkit providing annotation pipelines for key NLP components such as word segmentation and named entity recognition. 592
vinairesearch/phobert Pre-trained language models for Vietnamese NLP tasks 663
trungtv/pyvi A toolkit for processing Vietnamese text with tokenization, part-of-speech tagging, accents removal and addition capabilities. 245
phuonglh/vn.vitk A toolkit for processing and analyzing text data in Vietnamese, with tools for word segmentation, part-of-speech tagging, and dependency parsing. 214
roshan-research/hazm A Python library for natural language processing tasks on Persian text, providing tools for text normalization, tokenization, lemmatization, and more. 1,208
duydo/elasticsearch-analysis-vietnamese Provides Vietnamese language analysis functionality for Elasticsearch 510
nlp-uoregon/trankit A lightweight toolkit for multilingual natural language processing tasks using transformer-based architectures. 736
pythainlp/pythainlp A Python package for text processing and linguistic analysis focused on the Thai language. 987
goru001/inltk A comprehensive toolkit for Natural Language Processing tasks in Indic languages, providing pre-trained models and datasets. 822
proycon/python-frog A Python binding to a C++ NLP tool for Dutch language processing tasks 47
citiususc/linguakit A multilingual NLP toolkit providing various natural language processing tasks 65
cogcomp/cogcomp-nlp A collection of libraries and tools for Natural Language Processing 472
adobe/nlp-cube A framework providing a set of Natural Language Processing tasks such as tokenization, part-of-speech tagging, and dependency parsing for multiple languages. 554
sandeep42/anuvada This is an open source PyTorch library providing tools and models to explain the predictions of deep neural networks for natural language processing tasks. 19
cltk/cltk A Python library offering natural language processing capabilities for pre-modern languages 839