underthesea

Vietnamese NLP Toolkit

A comprehensive toolkit for processing and analyzing Vietnamese language texts

Underthesea - Vietnamese NLP Toolkit

GitHub

1k stars
81 watching
274 forks
Language: Python
last commit: 3 months ago
Linked from 1 awesome list

dependency-parserdependency-parsingnamed-entity-recognitionnatural-language-processingnernlpnlp-librarypos-taggingsentence-segmentationvietnamesevietnamese-nlpvietnamese-tokenizerword-segmenter

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
vncorenlp/vncorenlp A Vietnamese natural language processing toolkit providing annotation pipelines for key NLP components such as word segmentation and named entity recognition. 600
vinairesearch/phobert Pre-trained language models for Vietnamese NLP tasks 671
trungtv/pyvi A toolkit for processing Vietnamese text with tokenization, part-of-speech tagging, accents removal and addition capabilities. 248
phuonglh/vn.vitk A toolkit for processing and analyzing text data in Vietnamese, with tools for word segmentation, part-of-speech tagging, and dependency parsing. 214
roshan-research/hazm A Python library for natural language processing tasks on Persian text, providing tools for text normalization, tokenization, lemmatization, and more. 1,219
duydo/elasticsearch-analysis-vietnamese Provides Vietnamese language analysis functionality for Elasticsearch 512
nlp-uoregon/trankit A lightweight toolkit for multilingual natural language processing tasks using transformer-based architectures. 738
pythainlp/pythainlp A Python package for text processing and linguistic analysis focused on Thai language 993
goru001/inltk A comprehensive toolkit for Natural Language Processing tasks in Indic languages, providing pre-trained models and datasets. 825
proycon/python-frog A Python binding to a C++ NLP tool for Dutch language processing tasks 47
citiususc/linguakit A multilingual NLP toolkit providing various natural language processing tasks 65
cogcomp/cogcomp-nlp A collection of libraries and tools for Natural Language Processing 473
adobe/nlp-cube A framework providing a set of Natural Language Processing tasks such as tokenization, part-of-speech tagging, and dependency parsing for multiple languages. 555
sandeep42/anuvada This is an open source PyTorch library providing tools and models to explain the predictions of deep neural networks for natural language processing tasks. 19
cltk/cltk A Python library offering natural language processing capabilities for pre-modern languages 843