UD_Vietnamese-VTB

Vietnamese treebank

An annotated corpus of Vietnamese language structure

GitHub

36 stars
81 watching
9 forks
last commit: 9 days ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
universaldependencies/ud_ukrainian-iu A dataset of annotated text in Ukrainian with standardized formatting and annotation guidelines. 28
universaldependencies/ud_galician-treegal A treebank for the Galician language with annotated syntactic and morphological features. 6
universaldependencies/ud_hungarian-szeged A corpus of annotated Hungarian text data for machine learning and natural language processing tasks 5
universaldependencies/ud_galician-ctg This is a collection of annotated text data for the Galician language. 1
vinairesearch/phobert Pre-trained language models for Vietnamese NLP tasks 663
duydo/elasticsearch-analysis-vietnamese Provides Vietnamese language analysis functionality for Elasticsearch 510
phuonglh/vn.vitk A toolkit for processing and analyzing text data in Vietnamese, with tools for word segmentation, part-of-speech tagging, and dependency parsing. 214
famrashel/idn-treebank A manually tagged Indonesian corpus consisting of parse-trees from sentences. 36
universaldependencies/docs An online documentation repository providing detailed resources and guides for the Universal Dependencies project 273
phpvietnam/bookmarks A collection of resources and links for learning and sharing knowledge about PHP programming in Vietnam. 0
qhungngo/evbcorpus A large-scale bilingual corpus collection for language technology and NLP tasks, containing English-Vietnamese translations and bitexts. 42
nytud/hulu A collection of linguistic datasets and benchmarks for natural language understanding tasks 9
undertheseanlp/underthesea A comprehensive toolkit for processing and analyzing Vietnamese language texts 1,414
dbuenzli/uutf A non-blocking streaming codec for Unicode encoding schemes 32
libp2p/js-libp2p-udt A Node.js implementation of the UDT module used in peer-to-peer networking 3