pyvi
VN Text Toolkit
A toolkit for processing Vietnamese text with tokenization, part-of-speech tagging, accents removal and addition capabilities.
Python Vietnamese Core NLP Toolkit
248 stars
12 watching
49 forks
Language: Jupyter Notebook
last commit: 5 months ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
| A toolkit for processing and analyzing text data in Vietnamese, with tools for word segmentation, part-of-speech tagging, and dependency parsing. | 214 |
| A comprehensive toolkit for processing and analyzing Vietnamese language texts | 1,436 |
| A Vietnamese natural language processing toolkit providing annotation pipelines for key NLP components such as word segmentation and named entity recognition. | 600 |
| Pre-trained language models for Vietnamese NLP tasks | 671 |
| Custom tools to extract text from YouTube video transcripts | 63 |
| A lightweight toolkit for multilingual natural language processing tasks using transformer-based architectures. | 738 |
| Evaluates and benchmarks large language models' video understanding capabilities | 121 |
| A tool to simplify translating software source code and check existing translations. | 46 |
| A collection of tools and conversion utilities for the W3C Timed Text Markup Language (TTML) | 74 |
| Provides tools and data for training image classification models using the LSUN dataset. | 547 |
| A library for Finite-State Morphology and Constraint Grammar based NLP tasks, providing tools for tokenisation, normalisation, grammar-checking and correction. | 9 |
| Tools and techniques for improving machine translation in resource-constrained environments. | 3 |
| A collection of miscellaneous PyTorch implementations covering various machine learning concepts and techniques | 468 |
| Provides pre-trained word vectors for multiple languages to facilitate NLP tasks | 2,216 |
| A Python framework for training and applying neural networks to acoustic communication research | 78 |