prachathai-67k
News dataset
An article classification dataset created from news articles scraped from Prachathai.com with multiple benchmark models for multi-label classification
News Article Corpus from Prachathai.com
16 stars
5 watching
10 forks
Language: Jupyter Notebook
last commit: almost 4 years ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
| A large Thai social media text sentiment dataset with annotated labels | 77 |
| A Thai language corpus and lexicon repository for natural language processing | 142 |
| A curated list of Natural Language Processing datasets used to train and evaluate NLP models. | 919 |
| A Python package for text processing and linguistic analysis focused on Thai language | 993 |
| Detects whether a given text sequence is part of the training data used to train a large language model. | 23 |
| A Python package implementing an interpretable machine learning model for text classification with visualization tools | 336 |
| A PyTorch-based toolkit for creating customized multimedia datasets and handling heterogeneous data for training AI models. | 346 |
| An implementation of convolutional neural networks for text classification using PyTorch | 66 |
| A collection of natural language processing models and tools for collaboration on a joint project between BAAI and JDAI. | 254 |
| A PyTorch project for comparing image classification models and facilitating quick experiment setup | 366 |
| This is an open source PyTorch library providing tools and models to explain the predictions of deep neural networks for natural language processing tasks. | 19 |
| A collection of Urdu language datasets for various NLP tasks and applications | 71 |
| Provides scalable, performant data loading solutions and utilities to be shared by PyTorch domain libraries | 1,149 |
| A PyTorch toolbox for supporting research and development of domain adaptation, generalization, and semi-supervised learning methods in computer vision. | 1,236 |