Urdu
NLP datasets
A collection of Urdu language datasets for various NLP tasks and applications
Collection of Urdu datasets for POS, NER, Sentiment, Summarization and NLP tasks.
71 stars
4 watching
19 forks
last commit: 3 months ago
Linked from 1 awesome list
machine-learningnernlpsentiment-analysisspacy-modelssummarizationurdu-languageurdu-model
Related projects:
Repository | Description | Stars |
---|---|---|
karthikncode/nlp-datasets | A curated list of Natural Language Processing datasets used to train and evaluate NLP models. | 919 |
louisowen6/nlp_bahasa_resources | A curated collection of NLP datasets and resources for Bahasa Indonesia | 489 |
urduhack/urduhack | A Python library providing a suite of natural language processing tools and utilities for the Urdu language | 283 |
fido-ai/ua-datasets | Provides a collection of datasets for natural language processing in Ukrainian. | 55 |
lang-uk/ner-uk | A Ukrainian NER corpus and annotation dataset for training and evaluating named entity recognition models. | 90 |
dayyass/dayyass | A collection of libraries and tools for natural language processing and reinforcement learning. | 39 |
sdadas/polish-nlp-resources | Pre-trained models and resources for Natural Language Processing in Polish | 323 |
balavenkatesh3322/nlp-pretrained-model | A collection of pre-trained natural language processing models | 170 |
hanzhenlei767/nlp_learn | A comprehensive collection of NLP-related code snippets and notes on various models and techniques, including pre-trained language models and Chinese text processing methods. | 25 |
andrewt3000/dl4nlp | A collection of resources and notes on deep learning for natural language processing. | 2,198 |
01-ai/yi | A series of large language models trained from scratch to excel in multiple NLP tasks | 7,699 |
novakat/nytk-nerkor-cars-ontonotespp | A large annotated dataset of Hungarian text with over 30 entity types derived from various sources and formats. | 1 |
haniehp/persianner | This project provides tools and resources for named-entity recognition in Persian language, including annotated datasets and pre-trained word embeddings. | 56 |
ajdavidl/portuguese-nlp | A collection of resources and tools for Natural Language Processing in Portuguese. | 240 |
nnlp-il/hebrew-resources | A comprehensive collection of Hebrew NLP resources and tools. | 249 |