Urdu

NLP datasets

A collection of Urdu language datasets for various NLP tasks and applications

Collection of Urdu datasets for POS, NER, Sentiment, Summarization and NLP tasks.

GitHub

71 stars
4 watching
19 forks
last commit: 3 months ago
Linked from 1 awesome list

machine-learningnernlpsentiment-analysisspacy-modelssummarizationurdu-languageurdu-model

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
karthikncode/nlp-datasets A curated list of Natural Language Processing datasets used to train and evaluate NLP models. 919
louisowen6/nlp_bahasa_resources A curated collection of NLP datasets and resources for Bahasa Indonesia 489
urduhack/urduhack A Python library providing a suite of natural language processing tools and utilities for the Urdu language 283
fido-ai/ua-datasets Provides a collection of datasets for natural language processing in Ukrainian. 55
lang-uk/ner-uk A Ukrainian NER corpus and annotation dataset for training and evaluating named entity recognition models. 90
dayyass/dayyass A collection of libraries and tools for natural language processing and reinforcement learning. 39
sdadas/polish-nlp-resources Pre-trained models and resources for Natural Language Processing in Polish 323
balavenkatesh3322/nlp-pretrained-model A collection of pre-trained natural language processing models 170
hanzhenlei767/nlp_learn A comprehensive collection of NLP-related code snippets and notes on various models and techniques, including pre-trained language models and Chinese text processing methods. 25
andrewt3000/dl4nlp A collection of resources and notes on deep learning for natural language processing. 2,198
01-ai/yi A series of large language models trained from scratch to excel in multiple NLP tasks 7,699
novakat/nytk-nerkor-cars-ontonotespp A large annotated dataset of Hungarian text with over 30 entity types derived from various sources and formats. 1
haniehp/persianner This project provides tools and resources for named-entity recognition in Persian language, including annotated datasets and pre-trained word embeddings. 56
ajdavidl/portuguese-nlp A collection of resources and tools for Natural Language Processing in Portuguese. 240
nnlp-il/hebrew-resources A comprehensive collection of Hebrew NLP resources and tools. 249