chazutsu

NLP dataset manager

A tool that simplifies the process of preparing and manipulating natural language processing datasets

The tool to make NLP datasets ready to use

GitHub

243 stars
14 watching
33 forks
Language: Python
last commit: about 2 years ago
Linked from 1 awesome list

datasetmachine-learningnatural-language-processing

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
karthikncode/nlp-datasets A curated list of Natural Language Processing datasets used to train and evaluate NLP models. 919
kimtaro/ve A linguistic framework for natural language processing tasks. 216
chartbeat-labs/textacy A Python library providing a range of natural language processing tasks and utilities 2,217
alexa/massive A collection of tools and modeling code for a large multilingual Natural Language Understanding dataset 538
languagemachines/luiginlp A workflow management system for Natural Language Processing tasks 21
goru001/inltk A comprehensive toolkit for Natural Language Processing tasks in Indic languages, providing pre-trained models and datasets. 822
jd-aig/nlp_baai A collection of natural language processing models and tools for collaboration on a joint project between BAAI and JDAI. 252
radi-cho/datasetgpt A command-line interface to generate textual datasets with Large Language Models 293
pks/zipf A Ruby NLP library providing tools and data structures for natural language processing tasks 3
michael-wzhu/promptcblue A large-scale instruction-tuning dataset for multi-task and few-shot learning in the medical domain 323
piskvorky/gensim-data A repository of pre-trained NLP models and corpora for text processing. 988
dayyass/dayyass A collection of libraries and tools for natural language processing and reinforcement learning. 39
dkogan/vnlog A toolkit for manipulating tabular ASCII data with normal UNIX tools. 160
zaibacu/rita-dsl A DSL for building custom NLP patterns from manual language rules 65
leks-forever/nllb-tuning This is an experimental project for fine-tuning the NLB language model with a specific dataset and evaluating its performance on translation tasks. 7