datasetGPT

LLM dataset generator

A command-line interface to generate textual datasets with Large Language Models

A command-line interface to generate textual and conversational datasets with LLMs.

GitHub

293 stars
4 watching
19 forks
Language: Python
last commit: about 1 year ago
Linked from 1 awesome list

clidataset-generationlarge-language-modelspython3

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
thu-coai/cdial-gpt A large-scale Chinese conversation dataset and pre-trained dialog models for text generation 1,782
pratyushmaini/llm_dataset_inference Detects whether a given text sequence is part of the training data used to train a large language model. 23
piskvorky/gensim-data A repository of pre-trained NLP models and corpora for text processing. 988
gorilla-llm/gorilla-cli An AI-powered command-line interface that generates potential commands based on user input and suggests the best course of action. 1,297
rodrigopivi/chatito A tool for generating datasets for AI chatbots and natural language processing tasks using a simple domain-specific language. 876
chakki-works/chazutsu A tool that simplifies the process of preparing and manipulating natural language processing datasets 243
samholt/l2mac Automates large code generation and writing tasks using a large language model framework 70
nanbeige/nanbeige Develops large language models for text understanding and generation tasks. 85
karthikncode/nlp-datasets A curated list of Natural Language Processing datasets used to train and evaluate NLP models. 919
melih-unsal/demogpt A comprehensive toolset for building Large Language Model (LLM) based applications 1,710
r2d4/openlm Library that provides a unified API to interact with various Large Language Models (LLMs) 366
damo-nlp-sg/videollama2 An audio-visual language model designed to understand and generate video content 871
per9000/lorem Generates random text in various styles and formats. 81
patterns-ai-core/langchainrb A Ruby library providing an interface to Large Language Model (LLM) providers for text generation and embedding 1,415
bin123apple/autocoder An AI model designed to generate and execute code automatically 814