datasetGPT

LLM dataset generator

A command-line interface to generate textual datasets with Large Language Models

A command-line interface to generate textual and conversational datasets with LLMs.

GitHub

293 stars
4 watching
19 forks
Language: Python
last commit: over 1 year ago
Linked from 1 awesome list

clidataset-generationlarge-language-modelspython3

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
thu-coai/cdial-gpt A large-scale Chinese conversation dataset and pre-trained dialog models for text generation 1,799
pratyushmaini/llm_dataset_inference Detects whether a given text sequence is part of the training data used to train a large language model. 23
piskvorky/gensim-data A repository of pre-trained NLP models and corpora for text processing. 990
gorilla-llm/gorilla-cli An AI-powered command-line interface that generates potential commands based on user input and suggests the best course of action. 1,305
rodrigopivi/chatito A tool for generating datasets for AI chatbots and natural language processing tasks using a simple domain-specific language. 877
chakki-works/chazutsu A tool that simplifies the process of preparing and manipulating natural language processing datasets 243
samholt/l2mac Automates large code generation and writing tasks using a large language model framework 79
nanbeige/nanbeige Develops large language models for text understanding and generation tasks. 85
karthikncode/nlp-datasets A curated list of Natural Language Processing datasets used to train and evaluate NLP models. 919
melih-unsal/demogpt A comprehensive toolset for building Large Language Model (LLM) based applications 1,733
r2d4/openlm Library that provides a unified API to interact with various Large Language Models (LLMs) 367
damo-nlp-sg/videollama2 An audio-visual language model designed to advance spatial-temporal modeling and audio understanding in video processing. 957
per9000/lorem Generates random text in various styles and formats. 83
patterns-ai-core/langchainrb A Ruby library providing an interface to Large Language Model (LLM) providers for text generation and embedding 1,487
bin123apple/autocoder An AI model designed to generate and execute code automatically 816