datasetGPT
LLM dataset generator
A command-line interface to generate textual datasets with Large Language Models
A command-line interface to generate textual and conversational datasets with LLMs.
293 stars
4 watching
19 forks
Language: Python
last commit: about 1 year ago
Linked from 1 awesome list
clidataset-generationlarge-language-modelspython3
Related projects:
Repository | Description | Stars |
---|---|---|
thu-coai/cdial-gpt | A large-scale Chinese conversation dataset and pre-trained dialog models for text generation | 1,782 |
pratyushmaini/llm_dataset_inference | Detects whether a given text sequence is part of the training data used to train a large language model. | 23 |
piskvorky/gensim-data | A repository of pre-trained NLP models and corpora for text processing. | 988 |
gorilla-llm/gorilla-cli | An AI-powered command-line interface that generates potential commands based on user input and suggests the best course of action. | 1,297 |
rodrigopivi/chatito | A tool for generating datasets for AI chatbots and natural language processing tasks using a simple domain-specific language. | 876 |
chakki-works/chazutsu | A tool that simplifies the process of preparing and manipulating natural language processing datasets | 243 |
samholt/l2mac | Automates large code generation and writing tasks using a large language model framework | 70 |
nanbeige/nanbeige | Develops large language models for text understanding and generation tasks. | 85 |
karthikncode/nlp-datasets | A curated list of Natural Language Processing datasets used to train and evaluate NLP models. | 919 |
melih-unsal/demogpt | A comprehensive toolset for building Large Language Model (LLM) based applications | 1,710 |
r2d4/openlm | Library that provides a unified API to interact with various Large Language Models (LLMs) | 366 |
damo-nlp-sg/videollama2 | An audio-visual language model designed to understand and generate video content | 871 |
per9000/lorem | Generates random text in various styles and formats. | 81 |
patterns-ai-core/langchainrb | A Ruby library providing an interface to Large Language Model (LLM) providers for text generation and embedding | 1,415 |
bin123apple/autocoder | An AI model designed to generate and execute code automatically | 814 |