datasetGPT
LLM dataset generator
A command-line interface to generate textual datasets with Large Language Models
A command-line interface to generate textual and conversational datasets with LLMs.
293 stars
4 watching
19 forks
Language: Python
last commit: over 1 year ago
Linked from 1 awesome list
clidataset-generationlarge-language-modelspython3
Related projects:
Repository | Description | Stars |
---|---|---|
thu-coai/cdial-gpt | A large-scale Chinese conversation dataset and pre-trained dialog models for text generation | 1,799 |
pratyushmaini/llm_dataset_inference | Detects whether a given text sequence is part of the training data used to train a large language model. | 23 |
piskvorky/gensim-data | A repository of pre-trained NLP models and corpora for text processing. | 990 |
gorilla-llm/gorilla-cli | An AI-powered command-line interface that generates potential commands based on user input and suggests the best course of action. | 1,305 |
rodrigopivi/chatito | A tool for generating datasets for AI chatbots and natural language processing tasks using a simple domain-specific language. | 877 |
chakki-works/chazutsu | A tool that simplifies the process of preparing and manipulating natural language processing datasets | 243 |
samholt/l2mac | Automates large code generation and writing tasks using a large language model framework | 79 |
nanbeige/nanbeige | Develops large language models for text understanding and generation tasks. | 85 |
karthikncode/nlp-datasets | A curated list of Natural Language Processing datasets used to train and evaluate NLP models. | 919 |
melih-unsal/demogpt | A comprehensive toolset for building Large Language Model (LLM) based applications | 1,733 |
r2d4/openlm | Library that provides a unified API to interact with various Large Language Models (LLMs) | 367 |
damo-nlp-sg/videollama2 | An audio-visual language model designed to advance spatial-temporal modeling and audio understanding in video processing. | 957 |
per9000/lorem | Generates random text in various styles and formats. | 83 |
patterns-ai-core/langchainrb | A Ruby library providing an interface to Large Language Model (LLM) providers for text generation and embedding | 1,487 |
bin123apple/autocoder | An AI model designed to generate and execute code automatically | 816 |