FlagInstruct

Instruction dataset

A collection of diverse instruction corpora for improving the development and tuning of Chinese Language Models

GitHub

173 stars
7 watching
3 forks
last commit: over 1 year ago

Related projects:

Repository Description Stars
xuefuzhao/instructionwild Creating a large-scale user-based instruction dataset for natural language processing research and development 453
x2fd/lvis-instruct4v A dataset of fine-grained visual instructions generated by prompting a large language model with images from another dataset 131
vt-nlp/multiinstruct A multimodal benchmark dataset designed to evaluate the performance of vision-language foundation models through instruction tuning. 133
transitive-bullshit/ffmpeg-cli-flags A comprehensive documentation repository of FFmpeg commandline flags. 47
orhonovich/unnatural-instructions A collection of automatically generated instructions for training language models. 175
opendatalab/vigc Autonomously generates high-quality image-text instruction fine-tuning datasets 90
liushulinle/events_in_framenet Provides data and mappings for leveraging FrameNet to improve automatic event detection in natural language processing 2
kyubyong/css10 A collection of speech datasets for 10 languages to support text-to-speech tasks 465
nytud/hulu A collection of linguistic datasets and benchmarks for natural language understanding tasks 9
yxuansu/pandagpt A foundation model capable of instruction-following data across multiple modalities without explicit supervision. 764
ncsoft/cap2qa A dataset and implementation of a method to generate instructions based on visual data 5
alessandrogianfelici/danish_reviews_dataset A dataset of Danish reviews scraped from the internet to train sentiment classification models 2
flagai-open/aquila2 Provides pre-trained language models and tools for fine-tuning and evaluation 437
fabasoad/setup-brainfuck-action Installs a Brainfuck compiler interpreter 2
mbzuai-nlp/bactrian-x A collection of multilingual language models trained on a dataset of instructions and responses in various languages. 94