FlagInstruct
Instruction dataset
A collection of diverse instruction corpora for improving the development and tuning of Chinese Language Models
173 stars
7 watching
3 forks
last commit: over 1 year ago Related projects:
Repository | Description | Stars |
---|---|---|
xuefuzhao/instructionwild | Creating a large-scale user-based instruction dataset for natural language processing research and development | 453 |
x2fd/lvis-instruct4v | A dataset of fine-grained visual instructions generated by prompting a large language model with images from another dataset | 131 |
vt-nlp/multiinstruct | A multimodal benchmark dataset designed to evaluate the performance of vision-language foundation models through instruction tuning. | 133 |
transitive-bullshit/ffmpeg-cli-flags | A comprehensive documentation repository of FFmpeg commandline flags. | 47 |
orhonovich/unnatural-instructions | A collection of automatically generated instructions for training language models. | 175 |
opendatalab/vigc | Autonomously generates high-quality image-text instruction fine-tuning datasets | 90 |
liushulinle/events_in_framenet | Provides data and mappings for leveraging FrameNet to improve automatic event detection in natural language processing | 2 |
kyubyong/css10 | A collection of speech datasets for 10 languages to support text-to-speech tasks | 465 |
nytud/hulu | A collection of linguistic datasets and benchmarks for natural language understanding tasks | 9 |
yxuansu/pandagpt | A foundation model capable of instruction-following data across multiple modalities without explicit supervision. | 764 |
ncsoft/cap2qa | A dataset and implementation of a method to generate instructions based on visual data | 5 |
alessandrogianfelici/danish_reviews_dataset | A dataset of Danish reviews scraped from the internet to train sentiment classification models | 2 |
flagai-open/aquila2 | Provides pre-trained language models and tools for fine-tuning and evaluation | 437 |
fabasoad/setup-brainfuck-action | Installs a Brainfuck compiler interpreter | 2 |
mbzuai-nlp/bactrian-x | A collection of multilingual language models trained on a dataset of instructions and responses in various languages. | 94 |