FlagInstruct
Instruction dataset
A collection of diverse instruction corpora for improving the development and tuning of Chinese Language Models
173 stars
7 watching
3 forks
last commit: almost 2 years ago Related projects:
Repository | Description | Stars |
---|---|---|
| Creating a large-scale user-based instruction dataset for natural language processing research and development | 455 |
| A dataset of fine-grained visual instructions generated by prompting a large language model with images from another dataset | 131 |
| A multimodal benchmark dataset designed to evaluate the performance of vision-language foundation models through instruction tuning. | 134 |
| A comprehensive documentation repository of FFmpeg commandline flags. | 49 |
| A collection of automatically generated instructions for training language models. | 176 |
| Autonomously generates high-quality image-text instruction fine-tuning datasets | 91 |
| Provides data and mappings for leveraging FrameNet to improve automatic event detection in natural language processing | 2 |
| A collection of speech datasets for 10 languages to support text-to-speech tasks | 467 |
| A collection of linguistic datasets and benchmarks for natural language understanding tasks | 8 |
| A foundation model capable of instruction-following data across multiple modalities without explicit supervision. | 772 |
| A dataset and implementation of a method to generate instructions based on visual data | 5 |
| A dataset of Danish reviews scraped from the internet to train sentiment classification models | 2 |
| Provides pre-trained language models and tools for fine-tuning and evaluation | 439 |
| Installs a Brainfuck compiler interpreter | 2 |
| A collection of multilingual language models trained on a dataset of instructions and responses in various languages. | 94 |