unnatural-instructions
Instruction dataset
A collection of automatically generated instructions for training language models.
176 stars
7 watching
10 forks
last commit: almost 2 years ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
| Creating a large-scale user-based instruction dataset for natural language processing research and development | 455 |
| A dataset of fine-grained visual instructions generated by prompting a large language model with images from another dataset | 131 |
| Creating a large collection of tasks and their natural language definitions/instructions to support the development of NLP models with generalization capabilities | 963 |
| A collection of diverse instruction corpora for improving the development and tuning of Chinese Language Models | 173 |
| A multimodal benchmark dataset designed to evaluate the performance of vision-language foundation models through instruction tuning. | 134 |
| A comprehensive guide to optimizing chatbot responses using prompt engineering techniques | 989 |
| A dataset and tools package designed to support the training and evaluation of large language models for molecular biology tasks | 255 |
| Generates training data for intent parsing systems by creating pairs of sentences and grammar trees from a template file | 55 |
| Creating synthetic visual reasoning instructions to improve the performance of large language models on image-related tasks | 18 |
| A collection of command-line tools and utilities for various platforms. | 74 |
| An IDA plugin that removes unnecessary bytes from instructions | 14 |
| Training data for a handwritten recognition system | 21 |
| Analyzes the frequency of instructions in Game Boy code to provide insights into coding patterns and optimization opportunities | 2 |
| Provides labeled ELF binaries for research and testing purposes. | 87 |
| A collection of multilingual language models trained on a dataset of instructions and responses in various languages. | 94 |