InstructionWild
Instruction Dataset
Creating a large-scale user-based instruction dataset for natural language processing research and development
453 stars
9 watching
41 forks
last commit: 6 months ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
orhonovich/unnatural-instructions | A collection of automatically generated instructions for training language models. | 175 |
x2fd/lvis-instruct4v | A dataset of fine-grained visual instructions generated by prompting a large language model with images from another dataset | 131 |
flagopen/flaginstruct | A collection of diverse instruction corpora for improving the development and tuning of Chinese Language Models | 173 |
nytud/hulu | A collection of linguistic datasets and benchmarks for natural language understanding tasks | 9 |
justfollowus/natural-language-processing | Comprehensive resource for learning natural language processing (NLP) with a structured course outline and recommended readings. | 834 |
baai-wudao/model | A repository of pre-trained language models for various tasks and domains. | 121 |
michael-wzhu/promptcblue | A large-scale instruction-tuning dataset for multi-task and few-shot learning in the medical domain | 323 |
ffxsam/vue-typescript-cookbook | A cookbook and resource guide for developers learning Vue.js with TypeScript | 273 |
wavelets/thinkstats2 | Text and supporting code for a comprehensive statistical analysis book with accompanying software | 8 |
joelcoxokc/aurelia-interface | Provides a set of custom HTML elements and attributes to build cross-platform applications with platform-specific styles, themes, and behaviors. | 85 |
benjamintanweihao/elixir-cheatsheets | A collection of concise guides and reference materials for learning Elixir programming language and its ecosystem | 104 |
bingwen/free-programming-books | A curated list of resources for learning programming languages and software development | 49 |
mbzuai-nlp/bactrian-x | A collection of multilingual language models trained on a dataset of instructions and responses in various languages. | 94 |
zhuangbiaowei/open_source_analysis | An analysis of notable software projects | 21 |
zhuiyitechnology/pretrained-models | A collection of pre-trained language models for natural language processing tasks | 987 |