InstructionWild

Instruction Dataset

Creating a large-scale user-based instruction dataset for natural language processing research and development

GitHub

453 stars
9 watching
41 forks
last commit: 6 months ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
orhonovich/unnatural-instructions A collection of automatically generated instructions for training language models. 175
x2fd/lvis-instruct4v A dataset of fine-grained visual instructions generated by prompting a large language model with images from another dataset 131
flagopen/flaginstruct A collection of diverse instruction corpora for improving the development and tuning of Chinese Language Models 173
nytud/hulu A collection of linguistic datasets and benchmarks for natural language understanding tasks 9
justfollowus/natural-language-processing Comprehensive resource for learning natural language processing (NLP) with a structured course outline and recommended readings. 834
baai-wudao/model A repository of pre-trained language models for various tasks and domains. 121
michael-wzhu/promptcblue A large-scale instruction-tuning dataset for multi-task and few-shot learning in the medical domain 323
ffxsam/vue-typescript-cookbook A cookbook and resource guide for developers learning Vue.js with TypeScript 273
wavelets/thinkstats2 Text and supporting code for a comprehensive statistical analysis book with accompanying software 8
joelcoxokc/aurelia-interface Provides a set of custom HTML elements and attributes to build cross-platform applications with platform-specific styles, themes, and behaviors. 85
benjamintanweihao/elixir-cheatsheets A collection of concise guides and reference materials for learning Elixir programming language and its ecosystem 104
bingwen/free-programming-books A curated list of resources for learning programming languages and software development 49
mbzuai-nlp/bactrian-x A collection of multilingual language models trained on a dataset of instructions and responses in various languages. 94
zhuangbiaowei/open_source_analysis An analysis of notable software projects 21
zhuiyitechnology/pretrained-models A collection of pre-trained language models for natural language processing tasks 987