GraphQuestions
QA dataset
A characteristic-rich dataset for factoid question answering with explicit specification of question characteristics and logical forms.
A characteristic-rich dataset for factoid question answering described in the paper "On Generating Characteristic-rich Question Sets for QA Evaluation" - EMNLP'16
92 stars
4 watching
16 forks
Language: ReScript
last commit: about 2 years ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
| A dataset and benchmarking framework for evaluating the performance of large language models on multi-turn question answering tasks for scientific graphs. | 38 |
| A collection of data to train chatbots on COVID-19-related questions | 11 |
| Tools and codebase for training neural question answering models on multiple paragraphs of text data | 435 |
| A collection of mined question-code pairs from Stack Overflow used for training and testing AI models | 166 |
| Compiles and provides structured access to Maluuba's NewsQA dataset for natural language question answering research. | 253 |
| A dataset collection providing text documents with corresponding summaries and questions. | 463 |
| Developing a standardized data schema for Quantified Self data to enable interoperability and collaboration among users and researchers. | 1 |
| A VQA dataset with unanswerable questions designed to test the limits of large models' knowledge and reasoning abilities. | 3 |
| A JavaScript-based question classification system inspired by a research paper, designed to categorize questions into four categories. | 160 |
| A syntax extension for writing SQL queries in OCaml with type inference and syntax checking. | 138 |
| Provides a simple way to perform question answering using a pre-trained model in Node.js | 466 |
| A Python-based question answering system built on top of Notion's database and OpenAI's API for natural language processing. | 2,139 |
| An insurance industry conversation corpus with pre-processed data for natural language processing and question answering tasks. | 1,019 |
| A resource-efficient method for pretraining dense corpus indexes for open-domain QA and IR. | 43 |
| A large-scale medical question-and-answer dataset with over 26 million high-quality pairs, designed for natural language processing and machine learning applications in the medical field. | 226 |