semeval-2016_2017-task3-subtaskB-english
Q&A dataset converter
Converts XML datasets to JSON for community question answering task 3 subtask b
Converter from XML SemEval 2016 and 2017 English Task 3, Subtask B datasets to JSON
1 stars
3 watching
0 forks
Language: Python
last commit: almost 7 years ago Related projects:
Repository | Description | Stars |
---|---|---|
witiko/semeval-2016_2017-task3-subtaska-unannotated-english | Converts unannotated XML datasets to JSON format for use in natural language processing tasks | 0 |
wesbos/ama | Q © A live Q¦ session where the developer answers questions on various topics related to coding, work, and life. | 281 |
ukdtom/srt2utf-8.bundle | Converts sidecar subtitle files to UTF-8 encoding and renames them with language codes. | 125 |
eulerto/wal2json | Provides an output plugin to convert PostgreSQL's Logical Decoding changeset data into JSON format | 1,326 |
antoine77340/howto100m | Provides code and tools for learning joint text-video embeddings using the HowTo100M dataset | 250 |
rnons/ted2srt | A website that downloads bilingual subtitles for TED talks. | 143 |
sublee/hangulize | Automates conversion of words from non-Korean languages to Korean using standardized rules. | 213 |
pharo-contributions/xml-xmlparserstax | A Smalltalk implementation of a streaming XML parser with DOM support | 0 |
maluuba/newsqa | Compiles and provides structured access to Maluuba's NewsQA dataset for natural language question answering research. | 253 |
jcgood/desmeme | A software framework for analyzing and visualizing linguistic templates using graphs and similarity measures. | 4 |
untra/polyglot | A plugin for Jekyll blogs that enables support for multiple languages and internationalization. | 417 |
littleyuyu/stackoverflow-question-code-dataset | A collection of mined question-code pairs from Stack Overflow used for training and testing AI models | 165 |
osuked/elexondataportal | A Python wrapper for retrieving data from the Elexon/BMRS API | 52 |
helsinki-nlp/tatoeba-challenge | A collection of machine translation datasets and tools to support real-world low-resource scenarios | 803 |