PWS

Winograd datasets

A collection of parallel corpora of Winograd schemata in multiple languages

GitHub

0 stars
3 watching
0 forks
Language: TeX
last commit: almost 2 years ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
nytud/huws A dataset of manually curated Hungarian sentences with ambiguous wordings that require world knowledge and reasoning for resolution. 1
nytud/husst A dataset of annotated sentences for training and evaluating sentiment analysis models in the Hungarian language. 1
nytud/hulu A collection of linguistic datasets and benchmarks for natural language understanding tasks 8
nytud/happ A dataset of Hungarian translations of human-language examples to test anaphora resolution algorithms 1
nytud/panmorph Harmonized tagset and annotation scheme for Hungarian morphological analysers 4
nytud/emtsv A text processing system designed to handle various tasks in Hungarian language processing using Python and TSV-based data exchange. 28
nytud/hadifogoly-adatbazis An attempt to transcribe Cyrillic text into Hungarian script for a large dataset of WWII prisoner-of-war records 23
nytud/hucola A collection of 9,076 annotated sentences in Hungarian to evaluate linguistic acceptability and grammaticality 1
novakat/nytk-nerkor-cars-ontonotespp A large annotated dataset of Hungarian text with over 30 entity types derived from various sources and formats. 1
nytud/hunlp-gate A collection of Hungarian NLP tools integrated as GATE processing resources 8
pythainlp/prachathai-67k An article classification dataset created from news articles scraped from Prachathai.com with multiple benchmark models for multi-label classification 16
svenkreiss/pysparkling A lightweight Python implementation of Spark's RDD and DStream interfaces for improved performance on small datasets 262
universaldependencies/ud_hungarian-szeged This repository provides a Hungarian language treebank dataset in the Universal Dependencies format. 5
nytud/machine-translation Provides machine translation models and a demo site for Hungarian language translations 5
gsoh/ved A large-scale dataset capturing vehicle energy consumption and usage patterns in real-world driving scenarios. 94