Danish-Similarity-Dataset
Danish word similarity dataset
A dataset used to evaluate and measure the similarity of words in Danish language models.
Gold standard resource for evaluation of Danish word embedding models.
8 stars
5 watching
0 forks
last commit: almost 5 years ago
Linked from 1 awesome list
danishembedding-evaluationmanual-annotationssemantic-similarity
Related projects:
Repository | Description | Stars |
---|---|---|
| A dataset of Danish reviews scraped from the internet to train sentiment classification models | 2 |
| This repository provides code and data for a named entity recognition system for the Danish language, including tools for lexical normalization. | 5 |
| A curated list of Natural Language Processing datasets used to train and evaluate NLP models. | 919 |
| An open-source collection of Danish language models for natural language processing tasks | 30 |
| Extracts, prepares and publishes football dataset from Transfermarkt website | 249 |
| Library for comparing sequences of characters with various distance metrics. | 117 |
| A comprehensive lexicon of Danish words with sentiment polarity annotations | 8 |
| Calculates similarity between pieces of text using TF-IDF weights | 115 |
| A dataset of annotated sentences for training and evaluating sentiment analysis models in the Hungarian language. | 1 |
| Facilities to calculate the distance and similarity between strings using various algorithms | 61 |
| This project provides annotated data and guidelines for fine-grained sentiment analysis on Danish social media comments. | 7 |
| A collection of pre-processed machine learning datasets for use with the Torch7 deep learning framework. | 37 |
| Provides methods for evaluating word embeddings on various benchmarks | 437 |
| Provides efficient algorithms to calculate string similarity metrics | 22 |
| Training data for a handwritten recognition system | 21 |