Danish-Similarity-Dataset
Danish word similarity dataset
A dataset used to evaluate and measure the similarity of words in Danish language models.
Gold standard resource for evaluation of Danish word embedding models.
8 stars
5 watching
0 forks
last commit: over 5 years ago
Linked from 1 awesome list
danishembedding-evaluationmanual-annotationssemantic-similarity
Related projects:
| Repository | Description | Stars |
|---|---|---|
| | A dataset of Danish reviews scraped from the internet to train sentiment classification models | 2 |
| | This repository provides code and data for a named entity recognition system for the Danish language, including tools for lexical normalization. | 5 |
| | A curated list of Natural Language Processing datasets used to train and evaluate NLP models. | 919 |
| | An open-source collection of Danish language models for natural language processing tasks | 30 |
| | Extracts, prepares and publishes football dataset from Transfermarkt website | 249 |
| | Library for comparing sequences of characters with various distance metrics. | 117 |
| | A comprehensive lexicon of Danish words with sentiment polarity annotations | 8 |
| | Calculates similarity between pieces of text using TF-IDF weights | 115 |
| | A dataset of annotated sentences for training and evaluating sentiment analysis models in the Hungarian language. | 1 |
| | Facilities to calculate the distance and similarity between strings using various algorithms | 61 |
| | This project provides annotated data and guidelines for fine-grained sentiment analysis on Danish social media comments. | 7 |
| | A collection of pre-processed machine learning datasets for use with the Torch7 deep learning framework. | 37 |
| | Provides methods for evaluating word embeddings on various benchmarks | 437 |
| | Provides efficient algorithms to calculate string similarity metrics | 22 |
| | Training data for a handwritten recognition system | 21 |