magyarlanc_spark

Hungarian NLP tool

A Spark-based tool for processing Hungarian text data with Magyarlanc language processing features and optional integration with ElasticSearch.

analyze hungarian texts with magyarlanc on apache spark

GitHub

4 stars
2 watching
0 forks
Language: Kotlin
last commit: over 7 years ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
damesek/eszterlanc Clojure interface to Hungarian linguistic processing toolkit 4
nytud/emtsv A text processing system designed to handle various tasks in Hungarian language processing using Python and TSV-based data exchange. 28
kotlin/kotlin-spark-api Provides compatibility and extensions between Kotlin and Apache Spark for big data processing 463
sedthh/lara-hungarian-nlp A lightweight Python library for natural language processing in Hungarian 29
huspacy/huspacy An industrial-strength natural language processing library for Hungarian language text analysis 158
nytud/panmorph Harmonized tagset and annotation scheme for Hungarian morphological analysers 4
mmihaltz/trendminer-hunlp A suite of scripts that perform NLP processing steps tailored to analyze social media text 5
mmihaltz/huwn A repository of wordnet lexicon in Hungarian language. 11
languagemachines/frog An integration of memory-based natural language processing modules for Dutch 75
flint-bot/sparky Provides a NodeJS API to interact with the Cisco Spark platform 16
novakat/nytk-nerkor-cars-ontonotespp A large annotated dataset of Hungarian text with over 30 entity types derived from various sources and formats. 1
dmmiller612/sparktorch A PyTorch implementation on Apache Spark for distributed deep learning model training and inference. 339
rocher/ob-ada-spark Supports Ada and SPARK programming languages in Emacs org-babel for compiling, running, and formal verification of code 8
nytud/hunlp-gate A collection of Hungarian NLP tools integrated as GATE processing resources 8
sparklyr/sparklyr An R interface to Apache Spark for distributed data analysis and machine learning 955