varaha

NLP tools

A set of Apache Pig scripts and UDFs for machine learning and natural language processing

Machine learning and natural language processing with Apache Pig

GitHub

53 stars
9 watching
15 forks
Language: Java
last commit: almost 11 years ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
sandeep42/anuvada This is an open source PyTorch library providing tools and models to explain the predictions of deep neural networks for natural language processing tasks. 19
apache/pig Enables data processing and transformation in large files using a high-level language with compile-time optimizations for efficient execution on distributed computing frameworks. 681
languagemachines/frog An integration of memory-based natural language processing modules for Dutch 75
sedthh/lara-hungarian-nlp A lightweight Python library for natural language processing in Hungarian 29
robinhad/kruk A collection of Ukrainian language models and datasets for natural language processing tasks. 84
anoopkunchukuttan/indic_nlp_library A Python-based library providing common text processing and Natural Language Processing tools for Indian languages 556
kangfend/bahasa A natural language processing toolkit for the Indonesian language. 19
apache/opennlp-sandbox A Java-based toolkit for natural language text processing tasks 42
johanwk/elot Tools and functions to help create, manage, and query ontologies in a readable and machine-readable format. 8
dayyass/dayyass A collection of libraries and tools for natural language processing and reinforcement learning. 39
hck/open_nlp A Ruby wrapper around Apache OpenNLP's natural language processing tools 11
apache/opennlp-models Distributes pre-trained models for natural language text processing tasks in various languages 4
jd-aig/nlp_baai A collection of natural language processing models and tools for collaboration on a joint project between BAAI and JDAI. 252
proycon/python-frog A Python binding to a C++ NLP tool for Dutch language processing tasks 47
goru001/inltk A comprehensive toolkit for Natural Language Processing tasks in Indic languages, providing pre-trained models and datasets. 822