indonlu

NLP toolkit

A comprehensive collection of natural language understanding resources and pre-trained models for Indonesian language.

The first-ever vast natural language processing benchmark for Indonesian Language. We provide multiple downstream tasks, pre-trained IndoBERT models, and a starter code! (AACL-IJCNLP 2020)

GitHub

564 stars
18 watching
196 forks
Language: Jupyter Notebook
last commit: 2 months ago
Linked from 1 awesome list

aaclbahasabenchmarkbertdatasetsindo4bindobertindobert-liteindobert-modelsindonesianindonlpindonlunlpnlu

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
kangfend/bahasa A natural language processing toolkit for the Indonesian language. 19
kata-ai/indosum Provides a benchmark dataset and tools for training text summarization models in the Indonesian language. 77
anoopkunchukuttan/indic_nlp_library A Python-based library providing common text processing and Natural Language Processing tools for Indian languages 561
louisowen6/nlp_bahasa_resources A curated collection of NLP datasets and resources for Bahasa Indonesia 496
apache/opennlp-models Provides pre-trained binary models for natural language text processing across multiple languages 4
apache/opennlp-sandbox A Java-based toolkit for natural language text processing tasks 42
goru001/inltk A comprehensive toolkit for Natural Language Processing tasks in Indic languages, providing pre-trained models and datasets. 825
utcompling/texnlp Tools and libraries for natural language processing using Hidden Markov Models and Maximum Entropy Markov Models 14
apache/opennlp Provides a toolkit for natural language text processing tasks using machine learning algorithms in Java. 1,449
sastrawi/nlp-bahasa-indonesia A collection of NLP papers and resources for Bahasa Indonesia, including tools and software for text processing tasks such as summarization, parsing, part-of-speech tagging, stemming, and word sense disambiguation. 186
emorynlp/nlp4j Provides tools and APIs for text processing and analysis on Java-based platforms. 148
nnlp-il/hebrew-resources A comprehensive collection of Hebrew NLP resources and tools. 255
balavenkatesh3322/nlp-pretrained-model A collection of pre-trained natural language processing models 170
jd-aig/nlp_baai A collection of natural language processing models and tools for collaboration on a joint project between BAAI and JDAI. 254