indonlu

NLP toolkit

A comprehensive collection of natural language understanding resources and pre-trained models for Indonesian language.

The first-ever vast natural language processing benchmark for Indonesian Language. We provide multiple downstream tasks, pre-trained IndoBERT models, and a starter code! (AACL-IJCNLP 2020)

GitHub

556 stars
18 watching
193 forks
Language: Jupyter Notebook
last commit: 5 days ago
Linked from 1 awesome list

aaclbahasabenchmarkbertdatasetsindo4bindobertindobert-liteindobert-modelsindonesianindonlpindonlunlpnlu

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
kangfend/bahasa A natural language processing toolkit for the Indonesian language. 19
kata-ai/indosum Provides a benchmark dataset and tools for training text summarization models in the Indonesian language. 76
anoopkunchukuttan/indic_nlp_library A Python-based library providing common text processing and Natural Language Processing tools for Indian languages 556
louisowen6/nlp_bahasa_resources A curated collection of NLP datasets and resources for Bahasa Indonesia 489
apache/opennlp-models Distributes pre-trained models for natural language text processing tasks in various languages 4
apache/opennlp-sandbox A Java-based toolkit for natural language text processing tasks 42
goru001/inltk A comprehensive toolkit for Natural Language Processing tasks in Indic languages, providing pre-trained models and datasets. 822
hanzhenlei767/nlp_learn A comprehensive collection of NLP-related code snippets and notes on various models and techniques, including pre-trained language models and Chinese text processing methods. 25
utcompling/texnlp Develops tools and algorithms for natural language processing tasks using Hidden Markov Models and Maximum Entropy Markov Models. 14
apache/opennlp A machine learning-based toolkit for text processing and analysis 1,447
sastrawi/nlp-bahasa-indonesia A collection of NLP papers and resources for Bahasa Indonesia, including tools and software for text processing tasks such as summarization, parsing, part-of-speech tagging, stemming, and word sense disambiguation. 186
emorynlp/nlp4j Provides tools and APIs for text processing and analysis on Java-based platforms. 148
nnlp-il/hebrew-resources A comprehensive collection of Hebrew NLP resources and tools. 249
balavenkatesh3322/nlp-pretrained-model A collection of pre-trained natural language processing models 170
jd-aig/nlp_baai A collection of natural language processing models and tools for collaboration on a joint project between BAAI and JDAI. 252