bahasa

Indonesian NLP Toolkit

A natural language processing toolkit for the Indonesian language.

Natural language toolkit for Indonesian Language (Bahasa)

GitHub

19 stars
2 watching
10 forks
Language: Python
last commit: 6 months ago
Linked from 1 awesome list

bahasaindonesianatural-language-processingnlpnlp-pythonpythonsastrawistemmerstemming

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
sastrawi/nlp-bahasa-indonesia A collection of NLP papers and resources for Bahasa Indonesia, including tools and software for text processing tasks such as summarization, parsing, part-of-speech tagging, stemming, and word sense disambiguation. 186
adobe/nlp-cube A framework providing a set of Natural Language Processing tasks such as tokenization, part-of-speech tagging, and dependency parsing for multiple languages. 554
louisowen6/nlp_bahasa_resources A curated collection of NLP datasets and resources for Bahasa Indonesia 489
indonlp/indonlu A comprehensive collection of natural language understanding resources and pre-trained models for Indonesian language. 556
goru001/inltk A comprehensive toolkit for Natural Language Processing tasks in Indic languages, providing pre-trained models and datasets. 822
dayyass/dayyass A collection of libraries and tools for natural language processing and reinforcement learning. 39
kimtaro/ve A linguistic framework for natural language processing tasks. 216
cltk/cltk A Python library offering natural language processing capabilities for pre-modern languages 839
apache/opennlp-sandbox A Java-based toolkit for natural language text processing tasks 42
nnlp-il/hebrew-resources A comprehensive collection of Hebrew NLP resources and tools. 249
cleartk/cleartk A framework for building statistical natural language processing components in Java using Apache UIMA. 129
othmanela/nlp_arabic Provides tools for Natural Language Processing in Arabic 11
jd-aig/nlp_baai A collection of natural language processing models and tools for collaboration on a joint project between BAAI and JDAI. 252
galuhsahid/indonesian-word-embedding Demonstrates word embedding in Indonesian language using pre-trained Word2vec models 20
roshan-research/hazm A Python library for natural language processing tasks on Persian text, providing tools for text normalization, tokenization, lemmatization, and more. 1,208