PySastrawi

Indonesian Stemmer

A Python port of an Indonesian stemmer library, reducing inflected words to their base form.

Indonesian stemmer. Python port of PHP Sastrawi project.

GitHub

336 stars
16 watching
111 forks
Language: Python
last commit: over 3 years ago
Linked from 1 awesome list

nlp-stemmingsastrawi-python

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
sastrawi/sastrawi A PHP library for stemming Indonesian words to their base form 422
meisyal/sastrawi-ruby Ruby bindings for stemming words in Bahasa Indonesia 6
sastrawi/nlp-bahasa-indonesia A collection of NLP papers and resources for Bahasa Indonesia, including tools and software for text processing tasks such as summarization, parsing, part-of-speech tagging, stemming, and word sense disambiguation. 186
nikolamilosevic86/serbianstemmer A stemmer for Serbian language improved from Keselj and Šipla's original implementation. 16
adelra/kurmanji-stemmer A stemmer algorithm for Kurmanji Kurdish, compatible with the NLTK library 1
assem-ch/arabicstemmer An Arabic language stemming algorithm that normalizes and refines text input using a snowball-based framework 145
words/stemmer A stemming algorithm package that converts words into shorter forms to facilitate comparison and analysis of natural language text 129
fredwu/stemmer An English stemming implementation in Elixir 150
dchest/stemmer Provides an interface for implementing various stemming algorithms in the Go programming language. 53
skroutz/turkish_stemmer A Ruby library implementing a stemming algorithm for the Turkish language based on its unique morphological structure and suffix rules. 42
amakukha/stemmers_ukrainian A novel stemmer for the Ukrainian language trained with AI 28
kangfend/bahasa A natural language processing toolkit for the Indonesian language. 19
ejtaal/jsastem A JavaScript implementation of an Arabic stemmer algorithm to normalize and analyze Arabic words. 26
jedp/porter-stemmer An implementation of Martin Porter's stemming algorithm in JavaScript for removing morphological endings from English words. 100
kata-ai/indosum Provides a benchmark dataset and tools for training text summarization models in the Indonesian language. 76