arabic-stop-words
Stop words list
A collection of pre-identified words to be excluded from text analysis in Arabic language processing.
Largest list of Arabic stop words on Github. أكبر قائمة لمستبعدات الفهرسة العربية على جيت هاب
308 stars
12 watching
148 forks
last commit: 8 months ago
Linked from 2 awesome lists
arabic-languagearabic-nlpstopwords
Related projects:
Repository | Description | Stars |
---|---|---|
6/stopwords-json | A collection of stopword lists in JSON format for various languages. | 424 |
brez/stopwords | A collection of pre-defined words commonly ignored in text processing | 12 |
stopwords-iso/stopwords-gl | A collection of Galician stopwords in JSON and text formats for use in natural language processing applications. | 1 |
mohamedadaly/labr | A dataset of Arabic book reviews for natural language processing tasks | 44 |
mznmel/aln9 | A markup language designed to simplify the formatting of Arabic text in HTML documents. | 11 |
brenes/stopwords-filter | A Ruby library that removes common words from text before processing | 77 |
alexrutherford/arabic_nlp | Tools for normalizing and deriving sentiment from Arabic text | 26 |
mpcabd/python-arabic-reshaper | Reconstructs Arabic text to be used in applications with limited support | 406 |
othmanela/nlp_arabic | Provides tools for Natural Language Processing in Arabic | 11 |
semanticfrontiers/arabicnlp | A collection of Python scripts and utilities for processing Arabic text | 55 |
lantip/baku-tidak-baku | A repository of linguistic data for Indonesian words categorized as either standard or non-standard | 29 |
01walid/goarabic | A package of Go functions to process and manipulate Arabic text | 108 |
galuhsahid/indonesian-word-embedding | Demonstrates word embedding in Indonesian language using pre-trained Word2vec models | 20 |
mojtaba-khallash/nhazm | A C# library for digesting Persian text using natural language processing techniques. | 38 |
bluemix/numbertoarabicwords | Converts Arabic numbers to written words in a human-readable format. | 53 |