arabic-stop-words

Stop words list

A collection of pre-identified words to be excluded from text analysis in Arabic language processing.

Largest list of Arabic stop words on Github. أكبر قائمة لمستبعدات الفهرسة العربية على جيت هاب

GitHub

308 stars
12 watching
148 forks
last commit: 8 months ago
Linked from 2 awesome lists

arabic-languagearabic-nlpstopwords

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
6/stopwords-json A collection of stopword lists in JSON format for various languages. 424
brez/stopwords A collection of pre-defined words commonly ignored in text processing 12
stopwords-iso/stopwords-gl A collection of Galician stopwords in JSON and text formats for use in natural language processing applications. 1
mohamedadaly/labr A dataset of Arabic book reviews for natural language processing tasks 44
mznmel/aln9 A markup language designed to simplify the formatting of Arabic text in HTML documents. 11
brenes/stopwords-filter A Ruby library that removes common words from text before processing 77
alexrutherford/arabic_nlp Tools for normalizing and deriving sentiment from Arabic text 26
mpcabd/python-arabic-reshaper Reconstructs Arabic text to be used in applications with limited support 406
othmanela/nlp_arabic Provides tools for Natural Language Processing in Arabic 11
semanticfrontiers/arabicnlp A collection of Python scripts and utilities for processing Arabic text 55
lantip/baku-tidak-baku A repository of linguistic data for Indonesian words categorized as either standard or non-standard 29
01walid/goarabic A package of Go functions to process and manipulate Arabic text 108
galuhsahid/indonesian-word-embedding Demonstrates word embedding in Indonesian language using pre-trained Word2vec models 20
mojtaba-khallash/nhazm A C# library for digesting Persian text using natural language processing techniques. 38
bluemix/numbertoarabicwords Converts Arabic numbers to written words in a human-readable format. 53