spanlp
Profanity detector library
A Python library to detect and filter profanity in Spanish text, suitable for various applications such as censorship, content moderation, and data cleaning.
spanlp: nlp applied for spanish vulgarity. A fast, robust Python library to check for profanity or offensive language in Spanish strings. It contains all the rude words of Spanish-speaking countries.
36 stars
2 watching
7 forks
Language: Python
last commit: 5 months ago
Linked from 1 awesome list
distance-metricnlpnlp-libraryprofanity-libraryscience-researchscientific-computing
Related projects:
Repository | Description | Stars |
---|---|---|
facebookresearch/spanbert | This repository provides pre-trained models and code for using SpanBERT in various natural language processing tasks. | 891 |
gugarosa/nalp | A Python library for natural language processing with adversarial learning capabilities | 23 |
pemistahl/lingua | An accurate language detection library for Java and the JVM suitable for both short and long text inputs. | 707 |
proycon/pynlpl | A Python library for natural language processing tasks, including text manipulation and analysis. | 479 |
hashwin/scylla | A Ruby-based language detection tool that uses N-Gram based text categorization to identify the language of given text. | 36 |
cslu-nlp/detectormorse | A tool for automatically detecting sentence boundaries in natural language text using machine learning and handcrafted features. | 90 |
spandanb/learndb-py | This is an open-source relational database management system implemented from scratch in Python. | 1,287 |
sergey-tihon/stanford.nlp.net | A .NET wrapper around Stanford NLP's Java API, providing access to natural language processing capabilities. | 598 |
sedthh/lara-hungarian-nlp | A lightweight Python library for natural language processing in Hungarian | 29 |
pld-linux/aspell-gl | A Galician language dictionary for use in spell-checking software | 1 |
vseloved/wiki-lang-detect | Uses Wikipedia data to identify the language of unstructured text | 31 |
internetlab-br/twitter-bots | An analysis tool to identify Twitter bots following presidential candidates. | 48 |
pemistahl/lingua-go | A library that accurately detects the language of short to long text inputs without requiring external APIs or configuration. | 1,190 |
cidles/pyannotation | A Python library to access and manipulate linguistically annotated corpus files in various formats. | 16 |
outcastofmusic/quick-nlp | A Python NLP library for training and running sequence-to-sequence models similar to the fast.ai library. | 283 |