spanlp

Profanity detector library

A Python library to detect and filter profanity in Spanish text, suitable for various applications such as censorship, content moderation, and data cleaning.

spanlp: nlp applied for spanish vulgarity. A fast, robust Python library to check for profanity or offensive language in Spanish strings. It contains all the rude words of Spanish-speaking countries.

GitHub

36 stars
2 watching
7 forks
Language: Python
last commit: 5 months ago
Linked from 1 awesome list

distance-metricnlpnlp-libraryprofanity-libraryscience-researchscientific-computing

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
facebookresearch/spanbert This repository provides pre-trained models and code for using SpanBERT in various natural language processing tasks. 891
gugarosa/nalp A Python library for natural language processing with adversarial learning capabilities 23
pemistahl/lingua An accurate language detection library for Java and the JVM suitable for both short and long text inputs. 707
proycon/pynlpl A Python library for natural language processing tasks, including text manipulation and analysis. 479
hashwin/scylla A Ruby-based language detection tool that uses N-Gram based text categorization to identify the language of given text. 36
cslu-nlp/detectormorse A tool for automatically detecting sentence boundaries in natural language text using machine learning and handcrafted features. 90
spandanb/learndb-py This is an open-source relational database management system implemented from scratch in Python. 1,287
sergey-tihon/stanford.nlp.net A .NET wrapper around Stanford NLP's Java API, providing access to natural language processing capabilities. 598
sedthh/lara-hungarian-nlp A lightweight Python library for natural language processing in Hungarian 29
pld-linux/aspell-gl A Galician language dictionary for use in spell-checking software 1
vseloved/wiki-lang-detect Uses Wikipedia data to identify the language of unstructured text 31
internetlab-br/twitter-bots An analysis tool to identify Twitter bots following presidential candidates. 48
pemistahl/lingua-go A library that accurately detects the language of short to long text inputs without requiring external APIs or configuration. 1,190
cidles/pyannotation A Python library to access and manipulate linguistically annotated corpus files in various formats. 16
outcastofmusic/quick-nlp A Python NLP library for training and running sequence-to-sequence models similar to the fast.ai library. 283