spanlp
Profanity detector library
A Python library to detect and filter profanity in Spanish text, suitable for various applications such as censorship, content moderation, and data cleaning.
spanlp: nlp applied for spanish vulgarity. A fast, robust Python library to check for profanity or offensive language in Spanish strings. It contains all the rude words of Spanish-speaking countries.
36 stars
2 watching
7 forks
Language: Python
last commit: 8 months ago
Linked from 1 awesome list
distance-metricnlpnlp-libraryprofanity-libraryscience-researchscientific-computing
Related projects:
Repository | Description | Stars |
---|---|---|
| This repository provides pre-trained models and code for using SpanBERT in various natural language processing tasks. | 893 |
| A Python library for natural language processing with adversarial learning capabilities | 23 |
| An accurate language detection library for Java and the JVM suitable for both short and long text inputs. | 716 |
| A Python library for natural language processing tasks, including text manipulation and analysis. | 479 |
| A Ruby-based language detection tool that uses N-Gram based text categorization to identify the language of given text. | 36 |
| A tool for automatically detecting sentence boundaries in natural language text using machine learning and handcrafted features. | 90 |
| This is an open-source relational database management system implemented from scratch in Python. | 1,290 |
| A .NET wrapper around Stanford NLP's Java API, providing access to natural language processing capabilities. | 598 |
| A lightweight Python library for natural language processing in Hungarian | 29 |
| A Galician language dictionary for use in spell-checking software | 1 |
| Uses Wikipedia data to identify the language of unstructured text | 31 |
| An analysis tool to identify Twitter bots following presidential candidates. | 48 |
| A library that accurately detects the language of short to long text inputs without requiring external APIs or configuration. | 1,192 |
| A Python library to access and manipulate linguistically annotated corpus files in various formats. | 16 |
| A Python NLP library for training and running sequence-to-sequence models similar to the fast.ai library. | 283 |