anonymizer_hu
Named Entity Anonymizer
Anonymizes text data by masking or replacing named entities using machine learning models.
The Hungarian anonymization tool for CURLICAT
1 stars
3 watching
0 forks
Language: Python
last commit: about 3 years ago
Linked from 1 awesome list
Related projects:
| Repository | Description | Stars |
|---|---|---|
| | Tool to anonymize databases by replacing all data with anonymized random data | 358 |
| | A system that anonymizes GitHub repositories to support double-anonymous peer-review and open-science research. | 1,509 |
| | A PyTorch implementation of named entity recognition using the Google AI's pre-trained BERT model for Chinese text data. | 442 |
| | A run-anywhere, dependency-less database anonymizer that protects sensitive data by replacing personal identifiers with fake values. | 58 |
| | An Elixir library for anonymizing data in Ecto models by applying custom masking or redaction rules to sensitive fields. | 89 |
| | A collection of 9,076 annotated sentences in Hungarian to evaluate linguistic acceptability and grammaticality | 1 |
| | Generates anonymized data dumps from various databases. | 524 |
| | A dataset and toolset for Hungarian anaphora resolution in natural language inference tasks | 0 |
| | A dataset of Hungarian translations of human-language examples to test anaphora resolution algorithms | 1 |
| | A large annotated dataset of Hungarian text with over 30 entity types derived from various sources and formats. | 1 |
| | A system for efficiently training and deploying neural networks for named entity recognition tasks with context-dependent representations. | 147 |
| | A plugin to anonymize person-related information in a Shopware production database | 9 |
| | Harmonized tagset and annotation scheme for Hungarian morphological analysers | 4 |
| | This code implements a PyTorch-based makeup transfer system using semantic-aware correspondence | 34 |
| | A tool for annotating named entities in text data using Spacy, allowing users to train and fine-tune the model on their own custom datasets. | 182 |