neural-punctuator

Punctuation restorer

An implementation of BERT-based models for automatic punctuation restoration in English and Hungarian texts.

Complimentary code for our paper Automatic punctuation restoration with BERT models

GitHub

48 stars
4 watching
7 forks
Language: Jupyter Notebook
last commit: about 1 year ago
Linked from 1 awesome list

bertpunctuation-restorationtransformer

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
geyang/deep-auto-punctuation This project trains an AI model to automatically punctuate text by reading and learning from a specific dataset of English articles. 141
alvenirai/punctfix A Python library that adds punctuation and capitalization to text without punctuation. 22
juditacs/hunaccent A C++ library that uses machine learning to restore diacritics in Hungarian text 15
aielte-research/diacritics_restoration A Python-based framework for training and evaluating diacritics restoration models using lightweight 1D convolutional neural networks 4
deeppavlov/slavic-bert-ner A shared BERT model for NER tasks in Slavic languages, pre-trained on Bulgarian, Czech, Polish, and Russian texts. 73
zhendongwang6/uformer An implementation of a deep learning model for restoring images in various conditions 806
swz30/restormer Proposes an efficient neural architecture model for high-resolution image restoration tasks 1,805
tonianelope/multilingual-bert Investigating multilingual language models for Named Entity Recognition in German and English 14
allegro/herbert A BERT-based language model pre-trained on Polish corpora for understanding Polish language. 65
dbmdz/berts Provides pre-trained language models for natural language processing tasks 155
laurentmazare/ocaml-bert Implementing BERT-like NLP models in OCaml using PyTorch bindings and pre-trained weights from popular sources. 23
yxuansu/tacl Improves pre-trained language models by encouraging an isotropic and discriminative distribution of token representations. 92
algolzw/daclip-uir This project controls vision-language models to restore degraded images in various environments and conditions. 668
sinovation/zen A pre-trained BERT-based Chinese text encoder with enhanced N-gram representations 643
psharanda/atributika A library that converts HTML-like text into NSAttributedString with various styles and tags 1,450