PoCoTo

Text correction tool

A Java-based tool for correcting errors in OCR'd historical documents

The CIS OCR PostCorrectionTool

GitHub

40 stars
5 watching
4 forks
Language: Java
last commit: about 2 years ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
cisocrgroup/resources Resources and data for developing a language-aware OCR document error profiler and PoCoTo tools. 15
juditacs/hunaccent A C++ library that uses machine learning to restore diacritics in Hungarian text 15
cpitclaudel/alectryon A tool for processing Coq and Lean 4 code embedded in text documents 237
alvenirai/punctfix A Python library that adds punctuation and capitalization to text without punctuation. 23
impactcentre/ocrevaluation A tool for comparing two text files to evaluate the accuracy of OCR engines. 67
aslez/concor A software package for concordance analysis in R 9
antoniogarrote/clj-tesseract A Clojure wrapper for the Tesseract OCR software, allowing developers to easily integrate optical character recognition capabilities into their applications. 54
tcrouch/edits.cr A collection of edit distance algorithms and similarity measures for text sequences 16
mstksg/advent-of-code-ocr A tool for parsing ASCII art word solutions from Advent of Code puzzles 5
jpgattuso/seacarb-git A package to calculate various parameters of the carbonate system in seawater 8
xiadingz/video-caption.pytorch PyTorch implementation of video captioning, combining deep learning and computer vision techniques. 401
povilasjurcys/alignment A Ruby library implementing an alignment algorithm for corpus linguistics 1
pyjarrett/septum A tool to search and filter code text based on contextual lines 380
vision-cair/chatcaptioner Enables automatic generation of descriptive text from images and videos based on user input. 454
tshatrov/ichiran A collection of tools for working with text in Japanese language 311