PoCoTo
Text correction tool
A Java-based tool for correcting errors in OCR'd historical documents
The CIS OCR PostCorrectionTool
40 stars
5 watching
4 forks
Language: Java
last commit: about 2 years ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
cisocrgroup/resources | Resources and data for developing a language-aware OCR document error profiler and PoCoTo tools. | 15 |
juditacs/hunaccent | A C++ library that uses machine learning to restore diacritics in Hungarian text | 15 |
cpitclaudel/alectryon | A tool for processing Coq and Lean 4 code embedded in text documents | 237 |
alvenirai/punctfix | A Python library that adds punctuation and capitalization to text without punctuation. | 23 |
impactcentre/ocrevaluation | A tool for comparing two text files to evaluate the accuracy of OCR engines. | 67 |
aslez/concor | A software package for concordance analysis in R | 9 |
antoniogarrote/clj-tesseract | A Clojure wrapper for the Tesseract OCR software, allowing developers to easily integrate optical character recognition capabilities into their applications. | 54 |
tcrouch/edits.cr | A collection of edit distance algorithms and similarity measures for text sequences | 16 |
mstksg/advent-of-code-ocr | A tool for parsing ASCII art word solutions from Advent of Code puzzles | 5 |
jpgattuso/seacarb-git | A package to calculate various parameters of the carbonate system in seawater | 8 |
xiadingz/video-caption.pytorch | PyTorch implementation of video captioning, combining deep learning and computer vision techniques. | 401 |
povilasjurcys/alignment | A Ruby library implementing an alignment algorithm for corpus linguistics | 1 |
pyjarrett/septum | A tool to search and filter code text based on contextual lines | 380 |
vision-cair/chatcaptioner | Enables automatic generation of descriptive text from images and videos based on user input. | 454 |
tshatrov/ichiran | A collection of tools for working with text in Japanese language | 311 |