PoCoTo
Text correction tool
A Java-based tool for correcting errors in OCR'd historical documents
The CIS OCR PostCorrectionTool
40 stars
5 watching
4 forks
Language: Java
last commit: over 2 years ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
| Resources and data for developing a language-aware OCR document error profiler and PoCoTo tools. | 15 |
| A C++ library that uses machine learning to restore diacritics in Hungarian text | 15 |
| A tool for processing Coq and Lean 4 code embedded in text documents | 237 |
| A Python library that adds punctuation and capitalization to text without punctuation. | 23 |
| A tool for comparing two text files to evaluate the accuracy of OCR engines. | 67 |
| A software package for concordance analysis in R | 9 |
| A Clojure wrapper for the Tesseract OCR software, allowing developers to easily integrate optical character recognition capabilities into their applications. | 54 |
| A collection of edit distance algorithms and similarity measures for text sequences | 16 |
| A tool for parsing ASCII art word solutions from Advent of Code puzzles | 5 |
| A package to calculate various parameters of the carbonate system in seawater | 8 |
| PyTorch implementation of video captioning, combining deep learning and computer vision techniques. | 402 |
| A Ruby library implementing an alignment algorithm for corpus linguistics | 1 |
| A tool to find specific lines or patterns in codebases by searching for matching contexts of contiguous lines. | 381 |
| Enables automatic generation of descriptive text from images and videos based on user input. | 457 |
| A collection of tools for working with text in Japanese language | 312 |