PoCoTo
Text correction tool
A Java-based tool for correcting errors in OCR'd historical documents
The CIS OCR PostCorrectionTool
40 stars
5 watching
4 forks
Language: Java
last commit: about 3 years ago
Linked from 1 awesome list
Related projects:
| Repository | Description | Stars |
|---|---|---|
| | Resources and data for developing a language-aware OCR document error profiler and PoCoTo tools. | 15 |
| | A C++ library that uses machine learning to restore diacritics in Hungarian text | 15 |
| | A tool for processing Coq and Lean 4 code embedded in text documents | 237 |
| | A Python library that adds punctuation and capitalization to text without punctuation. | 23 |
| | A tool for comparing two text files to evaluate the accuracy of OCR engines. | 67 |
| | A software package for concordance analysis in R | 9 |
| | A Clojure wrapper for the Tesseract OCR software, allowing developers to easily integrate optical character recognition capabilities into their applications. | 54 |
| | A collection of edit distance algorithms and similarity measures for text sequences | 16 |
| | A tool for parsing ASCII art word solutions from Advent of Code puzzles | 5 |
| | A package to calculate various parameters of the carbonate system in seawater | 8 |
| | PyTorch implementation of video captioning, combining deep learning and computer vision techniques. | 402 |
| | A Ruby library implementing an alignment algorithm for corpus linguistics | 1 |
| | A tool to find specific lines or patterns in codebases by searching for matching contexts of contiguous lines. | 381 |
| | Enables automatic generation of descriptive text from images and videos based on user input. | 457 |
| | A collection of tools for working with text in Japanese language | 312 |