TCP-ECCO-texts

Text converter

Making TCP transcribed ECCO documents searchable and accessible through automated optical recognition and text processing

Document level full-text of TCP transcribed ECCO docs (2188)

GitHub

3 stars
5 watching
3 forks
last commit: almost 9 years ago

Related projects:

Repository Description Stars
chreul/ocr_testdata_earlyprintedbooks Provides test data and models for training Optical Character Recognition (OCR) systems on historical printed books. 10
cneud/ocr-conversion A collection of scripts and stylesheets for converting data between different OCR formats. 71
tianzhi0549/ctpn Detects text in images using a neural network architecture 1,283
cpitclaudel/alectryon A tool for processing Coq and Lean 4 code embedded in text documents 237
token-economy-book/englishoriginal An English-language adaptation of a book about token economies and their role in Web3 164
cisocrgroup/resources Resources and data for developing a language-aware OCR document error profiler and PoCoTo tools. 15
wcz-txp/unicode-url-for-textpattern Automatically converts non-ASCII characters in text links to UTF-8 URLs for improved SEO and readability 4
cppcon/cppcon2022 Repository of presentation materials and code from CppCon 2022 528
yesco/esp-lisp A small, fast Lisp interpreter for the ESP8266 microcontroller 257
av1ctor/libsecp256k1.mo A Motoko port of the widely-used secp256k1 cryptographic library 1
openphilology/tei-ocr Customizes TEI XML for metadata from OCR processes to capture detailed layout and content information 1
cppcon/cppcon2023 A collection of presentation materials and code from CppCon 2023 290
bradymholt/cron-expression-descriptor Converts cron expressions into human-readable descriptions 1,014
goodsign/icu Provides a Cgo binding to detect and convert text encoding in a Unicode-based C library 21
yvescoding/rcpress A React-based documentation generator that provides static site and server-side rendering capabilities. 193