convertextract

Text processor

A library that provides text extraction and replacement functionality based on arbitrary correspondences.

Extract and find/replace text based on arbitrary correspondences while preserving original file formatting. This library is a fork from the Textract library by Dean Malmgren.

GitHub

11 stars
4 watching
3 forks
Language: HTML
last commit: about 1 year ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
ezrosent/frawk A small programming language for processing textual data with improved performance compared to AWK. 1,256
psharanda/atributika A library that converts HTML-like text into NSAttributedString with various styles and tags 1,450
senselogic/pendown A text-to-HTML conversion tool with integrated styling and tag customization 49
juliasilge/tidytext Provides tools and data to convert text into tidy data formats for natural language processing tasks 1,180
gagolews/stringi A package providing a fast and portable way to process character strings with Unicode support 304
talyssonoc/commonregexruby Extracts common information from text strings in various formats 79
jtalbot/riposte A fast interpreter and JIT for the R programming language 90
emorynlp/nlp4j Provides tools and APIs for text processing and analysis on Java-based platforms. 148
robhz786/strf A C++ library for formatting and transcoding text 70
geemus/formatador A library for formatting text with various options and capabilities for displaying tables, progress bars, and other formatted output. 451
farism/rgbeef A color manipulation and conversion library 2
zix99/rare A tool that provides fast and efficient text analysis and visualization capabilities 274
tin2tin/trim-whitespace-change-case-and-split-join-lines A tool to process text in the Blender Text Editor by removing whitespace, changing case, and splitting/joining lines. 5
neukg/techgpt A generative transformer model designed to process and generate text in various vertical domains, including computer science, finance, and more. 212
turbopape/postagga A Clojure-based natural language processing library for parsing and structuring text input into meaningful data. 159