convertextract
Text processor
A library that provides text extraction and replacement functionality based on arbitrary correspondences.
Extract and find/replace text based on arbitrary correspondences while preserving original file formatting. This library is a fork from the Textract library by Dean Malmgren.
11 stars
4 watching
3 forks
Language: HTML
last commit: about 1 year ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
ezrosent/frawk | A small programming language for processing textual data with improved performance compared to AWK. | 1,256 |
psharanda/atributika | A library that converts HTML-like text into NSAttributedString with various styles and tags | 1,450 |
senselogic/pendown | A text-to-HTML conversion tool with integrated styling and tag customization | 49 |
juliasilge/tidytext | Provides tools and data to convert text into tidy data formats for natural language processing tasks | 1,180 |
gagolews/stringi | A package providing a fast and portable way to process character strings with Unicode support | 304 |
talyssonoc/commonregexruby | Extracts common information from text strings in various formats | 79 |
jtalbot/riposte | A fast interpreter and JIT for the R programming language | 90 |
emorynlp/nlp4j | Provides tools and APIs for text processing and analysis on Java-based platforms. | 148 |
robhz786/strf | A C++ library for formatting and transcoding text | 70 |
geemus/formatador | A library for formatting text with various options and capabilities for displaying tables, progress bars, and other formatted output. | 451 |
farism/rgbeef | A color manipulation and conversion library | 2 |
zix99/rare | A tool that provides fast and efficient text analysis and visualization capabilities | 274 |
tin2tin/trim-whitespace-change-case-and-split-join-lines | A tool to process text in the Blender Text Editor by removing whitespace, changing case, and splitting/joining lines. | 5 |
neukg/techgpt | A generative transformer model designed to process and generate text in various vertical domains, including computer science, finance, and more. | 212 |
turbopape/postagga | A Clojure-based natural language processing library for parsing and structuring text input into meaningful data. | 159 |