squeeze

Extractor

A tool to extract relevant information from text

Extract rich information from any text (urls, todos, etc)

17 stars

2 watching

0 forks

Language: Rust

last commit: over 4 years ago

Linked from 1 awesome list

Backlinks from these awesome lists:

learn-anything/command-line-tools

Related projects:

Repository	Description	Stars
fourdigits/wagtail_textract	A Django package that enhances Wagtail's document search with text extraction capabilities using Tesseract and Textract libraries.	33
knowitall/reverb	Extracts binary relationships from English sentences at scale	543
deviantech/rack-referrals	Extracts information about referring search engines from HTTP requests.	17
coleifer/micawber	A library for extracting metadata and content from URLs	635
gmarty/xgettext	Tools for extracting translatable strings from source code written in template languages.	77
cantino/ruby-readability	A Ruby port of a readability tool that extracts primary content from web pages.	927
steelthread/mimeograph	A CoffeeScript library for extracting text from PDF files and creating searchable documents with OCR capabilities	28
referefref/aiocrioc	Automates the extraction of indicators of compromise from text-based reports	31
emersonelectricco/boomerang	A tool designed to safely capture off-network web resources for network defense and security analysis	38
yaa110/rake-rs	A Rust library implementing a keyword extraction algorithm to automatically identify relevant words in text	33
anonyfox/elixir-scrape	A tool for extracting structured data from web resources using information-retrieval techniques.	328
recrm/archivetools	A collection of tools for extracting and analyzing data from web archives	71
iseahound/vis2	An automated OCR tool using computer vision for image text extraction	160
cocacola-lab/chatie	A framework for extracting information from unannotated text using large language models	795
feichao93/temme	A lightweight, CSS-based selector for extracting structured data from HTML documents.	273