squeeze
Extractor
A tool to extract relevant information from text
Extract rich information from any text (urls, todos, etc)
17 stars
2 watching
0 forks
Language: Rust
last commit: almost 4 years ago
Linked from 1 awesome list
Related projects:
| Repository | Description | Stars |
|---|---|---|
| | A Django package that enhances Wagtail's document search with text extraction capabilities using Tesseract and Textract libraries. | 33 |
| | Extracts binary relationships from English sentences at scale | 543 |
| | Extracts information about referring search engines from HTTP requests. | 17 |
| | A library for extracting metadata and content from URLs | 635 |
| | Tools for extracting translatable strings from source code written in template languages. | 77 |
| | A Ruby port of a readability tool that extracts primary content from web pages. | 927 |
| | A CoffeeScript library for extracting text from PDF files and creating searchable documents with OCR capabilities | 28 |
| | Automates the extraction of indicators of compromise from text-based reports | 31 |
| | A tool designed to safely capture off-network web resources for network defense and security analysis | 38 |
| | A Rust library implementing a keyword extraction algorithm to automatically identify relevant words in text | 33 |
| | A tool for extracting structured data from web resources using information-retrieval techniques. | 328 |
| | A collection of tools for extracting and analyzing data from web archives | 71 |
| | An automated OCR tool using computer vision for image text extraction | 160 |
| | A framework for extracting information from unannotated text using large language models | 795 |
| | A lightweight, CSS-based selector for extracting structured data from HTML documents. | 273 |