bulk_extractor
Data extractor
Extracts structured information from digital data without parsing file systems
This is the development tree. Production downloads are at:
1k stars
76 watching
187 forks
Language: C++
last commit: 7 months ago
Linked from 4 awesome lists
Related projects:
Repository | Description | Stars |
---|---|---|
eyurtsev/kor | Extracts structured data from unstructured text using large language models | 1,629 |
sblom/regextract | A tool that enables easy and efficient data extraction from text using regular expressions in C#. | 697 |
cmu-sei/cyobstract | Extracts structured cyber information from incident reports. | 78 |
bromiumlabs/packerattacker | An application designed to detect and extract hidden code from malicious Windows executables. | 268 |
idea-fasoc/datasheet-scrubber | Automates extraction of key circuit information from PDF datasheets/documents to build a database of commercial off-the-shelf IP. | 51 |
suse/clang-extract | A tool to extract code content from source files using the clang and LLVM infrastructure. | 14 |
siguza/imobax | Extracts and processes iOS mobile backups | 182 |
gskril/farcaster-indexer | An indexer tool for extracting data from the Farcaster protocol and storing it in a Postgres database | 149 |
51j0/android-storage-extractor | A tool to extract local data storage of an Android application in one click. | 16 |
nissl-lab/toxy | A .NET framework for extracting text from various document formats across multiple platforms. | 359 |
syntax-tree/hast-util-to-text | Utility function to extract plain text from HTML-like data structures | 19 |
anssi-fr/bits_parser | Extracts and stores BITS job data from QMGR queues as CSV records. | 74 |
fox-it/dissect.target | Provides a programming API and command line tools to access various data sources inside disk images or file collections. | 44 |
egorbo/simdjsonsharp | Library for fast JSON parsing and minification using SIMD instructions | 646 |
recrm/archivetools | A collection of tools for extracting and analyzing data from web archives | 69 |