datasheet-scrubber

Datasheet extractor

Automates extraction of key circuit information from PDF datasheets/documents to build a database of commercial off-the-shelf IP.

GitHub

51 stars
4 watching
9 forks
Language: Python
last commit: 6 months ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
robotips/uconfig Automates pinout extraction and schematic creation from PDF datasheets. 521
unkl4b/gitminer Automated tool for gathering code information from Github repositories 2,093
philips-labs/tabia Analyzes codebases to extract characteristics and provides insights on their properties 12
nlgranger/seqtools A Python library to manipulate and transform indexable data 49
msamogh/nonechucks Library that provides dynamic data cleaning and filtering capabilities for PyTorch datasets and samplers 378
karlicoss/kobuddy Extracts data from Kobo eReader databases for analysis and backup 152
simsong/bulk_extractor Extracts structured information from digital data without parsing file systems 1,129
cgarciae/phi A Python library for functional programming that aims to simplify the experience by providing a unified API and operator overloading for common data transformations and operations. 134
python-bonobo/bonobo A Python framework for parallelizing data transformations and processing 1,589
databricks/lilac A tool to improve data quality and efficiency for large language models 987
aymericbeaumet/squeeze A tool to extract relevant information from text 17
pycqa/autoflake A tool to automatically remove unused imports and variables from Python code based on pyflakes analysis 908
geeks-of-data/knowledge-gpt Extracts and stores information from various sources using AI models to generate answers. 283
eyurtsev/kor An open-source wrapper around LLMs to extract structured data from text 1,638
recrm/archivetools A collection of tools for extracting and analyzing data from web archives 71