pdfplumber
PDF parser
A tool for extracting detailed information from PDFs
Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
7k stars
93 watching
687 forks
Language: Python
last commit: 2 months ago pdfpdf-parsingtable-extraction
Related projects:
Repository | Description | Stars |
---|---|---|
| A Python-based tool for extracting information from PDF documents. | 6,046 |
| A tool that adds OCR text to scanned PDF files, allowing them to be searchable and copy-pasted. | 14,363 |
| A Python library for manipulating and extracting data from PDF files | 8,524 |
| A Python library for creating and manipulating PDF documents in a JSON-like data structure. | 3,413 |
| An application that allows users to manipulate PDF documents by merging/splitting and rearranging pages. | 3,653 |
| A comprehensive guide to getting started with Python's pandas library using real-world data examples | 6,697 |
| A JavaScript library for creating and modifying PDF documents in any environment | 7,089 |
| An online guide and set of executable Jupyter notebooks providing an introduction to core libraries for data science in Python. | 43,422 |
| A general-purpose PDF viewer built with HTML5, allowing parsing and rendering of Portable Document Format files. | 49,009 |
| A C# library for extracting and analyzing text from PDF files | 1,794 |
| Converts PDF documents to text formats with high accuracy and support for various document types | 18,618 |
| An integrated framework for document AI tasks using deep learning models. | 2,628 |
| A Python library for creating charts with a consistent input data format and intuitive API | 3,546 |
| A JavaScript library for generating PDF documents with various features and functionalities | 9,970 |
| A tool to extract text from PDFs and add a searchable layer to them | 279 |