MegaParse

Document parser

A tool for efficiently parsing various document types to prepare data for Large Language Models.

File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.

GitHub

5k stars
23 watching
228 forks
Language: Python
last commit: 19 days ago
docxllmparserpdfpowerpoint