wagtail_textract

Document Extractor

A Django package that enhances Wagtail's document search with text extraction capabilities using Tesseract and Textract libraries.

Text extraction for Wagtail document search

GitHub

33 stars
6 watching
13 forks
Language: Python
last commit: about 1 year ago
Linked from 1 awesome list

djangosearchtesseracttext-extractiontextractwagtail

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
wagtail/cookiecutter-wagtail-package A template for building custom Wagtail add-ons 17
nigel2392/wagtail_editorjs A Django app providing an integrated rich text editor with features like page/image chooser and document support. 9
wagtail/wagtailtrans A Wagtail add-on that supports multilingual sites 104
mattsegal/wagtail-clip A Python package that enables natural language search over Wagtail images using the CLIP model. 12
nigel2392/wagtail_text_alignment Enhances text alignment in Wagtail richtext editors with support for block entities. 4
springload/wagtaildraftail Provides a rich text editor with Draft.js capabilities for Wagtail pages 24
aleksi44/wagtailyoast Integrates Wagtail and Yoast SEO for Django projects 34
wagtail-nest/wagtail-accessibility A tool for testing and improving accessibility in web content 32
infoportugal/wagtail-modeltranslation An app to add translation support to Wagtail CMS without modifying the original models 151
filepreviews/wagtail-filepreviews Extends Wagtail's Documents to include image previews and metadata from FilePreviews.io 22
donhauser/wagtail-pdf A tool for converting Wagtail pages and models to PDF documents using weasyprint. 25
aymericbeaumet/squeeze A tool to extract relevant information from text 17
wagtail/wagtail-localize Enables translation of Wagtail content within its admin interface 226
aeksco/aws-pdf-textract-pipeline A data pipeline for extracting structured data from PDFs using AWS Textract and cloud-based services 164
nigel2392/wagtail_word A Wagtail module to display Word documents as pages in the admin interface. 0