wagtail_textract
Document Extractor
A Django package that enhances Wagtail's document search with text extraction capabilities using Tesseract and Textract libraries.
Text extraction for Wagtail document search
33 stars
6 watching
13 forks
Language: Python
last commit: over 1 year ago
Linked from 1 awesome list
djangosearchtesseracttext-extractiontextractwagtail
Related projects:
Repository | Description | Stars |
---|---|---|
| A template for building custom Wagtail add-ons | 17 |
| A Django app providing an integrated rich text editor with features like page/image chooser and document support. | 9 |
| A Wagtail add-on that supports multilingual sites | 104 |
| A Python package that enables natural language search over Wagtail images using the CLIP model. | 12 |
| Enhances text alignment in Wagtail richtext editors with support for block entities. | 4 |
| Provides a rich text editor with Draft.js capabilities for Wagtail pages | 24 |
| Integrates Wagtail and Yoast SEO for Django projects | 34 |
| A tool for testing and improving accessibility in web content | 32 |
| An app to add translation support to Wagtail CMS without modifying the original models | 151 |
| Extends Wagtail's Documents to include image previews and metadata from FilePreviews.io | 22 |
| A tool for converting Wagtail pages and models to PDF documents using weasyprint. | 26 |
| A tool to extract relevant information from text | 17 |
| Enables translation of Wagtail content within its admin interface | 228 |
| A data pipeline for extracting structured data from PDFs using AWS Textract and cloud-based services | 164 |
| A Wagtail module to display Word documents as pages in the admin interface. | 0 |