MathOCR
Document analyzer
A software project that enables the recognition and analysis of printed scientific documents, particularly focusing on mathematical expressions.
A scientific document recognition system
168 stars
11 watching
41 forks
Language: Java
last commit: over 2 years ago
Linked from 1 awesome list
latexoptical-character-recognitionscientific-document-recognition
Related projects:
Repository | Description | Stars |
---|---|---|
| A document analytics platform providing features for managing documents, extracting layout information and vector embeddings, annotating documents, and querying them using LlamaIndex. | 728 |
| Develops tools and algorithms for analyzing layout and structure of documents in PDF format | 591 |
| An application that helps investigate journalists analyze and search documents, using natural language processing and entity recognition techniques. | 601 |
| An Android-specific toolkit for analyzing and understanding APK files | 118 |
| An Elasticsearch analyzer plugin for analyzing Korean text using the Open-Korean Text module. | 127 |
| A multimodal LLM for understanding and generating charts in various formats. | 202 |
| A tool for analyzing and visualizing discrimination in machine learning models | 6 |
| A model-driven language text analysis system with a rule-based approach to extract information from large volumes of text | 57 |
| Analyzes web components and emits documentation in various formats | 509 |
| A C# library for extracting and analyzing text from PDF files | 1,794 |
| Analyzes software patches to identify vulnerabilities and weaknesses | 359 |
| A tool to analyze and improve the language of scientific papers before submission. | 98 |
| A tool to analyze and extract malicious content from office documents and executables | 126 |
| Analyzes D source code for syntax, style, and security issues | 242 |
| Evaluates PDF extraction tools' ability to extract meaningful text from scientific articles | 65 |