datashare
Document analyzer
An application that helps investigate journalists analyze and search documents, using natural language processing and entity recognition techniques.
A self-hosted search engine for documents.
601 stars
29 watching
54 forks
Language: Java
last commit: 2 months ago datasharedockerelasticsearchextractinvestigative-journalismnamed-entity-recognitiontext-extractionweb-gui
Related projects:
Repository | Description | Stars |
---|---|---|
| A document analytics platform providing features for managing documents, extracting layout information and vector embeddings, annotating documents, and querying them using LlamaIndex. | 728 |
| An Elasticsearch plugin that provides a user interface for analyzing text with the Analyzer. | 110 |
| A software project that enables the recognition and analysis of printed scientific documents, particularly focusing on mathematical expressions. | 168 |
| A tool to analyze and display website metadata for SEO purposes. | 51 |
| An interactive, command-line tool for diving into and analyzing text data. | 1,959 |
| Analyzes Java JAR files for malicious code and extracts human-readable class files | 2 |
| A tool for analyzing Single-cell RNA-Seq data to identify patterns and clusters in gene expression. | 27 |
| A tool to determine and extract metadata from various file formats | 23 |
| A small JavaScript database engine designed to support fast aggregation queries in web-based analytics and visualization applications. | 248 |
| Analyzes HTML files for SEO defects and provides customizable rules-based analysis | 79 |
| A tool to explore and analyze Firebase datastores | 211 |
| A forensic tool that extracts and analyzes interesting information from Firefox, Iceweasel, and Seamonkey browsers | 130 |
| An Elasticsearch plugin for analyzing and processing Persian text | 154 |
| A JavaScript library for easy data analysis and processing of tabular and geospatial data | 251 |
| A JavaScript query library with a JSON query language to filter objects based on conditions specified in the query. | 313 |