datashare
Document analyzer
An application that helps investigate journalists analyze and search documents, using natural language processing and entity recognition techniques.
A self-hosted search engine for documents.
597 stars
29 watching
53 forks
Language: Java
last commit: 10 days ago datasharedockerelasticsearchextractinvestigative-journalismnamed-entity-recognitiontext-extractionweb-gui
Related projects:
Repository | Description | Stars |
---|---|---|
jsv4/opencontracts | A document analytics platform providing features for managing documents, extracting layout information and vector embeddings, annotating documents, and querying them using LlamaIndex. | 717 |
johtani/analyze-api-ui-plugin | An Elasticsearch plugin that provides a user interface for analyzing text with the Analyzer. | 109 |
chungkwong/mathocr | A software project that enables the recognition and analysis of printed scientific documents, particularly focusing on mathematical expressions. | 167 |
carlsednaoui/seo-bookmarklet | A tool to analyze and display website metadata for SEO purposes. | 50 |
hazyresearch/deepdive | An interactive, command-line tool for diving into and analyzing text data. | 1,958 |
cybercentrecanada/assemblyline-service-espresso | Analyzes Java JAR files for malicious code and extracts human-readable class files | 2 |
naikai/sake | A tool for analyzing Single-cell RNA-Seq data to identify patterns and clusters in gene expression. | 27 |
dunyakirkali/format_parser.ex | A tool to determine and extract metadata from various file formats | 23 |
stanfordhci/datavore | A small JavaScript database engine designed to support fast aggregation queries in web-based analytics and visualization applications. | 248 |
maddevsio/seo-analyzer | Analyzes HTML files for SEO defects and provides customizable rules-based analysis | 78 |
iosiro/baserunner | A tool to explore and analyze Firebase datastores | 205 |
busindre/dumpzilla | A forensic tool that extracts and analyzes interesting information from Firefox, Iceweasel, and Seamonkey browsers | 130 |
narimann2/parsianalyzer | An Elasticsearch plugin for analyzing and processing Persian text | 152 |
nshiab/simple-data-analysis | A JavaScript library for easy data analysis and processing of tabular and geospatial data | 252 |
deitch/searchjs | A JavaScript query library with a JSON query language to filter objects based on conditions specified in the query. | 311 |