datashare

Document analyzer

An application that helps investigate journalists analyze and search documents, using natural language processing and entity recognition techniques.

A self-hosted search engine for documents.

GitHub

597 stars
29 watching
53 forks
Language: Java
last commit: 8 days ago
datasharedockerelasticsearchextractinvestigative-journalismnamed-entity-recognitiontext-extractionweb-gui

Related projects:

Repository Description Stars
jsv4/opencontracts A document analytics platform providing features for managing documents, extracting layout information and vector embeddings, annotating documents, and querying them using LlamaIndex. 717
johtani/analyze-api-ui-plugin An Elasticsearch plugin that provides a user interface for analyzing text with the Analyzer. 109
chungkwong/mathocr A software project that enables the recognition and analysis of printed scientific documents, particularly focusing on mathematical expressions. 167
carlsednaoui/seo-bookmarklet A tool to analyze and display website metadata for SEO purposes. 50
hazyresearch/deepdive An interactive, command-line tool for diving into and analyzing text data. 1,958
cybercentrecanada/assemblyline-service-espresso Analyzes Java JAR files for malicious code and extracts human-readable class files 2
naikai/sake A tool for analyzing Single-cell RNA-Seq data to identify patterns and clusters in gene expression. 27
dunyakirkali/format_parser.ex A tool to determine and extract metadata from various file formats 23
stanfordhci/datavore A small JavaScript database engine designed to support fast aggregation queries in web-based analytics and visualization applications. 248
maddevsio/seo-analyzer Analyzes HTML files for SEO defects and provides customizable rules-based analysis 78
iosiro/baserunner A tool to explore and analyze Firebase datastores 205
busindre/dumpzilla A forensic tool that extracts and analyzes interesting information from Firefox, Iceweasel, and Seamonkey browsers 130
narimann2/parsianalyzer An Elasticsearch plugin for analyzing and processing Persian text 152
nshiab/simple-data-analysis A JavaScript library for easy data analysis and processing of tabular and geospatial data 252
deitch/searchjs A JavaScript query library with a JSON query language to filter objects based on conditions specified in the query. 311