skizze

Data estimator

A service that provides a probabilistic data structure storage for efficient estimation of metrics in large datasets

A probabilistic data structure service and storage

GitHub

89 stars
8 watching
9 forks
Language: Go
last commit: over 8 years ago
Linked from 2 awesome lists


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
skizzehq/skizze A service for efficient storage and analysis of large datasets using probabilistic data structures. 771
seiflotfy/pmc A probabilistic algorithm for estimating multiplicity counts in large data streams without prior knowledge of maximum frequency 49
evgenyneu/sigmaswiftstatistics Provides a set of statistical functions in Swift for calculations such as mean, median, standard deviation, and more. 701
statomics/zinbwavezinger A software framework for integrating zingeR with ZINB-WaVE weights for RNA-seq data analysis 23
dataforgoodfr/quotaclimat A tool to quantify media coverage of climate crises by collecting and analyzing radio and TV data from the Mediatree API. 28
ogfris/gostats A collection of statistical measures functions for use in machine learning applications 22
seldonio/alibi-detect A Python library for detecting outliers, adversarial examples, and data drift in various types of data 2,247
sosdave/enumeration-as-a-service Analyzes DNS records to identify SaaS usage, providing a list of potential services and their associated IP addresses 28
seiflotfy/count-min-log A Go implementation of a technique to approximate event counting with reduced error for low-frequency events in large-scale processing. 66
nferraz/st A command-line tool for calculating simple statistics from datasets 924
sktime/pysf A Python library for supervised forecasting of sequential data 55
pusewicz/descriptive_statistics A collection of functions to calculate descriptive statistics from a list of numbers 9
skgrange/saqgetr An R package to fetch and manipulate air quality monitoring data from remote servers. 9
qifeidkn/stagate A software framework for learning spatial embeddings from high-dimensional transcriptomics data using an adaptive graph attention auto-encoder 37
alexbrillant/seq2seq-attention An implementation of an attention mechanism using TensorFlow 2 to analyze time series data. 7