HyperMinHash-java

Cardinality estimator

An algorithm for approximating set cardinalities and intersections with low memory usage

Union, intersection, and set cardinality in loglog space

GitHub

54 stars
124 watching
10 forks
Language: Java
last commit: over 1 year ago
Linked from 1 awesome list

cardinalitycardinality-estimationhyperlogloghyperloglog-sketchesjavaloglogloglog-betaminhash

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
axiomhq/hyperloglog An algorithm for approximating the number of distinct elements in data sets 946
aggregateknowledge/java-hll A Java library implementing a probabilistic set data structure for counting distinct values with tunable precision 313
datacanvasio/hypergbm Automated machine learning tool for tabular data pipelines 343
aviate-labs/hash.mo Provides a set of hash functions implemented using the Modelica programming language. 2
nixtla/mlforecast A framework to perform time series forecasting using machine learning models on large datasets. 924
sjsingh91/ib-cnn A library implementing a learning algorithm for improving classification accuracy with incremental updates and ensemble methods using neural networks 2
nathanmarz/cascalog A library for data processing and querying on large datasets without the need for Hadoop expertise 1,375
lancopku/iais This project proposes a novel method for calibrating attention distributions in multimodal models to improve contextualized representations of image-text pairs. 30
emilbayes/kmeans.js An implementation of the k-means clustering algorithm in JavaScript 45
tlk00/bitmagic A C++ library for compact data structures and algorithms optimized for memory efficiency and high performance 413
louismullie/graph-rank An implementation of PageRank and TextRank algorithms in Ruby. 75
jcollard/unm-hip A Haskell library for image processing and display 82
liyanghart/hyperparameter-optimization-of-machine-learning-algorithms Provides tools and techniques for tuning hyperparameters in machine learning models to improve performance. 1,283
mcleonard/sampyl An MCMC sampler library for Bayesian estimation using Python 331
kirthevasank/nasbot An implementation of Neural Architecture Search with Bayesian Optimization and Optimal Transport 133