adam

Genomics analyzer

A platform for parallelizing genomic data analysis on large clusters

ADAM is a genomics analysis platform with specialized file formats built using Apache Avro, Apache Spark, and Apache Parquet. Apache 2 licensed.

GitHub

1k stars
99 watching
309 forks
Language: Scala
last commit: about 1 month ago
Linked from 4 awesome lists

avrobig-databioinformaticsgenomicsjavaparquetpythonrscalaspark

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
dfajar2/bigscale An analytical framework for analyzing large-scale single-cell data by identifying coexpressed genes and detecting differentially expressed genes across multiple clusters. 1
hail-is/hail A Python-based tool for analyzing genomic data in the cloud with support for batch computing and distributed queries 984
jtaghiyar/kronos Tools and workflow management for large-scale genome data analysis 19
andersenlab/cegwas2-nf A software framework for genomic association studies using C. elegans data 8
umich-biostatistics/ibag An R application for conducting Bayesian analyses of genomic models using a user-friendly interface. 5
yannael/bigdataanalytics_infoh515 A collection of Jupyter notebooks teaching Big Data Analytics with Spark and machine learning concepts 59
nvidia-genomics-research/atacworks A toolkit for preprocessing and analyzing high-throughput DNA sequencing data from ATAC-seq experiments using deep learning techniques. 128
pachterlab/gget Enables efficient querying of genomic databases using a modular approach and multiple tools 946
nvidia-genomics-research/rapids-single-cell-examples A collection of example notebooks demonstrating GPU-accelerated single-cell genomic analysis using the RAPIDS libraries 324
jhu99/scbean Analyzes single-cell multi-omics data from various modalities like RNA-seq and ATAC-seq 16
arvados/arvados A platform for managing and analyzing large biomedical datasets through scalable workflows and reliable storage 400
bodenmillergroup/histocat An interactive analysis toolbox for multiplexed image cytometry data 1
h3abionet/h3agwas A comprehensive human genome-wide association study workflow for data quality control and basic association testing. 106
nsalomonis/altanalyze An automated cross-platform workflow for RNA-Seq gene and splicing analysis 99
xtra-computing/thundergbm Accelerates machine learning algorithms on GPUs to improve performance and efficiency 693