hail

Genomic data analysis platform

A Python-based tool for analyzing genomic data in the cloud with support for batch computing and distributed queries

Cloud-native genomic dataframes and batch computing

GitHub

984 stars
55 watching
246 forks
Language: Python
last commit: 6 days ago
Linked from 1 awesome list

bioinformaticsgeneticsgenomicsgwashailpythonsoftwarevcf

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
bigdatagenomics/adam A platform for parallelizing genomic data analysis on large clusters 1,003
brwnj/smoove-nf An automation workflow for detecting structural variations in genomic data using the smoove toolset 12
pachterlab/gget Enables efficient querying of genomic databases using a modular approach and multiple tools 946
umich-biostatistics/ibag An R application for conducting Bayesian analyses of genomic models using a user-friendly interface. 5
andersenlab/cegwas2-nf A software framework for genomic association studies using C. elegans data 8
nationalgenomicsinfrastructure/icing An approach to analyzing OxfordNanopore reads for HLA typing using Python 13
h3abionet/h3agwas A comprehensive human genome-wide association study workflow for data quality control and basic association testing. 106
h3abionet/chipimputation An imputation workflow tool for genomics data analysis 20
hexgnu/wine_clustering An application of machine learning to cluster similar data points from various sources 0
jtaghiyar/kronos Tools and workflow management for large-scale genome data analysis 19
nvidia-genomics-research/rapids-single-cell-examples A collection of example notebooks demonstrating GPU-accelerated single-cell genomic analysis using the RAPIDS libraries 324
emissions-api/emissions-api Provides access to satellite-based emission data 75
singularity-energy/open-grid-emissions Provides tools and data for high-quality hourly grid emissions analysis 75
sigven/pcgr Software tool for annotating and interpreting cancer genome data to support precision oncology 254
glacials/splits-io A speedrunning data store and analysis engine that enables runners to improve through data analysis. 133