tape

Protein benchmarks

Provides pre-trained protein embeddings and benchmarking tools for semi-supervised learning tasks in protein biology

Tasks Assessing Protein Embeddings (TAPE), a set of five biologically relevant semi-supervised learning tasks spread across different domains of protein biology.

GitHub

671 stars

22 watching

130 forks

Language: Python

last commit: about 3 years ago

Linked from 1 awesome list

benchmarkdatasetdeep-learninglanguage-modelingprotein-sequencesprotein-structurepytorchsemi-supervised-learning

www.biorxiv.org/content/10.1101/676825v1

Backlinks from these awesome lists:

xnuohz/awesome-drug-discovery

Related projects:

Repository	Description	Stars
songlab-cal/tape-neurips2019	A software framework for evaluating protein embeddings and benchmarking semi-supervised learning tasks in protein biology	118
tbepler/protein-sequence-embedding-iclr2019	Developing models to learn and represent protein sequences based on their structure	259
hicai-zju/promptprotein	An implementation of a protein language model that uses prompts to learn from multi-level structural information in proteins.	32
cbcrg/benchfam	Generates a benchmark dataset for evaluating protein alignment programs	3
pku-yuangroup/video-bench	Evaluates and benchmarks large language models' video understanding capabilities	121
prosodylab/prosodylab.alignertools	A package of scripts to prepare data for alignment in speech processing	12
kudkudak/word-embeddings-benchmarks	Provides methods for evaluating word embeddings on various benchmarks	437
antoine77340/howto100m	Provides code and tools for learning joint text-video embeddings using the HowTo100M dataset	254
ys-zong/vl-icl	A benchmarking suite for multimodal in-context learning models	31
ailab-cvc/seed-bench	A benchmark for evaluating large language models' ability to process multimodal input	322
automl/hpobench	A collection of benchmark problems for hyperparameter optimization	140
talwalkarlab/leaf	A benchmarking framework for federated machine learning tasks across various domains and datasets	856
yandex/rep	A toolset for building and running reproducible machine learning experiments in Python	689
jordipons/eusipco2017	Research code for music auto-tagging using deep learning and feature extraction	23
ncbi-nlp/biosentvec	Pre-trained word and sentence embeddings for biomedical text analysis	578