tape
Protein benchmarks
Provides pre-trained protein embeddings and benchmarking tools for semi-supervised learning tasks in protein biology
Tasks Assessing Protein Embeddings (TAPE), a set of five biologically relevant semi-supervised learning tasks spread across different domains of protein biology.
671 stars
22 watching
130 forks
Language: Python
last commit: about 2 years ago
Linked from 1 awesome list
benchmarkdatasetdeep-learninglanguage-modelingprotein-sequencesprotein-structurepytorchsemi-supervised-learning
Related projects:
Repository | Description | Stars |
---|---|---|
| A software framework for evaluating protein embeddings and benchmarking semi-supervised learning tasks in protein biology | 118 |
| Developing models to learn and represent protein sequences based on their structure | 259 |
| An implementation of a protein language model that uses prompts to learn from multi-level structural information in proteins. | 32 |
| Generates a benchmark dataset for evaluating protein alignment programs | 3 |
| Evaluates and benchmarks large language models' video understanding capabilities | 121 |
| A package of scripts to prepare data for alignment in speech processing | 12 |
| Provides methods for evaluating word embeddings on various benchmarks | 437 |
| Provides code and tools for learning joint text-video embeddings using the HowTo100M dataset | 254 |
| A benchmarking suite for multimodal in-context learning models | 31 |
| A benchmark for evaluating large language models' ability to process multimodal input | 322 |
| A collection of benchmark problems for hyperparameter optimization | 140 |
| A benchmarking framework for federated machine learning tasks across various domains and datasets | 856 |
| A toolset for building and running reproducible machine learning experiments in Python | 689 |
| Research code for music auto-tagging using deep learning and feature extraction | 23 |
| Pre-trained word and sentence embeddings for biomedical text analysis | 578 |