PromptProtein

Protein Model

An implementation of a protein language model that uses prompts to learn from multi-level structural information in proteins.

Code and Data for the paper: Multi-level Protein Structure Pre-training with Prompt Learning [ICLR 2023]

GitHub

32 stars
2 watching
9 forks
Language: Python
last commit: over 1 year ago

Related projects:

Repository Description Stars
muhaochen/seq_ppi A deep learning framework for predicting protein-protein interactions based on sequence data 89
mbzuai-nlp/bactrian-x A collection of multilingual language models trained on a dataset of instructions and responses in various languages. 94
songlab-cal/tape Provides pre-trained protein embeddings and benchmarking tools for semi-supervised learning tasks in protein biology 671
tbepler/protein-sequence-embedding-iclr2019 Developing models to learn and represent protein sequences based on their structure 259
zjulearning/graph_level_drug_discovery A Python project that uses machine learning to improve the representation of molecules in drug discovery 60
01-ai/yi A series of large language models trained from scratch to excel in multiple NLP tasks 7,743
zhuiyitechnology/gau-alpha An implementation of a transformer-based NLP model utilizing gated attention units 98
lifanchen-simm/transformercpi Develops a deep learning model to predict compound-protein interactions by leveraging sequence-based learning and self-attention mechanisms 134
zcli-charlie/batgpt A large language model designed to support long context conversations with improved efficiency and effectiveness 38
tiger-ai-lab/uniir Trains and evaluates a universal multimodal retrieval model to perform various information retrieval tasks. 114
zhuiyitechnology/pretrained-models A collection of pre-trained language models for natural language processing tasks 989
michael-wzhu/promptcblue A large-scale instruction-tuning dataset for multi-task and few-shot learning in the medical domain 328
brightmart/xlnet_zh Trains a large Chinese language model on massive data and provides a pre-trained model for downstream tasks 230
zhuiyitechnology/roformer-sim An upgraded version of SimBERT with integrated retrieval and generation capabilities 441
ieit-yuan/yuan2.0-m32 A high-performance language model designed to excel in tasks like natural language understanding, mathematical computation, and code generation 182