PromptProtein

Protein Model

An implementation of a protein language model that uses prompts to learn from multi-level structural information in proteins.

Code and Data for the paper: Multi-level Protein Structure Pre-training with Prompt Learning [ICLR 2023]

GitHub

32 stars
2 watching
8 forks
Language: Python
last commit: over 1 year ago

Related projects:

Repository Description Stars
muhaochen/seq_ppi A deep learning framework for predicting protein-protein interactions based on sequence data 89
mbzuai-nlp/bactrian-x A collection of multilingual language models trained on a dataset of instructions and responses in various languages. 94
songlab-cal/tape Provides pre-trained protein embeddings and benchmarking tools for semi-supervised learning tasks in protein biology 662
tbepler/protein-sequence-embedding-iclr2019 A framework for learning protein sequence and structure embeddings using deep learning models. 258
zjulearning/graph_level_drug_discovery A Python project that uses machine learning to improve the representation of molecules in drug discovery 60
01-ai/yi A series of large language models trained from scratch to excel in multiple NLP tasks 7,699
zhuiyitechnology/gau-alpha An implementation of a Gated Attention Unit-based Transformer model for natural language processing tasks 96
lifanchen-simm/transformercpi Develops a deep learning model to predict compound-protein interactions by leveraging sequence-based learning and self-attention mechanisms 134
zcli-charlie/batgpt A large language model designed to support long context conversations with improved efficiency and effectiveness 38
tiger-ai-lab/uniir Trains and evaluates a universal multimodal retrieval model to perform various information retrieval tasks. 110
zhuiyitechnology/pretrained-models A collection of pre-trained language models for natural language processing tasks 987
michael-wzhu/promptcblue A large-scale instruction-tuning dataset for multi-task and few-shot learning in the medical domain 323
brightmart/xlnet_zh Trains a large Chinese language model on massive data and provides a pre-trained model for downstream tasks 230
zhuiyitechnology/roformer-sim An upgraded version of SimBERT model with integrated retrieval and generation capabilities 438
ieit-yuan/yuan2.0-m32 A high-performance language model designed to excel in tasks like natural language understanding, mathematical computation, and code generation 180