PromptProtein

Protein Model

An implementation of a protein language model that uses prompts to learn from multi-level structural information in proteins.

Code and Data for the paper: Multi-level Protein Structure Pre-training with Prompt Learning [ICLR 2023]

GitHub

32 stars

2 watching

9 forks

Language: Python

last commit: almost 3 years ago

Related projects:

Repository	Description	Stars
muhaochen/seq_ppi	A deep learning framework for predicting protein-protein interactions based on sequence data	89
mbzuai-nlp/bactrian-x	A collection of multilingual language models trained on a dataset of instructions and responses in various languages.	94
songlab-cal/tape	Provides pre-trained protein embeddings and benchmarking tools for semi-supervised learning tasks in protein biology	671
tbepler/protein-sequence-embedding-iclr2019	Developing models to learn and represent protein sequences based on their structure	259
zjulearning/graph_level_drug_discovery	A Python project that uses machine learning to improve the representation of molecules in drug discovery	60
01-ai/yi	A series of large language models trained from scratch to excel in multiple NLP tasks	7,743
zhuiyitechnology/gau-alpha	An implementation of a transformer-based NLP model utilizing gated attention units	98
lifanchen-simm/transformercpi	Develops a deep learning model to predict compound-protein interactions by leveraging sequence-based learning and self-attention mechanisms	134
zcli-charlie/batgpt	A large language model designed to support long context conversations with improved efficiency and effectiveness	38
tiger-ai-lab/uniir	Trains and evaluates a universal multimodal retrieval model to perform various information retrieval tasks.	114
zhuiyitechnology/pretrained-models	A collection of pre-trained language models for natural language processing tasks	989
michael-wzhu/promptcblue	A large-scale instruction-tuning dataset for multi-task and few-shot learning in the medical domain	328
brightmart/xlnet_zh	Trains a large Chinese language model on massive data and provides a pre-trained model for downstream tasks	230
zhuiyitechnology/roformer-sim	An upgraded version of SimBERT with integrated retrieval and generation capabilities	441
ieit-yuan/yuan2.0-m32	A high-performance language model designed to excel in tasks like natural language understanding, mathematical computation, and code generation	182