GenePT

Gene embedding model

A foundation model for single-cell biology tasks by leveraging large language model embeddings of gene descriptions.

GitHub

176 stars
7 watching
27 forks
Language: Jupyter Notebook
last commit: 8 months ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
zhangxiann/skip-gram A Python implementation of a neural network model for learning word embeddings from text data 6
mnqu/pte An implementation of the Predictive Text Embedding model for learning word representations from large-scale heterogeneous text networks. 96
fanglanting/skip-gram-pytorch A PyTorch implementation of the skip-gram model for learning word embeddings. 188
alexandres/lexvec An implementation of a word embedding model that uses character n-grams and achieves state-of-the-art results in multiple NLP tasks 803
wangyuxinwhy/uniem Develops unified sentence embedding models for NLP tasks 833
xiaoqijiao/coling2018 Provides training and testing code for a CNN-based sentence embedding model 2
jwieting/acl2017 A codebase for training and using models of sentence embeddings. 33
malllabiisc/wordgcn A deep learning model that generates word embeddings by predicting words based on their dependency context 290
jianhuupenn/spagcn An algorithmic framework to integrate gene expression data with spatial location and histological information to identify distinct regions in tissue samples. 198
rguthrie3/morphologicalpriorsforwordembeddings A project implementing a method to incorporate morphological information into word embeddings using a neural network model 52
ieit-yuan/yuan2.0-m32 A high-performance language model designed to excel in tasks like natural language understanding, mathematical computation, and code generation 180
ymcui/chinese-xlnet Provides pre-trained models for Chinese natural language processing tasks using the XLNet architecture 1,653
bowang-lab/scgpt A Jupyter Notebook-based framework for training and applying generative AI models to single-cell multi-omics data 1,039
jiangtong-li/subword-elmo This is a repository for a subword ELMo model pre-trained on a large corpus of text. 12
01-ai/yi A series of large language models trained from scratch to excel in multiple NLP tasks 7,699