BioSentVec

Bio Embeddings

Pre-trained word and sentence embeddings for biomedical text analysis

BioWordVec & BioSentVec: pre-trained embeddings for biomedical words and sentences

GitHub

578 stars
17 watching
99 forks
Language: Jupyter Notebook
last commit: over 1 year ago
bionlpfasttextmimic-iiinatural-language-processingpubmedsent2vecsentence-embeddingssentence-similarityword-embeddings

Related projects:

Repository Description Stars
ncbi-nlp/bluebert Pre-trained language models for biomedical natural language processing tasks 558
dmis-lab/biobert Provides pre-trained language representation models for biomedical text mining tasks 1,954
naver/biobert-pretrained Provides pre-trained weights for a biomedical language representation model 667
ncbi/genegpt An LLM that leverages NCBI Web APIs to answer biomedical information questions with improved accuracy and reliability 379
botcenter/spanishwordembeddings This project generates Spanish word embeddings using fastText on large corpora. 9
botcenter/spanish-sent2vec This project trains a machine learning model to generate sentence embeddings from Spanish text data using the sent2vec algorithm. 4
ebi-webcomponents/nightingale A collection of reusable visualisation components for life sciences data 124
ncbi-hackathons/spew Automates the packaging and distribution of bioinformatics pipelines for seamless deployment on various workstations. 26
materialsintelligence/mat2vec Unsupervised word embeddings capture latent knowledge from materials science literature 619
embeddings-benchmark/mteb A benchmarking suite for evaluating text embedding models across various NLP tasks and datasets. 1,952
satwikkottur/visualword2vec Learning word embeddings from abstract images to improve language understanding 19
vefstathiou/so_word2vec This is a word embedding model trained on Stack Overflow posts for use in natural language processing tasks. 40
tbepler/protein-sequence-embedding-iclr2019 A framework for learning protein sequence and structure embeddings using deep learning models. 258
bionode/biohacker A workshop project providing examples and usage guidance for using bionode in bioinformatics pipelines 5
emilyalsentzer/clinicalbert Provides pre-trained embeddings for clinical text data 674