BioSentVec

Bio Embeddings

Pre-trained word and sentence embeddings for biomedical text analysis

BioWordVec & BioSentVec: pre-trained embeddings for biomedical words and sentences

GitHub

578 stars
17 watching
100 forks
Language: Jupyter Notebook
last commit: over 1 year ago
bionlpfasttextmimic-iiinatural-language-processingpubmedsent2vecsentence-embeddingssentence-similarityword-embeddings

Related projects:

Repository Description Stars
ncbi-nlp/bluebert Pre-trained language models for biomedical natural language processing tasks 560
dmis-lab/biobert Provides pre-trained language representation models for biomedical text mining tasks 1,970
naver/biobert-pretrained Provides pre-trained weights for a biomedical language representation model 672
ncbi/genegpt An LLM that leverages NCBI Web APIs to answer biomedical information questions with improved accuracy and reliability 384
botcenter/spanishwordembeddings This project generates Spanish word embeddings using fastText on large corpora. 9
botcenter/spanish-sent2vec This project trains a machine learning model to generate sentence embeddings from Spanish text data using the sent2vec algorithm. 4
ebi-webcomponents/nightingale A collection of reusable visualisation components for life sciences data 124
ncbi-hackathons/spew Automates the packaging and distribution of bioinformatics pipelines for seamless deployment on various workstations. 26
materialsintelligence/mat2vec Unsupervised word embeddings capture latent knowledge from materials science literature 624
embeddings-benchmark/mteb Provides tools and benchmarks for evaluating text embedding models 2,021
satwikkottur/visualword2vec Learning word embeddings from abstract images to improve language understanding 19
vefstathiou/so_word2vec This is a word embedding model trained on Stack Overflow posts for use in natural language processing tasks. 40
tbepler/protein-sequence-embedding-iclr2019 Developing models to learn and represent protein sequences based on their structure 259
bionode/biohacker A workshop project providing examples and usage guidance for using bionode in bioinformatics pipelines 5
emilyalsentzer/clinicalbert Provides clinical BERT embeddings for natural language processing tasks in healthcare 680