BioSentVec
Bio Embeddings
Pre-trained word and sentence embeddings for biomedical text analysis
BioWordVec & BioSentVec: pre-trained embeddings for biomedical words and sentences
578 stars
17 watching
100 forks
Language: Jupyter Notebook
last commit: over 1 year ago bionlpfasttextmimic-iiinatural-language-processingpubmedsent2vecsentence-embeddingssentence-similarityword-embeddings
Related projects:
Repository | Description | Stars |
---|---|---|
| Pre-trained language models for biomedical natural language processing tasks | 560 |
| Provides pre-trained language representation models for biomedical text mining tasks | 1,970 |
| Provides pre-trained weights for a biomedical language representation model | 672 |
| An LLM that leverages NCBI Web APIs to answer biomedical information questions with improved accuracy and reliability | 384 |
| This project generates Spanish word embeddings using fastText on large corpora. | 9 |
| This project trains a machine learning model to generate sentence embeddings from Spanish text data using the sent2vec algorithm. | 4 |
| A collection of reusable visualisation components for life sciences data | 124 |
| Automates the packaging and distribution of bioinformatics pipelines for seamless deployment on various workstations. | 26 |
| Unsupervised word embeddings capture latent knowledge from materials science literature | 624 |
| Provides tools and benchmarks for evaluating text embedding models | 2,021 |
| Learning word embeddings from abstract images to improve language understanding | 19 |
| This is a word embedding model trained on Stack Overflow posts for use in natural language processing tasks. | 40 |
| Developing models to learn and represent protein sequences based on their structure | 259 |
| A workshop project providing examples and usage guidance for using bionode in bioinformatics pipelines | 5 |
| Provides clinical BERT embeddings for natural language processing tasks in healthcare | 680 |