spanish-sent2vec

Spanish embedder

This project trains a machine learning model to generate sentence embeddings from Spanish text data using the sent2vec algorithm.

Spanish Sentence Embeddings computed from large corpora using sent2vec.

GitHub

4 stars
4 watching
0 forks
last commit: over 5 years ago
Linked from 1 awesome list

embeddingsnatural-language-processingnlpsent2vecsentence-embeddingsspanishspanish-language

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
botcenter/spanishwordembeddings This project generates Spanish word embeddings using fastText on large corpora. 9
lajanugen/s2v An implementation of a neural network model for learning efficient sentence representations from text data. 205
vefstathiou/so_word2vec This is a word embedding model trained on Stack Overflow posts for use in natural language processing tasks. 40
epfml/sent2vec An unsupervised technique to generate numerical representations of sentences and words for use in machine learning tasks 1,193
tca19/dict2vec A framework to learn word embeddings using lexical dictionaries 115
dccuchile/spanish-word-embeddings A collection of precomputed word embeddings for the Spanish language, derived from different corpora and computational methods. 356
kostyaev/sentence2vec This is a tool for creating deep sentence embeddings using Sequence-to-Sequence learning. 22
wikipedia2vec/wikipedia2vec A tool for learning vector representations of words and entities from Wikipedia text data. 940
satwikkottur/visualword2vec Learning word embeddings from abstract images to improve language understanding 19
alexandres/lexvec An implementation of a word embedding model that uses character n-grams and achieves state-of-the-art results in multiple NLP tasks 803
ncbi-nlp/biosentvec Pre-trained word and sentence embeddings for biomedical text analysis 578
dalinvip/cw2vec A software framework for learning Chinese word embeddings with stroke n-gram information 274
materialsintelligence/mat2vec Unsupervised word embeddings capture latent knowledge from materials science literature 619
cod3licious/conec A library for training and evaluating a type of word embedding model that extends the original Word2Vec algorithm 20
bakrianoo/aravec Provides pre-trained word embedding models for Arabic text analysis 394