aravec

Arabic embeddings

Provides pre-trained word embedding models for Arabic text analysis

AraVec is a pre-trained distributed word representation (word embedding) open source project which aims to provide the Arabic NLP research community with free to use and powerful word embedding models.

GitHub

395 stars

32 watching

79 forks

Language: Jupyter Notebook

last commit: over 5 years ago

arabicembedded-modelsgensimnlptext-miningword2vec

Related projects:

Repository	Description	Stars
dfki-interactive-machine-learning/arasif	Provides sentence embeddings for Arabic languages using pre-trained word embeddings and Smooth Inverse Frequency algorithm	5
alexandres/lexvec	An implementation of a word embedding model that uses character n-grams and achieves state-of-the-art results in multiple NLP tasks	803
wikipedia2vec/wikipedia2vec	A tool for learning vector representations of words and entities from Wikipedia text data.	946
galuhsahid/indonesian-word-embedding	Demonstrates word embedding in Indonesian language using pre-trained Word2vec models	20
tca19/dict2vec	A framework to learn word embeddings using lexical dictionaries	115
satwikkottur/visualword2vec	Learning word embeddings from abstract images to improve language understanding	19
vefstathiou/so_word2vec	This is a word embedding model trained on Stack Overflow posts for use in natural language processing tasks.	40
botcenter/spanish-sent2vec	This project trains a machine learning model to generate sentence embeddings from Spanish text data using the sent2vec algorithm.	4
auspicious3000/contentvec	An implementation of a self-supervised speech representation model using PyTorch and disentangled speaker embeddings	471
alexrutherford/arabic_nlp	Tools for normalizing and deriving sentiment from Arabic text	26
hassygo/charngram2vec	A repository providing a re-implementation of character n-gram embeddings for pre-training in natural language processing tasks	23
hit-scir/elmoformanylangs	Provides pre-trained ELMo representations for multiple languages to improve NLP tasks.	1,462
cod3licious/conec	A library for training and evaluating a type of word embedding model that extends the original Word2Vec algorithm	20
artetxem/vecmap	An implementation of cross-lingual word embedding mappings using unsupervised learning methods	648
botcenter/spanishwordembeddings	This project generates Spanish word embeddings using fastText on large corpora.	9