lda2vec

Topic model

A framework for creating interpretable natural language models by combining word embeddings and topic modeling.

3k stars

119 watching

627 forks

Language: Python

last commit: over 3 years ago

Related projects:

Repository	Description	Stars
ddangelov/top2vec	A Python library that provides a deep learning-based approach to topic modeling and semantic search by jointly embedding topics, documents, and words.	2,957
pair-code/lit	An interactive tool for analyzing and understanding machine learning models	3,500
vefstathiou/so_word2vec	This is a word embedding model trained on Stack Overflow posts for use in natural language processing tasks.	40
dselivanov/text2vec	An R package providing efficient tools for text analysis and natural language processing tasks.	852
wikipedia2vec/wikipedia2vec	A tool for learning vector representations of words and entities from Wikipedia text data.	946
tca19/dict2vec	A framework to learn word embeddings using lexical dictionaries	115
danieldk/go2vec	A package for reading and analyzing word embeddings from the word2vec format in Go.	56
alexandres/lexvec	An implementation of a word embedding model that uses character n-grams and achieves state-of-the-art results in multiple NLP tasks	803
ealdent/lda-ruby	A Ruby wrapper around an existing C implementation of Latent Dirichlet Allocation (LDA) for topic modeling in natural language processing.	133
dalinvip/cw2vec	A software framework for learning Chinese word embeddings with stroke n-gram information	274
ncbi-nlp/biosentvec	Pre-trained word and sentence embeddings for biomedical text analysis	578
wse-research/loris-llm-generated-representations-of-sparql-queries	Generates natural language representations of SPARQL queries for knowledge graphs	3
jalammar/ecco	An interactive visualization library for exploring and understanding transformer-based language models	1,986
primaryobjects/lda	A JavaScript library that uses Latent Dirichlet allocation to model topics in text data	292
mmihaltz/word2vec-googlenews-vectors	A repository hosting pre-trained word vector model (3 million 300-dimension English word vectors) from the Google News corpus.	519