 gensim
 gensim 
 Topic modeling library
 A Python library for topic modeling and document analysis with large corpora, providing efficient algorithms and easy integration.
Topic Modelling for Humans
16k stars
 430 watching
 4k forks
 
Language: Python 
last commit: about 1 year ago 
Linked from   5 awesome lists  
  data-miningdata-sciencedocument-similarityfasttextgensiminformation-retrievalmachine-learningnatural-language-processingneural-networknlppythontopic-modelingword-embeddingsword-similarityword2vec 
 Related projects:
| Repository | Description | Stars | 
|---|---|---|
|  | A repository of pre-trained NLP models and corpora for text processing. | 990 | 
|  | Trains a large-scale PyTorch language model on the 1-Billion Word dataset | 123 | 
|  | A large-scale language model for scientific domain training on redpajama arXiv split | 125 | 
|  | An implementation of an unsupervised topic modeling algorithm that leverages domain knowledge to generate informative topics from sparse count data. | 627 | 
|  | A lightweight Python package for reading and manipulating genomics data in the SAM/BAM format. | 790 | 
|  | An implementation of various topic modeling algorithms in Python | 369 | 
|  | Generates natural language representations of SPARQL queries for knowledge graphs | 3 | 
|  | A Python package implementing an interpretable machine learning model for text classification with visualization tools | 336 | 
|  | This project provides a set of tools and techniques to design and improve diffusion-based generative models. | 1,447 | 
|  | Provides an implementation of generalized additive models in Python for building flexible semi-parametric models | 876 | 
|  | Large-scale language model with improved performance on NLP tasks through distributed training and efficient data processing | 591 | 
|  | An unsupervised learning and generative models library for Python, focusing on probabilistic models and efficient computation. | 119 | 
|  | A versatile Python State Machine library for building flexible and scalable state-based systems | 73 | 
|  | Provides a Python interface to download and analyze GPM data from NASA's Precipitation Processing System | 60 | 
|  | A collection of algorithm implementations in Python | 195,521 |