gensim
Topic modeling library
A Python library for topic modeling and document analysis with large corpora, providing efficient algorithms and easy integration.
Topic Modelling for Humans
16k stars
430 watching
4k forks
Language: Python
last commit: 6 months ago
Linked from 5 awesome lists
data-miningdata-sciencedocument-similarityfasttextgensiminformation-retrievalmachine-learningnatural-language-processingneural-networknlppythontopic-modelingword-embeddingsword-similarityword2vec
Related projects:
Repository | Description | Stars |
---|---|---|
| A repository of pre-trained NLP models and corpora for text processing. | 990 |
| Trains a large-scale PyTorch language model on the 1-Billion Word dataset | 123 |
| A large-scale language model for scientific domain training on redpajama arXiv split | 125 |
| An implementation of an unsupervised topic modeling algorithm that leverages domain knowledge to generate informative topics from sparse count data. | 627 |
| A lightweight Python package for reading and manipulating genomics data in the SAM/BAM format. | 790 |
| An implementation of various topic modeling algorithms in Python | 369 |
| Generates natural language representations of SPARQL queries for knowledge graphs | 3 |
| A Python package implementing an interpretable machine learning model for text classification with visualization tools | 336 |
| This project provides a set of tools and techniques to design and improve diffusion-based generative models. | 1,447 |
| Provides an implementation of generalized additive models in Python for building flexible semi-parametric models | 876 |
| Large-scale language model with improved performance on NLP tasks through distributed training and efficient data processing | 591 |
| An unsupervised learning and generative models library for Python, focusing on probabilistic models and efficient computation. | 119 |
| A versatile Python State Machine library for building flexible and scalable state-based systems | 73 |
| Provides a Python interface to download and analyze GPM data from NASA's Precipitation Processing System | 60 |
| A collection of algorithm implementations in Python | 195,521 |