textcat
Text classifier
A tool for categorizing text based on its language or content
A Go package for n-gram based text categorization, with support for utf-8 and raw text
73 stars
7 watching
11 forks
Language: Go
last commit: about 1 month ago
Linked from 3 awesome lists
Related projects:
Repository | Description | Stars |
---|---|---|
goodsign/libtextcat | Provides a Go interface to classify text into languages | 13 |
eaigner/shield | A flexible Bayesian text classifier with backend storage support | 158 |
xiayandi/pytorch_text_classification | An implementation of convolutional neural networks for text classification using PyTorch | 66 |
sergioburdisso/pyss3 | A Python package implementing an interpretable machine learning model for text classification with visualization tools | 336 |
cardmagic/classifier | A module for text classification using Bayesian and Latent Semantic Indexing algorithms | 661 |
2shou/textgrocery | A text classification tool based on LibLinear with support for Chinese tokenize using jieba. | 677 |
ebanalyse/senda | A tool for fine-tuning transformer models for text classification tasks | 19 |
apcode/tensorflow_fasttext | An open-source implementation of a text classification system using word embeddings and TensorFlow. | 302 |
pebbe/go-proj-4 | Provides a limited interface to the Cartographic Projections Library PROJ.4 | 44 |
quarticcat/detypify | An image classification tool designed to recognize typographical symbols | 148 |
mustafaturan/omnicat | A framework providing a generalized strategy holder for text classification | 11 |
mustafaturan/omnicat-bayes | An implementation of the Naive Bayes algorithm as a text classification strategy in Ruby. | 32 |
tpolecat/atto | A compact, incremental text parsing library for Scala that enables efficient and functional processing of structured data | 359 |
tensorflow/text | Preprocessing and processing tools for text data in machine learning models | 1,239 |
djcp/linnaeus | A Redis-backed system for classifying documents into categories based on their content. | 38 |