lemmatizer
Text reducer
A Ruby library that provides a lemmatizer for text in English.
Lemmatizer for text in English. Inspired by Python's nltk.corpus.reader.wordnet.morphy
108 stars
8 watching
15 forks
Language: Ruby
last commit: about 3 years ago
Linked from 1 awesome list
lemmatizernlprubyrubynlpwordnet
Related projects:
Repository | Description | Stars |
---|---|---|
adbar/simplemma | Lemmatization tool for natural language processing | 145 |
yohasebe/ruby-spacy | A Ruby module providing access to spaCy's natural language processing capabilities | 63 |
yohasebe/rsyntaxtree | Generates graphical syntax trees for linguistic research using Ruby | 100 |
patterns-ai-core/langchainrb | A Ruby library providing an interface to Large Language Model (LLM) providers for text generation and embedding | 1,438 |
ankane/fasttext-ruby | Efficient text classification and representation learning library for Ruby | 203 |
ankane/torchtext-ruby | A Ruby library providing data loaders and abstractions for text and NLP tasks | 34 |
tkellen/ruby-ngram | Breaks text into contiguous sequences of words or phrases | 12 |
yohasebe/engtagger | A Ruby library for assigning part-of-speech tags to English text based on a lookup dictionary and probability values. | 262 |
mbj/unparser | Converts Ruby Abstract Syntax Tree (AST) to semantically equivalent source code | 309 |
kei500/liblinear-ruby | Provides an interface to train and predict with machine learning models using LIBLINEAR | 83 |
monkeylearn/monkeylearn-ruby | Provides an official Ruby client for the MonkeyLearn API to build and consume machine learning models for language processing from Ruby apps. | 80 |
kuhumcst/cstlemma | A lemmatiser tool for multiple languages using affix rules and supervised learning from full-form dictionaries. | 35 |
burdettelamar/markdown_helper | A Ruby gem for pre-processing markdown files with file inclusion and formatting options. | 40 |
sorenlind/lemmy | Lemmatizer for Danish and Swedish languages | 75 |
flavorjones/loofah | A Ruby library that provides tools for transforming and sanitizing HTML documents and fragments | 935 |