lemmatizer

Text reducer

A Ruby library that provides a lemmatizer for text in English.

Lemmatizer for text in English. Inspired by Python's nltk.corpus.reader.wordnet.morphy

GitHub

108 stars
8 watching
15 forks
Language: Ruby
last commit: about 3 years ago
Linked from 1 awesome list

lemmatizernlprubyrubynlpwordnet

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
adbar/simplemma Lemmatization tool for natural language processing 145
yohasebe/ruby-spacy A Ruby module providing access to spaCy's natural language processing capabilities 63
yohasebe/rsyntaxtree Generates graphical syntax trees for linguistic research using Ruby 100
patterns-ai-core/langchainrb A Ruby library providing an interface to Large Language Model (LLM) providers for text generation and embedding 1,438
ankane/fasttext-ruby Efficient text classification and representation learning library for Ruby 203
ankane/torchtext-ruby A Ruby library providing data loaders and abstractions for text and NLP tasks 34
tkellen/ruby-ngram Breaks text into contiguous sequences of words or phrases 12
yohasebe/engtagger A Ruby library for assigning part-of-speech tags to English text based on a lookup dictionary and probability values. 262
mbj/unparser Converts Ruby Abstract Syntax Tree (AST) to semantically equivalent source code 309
kei500/liblinear-ruby Provides an interface to train and predict with machine learning models using LIBLINEAR 83
monkeylearn/monkeylearn-ruby Provides an official Ruby client for the MonkeyLearn API to build and consume machine learning models for language processing from Ruby apps. 80
kuhumcst/cstlemma A lemmatiser tool for multiple languages using affix rules and supervised learning from full-form dictionaries. 35
burdettelamar/markdown_helper A Ruby gem for pre-processing markdown files with file inclusion and formatting options. 40
sorenlind/lemmy Lemmatizer for Danish and Swedish languages 75
flavorjones/loofah A Ruby library that provides tools for transforming and sanitizing HTML documents and fragments 935