nlp-with-ruby
NLP toolkit
A curated collection of resources and libraries for natural language processing tasks in Ruby
Curated List: Practical Natural Language Processing done in Ruby
1k stars
58 watching
70 forks
Language: Ruby
last commit: over 2 years ago
Linked from 8 awesome lists
awesomeawesome-listcomputational-linguisticslistmachine-learningnatural-language-processingnlppos-tagrubyrubymlrubynlpsentiment-analysis
Awesome NLP with Ruby / NLP Pipeline Subtasks / Pipeline Generation | |||
| composable_operations | 47 | about 9 years ago | - Definition framework for operation pipelines |
| ruby-spark | 227 | about 8 years ago | - Spark bindings with an easy to understand DSL |
| phobos | 220 | about 2 years ago | - Simplified Ruby Client for |
| parallel | 4,173 | about 1 year ago | - Supervisor for parallel execution on multiple CPUs or in many threads |
| pwrake | 57 | almost 6 years ago | - Rake extensions to run local and remote tasks in parallel |
Awesome NLP with Ruby / NLP Pipeline Subtasks / Multipurpose Engines | |||
| open-nlp | 91 | about 11 years ago | - Ruby Bindings for the Toolkit |
| stanford-core-nlp | 433 | over 5 years ago | - Ruby Bindings for the Stanford tools |
| treat | 1,369 | over 8 years ago | - Natural Language Processing framework for Ruby (like for Python) |
| nlp_toolz | 2 | over 7 years ago | - Wrapper over some classes and the original |
| open_nlp | 11 | almost 7 years ago | - JRuby Bindings for the Toolkit |
| ruby-spacy | 63 | about 1 year ago | — Wrapper module for spaCy NLP library via |
| alchemyapi_ruby | 36 | about 9 years ago | - Legacy Ruby SDK for AlchemyAPI/Bluemix |
| wit-ruby | 280 | over 3 years ago | - Ruby client library for the Language Understanding Platform |
| wlapi | 19 | over 2 years ago | Ruby client library for web services |
| monkeylearn-ruby | 80 | over 4 years ago | Sentiment Analysis, Topic Modelling, Language Detection, Named Entity Recognition via a Ruby based Web API client |
| google-cloud-language | 1,357 | 11 months ago | - Google's Natural Language service API for Ruby |
Awesome NLP with Ruby / NLP Pipeline Subtasks / Language Identification | |||
| scylla | 36 | over 2 years ago | - Language Categorization and Identification |
Awesome NLP with Ruby / NLP Pipeline Subtasks / Segmentation | |||
| tokenizer | 46 | over 8 years ago | - Simple multilingual tokenizer |
| pragmatic_tokenizer | 90 | about 1 year ago | - Multilingual tokenizer to split a string into tokens |
| nlp-pure | 20 | over 8 years ago | - Natural language processing algorithms implemented in pure Ruby with minimal dependencies |
| textoken | 31 | about 4 years ago | - Simple and customizable text tokenization library |
| pragmatic_segmenter | 559 | about 1 year ago | - Word Boundary Disambiguation with many cookies |
| punkt-segmenter | 92 | over 7 years ago | - Pure Ruby implementation of the Punkt Segmenter |
| tactful_tokenizer | 80 | over 11 years ago | - RegExp based tokenizer for different languages |
| scapel | 51 | almost 10 years ago | - Sentence Boundary Disambiguation tool |
Awesome NLP with Ruby / NLP Pipeline Subtasks / Lexical Processing | |||
| ruby-stemmer | 251 | over 3 years ago | - Ruby-Stemmer exposes the SnowBall API to Ruby |
| uea-stemmer | 53 | almost 3 years ago | - Conservative stemmer for search and indexing |
| lemmatizer | 108 | about 4 years ago | - WordNet based Lemmatizer for English texts |
| wc | 6 | almost 14 years ago | - Facilities to count word occurrences in a text |
| word_count | 5 | over 6 years ago | - Word counter for and objects |
| words_counted | 159 | about 4 years ago | - Pure Ruby library counting word statistics with different custom options |
| stopwords-filter | 77 | almost 2 years ago | Filter and Stop Word Lexicon based on the SnowBall lemmatizer |
Awesome NLP with Ruby / NLP Pipeline Subtasks / Phrasal Level Processing | |||
| n_gram | 37 | about 4 years ago | - N-Gram generator |
| ruby-ngram | 12 | almost 12 years ago | - Break words and phrases into ngrams |
| raingrams | 69 | over 4 years ago | - Flexible and general-purpose ngrams library written in pure Ruby |
Awesome NLP with Ruby / NLP Pipeline Subtasks / Syntactic Processing | |||
| stanfordparser | - Ruby based wrapper for the Stanford Parser | ||
| rley | 37 | over 3 years ago | - Pure Ruby implementation of the Parsing Algorithm for Context-Free Constituency Grammars |
| rsyntaxtree | 102 | about 1 year ago | - Visualization for syntactic trees in Ruby based on |
Awesome NLP with Ruby / NLP Pipeline Subtasks / Semantic Analysis | |||
| amatch | 378 | about 1 year ago | - Set of five distance types between strings (including Levenshtein, Sellers, Jaro-Winkler, 'pair distance') |
| damerau-levenshtein | 145 | about 1 year ago | - Calculates edit distance using the Damerau-Levenshtein algorithm |
| hotwater | 80 | over 12 years ago | - Fast Ruby FFI string edit distance algorithms |
| levenshtein-ffi | 150 | over 1 year ago | - Fast string edit distance computation, using the Damerau-Levenshtein algorithm |
| tf_idf | 35 | over 13 years ago | - Term Frequency / Inverse Document Frequency in pure Ruby |
| tf-idf-similarity | 749 | over 1 year ago | - Calculate the similarity between texts using TF/IDF |
Awesome NLP with Ruby / NLP Pipeline Subtasks / Pragmatical Analysis | |||
| SentimentLib | 14 | almost 13 years ago | - Simple extensible sentiment analysis gem |
Awesome NLP with Ruby / High Level Tasks / Spelling and Error Correction | |||
| gingerice | 480 | about 6 years ago | - Spelling and Grammar corrections via the API |
| hunspell-i18n | 4 | about 13 years ago | - Ruby bindings to the standard Spell Checker |
| ffi-hunspell | 48 | almost 2 years ago | - FFI based Ruby bindings for |
| hunspell | 35 | 12 months ago | - Ruby bindings to via Ruby C API |
Awesome NLP with Ruby / High Level Tasks / Text Alignment | |||
| alignment | 1 | over 11 years ago | - Alignment routines for bilingual texts (Gale-Church implementation) |
Awesome NLP with Ruby / High Level Tasks / Machine Translation | |||
| google-api-client | 2,812 | 11 months ago | - Google API Ruby Client |
| microsoft_translator | 21 | over 8 years ago | - Ruby client for the microsoft translator API |
| termit | 508 | over 8 years ago | - Google Translate with speech synthesis in your terminal |
| zipf | 3 | over 9 years ago | - implementation of BLEU and other base algorithms |
Awesome NLP with Ruby / High Level Tasks / Sentiment Analysis | |||
| stimmung | 20 | over 9 years ago | - Semantic Polarity based on the lexicon |
Awesome NLP with Ruby / High Level Tasks / Numbers, Dates, and Time Parsing | |||
| chronic | 3,243 | about 2 years ago | - Pure Ruby natural language date parser |
| chronic_between | 28 | over 4 years ago | - Simple Ruby natural language parser for date and time ranges |
| chronic_duration | 352 | 11 months ago | - Pure Ruby parser for elapsed time |
| kronic | 150 | about 11 years ago | - Methods for parsing and formatting human readable dates |
| nickel | 112 | about 8 years ago | - Extracts date, time, and message information from naturally worded text |
| tickle | 82 | about 5 years ago | - Parser for recurring and repeating events |
| numerizer | 38 | almost 3 years ago | - Ruby parser for English number expressions |
Awesome NLP with Ruby / High Level Tasks / Named Entity Recognition | |||
| ruby-ner | 19 | almost 3 years ago | - Named Entity Recognition with Stanford NER and Ruby |
| ruby-nlp | 92 | over 11 years ago | - Ruby Binding for Stanford Pos-Tagger and Name Entity Recognizer |
Awesome NLP with Ruby / High Level Tasks / Text-to-Speech-to-Text | |||
| espeak-ruby | 194 | about 1 year ago | - Small Ruby API for utilizing 'espeak' and 'lame' to create text-to-speech mp3 files |
| tts | 93 | about 3 years ago | - Text-to-Speech conversion using the Google translate service |
| att_speech | 20 | over 11 years ago | - Ruby wrapper over the AT&T Speech API for speech to text |
| pocketsphinx-ruby | 258 | over 8 years ago | - Pocketsphinx bindings |
Awesome NLP with Ruby / Dialog Agents, Assistants, and Chatbots | |||
| chatterbot | 491 | over 2 years ago | - Straightforward ruby-based Twitter Bot Framework, using OAuth to authenticate |
| lita | 1,678 | over 3 years ago | - Highly extensible chat operation bot framework written with persistent storage on |
Awesome NLP with Ruby / Linguistic Resources | |||
| rwordnet | 89 | about 6 years ago | - Pure Ruby self contained API library for the |
| wordnet | 138 | over 2 years ago | - Performance tuned bindings for the |
Awesome NLP with Ruby / Machine Learning Libraries | |||
| rb-libsvm | 278 | almost 2 years ago | - Support Vector Machines with Ruby |
| weka | 65 | almost 2 years ago | - JRuby bindings for Weka, different ML algorithms implemented through Weka |
| decisiontree | 1,438 | about 7 years ago | - Decision Tree ID3 Algorithm in pure Ruby |
| rtimbl | 5 | about 16 years ago | - Memory based learners from the Timbl framework |
| classifier-reborn | 554 | over 1 year ago | - General classifier module to allow Bayesian and other types of classifications |
| lda-ruby | 133 | about 5 years ago | - Ruby implementation of the (Latent Dirichlet Allocation) for automatic Topic Modelling and Document Clustering |
| liblinear-ruby-swig | 83 | over 2 years ago | - Ruby interface to LIBLINEAR (much more efficient than LIBSVM for text classification) |
| linnaeus | 38 | almost 10 years ago | - Redis-backed Bayesian classifier |
| maxent_string_classifier | 9 | over 16 years ago | - JRuby maximum entropy classifier for string data, based on the OpenNLP Maxent framework |
| naive_bayes | 49 | almost 14 years ago | - Simple Naive Bayes classifier |
| nbayes | 153 | over 1 year ago | - Full-featured, Ruby implementation of Naive Bayes |
| omnicat | 11 | almost 5 years ago | - Generalized rack framework for text classifications |
| omnicat-bayes | 32 | almost 5 years ago | - Naive Bayes text classification implementation as an OmniCat classifier strategy |
| ruby-fann | 498 | over 1 year ago | - Ruby bindings to the |
| rblearn | 2 | over 9 years ago | Feature Extraction and Crossvalidation library |
Awesome NLP with Ruby / Optical Character Recognition | |||
| tesseract-ocr | 629 | over 8 years ago | - FFI based wrapper over the |
Awesome NLP with Ruby / Text Extraction | |||
| yomu | 498 | over 2 years ago | - library for extracting text and metadata from files and documents using the content analysis toolkit |
Awesome NLP with Ruby / Full Text Search, Information Retrieval, Indexing | |||
| rsolr | 421 | over 1 year ago | - Ruby and Rails client library for |
| sunspot | 2,980 | over 1 year ago | - Rails centric client for |
| thinking-sphinx | 1,627 | over 1 year ago | - plugin for using in (not only) Rails based projects |
| elasticsearch | 1,971 | 11 months ago | - Ruby client and API for |
| elasticsearch-rails | 3,078 | about 1 year ago | - Ruby and Rails integrations for |
| google-api-client | 2,812 | 11 months ago | - Ruby API library for services |
Awesome NLP with Ruby / Language Aware String Manipulation | |||
| fuzzy_match | 676 | over 4 years ago | - Fuzzy string comparison with Distance measures and Regular Expression |
| fuzzy-string-match | 285 | over 5 years ago | - Fuzzy string matching library for Ruby |
| active_support | 56,200 | 11 months ago | - RoR gem has various string extensions that can handle case |
| fuzzy_tools | 23 | about 8 years ago | - Toolset for fuzzy searches in Ruby tuned for accuracy |
| u | - U extends Ruby’s Unicode support | ||
| unicode | 80 | over 1 year ago | - Unicode normalization library |
| CommonRegexRuby | 79 | almost 4 years ago | - Find a lot of kinds of common information in a string |
| regexp-examples | 520 | over 1 year ago | - Generate strings that match a given regular expression |
| verbal_expressions | 571 | almost 3 years ago | - Make difficult regular expressions easy |
| translit_kit | 7 | almost 3 years ago | - Transliterate Hebrew & Yiddish text into Latin characters |
| re2 | 130 | 11 months ago | - hight-speed Regular Expression library for Text Mining and Text Extraction |
| regex_sample | 1 | over 8 years ago | - sample string generation from a given Regular Expression |
| iuliia | 10 | almost 4 years ago | — transliteration Cyrillic to Latin in many possible ways (defined by the ) |
Awesome NLP with Ruby / Articles, Posts, Talks, and Presentations / 2019 | |||
| aonemd | by | ||
Awesome NLP with Ruby / Articles, Posts, Talks, and Presentations / 2018 | |||
| Cassandra Corrales | by | ||
Awesome NLP with Ruby / Articles, Posts, Talks, and Presentations / 2017 | |||
| Aja Hammerly | by | ||
| Aja Hammerly | by | ||
| Prasun Anand | by | ||
| Starr Horne | by | ||
Awesome NLP with Ruby / Articles, Posts, Talks, and Presentations / 2016 | |||
| Ardian Haxha | by | ||
| Geoffrey Litt | by | ||
| Kei Sawada | by | ||
| Eric Weinstein | by | ||
Awesome NLP with Ruby / Articles, Posts, Talks, and Presentations / 2015 | |||
| Jesus Castello | by | ||
| Lorenzo Masini | by | ||
| Rick Carlino | by | ||
| Kevin Dias | by | ||
| Lorenzo Masini | by | ||
Awesome NLP with Ruby / Articles, Posts, Talks, and Presentations / 2014 | |||
| Glauco Custódio | by | ||
| Todd Schneider | by | ||
| Konstantin Tennhard | by | ||
Awesome NLP with Ruby / Articles, Posts, Talks, and Presentations / 2013 | |||
| Tom Cartwright | by | ||
| Brandon Black | by | ||
| Nathan Kleyn | by | ||
| Robert Qualls | by | ||
Awesome NLP with Ruby / Articles, Posts, Talks, and Presentations / 2012 | |||
| Vasily Vasinov | by | ||
Awesome NLP with Ruby / Articles, Posts, Talks, and Presentations / 2011 | |||
| Benoit Hamelin | by | ||
| Colin Drake | by | ||
Awesome NLP with Ruby / Articles, Posts, Talks, and Presentations / 2010 | |||
| Mike Perham | by | ||
Awesome NLP with Ruby / Articles, Posts, Talks, and Presentations / 2009 | |||
| Jason Adams | by | ||
| Jason Adams | by | ||
Awesome NLP with Ruby / Articles, Posts, Talks, and Presentations / 2008 | |||
| Ilya Grigorik | by | ||
| Gleicon Moraes | by | ||
Awesome NLP with Ruby / Articles, Posts, Talks, and Presentations / 2007 | |||
| Ilya Grigorik | by | ||
Awesome NLP with Ruby / Articles, Posts, Talks, and Presentations / 2006 | |||
| Michael Granger | by | ||
Awesome NLP with Ruby / Projects and Code Examples | |||
| Going the Distance | 61 | about 9 years ago | - Implementations of various distance algorithms with example calculations |
| Named entity recognition with Stanford NER and Ruby | 19 | almost 3 years ago | - NER Examples in Ruby and Java with some |
| Words Counted | - examples of customizable word statistics powered by | ||
| RSyntaxTree | - Web based demonstration of the syntactic tree visualization | ||
Awesome NLP with Ruby / Books | |||
| Miller, Rob | . Pragmatic Programmers, 2015 | ||
| Watson, Mark | . APRESS, 2010 | ||
| Watson, Mark | . Lulu, 2010 | ||
Awesome NLP with Ruby / Community | |||
| Stack Overflow | |||
Awesome NLP with Ruby / Needs your Help! | |||
| ferret | 279 | almost 3 years ago | - Information Retrieval in C and Ruby |
| summarize | 204 | over 13 years ago | - Ruby native wrapper for |
Awesome NLP with Ruby / Related Resources | |||
| Neural Machine Translation Implementations | 359 | over 3 years ago | |
| Awesome Ruby | 13,629 | 11 months ago | - Among other awesome items a short list of NLP related projects |
| Ruby NLP | 1,272 | over 2 years ago | - State-of-Art collection of Ruby libraries for NLP |
| Speech and Natural Language Processing | 2,206 | over 6 years ago | - General List of NLP related resources (mostly not for Ruby programmers) |
| Scientific Ruby | - Linear Algebra, Visualization and Scientific Computing for Ruby | ||
| iRuby | 902 | 11 months ago | IRuby kernel for Jupyter (formelly IPython) |
| Awesome OCR | 2,843 | over 1 year ago | - Multitude of OCR (Optical Character Recognition) resources |
| Awesome TensorFlow | 17,213 | about 1 year ago | - Machine Learning with TensorFlow libraries |
More related projects:
-
cmusphinx/pocketsphinx-ruby
-
web64/norwegian-nlp-resources
-
influitive/apartment
-
mattjohnsonpint/timezoneconverter
-
rails-api/rails-api
-
giuse/dne
-
ffi/ffi
-
explosion/spacy
-
janko/sequel-activerecord_connection
-
bensheldon/good_job
-
richpeck/exception_handler
-
comfy/comfy-bootstrap-form
-
coderwall/coderwall-legacy
-
redditsota/state-of-the-art-result-for-machine-learning-problems