 nlp-with-ruby
 nlp-with-ruby 
 NLP toolkit
 A curated collection of resources and libraries for natural language processing tasks in Ruby
Curated List: Practical Natural Language Processing done in Ruby
1k stars
 58 watching
 70 forks
 
Language: Ruby 
last commit: over 2 years ago 
Linked from   8 awesome lists  
  awesomeawesome-listcomputational-linguisticslistmachine-learningnatural-language-processingnlppos-tagrubyrubymlrubynlpsentiment-analysis 
 | Awesome NLP with Ruby / NLP Pipeline Subtasks / Pipeline Generation | |||
| composable_operations | 47 | about 9 years ago | - Definition framework for operation pipelines | 
| ruby-spark | 227 | about 8 years ago | - Spark bindings with an easy to understand DSL | 
| phobos | 220 | about 2 years ago | - Simplified Ruby Client for | 
| parallel | 4,173 | about 1 year ago | - Supervisor for parallel execution on multiple CPUs or in many threads | 
| pwrake | 57 | almost 6 years ago | - Rake extensions to run local and remote tasks in parallel | 
| Awesome NLP with Ruby / NLP Pipeline Subtasks / Multipurpose Engines | |||
| open-nlp | 91 | about 11 years ago | - Ruby Bindings for the Toolkit | 
| stanford-core-nlp | 433 | over 5 years ago | - Ruby Bindings for the Stanford tools | 
| treat | 1,369 | over 8 years ago | - Natural Language Processing framework for Ruby (like for Python) | 
| nlp_toolz | 2 | over 7 years ago | - Wrapper over some classes and the original | 
| open_nlp | 11 | almost 7 years ago | - JRuby Bindings for the Toolkit | 
| ruby-spacy | 63 | about 1 year ago | — Wrapper module for spaCy NLP library via | 
| alchemyapi_ruby | 36 | about 9 years ago | - Legacy Ruby SDK for AlchemyAPI/Bluemix | 
| wit-ruby | 280 | over 3 years ago | - Ruby client library for the Language Understanding Platform | 
| wlapi | 19 | over 2 years ago | Ruby client library for web services | 
| monkeylearn-ruby | 80 | over 4 years ago | Sentiment Analysis, Topic Modelling, Language Detection, Named Entity Recognition via a Ruby based Web API client | 
| google-cloud-language | 1,357 | 11 months ago | - Google's Natural Language service API for Ruby | 
| Awesome NLP with Ruby / NLP Pipeline Subtasks / Language Identification | |||
| scylla | 36 | over 2 years ago | - Language Categorization and Identification | 
| Awesome NLP with Ruby / NLP Pipeline Subtasks / Segmentation | |||
| tokenizer | 46 | over 8 years ago | - Simple multilingual tokenizer | 
| pragmatic_tokenizer | 90 | about 1 year ago | - Multilingual tokenizer to split a string into tokens | 
| nlp-pure | 20 | over 8 years ago | - Natural language processing algorithms implemented in pure Ruby with minimal dependencies | 
| textoken | 31 | about 4 years ago | - Simple and customizable text tokenization library | 
| pragmatic_segmenter | 559 | about 1 year ago | - Word Boundary Disambiguation with many cookies | 
| punkt-segmenter | 92 | over 7 years ago | - Pure Ruby implementation of the Punkt Segmenter | 
| tactful_tokenizer | 80 | over 11 years ago | - RegExp based tokenizer for different languages | 
| scapel | 51 | almost 10 years ago | - Sentence Boundary Disambiguation tool | 
| Awesome NLP with Ruby / NLP Pipeline Subtasks / Lexical Processing | |||
| ruby-stemmer | 251 | over 3 years ago | - Ruby-Stemmer exposes the SnowBall API to Ruby | 
| uea-stemmer | 53 | almost 3 years ago | - Conservative stemmer for search and indexing | 
| lemmatizer | 108 | about 4 years ago | - WordNet based Lemmatizer for English texts | 
| wc | 6 | almost 14 years ago | - Facilities to count word occurrences in a text | 
| word_count | 5 | over 6 years ago | - Word counter for and objects | 
| words_counted | 159 | about 4 years ago | - Pure Ruby library counting word statistics with different custom options | 
| stopwords-filter | 77 | almost 2 years ago | Filter and Stop Word Lexicon based on the SnowBall lemmatizer | 
| Awesome NLP with Ruby / NLP Pipeline Subtasks / Phrasal Level Processing | |||
| n_gram | 37 | about 4 years ago | - N-Gram generator | 
| ruby-ngram | 12 | almost 12 years ago | - Break words and phrases into ngrams | 
| raingrams | 69 | over 4 years ago | - Flexible and general-purpose ngrams library written in pure Ruby | 
| Awesome NLP with Ruby / NLP Pipeline Subtasks / Syntactic Processing | |||
| stanfordparser | - Ruby based wrapper for the Stanford Parser | ||
| rley | 37 | over 3 years ago | - Pure Ruby implementation of the Parsing Algorithm for Context-Free Constituency Grammars | 
| rsyntaxtree | 102 | about 1 year ago | - Visualization for syntactic trees in Ruby based on | 
| Awesome NLP with Ruby / NLP Pipeline Subtasks / Semantic Analysis | |||
| amatch | 378 | about 1 year ago | - Set of five distance types between strings (including Levenshtein, Sellers, Jaro-Winkler, 'pair distance') | 
| damerau-levenshtein | 145 | about 1 year ago | - Calculates edit distance using the Damerau-Levenshtein algorithm | 
| hotwater | 80 | over 12 years ago | - Fast Ruby FFI string edit distance algorithms | 
| levenshtein-ffi | 150 | over 1 year ago | - Fast string edit distance computation, using the Damerau-Levenshtein algorithm | 
| tf_idf | 35 | over 13 years ago | - Term Frequency / Inverse Document Frequency in pure Ruby | 
| tf-idf-similarity | 749 | over 1 year ago | - Calculate the similarity between texts using TF/IDF | 
| Awesome NLP with Ruby / NLP Pipeline Subtasks / Pragmatical Analysis | |||
| SentimentLib | 14 | almost 13 years ago | - Simple extensible sentiment analysis gem | 
| Awesome NLP with Ruby / High Level Tasks / Spelling and Error Correction | |||
| gingerice | 480 | about 6 years ago | - Spelling and Grammar corrections via the API | 
| hunspell-i18n | 4 | about 13 years ago | - Ruby bindings to the standard Spell Checker | 
| ffi-hunspell | 48 | almost 2 years ago | - FFI based Ruby bindings for | 
| hunspell | 35 | 11 months ago | - Ruby bindings to via Ruby C API | 
| Awesome NLP with Ruby / High Level Tasks / Text Alignment | |||
| alignment | 1 | over 11 years ago | - Alignment routines for bilingual texts (Gale-Church implementation) | 
| Awesome NLP with Ruby / High Level Tasks / Machine Translation | |||
| google-api-client | 2,812 | 11 months ago | - Google API Ruby Client | 
| microsoft_translator | 21 | over 8 years ago | - Ruby client for the microsoft translator API | 
| termit | 508 | over 8 years ago | - Google Translate with speech synthesis in your terminal | 
| zipf | 3 | over 9 years ago | - implementation of BLEU and other base algorithms | 
| Awesome NLP with Ruby / High Level Tasks / Sentiment Analysis | |||
| stimmung | 20 | over 9 years ago | - Semantic Polarity based on the lexicon | 
| Awesome NLP with Ruby / High Level Tasks / Numbers, Dates, and Time Parsing | |||
| chronic | 3,243 | about 2 years ago | - Pure Ruby natural language date parser | 
| chronic_between | 28 | over 4 years ago | - Simple Ruby natural language parser for date and time ranges | 
| chronic_duration | 352 | 11 months ago | - Pure Ruby parser for elapsed time | 
| kronic | 150 | about 11 years ago | - Methods for parsing and formatting human readable dates | 
| nickel | 112 | almost 8 years ago | - Extracts date, time, and message information from naturally worded text | 
| tickle | 82 | about 5 years ago | - Parser for recurring and repeating events | 
| numerizer | 38 | over 2 years ago | - Ruby parser for English number expressions | 
| Awesome NLP with Ruby / High Level Tasks / Named Entity Recognition | |||
| ruby-ner | 19 | almost 3 years ago | - Named Entity Recognition with Stanford NER and Ruby | 
| ruby-nlp | 92 | about 11 years ago | - Ruby Binding for Stanford Pos-Tagger and Name Entity Recognizer | 
| Awesome NLP with Ruby / High Level Tasks / Text-to-Speech-to-Text | |||
| espeak-ruby | 194 | about 1 year ago | - Small Ruby API for utilizing 'espeak' and 'lame' to create text-to-speech mp3 files | 
| tts | 93 | about 3 years ago | - Text-to-Speech conversion using the Google translate service | 
| att_speech | 20 | over 11 years ago | - Ruby wrapper over the AT&T Speech API for speech to text | 
| pocketsphinx-ruby | 258 | over 8 years ago | - Pocketsphinx bindings | 
| Awesome NLP with Ruby / Dialog Agents, Assistants, and Chatbots | |||
| chatterbot | 491 | over 2 years ago | - Straightforward ruby-based Twitter Bot Framework, using OAuth to authenticate | 
| lita | 1,678 | over 3 years ago | - Highly extensible chat operation bot framework written with persistent storage on | 
| Awesome NLP with Ruby / Linguistic Resources | |||
| rwordnet | 89 | about 6 years ago | - Pure Ruby self contained API library for the | 
| wordnet | 138 | over 2 years ago | - Performance tuned bindings for the | 
| Awesome NLP with Ruby / Machine Learning Libraries | |||
| rb-libsvm | 278 | almost 2 years ago | - Support Vector Machines with Ruby | 
| weka | 65 | almost 2 years ago | - JRuby bindings for Weka, different ML algorithms implemented through Weka | 
| decisiontree | 1,438 | almost 7 years ago | - Decision Tree ID3 Algorithm in pure Ruby | 
| rtimbl | 5 | about 16 years ago | - Memory based learners from the Timbl framework | 
| classifier-reborn | 554 | over 1 year ago | - General classifier module to allow Bayesian and other types of classifications | 
| lda-ruby | 133 | about 5 years ago | - Ruby implementation of the (Latent Dirichlet Allocation) for automatic Topic Modelling and Document Clustering | 
| liblinear-ruby-swig | 83 | over 2 years ago | - Ruby interface to LIBLINEAR (much more efficient than LIBSVM for text classification) | 
| linnaeus | 38 | almost 10 years ago | - Redis-backed Bayesian classifier | 
| maxent_string_classifier | 9 | over 16 years ago | - JRuby maximum entropy classifier for string data, based on the OpenNLP Maxent framework | 
| naive_bayes | 49 | almost 14 years ago | - Simple Naive Bayes classifier | 
| nbayes | 153 | over 1 year ago | - Full-featured, Ruby implementation of Naive Bayes | 
| omnicat | 11 | almost 5 years ago | - Generalized rack framework for text classifications | 
| omnicat-bayes | 32 | almost 5 years ago | - Naive Bayes text classification implementation as an OmniCat classifier strategy | 
| ruby-fann | 498 | over 1 year ago | - Ruby bindings to the | 
| rblearn | 2 | about 9 years ago | Feature Extraction and Crossvalidation library | 
| Awesome NLP with Ruby / Optical Character Recognition | |||
| tesseract-ocr | 629 | over 8 years ago | - FFI based wrapper over the | 
| Awesome NLP with Ruby / Text Extraction | |||
| yomu | 498 | over 2 years ago | - library for extracting text and metadata from files and documents using the content analysis toolkit | 
| Awesome NLP with Ruby / Full Text Search, Information Retrieval, Indexing | |||
| rsolr | 421 | over 1 year ago | - Ruby and Rails client library for | 
| sunspot | 2,980 | over 1 year ago | - Rails centric client for | 
| thinking-sphinx | 1,627 | over 1 year ago | - plugin for using in (not only) Rails based projects | 
| elasticsearch | 1,971 | 11 months ago | - Ruby client and API for | 
| elasticsearch-rails | 3,078 | about 1 year ago | - Ruby and Rails integrations for | 
| google-api-client | 2,812 | 11 months ago | - Ruby API library for services | 
| Awesome NLP with Ruby / Language Aware String Manipulation | |||
| fuzzy_match | 676 | over 4 years ago | - Fuzzy string comparison with Distance measures and Regular Expression | 
| fuzzy-string-match | 285 | over 5 years ago | - Fuzzy string matching library for Ruby | 
| active_support | 56,200 | 11 months ago | - RoR gem has various string extensions that can handle case | 
| fuzzy_tools | 23 | about 8 years ago | - Toolset for fuzzy searches in Ruby tuned for accuracy | 
| u | - U extends Ruby’s Unicode support | ||
| unicode | 80 | over 1 year ago | - Unicode normalization library | 
| CommonRegexRuby | 79 | almost 4 years ago | - Find a lot of kinds of common information in a string | 
| regexp-examples | 520 | over 1 year ago | - Generate strings that match a given regular expression | 
| verbal_expressions | 571 | almost 3 years ago | - Make difficult regular expressions easy | 
| translit_kit | 7 | almost 3 years ago | - Transliterate Hebrew & Yiddish text into Latin characters | 
| re2 | 130 | 11 months ago | - hight-speed Regular Expression library for Text Mining and Text Extraction | 
| regex_sample | 1 | about 8 years ago | - sample string generation from a given Regular Expression | 
| iuliia | 10 | almost 4 years ago | — transliteration Cyrillic to Latin in many possible ways (defined by the ) | 
| Awesome NLP with Ruby / Articles, Posts, Talks, and Presentations / 2019 | |||
| aonemd | by | ||
| Awesome NLP with Ruby / Articles, Posts, Talks, and Presentations / 2018 | |||
| Cassandra Corrales | by | ||
| Awesome NLP with Ruby / Articles, Posts, Talks, and Presentations / 2017 | |||
| Aja Hammerly | by | ||
| Aja Hammerly | by | ||
| Prasun Anand | by | ||
| Starr Horne | by | ||
| Awesome NLP with Ruby / Articles, Posts, Talks, and Presentations / 2016 | |||
| Ardian Haxha | by | ||
| Geoffrey Litt | by | ||
| Kei Sawada | by | ||
| Eric Weinstein | by | ||
| Awesome NLP with Ruby / Articles, Posts, Talks, and Presentations / 2015 | |||
| Jesus Castello | by | ||
| Lorenzo Masini | by | ||
| Rick Carlino | by | ||
| Kevin Dias | by | ||
| Lorenzo Masini | by | ||
| Awesome NLP with Ruby / Articles, Posts, Talks, and Presentations / 2014 | |||
| Glauco Custódio | by | ||
| Todd Schneider | by | ||
| Konstantin Tennhard | by | ||
| Awesome NLP with Ruby / Articles, Posts, Talks, and Presentations / 2013 | |||
| Tom Cartwright | by | ||
| Brandon Black | by | ||
| Nathan Kleyn | by | ||
| Robert Qualls | by | ||
| Awesome NLP with Ruby / Articles, Posts, Talks, and Presentations / 2012 | |||
| Vasily Vasinov | by | ||
| Awesome NLP with Ruby / Articles, Posts, Talks, and Presentations / 2011 | |||
| Benoit Hamelin | by | ||
| Colin Drake | by | ||
| Awesome NLP with Ruby / Articles, Posts, Talks, and Presentations / 2010 | |||
| Mike Perham | by | ||
| Awesome NLP with Ruby / Articles, Posts, Talks, and Presentations / 2009 | |||
| Jason Adams | by | ||
| Jason Adams | by | ||
| Awesome NLP with Ruby / Articles, Posts, Talks, and Presentations / 2008 | |||
| Ilya Grigorik | by | ||
| Gleicon Moraes | by | ||
| Awesome NLP with Ruby / Articles, Posts, Talks, and Presentations / 2007 | |||
| Ilya Grigorik | by | ||
| Awesome NLP with Ruby / Articles, Posts, Talks, and Presentations / 2006 | |||
| Michael Granger | by | ||
| Awesome NLP with Ruby / Projects and Code Examples | |||
| Going the Distance | 61 | about 9 years ago | - Implementations of various distance algorithms with example calculations | 
| Named entity recognition with Stanford NER and Ruby | 19 | almost 3 years ago | - NER Examples in Ruby and Java with some | 
| Words Counted | - examples of customizable word statistics powered by | ||
| RSyntaxTree | - Web based demonstration of the syntactic tree visualization | ||
| Awesome NLP with Ruby / Books | |||
| Miller, Rob | . Pragmatic Programmers, 2015 | ||
| Watson, Mark | . APRESS, 2010 | ||
| Watson, Mark | . Lulu, 2010 | ||
| Awesome NLP with Ruby / Community | |||
| Stack Overflow | |||
| Awesome NLP with Ruby / Needs your Help! | |||
| ferret | 279 | over 2 years ago | - Information Retrieval in C and Ruby | 
| summarize | 204 | over 13 years ago | - Ruby native wrapper for | 
| Awesome NLP with Ruby / Related Resources | |||
| Neural Machine Translation Implementations | 359 | over 3 years ago | |
| Awesome Ruby | 13,629 | 11 months ago | - Among other awesome items a short list of NLP related projects | 
| Ruby NLP | 1,272 | over 2 years ago | - State-of-Art collection of Ruby libraries for NLP | 
| Speech and Natural Language Processing | 2,206 | over 6 years ago | - General List of NLP related resources (mostly not for Ruby programmers) | 
| Scientific Ruby | - Linear Algebra, Visualization and Scientific Computing for Ruby | ||
| iRuby | 902 | 11 months ago | IRuby kernel for Jupyter (formelly IPython) | 
| Awesome OCR | 2,843 | over 1 year ago | - Multitude of OCR (Optical Character Recognition) resources | 
| Awesome TensorFlow | 17,213 | about 1 year ago | - Machine Learning with TensorFlow libraries | 
More related projects:
-   cmusphinx/pocketsphinx-ruby 
-   web64/norwegian-nlp-resources 
-   influitive/apartment 
-   mattjohnsonpint/timezoneconverter 
-   rails-api/rails-api 
-   giuse/dne 
-   ffi/ffi 
-   explosion/spacy 
-   janko/sequel-activerecord_connection 
-   bensheldon/good_job 
-   richpeck/exception_handler 
-   comfy/comfy-bootstrap-form 
-   coderwall/coderwall-legacy 
-   redditsota/state-of-the-art-result-for-machine-learning-problems