stopwords-filter

Text cleaner

A Ruby library that removes common words from text before processing

Project for filtering stopwords

GitHub

77 stars
4 watching
51 forks
Language: Ruby
last commit: 12 months ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
brez/stopwords A collection of pre-defined words commonly ignored in text processing 12
kevinjalbert/pronto-dirty_words A tool to flag out inappropriate content from text using pre-defined lists of dirty words. 3
tkellen/ruby-ngram Breaks text into contiguous sequences of words or phrases 12
juliasilge/tidytext Provides tools and data to convert text into tidy data formats for natural language processing tasks 1,180
syntax-tree/unist-util-filter Utility to remove unwanted nodes from a tree data structure 18
yohasebe/lemmatizer A Ruby library that provides a lemmatizer for text in English. 108
janlelis/productive-sublime-snippets-ruby A collection of reusable code snippets for Ruby developers in Sublime Text 107
ankane/torchtext-ruby A Ruby library providing data loaders and abstractions for text and NLP tasks 34
nelstrom/vim-textobj-rubyblock A Vim plugin for selecting Ruby blocks 331
6/stopwords-json A collection of stopword lists in JSON format for various languages. 424
databasecleaner/database_cleaner-redis A tool for cleaning up data in Redis databases 4
ruby/did_you_mean A gem that suggests corrections for typos and errors in Ruby code 1,872
postmodern/wordlist.rb A Ruby library and CLI for managing wordlists by reading, manipulating, and combining text data 46
ankane/fasttext-ruby Efficient text classification and representation learning library for Ruby 203
henry-sarabia/blank A package that removes whitespace from strings and detects blank strings. 12