stopwords-filter
Text cleaner
A Ruby library that removes common words from text before processing
Project for filtering stopwords
77 stars
4 watching
51 forks
Language: Ruby
last commit: 12 months ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
brez/stopwords | A collection of pre-defined words commonly ignored in text processing | 12 |
kevinjalbert/pronto-dirty_words | A tool to flag out inappropriate content from text using pre-defined lists of dirty words. | 3 |
tkellen/ruby-ngram | Breaks text into contiguous sequences of words or phrases | 12 |
juliasilge/tidytext | Provides tools and data to convert text into tidy data formats for natural language processing tasks | 1,180 |
syntax-tree/unist-util-filter | Utility to remove unwanted nodes from a tree data structure | 18 |
yohasebe/lemmatizer | A Ruby library that provides a lemmatizer for text in English. | 108 |
janlelis/productive-sublime-snippets-ruby | A collection of reusable code snippets for Ruby developers in Sublime Text | 107 |
ankane/torchtext-ruby | A Ruby library providing data loaders and abstractions for text and NLP tasks | 34 |
nelstrom/vim-textobj-rubyblock | A Vim plugin for selecting Ruby blocks | 331 |
6/stopwords-json | A collection of stopword lists in JSON format for various languages. | 424 |
databasecleaner/database_cleaner-redis | A tool for cleaning up data in Redis databases | 4 |
ruby/did_you_mean | A gem that suggests corrections for typos and errors in Ruby code | 1,872 |
postmodern/wordlist.rb | A Ruby library and CLI for managing wordlists by reading, manipulating, and combining text data | 46 |
ankane/fasttext-ruby | Efficient text classification and representation learning library for Ruby | 203 |
henry-sarabia/blank | A package that removes whitespace from strings and detects blank strings. | 12 |