scalpel
Sentence splitter
A Ruby library that uses a simple rule-based approach to segment sentences into individual words or phrases.
A fast and accurate rule-based sentence segmentation tool for Ruby.
51 stars
8 watching
5 forks
Language: Ruby
last commit: almost 9 years ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
tkellen/ruby-ngram | Breaks text into contiguous sequences of words or phrases | 12 |
louismullie/open-nlp | Ruby bindings to a Java machine learning toolkit for natural language processing | 91 |
louismullie/treat | A Ruby-based toolkit for natural language processing and computational linguistics | 1,366 |
apohllo/srx-english | A Ruby library containing English sentence and word segmentation rules based on the SRX standard. | 18 |
lfcipriani/punkt-segmenter | An implementation of a sentence boundary detection algorithm in Ruby. | 92 |
louismullie/stanford-core-nlp | Provides Ruby bindings to Stanford Core NLP tools for natural language processing tasks | 432 |
louismullie/graph-rank | An implementation of PageRank and TextRank algorithms in Ruby. | 75 |
orling/grapheme-splitter | A library that breaks strings into their individual user-perceived characters | 928 |
diasks2/pragmatic_segmenter | A rule-based sentence boundary detection gem that works across many languages | 551 |
yohasebe/lemmatizer | A Ruby library that provides a lemmatizer for text in English. | 108 |
ruricolist/cl-shlex | A lexical analyzer for shell-like syntaxes with support for quoting and commenting | 8 |
wansmer/treesj | Plugin for splitting and joining blocks of code in text editors | 1,046 |
tomz/liblinear-ruby-swig | A Ruby interface to a high-performance machine learning library for large-scale text classification and other linear classifications | 83 |
bennypowers/splitjoin.nvim | A tool for manipulating list-like syntax constructs in text editors | 76 |
fnl/segtok | Provides tools for splitting text into sentences and words | 170 |