scalpel

Sentence splitter

A Ruby library that uses a simple rule-based approach to segment sentences into individual words or phrases.

A fast and accurate rule-based sentence segmentation tool for Ruby.

GitHub

51 stars
8 watching
5 forks
Language: Ruby
last commit: almost 9 years ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
tkellen/ruby-ngram Breaks text into contiguous sequences of words or phrases 12
louismullie/open-nlp Ruby bindings to a Java machine learning toolkit for natural language processing 91
louismullie/treat A Ruby-based toolkit for natural language processing and computational linguistics 1,366
apohllo/srx-english A Ruby library containing English sentence and word segmentation rules based on the SRX standard. 18
lfcipriani/punkt-segmenter An implementation of a sentence boundary detection algorithm in Ruby. 92
louismullie/stanford-core-nlp Provides Ruby bindings to Stanford Core NLP tools for natural language processing tasks 432
louismullie/graph-rank An implementation of PageRank and TextRank algorithms in Ruby. 75
orling/grapheme-splitter A library that breaks strings into their individual user-perceived characters 928
diasks2/pragmatic_segmenter A rule-based sentence boundary detection gem that works across many languages 551
yohasebe/lemmatizer A Ruby library that provides a lemmatizer for text in English. 108
ruricolist/cl-shlex A lexical analyzer for shell-like syntaxes with support for quoting and commenting 8
wansmer/treesj Plugin for splitting and joining blocks of code in text editors 1,046
tomz/liblinear-ruby-swig A Ruby interface to a high-performance machine learning library for large-scale text classification and other linear classifications 83
bennypowers/splitjoin.nvim A tool for manipulating list-like syntax constructs in text editors 76
fnl/segtok Provides tools for splitting text into sentences and words 170