srx-english

Sentence Segmentation Library

A Ruby library containing English sentence and word segmentation rules based on the SRX standard.

English sentence segmentation rules based on SRX standard.

GitHub

18 stars
3 watching
1 forks
Language: Ruby
last commit: about 12 years ago

Related projects:

Repository Description Stars
diasks2/pragmatic_segmenter A rule-based sentence boundary detection gem that works across many languages 551
louismullie/scalpel A Ruby library that uses a simple rule-based approach to segment sentences into individual words or phrases. 51
apohllo/poliqarpr A Ruby client for accessing and querying a large text corpus server 6
apohllo/string_pl Improves case conversion and comparison in Ruby 1.9 for non-English languages 10
tkellen/ruby-ngram Breaks text into contiguous sequences of words or phrases 12
lfcipriani/punkt-segmenter An implementation of a sentence boundary detection algorithm in Ruby. 92
ruricolist/cl-shlex A lexical analyzer for shell-like syntaxes with support for quoting and commenting 8
jackiezhangdx/instancesegmentationlist Compiles and organizes state-of-the-art instance segmentation papers and resources 88
talyssonoc/commonregexruby Extracts common information from text strings in various formats 79
roddyyaga/ppx_rapper A syntax extension for writing SQL queries in OCaml with type inference and syntax checking. 137
saaadhu/ramble A Ruby library for generating text based on a given grammar 4
henchc/syllabipy Provides algorithms for breaking down words into their constituent syllables. 44
machinalis/yalign Automates the process of extracting parallel sentences from comparable corpora to aid in statistical machine translation 127
prql/tree-sitter-prql A grammar definition for a programming language syntax 7
rizo/ppx_monad Provides a syntax extension for monadic computations in OCaml. 7