srx-english
Sentence Segmentation Library
A Ruby library containing English sentence and word segmentation rules based on the SRX standard.
English sentence segmentation rules based on SRX standard.
18 stars
3 watching
1 forks
Language: Ruby
last commit: about 12 years ago Related projects:
Repository | Description | Stars |
---|---|---|
diasks2/pragmatic_segmenter | A rule-based sentence boundary detection gem that works across many languages | 551 |
louismullie/scalpel | A Ruby library that uses a simple rule-based approach to segment sentences into individual words or phrases. | 51 |
apohllo/poliqarpr | A Ruby client for accessing and querying a large text corpus server | 6 |
apohllo/string_pl | Improves case conversion and comparison in Ruby 1.9 for non-English languages | 10 |
tkellen/ruby-ngram | Breaks text into contiguous sequences of words or phrases | 12 |
lfcipriani/punkt-segmenter | An implementation of a sentence boundary detection algorithm in Ruby. | 92 |
ruricolist/cl-shlex | A lexical analyzer for shell-like syntaxes with support for quoting and commenting | 8 |
jackiezhangdx/instancesegmentationlist | Compiles and organizes state-of-the-art instance segmentation papers and resources | 88 |
talyssonoc/commonregexruby | Extracts common information from text strings in various formats | 79 |
roddyyaga/ppx_rapper | A syntax extension for writing SQL queries in OCaml with type inference and syntax checking. | 137 |
saaadhu/ramble | A Ruby library for generating text based on a given grammar | 4 |
henchc/syllabipy | Provides algorithms for breaking down words into their constituent syllables. | 44 |
machinalis/yalign | Automates the process of extracting parallel sentences from comparable corpora to aid in statistical machine translation | 127 |
prql/tree-sitter-prql | A grammar definition for a programming language syntax | 7 |
rizo/ppx_monad | Provides a syntax extension for monadic computations in OCaml. | 7 |