 srx-english
 srx-english 
 sentence segmenter
 A Ruby library providing sentence segmentation rules based on the SRX standard for English language text processing.
English sentence segmentation rules based on SRX standard.
18 stars
 3 watching
 1 forks
 
Language: Ruby 
last commit: almost 13 years ago  Related projects:
| Repository | Description | Stars | 
|---|---|---|
|  | A rule-based sentence boundary detection gem that works across many languages | 559 | 
|  | A Ruby library that uses a simple rule-based approach to segment sentences into individual words or phrases. | 51 | 
|  | A Ruby client for accessing and querying a large text corpus server | 6 | 
|  | Provides custom case handling and comparison functionality for Polish strings in Ruby 1.9 | 10 | 
|  | Breaks text into contiguous sequences of words or phrases | 12 | 
|  | A Ruby port of the NLTK algorithm to detect sentence boundaries in unstructured text | 92 | 
|  | A lexical analyzer for shell-like syntaxes with support for quoting and commenting | 8 | 
|  | Compiles and organizes state-of-the-art instance segmentation papers and resources | 88 | 
|  | Extracts common information from text strings in various formats | 79 | 
|  | A syntax extension for writing SQL queries in OCaml with type inference and syntax checking. | 138 | 
|  | A Ruby library for generating text based on a given grammar | 4 | 
|  | Provides algorithms for breaking down words into their constituent syllables. | 44 | 
|  | Automates the process of extracting parallel sentences from comparable corpora to aid in statistical machine translation | 127 | 
|  | A grammar definition for a programming language syntax | 8 | 
|  | Provides a syntax extension for monadic computations in OCaml. | 7 |