srx-english
sentence segmenter
A Ruby library providing sentence segmentation rules based on the SRX standard for English language text processing.
English sentence segmentation rules based on SRX standard.
18 stars
3 watching
1 forks
Language: Ruby
last commit: over 12 years ago Related projects:
Repository | Description | Stars |
---|---|---|
| A rule-based sentence boundary detection gem that works across many languages | 559 |
| A Ruby library that uses a simple rule-based approach to segment sentences into individual words or phrases. | 51 |
| A Ruby client for accessing and querying a large text corpus server | 6 |
| Provides custom case handling and comparison functionality for Polish strings in Ruby 1.9 | 10 |
| Breaks text into contiguous sequences of words or phrases | 12 |
| A Ruby port of the NLTK algorithm to detect sentence boundaries in unstructured text | 92 |
| A lexical analyzer for shell-like syntaxes with support for quoting and commenting | 8 |
| Compiles and organizes state-of-the-art instance segmentation papers and resources | 88 |
| Extracts common information from text strings in various formats | 79 |
| A syntax extension for writing SQL queries in OCaml with type inference and syntax checking. | 138 |
| A Ruby library for generating text based on a given grammar | 4 |
| Provides algorithms for breaking down words into their constituent syllables. | 44 |
| Automates the process of extracting parallel sentences from comparable corpora to aid in statistical machine translation | 127 |
| A grammar definition for a programming language syntax | 8 |
| Provides a syntax extension for monadic computations in OCaml. | 7 |