pragmatic_segmenter
Sentence segmenter
A rule-based sentence boundary detection gem that works across many languages
Pragmatic Segmenter is a rule-based sentence boundary detection gem that works out-of-the-box across many languages.
559 stars
16 watching
54 forks
Language: Ruby
last commit: 6 months ago
Linked from 2 awesome lists
Related projects:
Repository | Description | Stars |
---|---|---|
| A Ruby port of the NLTK algorithm to detect sentence boundaries in unstructured text | 92 |
| A C# implementation of sentence boundary detection with rule-based approach. | 33 |
| A multilingual tokenizer to split strings into tokens, handling various language and formatting nuances. | 90 |
| A Python package for out-of-the-box sentence boundary detection using rule-based algorithms. | 821 |
| A Ruby library providing sentence segmentation rules based on the SRX standard for English language text processing. | 18 |
| Breaks text into contiguous sequences of words or phrases | 12 |
| A tool that highlights errors in user input to help improve English language skills | 43 |
| A Ruby library that uses a simple rule-based approach to segment sentences into individual words or phrases. | 51 |
| A Ruby port of a Japanese text tokenization algorithm | 21 |
| A library providing an implementation of various metrics for object segmentation and saliency detection in computer vision. | 150 |
| An analyzer tool to account for variations in word count calculations | 20 |
| A tool for automatically detecting sentence boundaries in natural language text using machine learning and handcrafted features. | 90 |
| An open-source software package for probabilistic cell segmentation in spatial transcriptomics | 46 |
| An implementation of a lightweight semantic segmentation model with real-time performance capabilities | 252 |
| A tool for unsupervised and semi-supervised morphological segmentation in text data | 186 |