morfessor
segmenter
A tool for unsupervised and semi-supervised morphological segmentation in text data
Morfessor is a tool for unsupervised and semi-supervised morphological segmentation
186 stars
23 watching
29 forks
Language: Python
last commit: over 4 years ago
Linked from 1 awesome list
pythonsegmentationsubword-segmentationsubword-units
Related projects:
Repository | Description | Stars |
---|---|---|
| An NLTK-based parser that provides morphological annotation for languages using KR-style annotations. | 4 |
| A rule-based sentence boundary detection gem that works across many languages | 559 |
| Automates the process of extracting parallel sentences from comparable corpora to aid in statistical machine translation | 127 |
| Provides tools for splitting text into sentences and words | 171 |
| Provides PyTorch implementations of various models and pipelines for semantic segmentation in deep learning. | 1,729 |
| A PyTorch implementation of semantic segmentation models with support for multiprocessing training and various backbones. | 1,347 |
| Monorepo implementing PyTorch-based neural network architecture for image segmentation | 1,787 |
| A Python wrapper around a Java library for segmenting Thai text into individual words | 3 |
| A tokenizer for segmenting words into morphological components | 27 |
| Lemmatization tool for natural language processing | 146 |
| A Ruby library providing sentence segmentation rules based on the SRX standard for English language text processing. | 18 |
| A Ruby port of the NLTK algorithm to detect sentence boundaries in unstructured text | 92 |
| A Japanese morphological analyzer that splits words into grammatical components and segments phrases for efficient text processing | 833 |
| A Ruby library that uses a simple rule-based approach to segment sentences into individual words or phrases. | 51 |
| A tool for automatically detecting sentence boundaries in natural language text using machine learning and handcrafted features. | 90 |