PythonLexTo

Thai word segmentor

A Python wrapper around a Java library for segmenting Thai text into individual words

LexTo (Thai word segmentator) with Python Wrapper

GitHub

3 stars
3 watching
7 forks
Language: Java
last commit: about 8 years ago

Related projects:

Repository Description Stars
c4n/pythonlexto A Python wrapper around the Thai word segmentator LexTo, allowing developers to easily integrate it into their applications. 1
pucktada/cutkum A tool for segmenting Thai text into words using Recurrent Neural Networks in TensorFlow. 154
pureexe/cutthai A tool for Thai word segmentation using a combination of data structures and algorithms 5
tchayintr/best2010_cooker Extracts segmented words from Thai BEST2010 corpus. 2
pythainlp/lexicon-thai A Thai language corpus and lexicon repository for natural language processing 141
krakenai/synthai A deep learning-based project for segmenting Thai text into words and annotating parts of speech with high accuracy. 41
pythainlp/pythainlp A Python package for text processing and linguistic analysis focused on the Thai language. 987
lantip/latintojavanese A Python script and Flask application for converting Latin characters to the Javanese alphabet 31
jagerv3/sentiment_analysis_thai Analyzes sentiment in Thai text using machine learning algorithms and natural language processing techniques. 12
rkcosmos/deepcut A Thai word tokenization library using Deep Neural Network 420
lextal/pspnet-pytorch A PyTorch implementation of a segmentation network architecture 585
reokatoa/koreksyon Tools for developing and implementing spell-checking and grammar-checking capabilities in low-resource languages. 3
kobkrit/tf-nlp-thai-word-embedding An implementation of a word embedding technique using TensorFlow for Thai language processing 11
har07/pysastrawi A Python port of an Indonesian stemmer library, reducing inflected words to their base form. 336
fnl/segtok Provides tools for splitting text into sentences and words 170