jtcc

TCC tool

A Java library to tokenize Thai text into groups of characters

Java library to tokenize Thai text into a list of TCCs

GitHub

18 stars
3 watching
8 forks
Language: Java
last commit: over 7 years ago
Linked from 2 awesome lists

javanatural-language-processingthai-nlp

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
uakihir0/jtw A Java library providing a simple API to interact with the Twitter v2 API 7
rkcosmos/deepcut A Thai word tokenization library using Deep Neural Network 420
c4n/pythonlexto A Python wrapper around the Thai word segmentator LexTo, allowing developers to easily integrate it into their applications. 1
skynav/ttt A collection of tools and conversion utilities for the W3C Timed Text Markup Language (TTML) 74
tyczj/tweedle An Android library providing access to the Twitter v2 API via Kotlin Coroutines 39
pythainlp/lexicon-thai A Thai language corpus and lexicon repository for natural language processing 141
tchayintr/thbert A pre-trained BERT model designed to facilitate NLP research and development with limited Thai language resources 6
mgilangjanuar/katanyagomoku A Java-based desktop game implementation of Tic Tac Toe with features such as online play and cloud storage, allowing users to customize the game experience. 1
sinemetu1/twitc A C library providing an interface to interact with the Twitter OAuth API 24
redouane59/twittered A Java library for interacting with the Twitter API 238
ahgamut/tcl A powerful platform for creating integration applications and GUIs. 8
languagemachines/ucto A tokeniser for natural language text that separates words from punctuation and supports basic preprocessing steps such as case changing 65
ta4j/ta4j A Java library for technical analysis of financial markets 2,076
pytorch/tnt A collection of tools and utilities for building and training neural networks with PyTorch. 1,666
tcatm/bitcoin-js-remote A JavaScript client-side interface to interact with a remote Bitcoin wallet 57