jieba-jruby

Chinese tokenizer library

Provides a Ruby port of the popular Chinese language processing library Jieba

jieba-analysis(结巴分词) for jRuby

8 stars

3 watching

1 forks

Language: Ruby

last commit: almost 12 years ago

Related projects:

Repository	Description	Stars
6/tiny_segmenter	A Ruby port of a Japanese text tokenization algorithm	21
paulgoetze/weka-jruby	Provides a JRuby wrapper around the Weka Java library for machine learning and data mining in Ruby.	65
jruby/jruby-debug	Provides a JRuby-based backend for Ruby Debugger	15
452896915/jieba-android	An Android implementation of the Chinese word segmentation algorithm jieba, optimized for fast initialization and tokenization	153
jedld/brains-jruby	An implementation of a feedforward neural network toolkit for JRuby	60
sciruby/iruby	A Ruby-based kernel for interactive computing environments like Jupyter Notebooks	902
xujiajun/gotokenizer	A tokenizer based on dictionary and Bigram language models for text segmentation in Chinese	21
arbox/tokenizer	A Ruby-based library for splitting written text into tokens for natural language processing tasks.	46
rinmyo/ruby-typ	A Typst programming language implementation of Ruby.	20
citahub/cita-sdk-ruby	A Ruby library providing a standardized interface to interact with the CITA network.	3
monkeylearn/monkeylearn-ruby	Provides an official Ruby client for the MonkeyLearn API to build and consume machine learning models for language processing from Ruby apps.	80
abitdodgy/words_counted	A Ruby library that tokenizes input and provides various statistical measures about the tokens	159
mizor/machine-learning-ruby	A Ruby implementation of common machine learning algorithms and techniques	13
gengo/gengo-ruby	A Ruby library to interact with the Gengo API for translation and management tasks	21
ruby-amqp/march_hare	A JRuby client for RabbitMQ messaging system	96