jieba-jruby

Chinese tokenizer library

Provides a Ruby port of the popular Chinese language processing library Jieba

jieba-analysis(结巴分词) for jRuby

GitHub

8 stars
3 watching
1 forks
Language: Ruby
last commit: about 10 years ago

Related projects:

Repository Description Stars
6/tiny_segmenter A Ruby port of a Japanese text tokenization algorithm 21
paulgoetze/weka-jruby Provides a JRuby wrapper around the Weka Java library for machine learning and data mining in Ruby. 65
jruby/jruby-debug Provides a JRuby-based backend for Ruby Debugger 15
452896915/jieba-android An Android implementation of the Chinese word segmentation algorithm jieba, optimized for fast initialization and tokenization 152
jedld/brains-jruby An implementation of a feedforward neural network toolkit for JRuby 60
sciruby/iruby A Ruby-based kernel for interactive computing environments like Jupyter Notebooks 901
xujiajun/gotokenizer A tokenizer based on dictionary and Bigram language models for text segmentation in Chinese 21
arbox/tokenizer A Ruby-based library for splitting written text into tokens for natural language processing tasks. 46
rinmyo/ruby-typ A Typst programming language implementation of Ruby. 20
citahub/cita-sdk-ruby A Ruby library providing a standardized interface to interact with the CITA network. 3
monkeylearn/monkeylearn-ruby Provides an official Ruby client for the MonkeyLearn API to build and consume machine learning models for language processing from Ruby apps. 80
abitdodgy/words_counted A Ruby library that tokenizes input and provides various statistical measures about the tokens 159
mizor/machine-learning-ruby A Ruby implementation of common machine learning algorithms and techniques 13
gengo/gengo-ruby A Ruby library to interact with the Gengo API for translation and management tasks 21
ruby-amqp/march_hare A JRuby client for RabbitMQ messaging system 97