jieba-jruby
Chinese tokenizer library
Provides a Ruby port of the popular Chinese language processing library Jieba
jieba-analysis(结巴分词) for jRuby
8 stars
3 watching
1 forks
Language: Ruby
last commit: about 10 years ago Related projects:
Repository | Description | Stars |
---|---|---|
6/tiny_segmenter | A Ruby port of a Japanese text tokenization algorithm | 21 |
paulgoetze/weka-jruby | Provides a JRuby wrapper around the Weka Java library for machine learning and data mining in Ruby. | 65 |
jruby/jruby-debug | Provides a JRuby-based backend for Ruby Debugger | 15 |
452896915/jieba-android | An Android implementation of the Chinese word segmentation algorithm jieba, optimized for fast initialization and tokenization | 152 |
jedld/brains-jruby | An implementation of a feedforward neural network toolkit for JRuby | 60 |
sciruby/iruby | A Ruby-based kernel for interactive computing environments like Jupyter Notebooks | 901 |
xujiajun/gotokenizer | A tokenizer based on dictionary and Bigram language models for text segmentation in Chinese | 21 |
arbox/tokenizer | A Ruby-based library for splitting written text into tokens for natural language processing tasks. | 46 |
rinmyo/ruby-typ | A Typst programming language implementation of Ruby. | 20 |
citahub/cita-sdk-ruby | A Ruby library providing a standardized interface to interact with the CITA network. | 3 |
monkeylearn/monkeylearn-ruby | Provides an official Ruby client for the MonkeyLearn API to build and consume machine learning models for language processing from Ruby apps. | 80 |
abitdodgy/words_counted | A Ruby library that tokenizes input and provides various statistical measures about the tokens | 159 |
mizor/machine-learning-ruby | A Ruby implementation of common machine learning algorithms and techniques | 13 |
gengo/gengo-ruby | A Ruby library to interact with the Gengo API for translation and management tasks | 21 |
ruby-amqp/march_hare | A JRuby client for RabbitMQ messaging system | 97 |