gojieba
Chinese Word Segmentation
Provides a Go implementation of Chinese word segmentation algorithms
"结巴"中文分词的Golang版本
2k stars
67 watching
303 forks
Language: Go
last commit: about 1 month ago
Linked from 2 awesome lists
Related projects:
Repository | Description | Stars |
---|---|---|
fudannlp/fnlp | A toolkit for Chinese natural language processing tasks | 2,648 |
ibm/max-chinese-phonetic-similarity-estimator | Estimates phonetic similarity between Chinese words and suggests similar-sounding candidates | 35 |
cluebenchmark/cluecorpus2020 | A large-scale Chinese corpus for pre-training language models. | 927 |
burgyn/mmlib.tostring | Automatically generates a ToString method for classes with a custom attribute | 18 |
cluebenchmark/cluepretrainedmodels | Provides pre-trained models for Chinese language tasks with improved performance and smaller model sizes compared to existing models. | 806 |
soloice/chinese-character-recognition | This project demonstrates how to build and train a convolutional neural network (CNN) to recognize Chinese characters. | 200 |
wenyan-lang/wenyan | A programming language designed to resemble ancient Chinese grammar and syntax, compiling to JavaScript or other languages. | 19,790 |
hongshenghu/membership-inference-machine-learning-literature | A curated collection of papers on membership inference attacks and defenses in machine learning models. | 296 |
huangzworks/real-world-haskell-cn | Translation of an influential Haskell book into Chinese. | 1,562 |
fangyidong/json-simple | A simple toolkit for encoding and decoding JSON text in Java | 748 |
android-cn/android-jobs | A comprehensive list of Android job openings in China | 2,200 |
hkust-knowcomp/jwe | This is a software project that trains and evaluates word embeddings for Chinese words, characters, and fine-grained subcharacter components. | 99 |
clue-ai/chatyuan | Large language model for dialogue support in multiple languages | 1,903 |
embedding/chinese-word-vectors | Provides pre-trained vectors with various properties for downstream tasks in natural language processing | 11,874 |
isnowfy/snownlp | A Python library for processing and analyzing Chinese text | 6,454 |