snownlp
Chinese Text Processor
A Python library for processing and analyzing Chinese text
Python library for processing Chinese text
6k stars
350 watching
1k forks
Language: Python
last commit: almost 5 years ago
Linked from 6 awesome lists
Related projects:
Repository | Description | Stars |
---|---|---|
fudannlp/fnlp | A toolkit for Chinese natural language processing tasks | 2,647 |
ymcui/chinese-bert-wwm | Develops and publishes pre-trained Chinese language models using Whole Word Masking technology. | 9,687 |
wenyan-lang/wenyan | A programming language designed to resemble ancient Chinese grammar and syntax, compiling to JavaScript or other languages. | 19,719 |
skyworkaigc/skytext-chinese-gpt3 | An AI-powered text generation model trained on Chinese data to perform various tasks such as conversation, translation, and content creation. | 419 |
lc1332/luotuo-chinese-llm | A large language model based on the Chinese LLaMA architecture, designed to support complex conversations and applications. | 3,637 |
esbatmop/mnbvc | Collects and provides access to a vast corpus of Chinese text data from various sources | 3,520 |
exacity/deeplearningbook-chinese | Translation of a popular deep learning book into Chinese, aiming to improve accuracy and accessibility. | 35,804 |
datawhalechina/llm-cookbook | A comprehensive guide to building applications with Large Language Models (LLMs) for developers | 11,929 |
thudm/glm | A general-purpose language model pre-trained with an autoregressive blank-filling objective and designed for various natural language understanding and generation tasks. | 3,199 |
shannonai/chinesebert | A deep learning model that incorporates visual and phonetic features of Chinese characters to improve its ability to understand Chinese language nuances | 542 |
cluebenchmark/cluepretrainedmodels | Provides pre-trained models for Chinese language tasks with improved performance and smaller model sizes compared to existing models. | 804 |
bbfamily/abu | A lightweight, extensible, and embeddable implementation of the Lua Virtual Machine (LVM) with support for bytecode manipulation and execution. | 12,119 |
openai/gpt-2 | A repository providing code and models for research into language modeling and multitask learning | 22,559 |
clue-ai/promptclue | A pre-trained language model for multiple natural language processing tasks with support for few-shot learning and transfer learning. | 654 |
morizeyao/gpt2-chinese | Training code for Chinese versions of the GPT2 language model using BERT tokenizer or BPE model. | 7,467 |