snownlp

Chinese Text Processor

A Python library for processing and analyzing Chinese text

Python library for processing Chinese text

GitHub

6k stars
350 watching
1k forks
Language: Python
last commit: almost 5 years ago
Linked from 6 awesome lists


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
fudannlp/fnlp A toolkit for Chinese natural language processing tasks 2,647
ymcui/chinese-bert-wwm Develops and publishes pre-trained Chinese language models using Whole Word Masking technology. 9,687
wenyan-lang/wenyan A programming language designed to resemble ancient Chinese grammar and syntax, compiling to JavaScript or other languages. 19,719
skyworkaigc/skytext-chinese-gpt3 An AI-powered text generation model trained on Chinese data to perform various tasks such as conversation, translation, and content creation. 419
lc1332/luotuo-chinese-llm A large language model based on the Chinese LLaMA architecture, designed to support complex conversations and applications. 3,637
esbatmop/mnbvc Collects and provides access to a vast corpus of Chinese text data from various sources 3,520
exacity/deeplearningbook-chinese Translation of a popular deep learning book into Chinese, aiming to improve accuracy and accessibility. 35,804
datawhalechina/llm-cookbook A comprehensive guide to building applications with Large Language Models (LLMs) for developers 11,929
thudm/glm A general-purpose language model pre-trained with an autoregressive blank-filling objective and designed for various natural language understanding and generation tasks. 3,199
shannonai/chinesebert A deep learning model that incorporates visual and phonetic features of Chinese characters to improve its ability to understand Chinese language nuances 542
cluebenchmark/cluepretrainedmodels Provides pre-trained models for Chinese language tasks with improved performance and smaller model sizes compared to existing models. 804
bbfamily/abu A lightweight, extensible, and embeddable implementation of the Lua Virtual Machine (LVM) with support for bytecode manipulation and execution. 12,119
openai/gpt-2 A repository providing code and models for research into language modeling and multitask learning 22,559
clue-ai/promptclue A pre-trained language model for multiple natural language processing tasks with support for few-shot learning and transfer learning. 654
morizeyao/gpt2-chinese Training code for Chinese versions of the GPT2 language model using BERT tokenizer or BPE model. 7,467