snownlp

Chinese Text Processor

A Python library for processing and analyzing Chinese text

Python library for processing Chinese text

GitHub

6k stars
350 watching
1k forks
Language: Python
last commit: almost 5 years ago
Linked from 6 awesome lists


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
fudannlp/fnlp A toolkit for Chinese natural language processing tasks 2,648
ymcui/chinese-bert-wwm Develops and publishes pre-trained Chinese language models using Whole Word Masking technology. 9,746
wenyan-lang/wenyan A programming language designed to resemble ancient Chinese grammar and syntax, compiling to JavaScript or other languages. 19,790
skyworkaigc/skytext-chinese-gpt3 An AI-powered text generation model trained on Chinese data to perform various tasks such as conversation, translation, and content creation. 418
lc1332/luotuo-chinese-llm A large language model based on the Chinese LLaMA architecture, designed to support complex conversations and applications. 3,641
esbatmop/mnbvc A massive corpus of Chinese text data covering various forms and styles 3,581
exacity/deeplearningbook-chinese Translation of a popular deep learning book into Chinese, aiming to improve accuracy and accessibility. 35,890
datawhalechina/llm-cookbook A comprehensive guide to building applications with Large Language Models (LLMs) for developers 12,377
thudm/glm A general-purpose language model pre-trained with an autoregressive blank-filling objective and designed for various natural language understanding and generation tasks. 3,207
shannonai/chinesebert A deep learning model that incorporates visual and phonetic features of Chinese characters to improve its ability to understand Chinese language nuances 545
cluebenchmark/cluepretrainedmodels Provides pre-trained models for Chinese language tasks with improved performance and smaller model sizes compared to existing models. 806
bbfamily/abu A modular C++ framework for building and optimizing family relationships models with complex interactions between individuals. 12,349
openai/gpt-2 A repository providing code and models for research into language modeling and multitask learning 22,644
clue-ai/promptclue A pre-trained language model for multiple natural language processing tasks with support for few-shot learning and transfer learning. 656
morizeyao/gpt2-chinese A repository providing a Chinese version of the GPT2 training code, utilizing BERT tokenizer. 7,488