snownlp
Chinese Text Processor
A Python library for processing and analyzing Chinese text
Python library for processing Chinese text
6k stars
350 watching
1k forks
Language: Python
last commit: about 5 years ago
Linked from 6 awesome lists
Related projects:
Repository | Description | Stars |
---|---|---|
| A toolkit for Chinese natural language processing tasks | 2,648 |
| Develops and publishes pre-trained Chinese language models using Whole Word Masking technology. | 9,746 |
| A programming language designed to resemble ancient Chinese grammar and syntax, compiling to JavaScript or other languages. | 19,790 |
| An AI-powered text generation model trained on Chinese data to perform various tasks such as conversation, translation, and content creation. | 418 |
| A large language model based on the Chinese LLaMA architecture, designed to support complex conversations and applications. | 3,641 |
| A massive corpus of Chinese text data covering various forms and styles | 3,581 |
| Translation of a popular deep learning book into Chinese, aiming to improve accuracy and accessibility. | 35,890 |
| A comprehensive guide to building applications with Large Language Models (LLMs) for developers | 12,377 |
| A general-purpose language model pre-trained with an autoregressive blank-filling objective and designed for various natural language understanding and generation tasks. | 3,207 |
| A deep learning model that incorporates visual and phonetic features of Chinese characters to improve its ability to understand Chinese language nuances | 545 |
| Provides pre-trained models for Chinese language tasks with improved performance and smaller model sizes compared to existing models. | 806 |
| A modular C++ framework for building and optimizing family relationships models with complex interactions between individuals. | 12,349 |
| A repository providing code and models for research into language modeling and multitask learning | 22,644 |
| A pre-trained language model for multiple natural language processing tasks with support for few-shot learning and transfer learning. | 656 |
| A repository providing a Chinese version of the GPT2 training code, utilizing BERT tokenizer. | 7,488 |