ChineseNlpCorpus
NLP dataset repository
A comprehensive collection of Chinese natural language processing datasets and models for various applications such as sentiment analysis, entity recognition, recommendation systems, and question answering.
搜集、整理、发布 中文 自然语言处理 语料/数据集,与 有志之士 共同 促进 中文 自然语言处理 的 发展。
6k stars
116 watching
1k forks
Language: Jupyter Notebook
last commit: about 6 years ago Related projects:
Repository | Description | Stars |
---|---|---|
| A collection of The Economist's English publications with PDF and audio formats. | 3,613 |
| Automated image recognition system using convolutional neural networks to recognize character-based CAPTCHAs. | 2,791 |
| An online web page builder and editor that allows users to create custom pages with various components, scripts, and styling options. | 4,451 |
| A blog documenting the author's growth and learning journey in software development, covering various topics such as web performance, Vue.js, Node, and more. | 4,161 |
| A massive corpus of Chinese text data covering various forms and styles | 3,581 |
| An adapter layer for running web frontend code in WeChat Mini Programs | 4,811 |
| A comprehensive resource guide for front-end developers covering various programming languages, frameworks, and protocols. | 2,925 |
| A 30-day project to develop a simple operating system from scratch using C and assembly languages | 6,004 |
| A comprehensive computer vision framework providing pre-built components and tools for image processing tasks | 2,604 |
| An open-source framework for developing local knowledge-based chatbots with support for various models and deployment frameworks | 32,496 |
| A collection of data structure implementations and algorithms from a C programming textbook | 3,647 |
| Automates literature review and paper analysis tasks using ChatGPT | 18,577 |
| A Python library providing a multi-domain Chinese word segmentation toolkit with high accuracy and flexibility. | 6,564 |
| Automated helper tool for China Telecom users to earn rewards and redeem points for phone credits | 1,321 |
| A modular C++ framework for building and optimizing family relationships models with complex interactions between individuals. | 12,349 |