ChineseNlpCorpus

NLP dataset repository

A comprehensive collection of Chinese natural language processing datasets and models for various applications such as sentiment analysis, entity recognition, recommendation systems, and question answering.

搜集、整理、发布 中文 自然语言处理 语料/数据集,与 有志之士 共同 促进 中文 自然语言处理 的 发展。

GitHub

6k stars
117 watching
1k forks
Language: Jupyter Notebook
last commit: almost 6 years ago

Related projects:

Repository Description Stars
nailperry-zd/the-economist A collection of The Economist's English publications with PDF and audio formats. 3,605
nickliqian/cnn_captcha Automated image recognition system using convolutional neural networks to recognize character-based CAPTCHAs. 2,781
ymm-tech/gods-pen An online web page builder and editor that allows users to create custom pages with various components, scripts, and styling options. 4,444
berwin/blog A blog documenting the author's growth and learning journey in software development, covering various topics such as web performance, Vue.js, Node, and more. 4,160
esbatmop/mnbvc Collects and provides access to a vast corpus of Chinese text data from various sources 3,520
tencent/kbone An adapter layer for running web frontend code in WeChat Mini Programs 4,802
icepy/front-end-develop-guide A comprehensive resource guide for front-end developers covering various programming languages, frameworks, and protocols. 2,927
yourtion/30daymakeos A 30-day project to develop a simple operating system from scratch using C and assembly languages 5,964
charmve/computer-vision-in-action A comprehensive computer vision framework providing pre-built components and tools for image processing tasks 2,577
chatchat-space/langchain-chatchat An open-source framework for developing local knowledge-based chatbots with support for various models and deployment frameworks 32,060
kangjianwei/data-structure A collection of data structure implementations and algorithms from a C programming textbook 3,629
kaixindelele/chatpaper Automates literature review and paper analysis tasks using ChatGPT 18,491
lancopku/pkuseg-python A Python library providing a multi-domain Chinese word segmentation toolkit with high accuracy and flexibility. 6,541
insoxin/china-telecom-helper Automated helper tool for China Telecom users to earn rewards and redeem points for phone credits 1,319
bbfamily/abu A lightweight, extensible, and embeddable implementation of the Lua Virtual Machine (LVM) with support for bytecode manipulation and execution. 12,119