THULAC-Python

Chinese lexer

An efficient Chinese lexical analyzer with morphological analysis capabilities

An Efficient Lexical Analyzer for Chinese

GitHub

2k stars
80 watching
336 forks
Language: Python
last commit: almost 3 years ago
Linked from 1 awesome list

chinese-nlp

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
thunlp/openclap A repository of pre-trained language models for natural language processing tasks in Chinese 979
lingpy/lingpy A Python library for performing quantitative tasks in historical linguistics 126
pythainlp/lexicon-thai A Thai language corpus and lexicon repository for natural language processing 142
c4n/pythonlexto A Python wrapper around the Thai word segmentator LexTo, allowing developers to easily integrate it into their applications. 1
johannesbuchner/languagecheck A tool to analyze and improve the language of scientific papers before submission. 98
proycon/python-frog A Python binding to a C++ NLP tool for Dutch language processing tasks 47
pymorphy2/pymorphy2 A morphological analyzer and generator for Russian and Ukrainian languages 1,125
ymcui/chinese-xlnet Provides pre-trained models for Chinese natural language processing tasks using the XLNet architecture 1,652
microsoft/unicoder This repository provides pre-trained models and code for understanding and generation tasks in multiple languages. 88
synyi/poplar A web-based annotation tool for natural language processing (NLP) 520
sixty-north/python-transducers A Python package providing a transducer framework for composing simple reducers into more complex data processing functions. 55
alexrutherford/arabic_nlp Tools for normalizing and deriving sentiment from Arabic text 26
taosir/cnn_handwritten_chinese_recognition A Python-based web application that recognizes handwritten Chinese characters using a Convolutional Neural Network (CNN), allowing users to input text via an online writing board and receive recognition results. 510
pythainlp/pythainlp A Python package for text processing and linguistic analysis focused on the Thai language. 989
lxneng/xpinyin A Python library for translating Chinese characters to pinyin 825