charguana

Character encoding library

A Python library that provides character encoding and Unicode support for various languages, including CJK, Romanji, Japanese, Korean, and Chinese.

Character Vomiting

GitHub

10 stars
4 watching
3 forks
Language: Python
last commit: over 6 years ago
Linked from 1 awesome list

cjkpython3unicode

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
uni-algo/uni-algo A C/C++ library that provides secure and efficient Unicode algorithms for text processing 280
darienhuss/shotgunyara Tools and utilities for generating encoded versions of input data 9
cl-babel/babel A Common Lisp library for efficient charset encoding and decoding 92
tahonermann/text_view A C++ library providing iterator and range-based interfaces for encoding and decoding strings in various character encodings. 122
dimakura/ka.js A library providing Georgian language support and utilities for converting between character sets and formatting numbers. 5
alvations/seedling A corpus and API for human language data 11
cidles/pyannotation A Python library to access and manipulate linguistically annotated corpus files in various formats. 16
jeremyevans/ape_tag_libs Providing a set of libraries to read and write APEv2 tags in various programming languages. 12
mpdavis/python-jose Provides a Python implementation of the JOSE technologies for encrypting and/or signing content. 1,548
sandeep42/anuvada This is an open source PyTorch library providing tools and models to explain the predictions of deep neural networks for natural language processing tasks. 19
pyos/dg A programming language compiler for CPython bytecode 576
ynakajima/ucd Provides functions to access and manipulate Unicode character data in JavaScript. 9
jakezhaojb/arae An implementation of Adversarially Regularized Autoencoders for language generation and discrete structure modeling. 400
gugarosa/nalp A Python library for natural language processing with adversarial learning capabilities 23
jecolon/ziglyph A library providing tools and utilities for processing Unicode text in the Zig programming language. 207