unicode

Unicode normalizer

A Unicode normalization library for Ruby, providing functions to compare, decompose, compose and normalize strings according to the Unicode standard.

Unicode normalization library. (Mirror of Yoshida-san's code base to maintain the RubyGem.)

GitHub

80 stars
5 watching
13 forks
Language: C
last commit: 5 months ago
Linked from 1 awesome list

rubyrubynlpunicode

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
dbuenzli/uunf A library for normalizing Unicode text to a standardized format 21
sshaw/normalize_country A Ruby library that standardizes country names and codes into multiple formats. 68
null8626/decancer A library that removes common unicode confusables/homoglyphs from strings 102
inputsh/cyrillizer A Ruby library for converting characters between Latin and Cyrillic scripts. 0
spellbook-technology/transmutation A gem that provides a simple way to serialize Ruby objects into JSON with an opinionated approach to serializer lookup 9
fazibear/colorize A Ruby library for colorizing text using ANSI escape sequences 1,248
sindresorhus/normalize-url A utility to standardize and normalize URLs by resolving relative references and stripping unnecessary parts. 840
twitter/twitter-cldr-rb A Ruby library providing Unicode-based formatting and localization tools for dates, numbers, currencies, and more. 672
craigwilliams/beautifyruby A Sublime Text plugin to format and beautify Ruby code. 258
r-lyeh-archived/unify A C++11 function to normalize resource identifiers into a standardized format 22
comphist/norma A tool for normalizing spelling in non-standard language data by combining multiple techniques with training data and a target dictionary. 20
floere/phony A Ruby gem that normalizes and formats international phone numbers 1,014
syntax-tree/nlcst-normalize A utility to normalize words for comparison by standardizing punctuation and case. 7
lang/unicode_utils Utilities for working with Unicode strings in Ruby 113
talyssonoc/commonregexruby Extracts common information from text strings in various formats 79