uuseg

Unicode segmentation library

An OCaml library for segmenting Unicode text into grapheme clusters, words, and sentences.

Unicode text segmentation for OCaml

GitHub

23 stars
6 watching
6 forks
Language: OCaml
last commit: 11 days ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
dbuenzli/uunf A library for normalizing Unicode text to a standardized format 21
dbuenzli/uucp Provides access to Unicode character properties in an OCaml library 21
dbuenzli/uutf A non-blocking streaming codec for Unicode encoding schemes 32
dbuenzli/uucd Decodes data from Unicode character database XML representation 17
dbuenzli/vg A declarative 2D vector graphics library written in OCaml 91
blevesearch/segment A Go library for performing Unicode Text Segmentation 89
yoriyuki/camomile A Unicode library for OCaml providing character type, string encodings, collation and locale-sensitive case mappings. 124
ocaml-community/sedlex A tool for generating Unicode-friendly lexers in OCaml 240
whitequark/ocaml-m17n A package that enables multilingualization of OCaml source code by supporting Unicode identifiers 50
uni-algo/uni-algo A C/C++ library that provides secure and efficient Unicode algorithms for text processing 280
dbuenzli/topkg A tool that provides an API and command line interface to manage OCaml software packages 68
brendanlong/ocaml-ooxml A collection of OCaml libraries for parsing and working with Microsoft Office document formats 33
ocaml-community/biniou An efficient binary data format for serializing and deserializing structured data 42
dannywillems/ocaml-cordova-plugin-barcode-scanner A binding that enables barcode scanning functionality in an OCaml application using the cordova-plugin-barcodescanner plugin 0
tsubame-sp/ocaml_at_p A debugging tool for OCaml programming 11