sqlite3-unicodesn

FTS tokenizer

An extension that adds full-text search capabilities to SQLite with Snowball stemming.

SQLite unicode full-text-search tokenizer with Snowball stemming

GitHub

34 stars
5 watching
13 forks
Language: C
last commit: over 5 years ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
abiliojr/fts5-snowball A Snowball stemmer tokenizer extension for FTS5 in SQLite 47
haifengkao/sqlitesubstringsearch A tokenizer that supports fast substring search with FTS (full text search) capabilities 83
wangwang4git/sqlite3-icu A C-based implementation of a Chinese tokenizer for SQLite3 using ICU's Analysis feature. 6
c4n/pythonlexto A Python wrapper around the Thai word segmentator LexTo, allowing developers to easily integrate it into their applications. 1
elau1004/tfr4sqlite A module to read and parse structured text files into a virtual table in SQLite 19
jasonshin/sqlx-ts Tool for compile-time checked queries with type generation 185
zurawiki/tiktoken-rs Provides a Rust library for tokenizing text with OpenAI models using tiktoken. 256
mathewsanders/mustard A Swift library for tokenizing strings with customizable matching behavior 689
frost/isn Provides PostgreSQL type definitions and Ecto extensions for international standards in data storage 10
flinedev/bartycrouch Automates string updates and translations in Xcode projects 1,365
florents-tselai/litejq A jq extension for SQLite that simplifies working with JSON data in SQL queries 92
zseder/huntoken A tool for tokenizing raw text into words and sentences in multiple languages. 3
hotpotatoc/snowflake Generates unique identifiers in the form of timestamps and sequence numbers with optional machine and process IDs 4
hissssst/pathex A library for fast and functional access to nested data structures in Elixir 339
teo-tsirpanis/farkle A parser library for F# and C# 93