sqlite3-unicodesn
FTS tokenizer
An extension that adds full-text search capabilities to SQLite with Snowball stemming.
SQLite unicode full-text-search tokenizer with Snowball stemming
34 stars
5 watching
13 forks
Language: C
last commit: over 5 years ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
abiliojr/fts5-snowball | A Snowball stemmer tokenizer extension for FTS5 in SQLite | 47 |
haifengkao/sqlitesubstringsearch | A tokenizer that supports fast substring search with FTS (full text search) capabilities | 83 |
wangwang4git/sqlite3-icu | A C-based implementation of a Chinese tokenizer for SQLite3 using ICU's Analysis feature. | 6 |
c4n/pythonlexto | A Python wrapper around the Thai word segmentator LexTo, allowing developers to easily integrate it into their applications. | 1 |
elau1004/tfr4sqlite | A module to read and parse structured text files into a virtual table in SQLite | 19 |
jasonshin/sqlx-ts | Tool for compile-time checked queries with type generation | 185 |
zurawiki/tiktoken-rs | Provides a Rust library for tokenizing text with OpenAI models using tiktoken. | 256 |
mathewsanders/mustard | A Swift library for tokenizing strings with customizable matching behavior | 689 |
frost/isn | Provides PostgreSQL type definitions and Ecto extensions for international standards in data storage | 10 |
flinedev/bartycrouch | Automates string updates and translations in Xcode projects | 1,365 |
florents-tselai/litejq | A jq extension for SQLite that simplifies working with JSON data in SQL queries | 92 |
zseder/huntoken | A tool for tokenizing raw text into words and sentences in multiple languages. | 3 |
hotpotatoc/snowflake | Generates unique identifiers in the form of timestamps and sequence numbers with optional machine and process IDs | 4 |
hissssst/pathex | A library for fast and functional access to nested data structures in Elixir | 339 |
teo-tsirpanis/farkle | A parser library for F# and C# | 93 |