wikipron
pronunciation scraper
A tool for extracting and processing multilingual pronunciation data from Wiktionary.
Massively multilingual pronunciation mining
323 stars
18 watching
71 forks
Language: Python
last commit: 3 months ago
Linked from 1 awesome list
computational-linguisticsg2planguagelinguisticsnlpphoneticsphonologypronunciationpython-apiscraped-dataspeech
Related projects:
Repository | Description | Stars |
---|---|---|
| Tool for automating pronunciation lexicon creation for low-resource languages using speech recognition and machine learning algorithms. | 1 |
| Software tool to generate pronunciation lexicons for low-resource languages using speech recognition and machine learning algorithms. | 21 |
| A collection of tools and libraries for analyzing and processing phonological data in various languages | 115 |
| A metadata agent that scrapes audiobook metadata from Audible.com and integrates it with Plex media servers. | 607 |
| A tool for phonetic transcription of languages with close-to-phonetic writing systems | 10 |
| A Ruby framework for converting Hebrew text to English using phoneme maps | 7 |
| A framework to interpret multilingual NLP models and understand their word representations. | 24 |
| Estimates phonetic similarity between Chinese words and suggests similar-sounding candidates | 35 |
| An industrial-strength natural language processing library for Hungarian language text analysis | 158 |
| Automates text extraction and alignment from Global Voices articles to create parallel corpora for low-resource languages. | 9 |
| Provides phonetic pattern matching functionality in Common Lisp to aid with natural language processing and text analysis. | 24 |
| Spelling correction system for the Ukrainian language using noisy channel model | 3 |
| A package of scripts to prepare data for alignment in speech processing | 12 |
| An open-source wrapper around LLMs to extract structured data from text | 1,638 |
| A web-based annotation tool for natural language processing (NLP) | 520 |