jingjuSingingPhraseMatching
Audio-score matcher
This repository provides a software framework to match singing audio with corresponding music scores based on phonetic and duration information.
Code for the paper: Audio to Score Matching by Combining Phonetic and Duration Information
27 stars
2 watching
6 forks
Language: Python
last commit: over 7 years ago acoustic-modelcnn-modelhsmmphonemescoresinging-phrase
Related projects:
Repository | Description | Stars |
---|---|---|
ronggong/jingjusyllabicsegmentaion | An implementation of a score-informed method for segmenting jingju a cappella singing voice into syllables using convolutional neural networks and Viterbi algorithm | 7 |
sergree/matchering | An audio matching and mastering tool that uses machine learning to adapt the sound of one track to match another | 1,782 |
ronggong/eusipco2017 | A software project that enables phoneme classification in music audio signals using convolutional neural networks and other machine learning techniques. | 19 |
ibm/max-chinese-phonetic-similarity-estimator | Estimates phonetic similarity between Chinese words and suggests similar-sounding candidates | 35 |
system-t/dimsim | A phonetic similarity algorithm for indexing Chinese characters by sound | 120 |
bgutter/cl-phonetic | Provides phonetic pattern matching functionality in Common Lisp to aid with natural language processing and text analysis. | 24 |
yuangongnd/ltu | An audio and speech large language model implementation with pre-trained models, datasets, and inference options | 385 |
jordipons/music-audio-tagging-at-scale-models | Research on end-to-end learning for music audio tagging using large datasets and different front-end paradigms. | 148 |
jamesturk/jellyfish | A Python library providing algorithms and encoding schemes for approximate string matching. | 2,066 |
cpjku/madmom | A Python audio signal processing library used in music information retrieval tasks. | 1,347 |
igglybuff/mreg | An application that generates a string expression for filtering movie releases. | 15 |
xidongwu/d-auprc | Provides an implementation of a specific algorithm used in audio signal processing | 0 |
iver56/audiomentations | Library for audio data augmentation used in machine learning | 1,873 |
yuangongnd/whisper-at | An audio processing model that adds audio event tagging capabilities to an existing speech recognition system with minimal additional computational cost. | 321 |
r3gm/sonitranslate | Software that allows video translation with synchronized audio, utilizing speech-to-text and text-to-speech technologies. | 869 |