jingjuSyllabicSegmentaion
Syllable segmentation tool
An implementation of a score-informed method for segmenting jingju a cappella singing voice into syllables using convolutional neural networks and Viterbi algorithm
Code for the paper: Score-informed Syllable Segmentation for A Cappella Singing Voice with Convolutional Neural Networks
7 stars
3 watching
3 forks
Language: Python
last commit: over 6 years ago cnn-modelonset-detectionsingingsyllableviterbi-algorithm
Related projects:
Repository | Description | Stars |
---|---|---|
ronggong/jingjusingingphrasematching | This repository provides a software framework to match singing audio with corresponding music scores based on phonetic and duration information. | 27 |
ronggong/eusipco2017 | A software project that enables phoneme classification in music audio signals using convolutional neural networks and other machine learning techniques. | 19 |
henchc/syllabipy | Provides algorithms for breaking down words into their constituent syllables. | 44 |
jingchuncheng/seg-with-spn | This project provides code and pre-trained models for object segmentation in videos using a spatial propagation network. | 144 |
js-mim/mss_pytorch | This project provides a PyTorch implementation of a singing voice separation algorithm using recurrent inference and skip-filtering connections. | 171 |
google/uis-rnn | This library provides an implementation of an algorithm for segmenting and clustering sequential data, learning from examples. | 1,560 |
yuangongnd/whisper-at | An audio processing model that adds audio event tagging capabilities to an existing speech recognition system with minimal additional computational cost. | 321 |
yunlongdong/fcn-pytorch | A PyTorch implementation of FCN for semantic segmentation with an easy-to-use interface and pre-trained models. | 160 |
zijundeng/pytorch-semantic-segmentation | Provides PyTorch implementations of various models and pipelines for semantic segmentation in deep learning. | 1,724 |
zhenghuatan/rvad | An unsupervised method for detecting speech activity in noisy audio signals | 128 |
f0k/ismir2015 | An experimental framework for improving singing voice detection with data augmentation and neural networks | 35 |
hyeonwoonoh/deconvnet | Deconvolution network architecture for semantic segmentation | 325 |
yxlllc/ddsp-svc | Real-time end-to-end singing voice conversion system using DDSP models and various machine learning techniques | 1,881 |
xiao-ming/unet-vocalseparation-chainer | An implementation of U-Net singing voice separation model using Chainer framework | 90 |
diasks2/pragmatic_segmenter | A rule-based sentence boundary detection gem that works across many languages | 551 |