vak
Acoustic analysis toolkit
A Python framework for training and applying neural networks to acoustic communication research
A neural network framework for researchers studying acoustic communication
78 stars
4 watching
16 forks
Language: Python
last commit: 4 months ago
Linked from 1 awesome list
animal-communicationanimal-vocalizationsbioacoustic-analysisbioacousticsbirdsongpythonpython3pytorchspectrogramsspeech-processingtorchtorchvisionvocalizations
Related projects:
Repository | Description | Stars |
---|---|---|
| A toolbox for analyzing and understanding environmental audio recordings | 109 |
| A collection of tools and libraries for analyzing and processing phonological data in various languages | 115 |
| An implementation of a neural network-based vocoder using parallel-wavenet architecture and autoregressive flow | 290 |
| An experimental framework for improving singing voice detection with data augmentation and neural networks | 36 |
| A comprehensive Python library for feature extraction, classification, segmentation, and applications of audio data. | 5,918 |
| A PyTorch implementation of end-to-end speech recognition models. | 756 |
| A Python library for handling audio datasets, providing tools for accessing, manipulating, and preparing data for machine learning tasks. | 133 |
| A toolbox to help understand neural networks' predictions by providing different analysis methods and a common interface. | 1,271 |
| A wrapper package providing an R interface to the BirdNET Python package for automated bird sound ID analysis | 17 |
| A Python library for identifying bird species by their sounds using deep learning and acoustic monitoring. | 12 |
| An audio processing toolkit using PyTorch convolutional neural networks to generate spectrograms from raw audio data | 1,036 |
| A collection of Matlab scripts implementing various audio analysis algorithms and features. | 86 |
| Library for audio data augmentation used in machine learning | 1,903 |
| Reconstructs audio features learned by convolutional neural networks into audible sounds | 42 |
| A collection of pre-trained audio and speech models for various applications | 183 |