speechpy
Speech feature extractor
Provides tools and libraries for extracting speech features from audio data.
SpeechPy - A Library for Speech Processing and Recognition: http://speechpy.readthedocs.io/en/latest/
880 stars
40 watching
104 forks
Language: Python
last commit: about 3 years ago
Linked from 1 awesome list
feature-extractionpythonspeech-recognitionspeechpy
Related projects:
Repository | Description | Stars |
---|---|---|
superkogito/spafe | A Python library for extracting audio features from mono audio files using various filter banks and spectrogram algorithms. | 458 |
jameslyons/matlab_speech_features | A set of speech feature extraction functions for various machine learning applications. | 43 |
vocalpy/vak | A Python framework for training and applying neural networks to acoustic communication research | 78 |
tyiannak/pyaudioanalysis | A comprehensive Python library for feature extraction, classification, segmentation, and applications of audio data. | 5,885 |
jamiebullock/libxtract | A portable, lightweight library providing a set of audio feature extraction functions that can be cascaded to create complex extraction hierarchies. | 227 |
iver56/audiomentations | Library for audio data augmentation used in machine learning | 1,873 |
aixander/realtime_pyaudio_fft | An audio analysis tool that extracts and visualizes features from live audio streams using FFTs. | 969 |
ynop/audiomate | A Python library for handling audio datasets, providing tools for accessing, manipulating, and preparing data for machine learning tasks. | 131 |
yaafe/yaafe | A tool for extracting audio features from digital audio files | 244 |
marl/pysox | A Python wrapper around an audio signal processing library. | 519 |
ibm/max-audio-classifier | Identifies sounds in short audio clips using machine learning and PCA transformation | 153 |
astorfi/3d-convolutional-speaker-recognition | Develops deep learning models using 3D convolutional neural networks for speaker verification tasks | 782 |
spatialaudio/python-sounddevice | Tools and bindings for playing and recording audio with Python | 1,052 |
keunwoochoi/auralisation | Reconstructs audio features learned by convolutional neural networks into audible sounds | 42 |
picovoice/rhino | A deep learning-based speech-to-intent engine for on-device voice interaction | 629 |