SpecVQGAN
Visual sound generator
A project to train an audio generation model that uses visual cues to produce high-quality sounds from a reduced dataset of representative vectors.
Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)
353 stars
8 watching
39 forks
Language: Jupyter Notebook
last commit: 7 months ago audioaudio-generationbmvcevaluation-metricsganmelganmulti-modalpytorchtransformervasvggsoundvideovideo-featuresvideo-understandingvqvae
Related projects:
Repository | Description | Stars |
---|---|---|
| A spectral modeling synthesizer for generating sound. | 32 |
| An implementation of a neural network for generating sound from video sequences | 52 |
| A Java-based Android app for visualizing audio recordings | 550 |
| Generates polyphonic music sequences using deep learning models and adversarial training | 28 |
| Generates polyphonic music from scratch or based on user input using a neural network architecture | 1,863 |
| An open-source machine learning algorithm for generating raw audio waveforms from raw data | 1,334 |
| An open-source plugin for generating vibrato effects in audio signals | 77 |
| An implementation of WaveNet for generating audio using PyTorch | 979 |
| A JavaScript synthesizer library used to generate sound effects and music for video games and small applications. | 236 |
| An audio visualizer extension that displays spectrograms of YouTube videos and microphone input on web pages. | 175 |
| An audio visualizer software that generates real-time visuals based on incoming audio streams | 24 |
| A software tool that generates synthetic engine sounds with customizable parameters and real-time preview capabilities. | 313 |
| An audio visualization plugin for the VueJS framework using HTML5 Web Audio API. | 726 |
| A music visualizer that uses audio spectrum analysis to create real-time visualizations of an audio stream. | 10 |
| A program for visualizing audio signals in real-time. | 19 |