SpecVQGAN
Visual sound generator
A project to train an audio generation model that uses visual cues to produce high-quality sounds from a reduced dataset of representative vectors.
Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)
347 stars
8 watching
40 forks
Language: Jupyter Notebook
last commit: 4 months ago audioaudio-generationbmvcevaluation-metricsganmelganmulti-modalpytorchtransformervasvggsoundvideovideo-featuresvideo-understandingvqvae
Related projects:
Repository | Description | Stars |
---|---|---|
virtualanalogy/paraphrasis | A spectral modeling synthesizer for generating sound. | 31 |
peihaochen/regnet | An implementation of a neural network for generating sound from video sequences | 52 |
tyorikan/voice-recording-visualizer | A Java-based Android app for visualizing audio recordings | 548 |
l0sg/seqgan-music | Generates polyphonic music sequences using deep learning models and adversarial training | 28 |
salu133445/musegan | Generates polyphonic music from scratch or based on user input using a neural network architecture | 1,846 |
chrisdonahue/wavegan | An open-source machine learning algorithm for generating raw audio waveforms from raw data | 1,330 |
mrugalla/nel-19 | An open-source plugin for generating vibrato effects in audio signals | 75 |
vincentherrmann/pytorch-wavenet | An implementation of WaveNet for generating audio using PyTorch | 975 |
nicolas-van/sonant-x | A JavaScript synthesizer library used to generate sound effects and music for video games and small applications. | 235 |
mfcc64/youtube-musical-spectrum | An audio visualizer extension that displays spectrograms of YouTube videos and microphone input on web pages. | 175 |
ghostnan/revidia-audio-visualizer | An audio visualizer software that generates real-time visuals based on incoming audio streams | 23 |
dasetwas/enginesound | A software tool that generates synthetic engine sounds with customizable parameters and real-time preview capabilities. | 312 |
staskobzar/vue-audio-visual | An audio visualization plugin for the VueJS framework using HTML5 Web Audio API. | 722 |
viktornova/nausea | A music visualizer that uses audio spectrum analysis to create real-time visualizations of an audio stream. | 10 |
jonathanzwhite/audio-visualizer | A program for visualizing audio signals in real-time. | 19 |