SpecVQGAN

Visual sound generator

A project to train an audio generation model that uses visual cues to produce high-quality sounds from a reduced dataset of representative vectors.

Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)

GitHub

347 stars
8 watching
40 forks
Language: Jupyter Notebook
last commit: 4 months ago
audioaudio-generationbmvcevaluation-metricsganmelganmulti-modalpytorchtransformervasvggsoundvideovideo-featuresvideo-understandingvqvae

Related projects:

Repository Description Stars
virtualanalogy/paraphrasis A spectral modeling synthesizer for generating sound. 31
peihaochen/regnet An implementation of a neural network for generating sound from video sequences 52
tyorikan/voice-recording-visualizer A Java-based Android app for visualizing audio recordings 548
l0sg/seqgan-music Generates polyphonic music sequences using deep learning models and adversarial training 28
salu133445/musegan Generates polyphonic music from scratch or based on user input using a neural network architecture 1,846
chrisdonahue/wavegan An open-source machine learning algorithm for generating raw audio waveforms from raw data 1,330
mrugalla/nel-19 An open-source plugin for generating vibrato effects in audio signals 75
vincentherrmann/pytorch-wavenet An implementation of WaveNet for generating audio using PyTorch 975
nicolas-van/sonant-x A JavaScript synthesizer library used to generate sound effects and music for video games and small applications. 235
mfcc64/youtube-musical-spectrum An audio visualizer extension that displays spectrograms of YouTube videos and microphone input on web pages. 175
ghostnan/revidia-audio-visualizer An audio visualizer software that generates real-time visuals based on incoming audio streams 23
dasetwas/enginesound A software tool that generates synthetic engine sounds with customizable parameters and real-time preview capabilities. 312
staskobzar/vue-audio-visual An audio visualization plugin for the VueJS framework using HTML5 Web Audio API. 722
viktornova/nausea A music visualizer that uses audio spectrum analysis to create real-time visualizations of an audio stream. 10
jonathanzwhite/audio-visualizer A program for visualizing audio signals in real-time. 19