项目作者: mazzzystar

项目描述 :
PyTorch implementation of " Synthesizing Audio with Generative Adversarial Networks"
高级语言: Python
项目地址: git://github.com/mazzzystar/WaveGAN-pytorch.git
创建时间: 2018-07-16T12:03:08Z
项目社区:https://github.com/mazzzystar/WaveGAN-pytorch

开源协议:

下载


WaveGAN-pytorch

PyTorch implementation of Synthesizing Audio with Generative Adversarial Networks(Chris Donahue, Feb 2018).

Befor running, make sure you have the sc09 dataset, and put that dataset under your current filepath.

Quick Start:

  1. Installation

    1. sudo apt-get install libav-tools
  2. Download dataset

  1. Run

For sc09 task, make sure sc09 dataset under your current project filepath befor run your code.

  1. $ python train.py

Training time

  • For SC09 dataset, 4 X Tesla P40 takes nearly 2 days to get reasonable result.
  • For piano piano dataset, 2 X Tesla P40 takes 3-6 hours to get reasonable result.
  • Increase the BATCH_SIZE from 10 to 32 or 64 can acquire shorter per-epoch time on multiple-GPU but slower gradient descent learning rate.

Results

Generated “0-9”: https://soundcloud.com/mazzzystar/sets/dcgan-sc09

Generated piano: https://soundcloud.com/mazzzystar/sets/wavegan-piano

Loss curve:

Architecture

TODO

  • Add some evaluation experiments, eg. inception score.

Contributions

This repo is based on chrisdonahue’s and jtcramer’s implementation.