Generative Audio (3)

Abstract

In this project, a machine critic is built to give comments on several pictures. The critic is not intelligent enough to surpass describing the contents of pictures. However, it performs excellent tongue-twisters to beat human beings. The voice of the critic comes from two models, DeepVoice3 and Tacotron2, respectively.

Results

ex.106

106 - DeepVoice3

106 - Tacotron2

ex.8

8 - DeepVoice3

8 - Tacotron2

ex.101

101 - DeepVoice3

101 - Tacotron2

Tongue-twister 1: Peter Piper picked a peck of pickled peppers. How many pickled peppers did Peter Piper pick?

DeepVoice3

Tacotron2

Tongue-twister 2: A skunk sat on a stump and thunk the stump stunk, but the stump thunk the skunk stunk.
DeepVoice3

Tacotron2

Code

Technical Notes

  • All the packages are included when running ipynb files.
  • Tested on Google Colab.

Reference