asteroza + learning + tts   8

DeepZen
Trying to put audiobook narrators out of business, by automating narration of e-books for conversion into audiobooks. Probably not better than real pro voice actors organized specifically for an audiobook, but for random small e-books, this is probably good enough (though that will put struggling unknown voice actors out of work...)
deep  machine  learning  ebook  e-book  voice  narration  automation  audiobook  conversion  audio  generator  service  speech  synthesis  TTS  text-to-speech 
june 2019 by asteroza
NVIDIA/waveglow: A Flow-based Generative Network for Speech Synthesis
25x realtime, so you can respond in realtime if your text generating NN can keep up with the speech-to-text...
Nvidia  GPU  CUDA  deep  machine  learning  human  speech  synthesis  synthetic  voice  artificial  generative  network  software  audio  TTS 
november 2018 by asteroza
ObamaNet: Photo-realistic lip-sync from text
Lyrebird moves to video. Only a few more steps before the average person can't distinguish from real video. Which leads to the endgame of never trusting any video, ever.
deep  learning  TTS  text-to-speech  sppech  machine  generated  video  lipsync  voice  emulation 
december 2017 by asteroza

Copy this bookmark:



description:


tags: