asteroza + voice   156

Trying to put audiobook narrators out of business, by automating narration of e-books for conversion into audiobooks. Probably not better than real pro voice actors organized specifically for an audiobook, but for random small e-books, this is probably good enough (though that will put struggling unknown voice actors out of work...)
deep  machine  learning  ebook  e-book  voice  narration  automation  audiobook  conversion  audio  generator  service  speech  synthesis  TTS  text-to-speech 
12 weeks ago by asteroza
NVIDIA/waveglow: A Flow-based Generative Network for Speech Synthesis
25x realtime, so you can respond in realtime if your text generating NN can keep up with the speech-to-text...
Nvidia  GPU  CUDA  deep  machine  learning  human  speech  synthesis  synthetic  voice  artificial  generative  network  software  audio  TTS 
november 2018 by asteroza
BabbleLabs Clear Cloud: Speak your mind
voice cleanup service for audio from noisy environments,free due to beta status
audio  voice  noise  removal  cleanup  cloud  service 
october 2018 by asteroza
Voicery Speech Synthesis
Using deep learning to make artificial voices for voice assistants
artificial  voice  synthesizer  service  TTS  text-to-speech  speech  audio 
march 2018 by asteroza
using audio not recognized by humans but recognized by voice recognition systems to execute commands in voice command systems. basically embedding trigger words in songs. kinda evil, such as disabling airplane mode...
security  research  adversarial  audio  voice  command  recognition  embedded  trigger 
february 2018 by asteroza
RaspberryPi shield with far field mics for voice recognition, with a FPGA to accelerate things
RaspberryPi  shield  voice  recognition  far  field  microphone  array  FPGA  IoT  hardware  electronics  devices 
january 2018 by asteroza
tazti | Voice Recognition Software | Speech Recognition Software
Voice control software for windows. Just control, not dictation like Dragon Naturally Speaking
windows  voice  recogntion  control  software  assistive 
january 2018 by asteroza
ObamaNet: Photo-realistic lip-sync from text
Lyrebird moves to video. Only a few more steps before the average person can't distinguish from real video. Which leads to the endgame of never trusting any video, ever.
deep  learning  TTS  text-to-speech  sppech  machine  generated  video  lipsync  voice  emulation 
december 2017 by asteroza
Selling HR analysis services via logging sociometric badge interactions. Used in that HBR study that women interact the same at work mostly, so wage gaps are more likely to be bias based...
sociometric  interaction  logging  badge  hardware  electronics  devices  social  analytics  HR  proximity  networking  MIT  audio  frequency  voice  analysis  conversation 
october 2017 by asteroza
AI Stars | Building the world's first Celebrity AI platform
Looks like they are starting with voice synthesis to sell voice talent packages to services, so you can get famous actors/musicians to voice your GPS navigation directions. Seems to be building on existing celebrity IP/brand, but the obvious endgame is a from-scratch virtual idol...
virtual  idol  voice  talent  actor  branding  AI  artificial 
july 2017 by asteroza
Alexa Skill Testing Tool - Welcome
Browser based interface to Alexa, ostensibly for testing
amazon  alexa  voice  input  virtual  assistant  simulator  browser  interface  test  testing  Delicious 
may 2016 by asteroza
Area Code Changes
useful for looking up which new area codes will exist, for choosing a google voice number that has no bad history.
north  america  telephone  area  code  change  list  listing  reference  information  google  voice  number  selection  tips  tricks  Delicious 
july 2015 by asteroza
キーワード頭出し ボイスレコーダー
Casio has an app for search audio recordings for words (currently japanese version only, so keyword input is in hiragana/katakana), so good for voice recorder/dictation work where you need to seek/search for a particular point in time on the recording but don't remember where. English version should be coming along soonish.
keyword  voice  audio  search  iPhone  iOS  app  software  japan  japanese  Delicious 
april 2015 by asteroza
Bluetooth call recorder, which doubles as a headset in a pinch. By pairing this a s aheadset to your phone, and optionally then pairing your own personal headset or car audio system to this device, it acts as a "bump in the wire" recorder, proxying audio data as a dual mode bluetooth piconet master and slave. It's basically a MitM attack of sorts, but a necessary evil as most smartphone apps can't capture both endpoints in call audio due to security restrictions to prevent unauthorized phone tapping by apps.
bluetooth  call  voice  audio  recorder  headset  hardware  electronics  devices  indiegogo  Delicious 
january 2015 by asteroza
Hardware inline headset voice encryption/scrambling using a one time password. No jacks for a fixed phone though. All external and OTP, so harder to compromise. Your phone/phone company will hate you for using scrambled audio as the codecs hate that.
inline  headset  voice  encryption  audio  sound  external  OTP  hardware  electronics  devices  smartphone  iPhone  android  Delicious 
september 2014 by asteroza
Oh hey, natural language processing as a service...
NLP  natural  language  processing  web  API  service  english  speech-to-text  voice  recognition  SaaS  Delicious 
september 2014 by asteroza
Dumb Store
Text/SMS/voice dumbphone information gateway service. because sometimes you don't need an app for that, but you do need that info.
dumbphone  SMS  voice  access  text  service  services  mobile  app  cellphone  phone  minimal  minimalist  information  Delicious 
june 2014 by asteroza
HC1 Headset Computer - Motorola Solutions USA
Still haven't solved the problem of these things being large and fugly...
computer  control  AR  hardware  portable  voice  tilt  camera  gesture  motorola  display  computing  WinCE  HUD  head  mounted  devices  HMD  video  wearable  electronics  Delicious 
october 2012 by asteroza
Robin - your personal eyes-free assistant on the road!
A voice activated eyes free virtual driving assistant app. Not meant to be a direct competitor to Siri, more of a competitor to various navigation apps/services.
Robin  android  app  navigation  driving  virtual  assistant  voice  command  assist  search  engine  service  software 
september 2012 by asteroza
Another potential Siri competitor, focused on API connectivity. They should get together with IFTTT though.
android  Maluuba  app  software  virtual  assistant  natural  language  audio  analays  search  engine  API  voice  command  Delicious 
september 2012 by asteroza
Expect Labs - MindMeld Coming Soon
Basically a kind of recommendation engine, analyzing the last 10 minutes of conversation to predict which topics you might need to know more about, then present them to you. This might have legs.
iPad  iPhone  app  software  MindMeld  voice  conversation  audio  monitoring  topic  suggestion  automated  search  recommendation  engine  prediction  analysis  predictive  query  Delicious 
september 2012 by asteroza
Herobutton – Alerts for the consumer!
A mobile sort of IFTTT service. Basically you can voice command it to do triggered alerts/notifications based on certain tiggers/preconditions. An example is to notify you if your favorite band is coming into town.
HeroButton  iPhone  android  app  software  voice  command  virtual  assistant  trigger  alert  notification  IFTTT  Delicious 
september 2012 by asteroza
VoiceBunny: Fast and professional voice overs.
Pretty cheap. I wonder if people might take to it for custom alert messages and voice prompts?
crowdsourcing  professional  sound  recording  voice  actor  actress  service  audio  custom  message  voiceover  Delicious 
july 2012 by asteroza
Zypr | Tight
Basically, like Siri, but the backend is a stable buffer API web platform overlaid on third party web services API's, allowing you to do stuff like Siri without being aware of the true underlying API's being used. So, if some app developer wants to use Zypr, and use facebook integration services, if facebook changes their underlying API, you won't notice with Zypr as it only provides a normalized web API.
Pioneer  Zypr  natural  language  voice  speech  recognition  API  web  service  platform  Siri  cloud  mashup  webdev  programming  development  Delicious 
november 2011 by asteroza
App Store - Vocre
Apparently this is kinda expensive, in the sense that it costs $1 for 10 sentences.
Vocre  voice  dictation  iPhone  app  software  Nuance  language  translation  speech-to-text  text-to-speech  speech-to-speech  Delicious 
october 2011 by asteroza
[Wireless Japan] Docomo R&D Demoed Cloud Automatic Translation and Cloud “Butler” « Akihabara News
The cloud translation stuff from your phone is nice and all, but the killer point is this Cloud Butler. It's the beginnings of an AI personal assisant agent/virtual concierge service similar to CALO/Siri (Siri got bought by Apple!)
DoCoMo  cloud  translation  service  realtime  phone  voice  audio  android  virtual  concierge  butler  assistant  agent  AI  Delicious 
may 2011 by asteroza
« earlier      
per page:    204080120160

related tags

1.2  1.4  1.6  2.0  4.5  9V  050  802.11  access  accessibility  actions  activated  actor  actress  addon  adhoc  ads  adversarial  advertising  age  agent  aggregator  AGprojects  AI  ajax  alert  alexa  alternative  amazon  america  analays  analog  analysis  analytics  android  annotation  anonymity  anonymous  API  app  application  applications  appswift  app_flite  app_swift  AR  area  array  Art  artificial  ASP  assist  assistance  assistant  assistive  asterisk  asynchronous  ATA  Audeo  audio  audiobook  authentication  Authentify  automated  automation  autotune  Autotune.NET  AWS  b-mobile  badge  battery  beta  binaural  bing  biometric  bitrate  blackberry  blind  block  blocker  blog  blogging  blooger  bluetooth  bowser  brain  branding  bridge  browser  buffer  burner  business  butler  button  C#  cabling  calendar  call  CallerID  camera  camrivox  captcha  card  Carnegie  CDR  cellphone  center  cepstral  change  changer  changing  channel  chat  chatbot  Chaufr  chip  chrome  cleanup  click2call  client  closed  clothing  cloud  CMU  coach  coaching  code  codec  Codec2  Collaboration  command  comment  communication  communications  communicator  company  computer  computing  concierge  conference  conferencing  connector  context  continuous  control  controlled  conversation  conversion  converter  copying  correction  cortana  crackberry  creation  credit  CRM  crowdsourcing  cryptography  CSipSImple  CUDA  custom  customization  Dalek  DARPA  data  deep  Delicious  delivery  demo  detection  development  devices  dial  dialer  dictation  dictionary  DID  digital  disability  disinformation  display  disposable  distro  DIY  DoCoMo  docs  download  dragon  driving  DSP  DSTAR  dumbphone  e-book  ebook  editor  Edwin  electronics  email  embedded  emulation  encrypted  encryption  engine  engineering  english  evesdropping  evices  exchange  experiment  experimental  extension  external  factor  fake  far  fashion  feature  festival  FestVox  field  file  firmware  flikr  flite  food  form  forwarding  FPGA  free  FreePBX  frequency  gadgets  game  gargoyle  gateway  generated  generative  generator  gesture  gizmoproject  glogger  google  googletalk  GPRS  GPU  GrandCentral  GSM  Gtalk  GTD  guide  gun  GVdialer  hacking  HAM  handicapped  handsfree  haptic  hardware  hatsune  HCI  head  headset  health  helmet  HeroButton  HID  hidden  HMC  HMD  hospital  hosted  howto  HR  HUD  human  humor  hybrid  hysteria  IBM  IDE  idea  identification  identity  idol  IFTTT  IM  image  images  impaired  impulse  incredible  indiegogo  Inferret  infinitalk  information  inline  input  installation  instant  integrated  interaction  intercept  interface  internet  internet-of-things  interoperability  iOS  IoT  iotum  IP  IP-PBX  iPad  iPhone  IPPBX  Iris  IVR  jabber  Jabbin  japan  japanese  java  javascript  Jeannie  JIbbigo  jingle  Jott  JR  keyword  kickstarter  kit  koe-tan  labs  language  laser  launcher  learning  Lebedev  Levelator  lifehacks  line  linguistics  linux  lipsync  list  listing  live  livecast  local  lock  logging  lookup  loop  low  mac  machine  macro  magic  Maluuba  management  manipulation  marketing  mashup  mass  media  Mellon  memory  MERL  mesh  message  messaging  mic  microblogging  microphone  microsoft  miku  military  MindMeld  minimal  minimalist  mining  MIT  mobile  model  module  monitoring  mosophre  motorola  mount  mounted  movable  MSN  multifactor  multiple  mumble  music  narration  natural  navigation  NEC  neckband  nerdvittles  nerve  network  networking  NICT  NLP  noise  normalization  north  note  notetaking  notification  Novauris  now  NSX-1  Nuance  number  Nvidia  OCS  office  OK  online  OOBA  open  opensource  organization  OS  OSX  OTP  out-of-band  outsourcing  package  paging  patch  PBX  pebble  personal  personalized  phone  phonecall  photos  phreaking  pictures  Pioneer  pitch  plasma  platform  plugin  podcast  popcorn  POPINATOR  portable  powered  prank  pranking  prediction  predictive  presence  presentation  privacy  private  processing  productivity  professional  programming  project  prompt  prompts  Promptu  proof-of-concept  proximity  proxy  PSTN  PTT  public  push-to-talk  python  Qualcomm  query  question  radio  railroad  railway  RaspberryPi  ray  realtime  recall  recognition  recogntion  recommendation  recongition  recorded  recorder  recording  recordings  redirect  redirector  RedPhone  reference  relevant  reminder  remote  removal  replication  reQall  research  response  responsepoint  RIM  Robin  routing  RP  s60  SaaS  say2go  scifi  SDR  SEAL  search  security  selection  sensor  server  service  services  setup  ShadowNumber  sharepoint  shield  shifter  shifting  short  SightSpeed  signal  silent  SIM  SIMPLE  simulation  simulator  Simulscribe  singing  SIP  Siri  skill  skills  skype  small  smartphone  sms  SNS  social  sociometric  softphone  software  sound  source  spam  spanish  speaking  speech  speech-to-speech  speech-to-text  Speereo  sphinx  sphinx-4  Sphynix  spinvox  SPIT  spoofing  sppech  SRM  SRTP  SSB  standard  standards  stickie  storage  streaming  subway  suggestion  swift  Swype  symbian  synthesis  synthesizer  synthetic  sysadmin  talent  Talk-Now  TalkPlus  teamspeak  technology  telepathy  telephone  telephoney  telephony  Teleport  terminal  terrorism  test  testing  text  text-to-speech  ThePudding  thought  tilt  timeline  timetable  tips  tokyo  tools  toolsuite  topic  toys  train  training  transcription  transit  translate  translater  translation  translator  travel  tree  tricks  trigger  TTS  turing  turnkey  tutorial  Twilio  twitter  type  typing  Ubi  ubiquitous  UC  UI  unified  update  uploading  usage  USB  use  utilities  utter  Utterz  VDS  ventrillo  verification  video  virtual  vision  vlog  vlogger  vlogging  vocal  vocaloid  Vocera  vocoder  Vocre  voice  voice-to-text  VoiceForge  Voicelok  voicemail  voiceover  voicexml  voip  volume  w3c  walkietalkie  wallplug  wavenet  wear  wearable  web  webdev  webRTC  widget  wifi  WinCE  windows  wireless  wiretapping  WLAN  WVoIP  XCAP  xml  XMPP  XT9  yahoo  Yamaha  YIM  Zello  ZRTp  Zypr 

Copy this bookmark: