speech-to-text   156

« earlier    

Targeted Audio Adversarial Examples
This is phenomenal:
We have constructed targeted audio adversarial examples on speech-to-text transcription neural networks: given an arbitrary waveform, we can make a small perturbation that when added to the original waveform causes it to transcribe as any phrase we choose.

In prior work, we constructed hidden voice commands, audio that sounded like noise but transcribed to any phrases chosen by an adversary. With our new attack, we are able to improve this and make an arbitrary waveform transcribe as any target phrase.

The audio examples on this page are impressive -- a little bit of background noise, such as you might hear on a telephone call with high compression, hard to perceive if you aren't listening out for it.

Paper here: https://arxiv.org/abs/1801.01944

(Via Parker Higgins, https://twitter.com/xor )
papers  audio  adversarial-classification  neural-networks  speech-to-text  speech  recognition  voice  attacks  exploits  via:xor 
january 2018 by jm
Speech API - Speech Recognition  |  Google Cloud Platform
Inexpensive (at least for my personal use) and probably pretty damn accurate speech-to-text from Google. Could be useful for transcribing my voice logs. Doesn't look like there's a web interface for it, but I should be able to hack together something pretty quickly to transcribe mp3 files (hopefully).
speech-to-text  google-cloud  google 
january 2018 by docwlad
Sample Applications  |  Google Cloud Speech API Documentation  |  Google Cloud Platform
Sample applications for the Google speech-to-text. Python's probably the easiest to get up and running quickly. Check out the API too.
speech-to-text  google-cloud  google  documentation 
january 2018 by docwlad

« earlier    

related tags

_share  academic_software  accessibility  acoustic  adobe  adversarial-classification  ai  alignment  an  android  api  app  applications  apps  asr  assistive-technology  assistive  attacks  audio  automatic  bbc  boris-soundbite  boris  browser  c#  c-print  captioning  captions  cart  chrome  cloud  code  cognitive-services  cognitive  comprehension  computer  conference  conferencecalls  david_pogue  deaf  dealhacker  delicious  dev  developers  dialoog  diarization  dictation  digital  documentation  downloads  dragon  dragon_dictation  education  education_tech  efficiency  en  engine  english  evernote  exploits  final-cut-pro  for  forced-aligner  forced-alignment  free  funnelback  funny  github  go  google-cloud  google  google_chrome  hci  howto  html5  human  ibm  icommunicator  ifttt  improved  interface  internet-of-things  ios  ios_assistant  iphone  japan  japanese  jarvis  jasper  java  jm  kaldi  language  lanugage  launches  learning  lens  liberated-learning  liberated  library  lifehacker  linguistics  linux  live  localisation  mac  machine  machinelearning  macrumors  manual-dictation  marketing  media  meeting  microphone  microsoft  mobile  mobiledev  mozilla  natural  network  neural-networks  neural  nlp  notetaking  nuance  nytimes  online  open-source  opensource  papers  paranoia  pepnet  perception  phonetics  pi  pocketsphinx  podcast  podcasting  preferences  premiere-pro  privacy  processing  productivity  programming  python  raspberry-pi  raspberry  realtime  recognition  recorder  reddit  remoteworking  research  respeaking  saas  sdk  search  search_engine  service...  service  services  siri  social  software  sound  soundbite  spanish  speaking  speech-recognition  speech-to-speech  speech  speech_recognition  speechkit  speechrecognition  speed  speex  sst  stenography  stt  subtitles  supercut  techcrunch  technology  tensorflow  text-processing  text-to-speech  text  textrecognition  thunderblog  to  tool  tools  transcribe  transcript  transcription  translation  tts  typewell  unsorted_bookmarks  unsorted_bookmarks​​​​​​​  unsorted_bookmarks​​​​​​​​​​​​​​​​  utilities  utility  vendor  vfx  video-making-tools-accessories  video  vista  vocre  voice-recognition  voice  voip  watson  web  webapps  webdev  webservices  windows  windows_downloads  wired  word  writing  youtube     

Copy this bookmark:



description:


tags: