speech-to-text   167

« earlier    

CMUSphinx Open Source Speech Recognition
CMUSphinx is an open source speech recognition system for mobile and server applications. Supported languages: C, C++, C#, Python, Ruby, Java, Javascript. Supported platforms: Unix, Windows, IOS, Android, hardware.
voice  transcription  opensource  speech-recognition  speech-to-text  voice-recognition 
12 weeks ago by lucaswilric
Eva by Voicera | Artificial Intelligence Assistant
Eva is an artificial intelligence assistant that listens, takes notes and helps capture the important moments in your meetings. Activate your meetings. Try it now.
meeting  speech-to-text  transcription  productivity 
june 2018 by shoesiq
Targeted Audio Adversarial Examples
This is phenomenal:
We have constructed targeted audio adversarial examples on speech-to-text transcription neural networks: given an arbitrary waveform, we can make a small perturbation that when added to the original waveform causes it to transcribe as any phrase we choose.

In prior work, we constructed hidden voice commands, audio that sounded like noise but transcribed to any phrases chosen by an adversary. With our new attack, we are able to improve this and make an arbitrary waveform transcribe as any target phrase.

The audio examples on this page are impressive -- a little bit of background noise, such as you might hear on a telephone call with high compression, hard to perceive if you aren't listening out for it.

Paper here: https://arxiv.org/abs/1801.01944

(Via Parker Higgins, https://twitter.com/xor )
papers  audio  adversarial-classification  neural-networks  speech-to-text  speech  recognition  voice  attacks  exploits  via:xor 
january 2018 by jm
Speech API - Speech Recognition  |  Google Cloud Platform
Inexpensive (at least for my personal use) and probably pretty damn accurate speech-to-text from Google. Could be useful for transcribing my voice logs. Doesn't look like there's a web interface for it, but I should be able to hack together something pretty quickly to transcribe mp3 files (hopefully).
speech-to-text  google-cloud  google 
january 2018 by docwlad
Sample Applications  |  Google Cloud Speech API Documentation  |  Google Cloud Platform
Sample applications for the Google speech-to-text. Python's probably the easiest to get up and running quickly. Check out the API too.
speech-to-text  google-cloud  google  documentation 
january 2018 by docwlad

« earlier    

related tags

_share  accessibility  acoustic  adobe  adversarial-classification  ai  alignment  an  android  api  app  applications  apps  archive  asr  assistive-technology  assistive  attacks  audio  automatic  bbc  boris-soundbite  boris  browser  c-print  captcha  captioning  captions  cart  chrome  cloud  code  cognitive-services  cognitive  comprehension  computer  conference  conferencecalls  deaf  dealhacker  delicious  dev  developers  dialoog  diarization  dictation  digital  diigo  documentation  downloads  dragon  dragon_dictation  education  efficiency  engine  english  evernote  exploits  facebook  final-cut-pro  for  forced-aligner  forced-alignment  free  funnelback  funny  github  go  google-cloud  google  google_chrome  hci  howto  html5  human  ibm  icommunicator  ifttt  improved  interface  internet-of-things  ios  ios_assistant  iphone  japanese  jarvis  jasper  java  jm  kaldi  language  lanugage  launches  learning  lens  liberated-learning  liberated  library  lifehacker  linguistics  live  localisation  mac  machine  machinelearning  macrumors  manual-dictation  marketing  meeting  microphone  microsoft  mobile  mobiledev  mozilla  natural  network  neural-networks  neural  neuralnetworks  nlp  nuance  online  open-source  opensource  papers  pepnet  perception  phonetics  pi  pocketsphinx  podcast  podcasting  preferences  premiere-pro  processing  productivity  programming  python  raspberry-pi  raspberry  realtime  recognition  recorder  remoteworking  research  respeaking  saas  sdk  search  search_engine  security  service...  service  services  siri  software  sound  soundbite  spanish  speaking  speech-recognition  speech-to-speech  speech  speech_recognition  speechkit  speechrecognition  speed  speex  sst  stenography  stt  subtitles  supercut  techcrunch  technology  tensorflow  text-processing  text-to-speech  text  textrecognition  thunderblog  to  tool  toolkit  tools  transcribe  transcript  transcription  translation  tts  tv  typewell  unsorted_bookmarks  unsorted_bookmarks​​​​​​​  unsorted_bookmarks​​​​​​​​​​​​​​​​  utilities  utility  vendor  vfx  video  vocre  voice-recognition  voice  watson  wavenet  web  webapps  webdev  webservices  windows  windows_downloads  wired  word  writing  youtube     

Copy this bookmark: