speech-recognition   337

« earlier    

GitHub - intel/rtos-drv-intel-s1000
"MCU/RTOS driver for Sue Creek speech and audio processor. This driver implements the APIs to applications and the underlying control mechanisms for managing audio capture, playback, wakeword notification, etc."
The Sue Creek S1000 processor, from Intel, with on-chip speech recognition. Part of the Quark family.
(Not sure what that means tho, is it a trained NN on the chip, or just a lot of fancy vector instructions?)
repo:github  intel  programming  speech-recognition  privacy  surveillance 
yesterday by mechazoidal
CMUSphinx Open Source Speech Recognition
CMUSphinx is an open source speech recognition system for mobile and server applications. Supported languages: C, C++, C#, Python, Ruby, Java, Javascript. Supported platforms: Unix, Windows, IOS, Android, hardware.
voice  transcription  opensource  speech-recognition  speech-to-text  voice-recognition 
12 weeks ago by lucaswilric
Home - Vocalize
"We have leveraged decades of experience from the field of Audiology and created a software suite designed to evaluate the hearing capabilities of AI powered virtual assistants such as Alexa, Siri, Cortana, Bixby and Google Assistant. We apply our protocols across spoken language variables such as volume level, background noise, distance and cadence. This creates a detailed description of the virtual assistant hearing ability."
speech-recognition  voxable  add-to-list 
september 2018 by techpeace
These companies are shrinking the voice recognition 'accent gap' | VentureBeat
y the end of 2018, the Google Assistant will support over 30 languages. Qualcomm has developed on-device models that can recognize words and phrases with 95 percent accuracy. And Microsoft’s call center solution is able to transcribe conversations more accurately than a team of humans.

[Speechmatic's] language pack — dubbed Global English — is the result of thousands of hours of speech data from over 40 countries and “tens of billions” of words. It supports “all major” English accents for speech-to-text transcription, and it’s built on the back of Speechmatic’s Automatic Linguist, an AI-powered framework that learns the linguistic foundations of new languages by drawing on patterns identified in known ones.

Nuance says it employs several methods to ensure its voice recognition models understand equally well speakers of the roughly 80 languages its products support
voice  accents  speech-recognition  speechmatics  nuance 
september 2018 by lavallee

« earlier    

related tags

@-public  accents  acm  add-to-list  ai  alexa  algorithm  amazon-transcribe  api  art  article  artificial-intelligence  asr  attacks  audio-analysis  audio  auphonic  australia  baidu  best  bingo  bullshit  code  command  communication  component  computer-vision  content-samurai  conversational  cortana  culture  dataset  datasets  deep-learning  dictation  discussion  domotics  dragon-dictate  emberjs  error  exploits  facebook  feature-engineering  feature-extraction  features  fft  gcp  google-cloud-speech  google-now  hacker-news-comments  health  hsr  human  ibm  ignis  image-recognition  insane  intel  internet-of-things  ios  iot  irish-diaspora  jan18  javascript-bits  javascript  language-assessment  learning  libraries  library  linguistics  listening-table  mac  machine-learning  machine_learning  machinelearning  macos  mar17  microphone  microsoft  ml  model  models  mozilla  natural-language-processing  neural-network  neural  nlp  node.js  nov16  nuance  open-source  opensource  papers  pl__python  pmz  privacy  programming  python  pytorch  raspberry-pi  recognition  repo:github  research  review  rnn  rsi  security  separation  siri  software  spectrogram  speech-generation  speech-synthesis  speech-to-text  speech  speechmatics  speechrecognition  surveillance  svr  switchboard  table  teaching  technology  tensor-flow  tensorflow  text-to-speech  tips  toolkit  tools  training  transcription  tutorial  user-interface  vision  voice-control  voice-interfaces  voice-recognition  voice  voxable  vue  wake-word  watson  wav2letter++  wavenet  wearable  web-speech-api  webkit  webmarketingfree  wikipedia  wit.ai  wtf 

Copy this bookmark: