deepmind   611

« earlier    

Learning by playing | DeepMind
Our new paper proposes a new learning paradigm called ‘Scheduled Auxiliary Control (SAC-X)’ which seeks to overcome the issue of exploration in control tasks. SAC-X is based on the idea that to learn complex tasks from scratch, an agent has to learn to explore and master a set of basic skills first. Just as a baby must develop coordination and balance before she crawls or walks—providing an agent with internal (auxiliary) goals corresponding to simple skills increases the chance it c...
deepmind  neuralnetworks 
17 days ago by e2b

« earlier    

related tags

2017-10-20  2017-10-22  2017  a.i  ai  algorithm  algorithms  alpha_go  alphabet  alphago  alphagozero  alphazero  ama  amazon  artificial  artificial_intelligence  artificialintelligence  arxiv  audio-generation  audio  awesome  benediktevans  bias  bigdata  biology  board  brain  breakthrough  buy  chess  clinicalai  cnns  cognitivepsychology  cogsci  consciousness  cool  data  dataethics  death  deep-learning  deep  deep_learning  deeplearning  deepreinforcementlearning  digitalhealth  digitalmarketing  disruptivepsychology  distributed-computing  drugdiscovery  economy  elmo  endoflife  engines  ethics  ethiek  fairness  folding  future_trends  game-theory  game  games  gaming  general  generative  go  golang  google's  google-speech  google  health  healthcare  hippocampus  hsk  human  hyper-parameter  hyperparameter  hyperparamter  ia  image-generation  in4care  input  intelligence  learn  learning  lifespan  lyrebird  machine-learning  machine  machine_learning  machinelearning  mentalhealth  mi  microsoft  ml  moorfields  networks  neural-networks  neural  neuralnetwork  neuralnetworks  neuroscience  newsletter  nhs  nn-distillation  nn  no  optimization  papers  parkinsons  physics  predpol  programming  psychology  pytorch  reddit  reinforcement-learning  reinforcement  reinforcementlearning  research  royalfreehospital  science  scratch  seo  shape  sharing  speech-synthesis  speech  stories  study-group  synth  tech  tech_solutions  technology  tensorflow  top  tts  tuning  tutorial  tutorials  up-to-us  video-generation  video  voicetech  watson  wavenet  web  weibo  zero 

Copy this bookmark: