reinforcement   765

« earlier    

Why we shouldn't like coffee, but we do: Weirdly, people with a higher sensitivity to bitter caffeine taste drink more coffee -- ScienceDaily
Hypothesizing as to why people sensitive to bitter taste should not like caffeine but oft do: they learn to associate caffeine’s bitter taste with energy
research  coffee  tea  wine  caffeine  taste  genetics  uk  behavior  modification  reinforcement  food  drink 
4 weeks ago by csrollyson
Key papers in deep RL
"What follows is a list of papers in deep RL that are worth reading. This is far from comprehensive, but should provide a useful starting point for someone looking to do research in the field."
machinelearning  deep  reinforcement  learning 
5 weeks ago by lucastheis
Atari - Reinforcement Learning in depth 🤖 (Part 1: DDQN)
In today’s article, I am going to show you how to implement one of the most groundbreaking Reinforcement Learning algorithms - DDQN (Double Q-Learning). After the end of this post, you will be able…
ai  atari  ml  reinforcement  learning 
6 weeks ago by tranqy

« earlier    

related tags

3d  abduction  accuracy  aesthetics  agent  ai  algorithmic_trading  alife  allocation  alpha-go-zero  alpha-go  alphago  architecture  art  artificial  atari  augmentation  automata  based  baseline  behavior  berkeley  best-practices  bestpractices  bias  bioshotcrete  blog:  caffeine  causal  cloud  cnn  coffee  cog-psych  construction  cooperation  course  ctf  curiosity  dark-arts  deep-learning  deep-q  deep  deeplearning  deepmimic  deepmind  diversity  dopamine  drink  drone  dual  dwango  error  evolution  evopsych  exposure  facebook  falsification  first  food  for  framework  game  gan  generation  genetics  go  google  gym  hierarchical  hmm  inference  intelligence  intricacy  intrinsic  inverse  join  learning  library  limit_order_model  logic  lstm  machiavelli  machine-learning  machine  machine_learning  machinelearning  manipulation  market_impact  marl  medium  meta  meta:prediction  ml  models  modification  molecule  motivation  movement  multi-agent  multi  multiagent  music  navigation  networks  neural  neural_networks  neuro-nitgrit  neuro  neurons  nlp  object  openai  opensource  optimal  pareto  person  policy  portfolio_algorithm  portfolio_strategy  predictive-processing  process  program  programming  prover  proving  psychology  python  pytorch  qlearning  reddit  reinforcement-learning  reinforcementlearning  relational  research  resource  reviews  rl  robotic...  robotic  robotics  roots  scalable  scipy  search  shiny  shotcrete  speculation  spray  ssc  structure  study  survey  synthesis  taste  tea  temporal  tensorflow  text-adventures  text  textworld  theorem  theory  tictactoe  training  transfer  tumblr  tutorial  uav  uk  wine  wire-guided  world_models  youtube  yvain  zork   

Copy this bookmark: