reinforcement   718

« earlier    

What’s New in Deep Learning Research: Microsoft’s TextWorld is the OpenAI Gym of Language Learning…
Conversational interfaces and natural language processing(NLP) are, arguably, the most widely adopted segment of modern artificial intelligence(AI). Despite the continuous progress in NLP research…
textworld  reinforcement  learning  text  based  game  zork 
11 days ago by tranqy
Reinforcement Learning: Basic Tic-Tac-Toe Implementation
Reinforcement learning puts idea of gaining maximum reward after performing some action from environment. This learning method is very different from common machine learning methods such as…
ml  ai  reinforcement  learning  tictactoe  python 
11 days ago by tranqy
Modern Game Theory and Multi-Agent Reinforcement Learning Systems
Most artificial intelligence(AI) systems nowadays are based on a single agent tackling a task or, in the case of adversarial models, a couple of agents that compete against each other to improve the…
marl  multi  agent  reinforcement  learning  game  theory 
11 days ago by tranqy

« earlier    

related tags

3d  abduction  accuracy  acmtariat  action  aesthetics  agent  ai-control  ai  algorithmic_trading  algorithms  alife  alignment  alpha-go-zero  alpha-go  alphago  animation  architecture  art  articulated  astonishing  augmentation  avoidance  backtracking  bandit  based  berkeley  blog:  bounded  brands  capitalism  character  clever-rats  clustering  cog-psych  commerce  communication  community  computer  connection  containers  course  ctf  curiosity  dark-arts  decision  deep-learning  deep  deepgoog  deeplearning  deepmind  devops  dotai  dotscale  driving  dwango  error  evolution  evopsych  experience  explanation  exposition  facebook  favorites  first  for  game  games  gaming  generation  geography  github  go  google-io  google  groups  gym  hacks  hierarchical  hmm  human-ml  hyperparameters  image  inference  influence  interests  intricacy  inverse  iteration-recursion  join  jupyter  kaggle  keras  kubernetes  language  learning  limit_order_model  logging  machiavelli  machine-learning  machine  machine_learning  machinelearning  macro  making  manipulation  market_impact  marl  membership  meta  meta:prediction  minute  ml  models  motion  multi  music  network  neural  neural_networks  neuro-nitgrit  neuro  neurons  nibble  nlp  notebook  object  openai  org:bleg  paper  papers  participation  person  portfolio_algorithm  portfolio_strategy  predictive-processing  probabilistic  program  prover  psychology  python  pytorch  rationality  ratty  reinforcement-learning  relational  relationships  representation  reproducibility  research-program  research  resources  restoration  review  reviews  rl  robotic...  robotic  roots  scalable  scipy  search  shiny  siggraph  simulation  speculation  ssc  study  successor  summary  synthesis  tax  taxonomy  tensorflow  text-adventures  text  textworld  theorem  theory  tictactoe  tilecoding  tradeoffs  training  transfer  tumblr  tutorial  two  values  verification  video  vision  volo-avolo  wire-guided  world_models  yvain  zork   

Copy this bookmark: