QuickTopic (SM) free message boards QuickTopic (SM) free message boards
Skip to Messages
  Sign In to access your topic list  |New Topic |My Topics|Profile
Upgrade to Pro   Customize, show pictures, add an intro, and more:   QuickTopic Pro...and check out QuickThreadSM
Topic: Implicit Imitation in Multi-agent Reinforcement Learning
Views: 257, Unique: 184 
Subscribers: 0
What's
this?
Printer-Friendly Page
Subscribe to get & post, or stop messages by email Subscribe
All messages    << 4-6  3-3 of 6  1-2 >>
About these ads
Who | When
Messagessort recent-top   
Post a new message
 
Greg HamerlyPerson was signed in when posted  3
04-23-2002 02:43 PM ET (US)
One thing I liked about this paper is the experiment they performed in figures 3 & 4, which has the mentor giving misleading information. Clearly, giving a training signal that is better than random information (with a mentor that has a correct policy) will give an improved performance -- but what about evil mentors, or error-prone mentors? This experiment speaks to those.

Obviously the constraints that the mentor's state space be a subset of the observer's is a strict one; can anyone comment on how well this restriction can be overcome?
RSS link What's this?
All messages    << 4-6  3-3 of 6  1-2 >>
QuickTopicSM message boards
Over 200,000 topics served
Learn more Frequently asked questions  Acknowledgements
What they're saying about QuickTopic
 Questions, comments, or suggestions? Contact Us
Read our use policy before beginning. We value your privacy; please read our privacy statement.
Copyright ©1999-2008 Internicity Inc. All rights reserved.