QuickTopic (SM) free message boards QuickTopic (SM) free message boards
Skip to Messages
  Sign In to access your topic list  |New Topic |My Topics|Profile
Upgrade to Pro   Customize, show pictures, add an intro, and more:   QuickTopic Pro...and check out QuickThreadSM
Topic: Implicit Imitation in Multi-agent Reinforcement Learning
Views: 258, Unique: 184 
Subscribers: 0
What's
this?
Printer-Friendly Page
Subscribe to get & post, or stop messages by email Subscribe
All messages    << 5-6  4-4 of 6  1-3 >>
About these ads
Who | When
Messagessort recent-top   
Post a new message
 
Dana Dahlstrom  4
04-23-2002 04:17 PM ET (US)
Dave:

I wholeheartedly agree the exposition in this paper is  far  from
an  ideal  elucidation  of  their  ideas.  Perhaps it's because I
lacked some of the necessary background, but I  really  struggled
to understand it in detail.

I don't think they've assumed the mentor and  the  observer  have
the same goal(s), however. As you point out, they stress that the
mentor and the observer  don't  need  to  have  the  same  reward
function  for their technique to work. The example illustrated in
figures 9 and 10 is their attempt to demonstrate this.

Greg:

This is sort of a fine point, but in figures  3  and  4  I  don't
think  the  mentor  is giving misleading information; rather, the
learner's   prior   beliefs   about   the   mentor's   transition
probabilities  are  doing the damage. I believe the mentor itself
is actually following an optimal policy.

This kind of misunderstanding is easy to make,  especially  being
that their explanation of how the priors are computed is (for me)
less than adequate.
RSS link What's this?
All messages    << 5-6  4-4 of 6  1-3 >>
QuickTopicSM message boards
Over 200,000 topics served
Learn more Frequently asked questions  Acknowledgements
What they're saying about QuickTopic
 Questions, comments, or suggestions? Contact Us
Read our use policy before beginning. We value your privacy; please read our privacy statement.
Copyright ©1999-2008 Internicity Inc. All rights reserved.