QuickTopic (SM) free message boards QuickTopic (SM) free message boards
Skip to Messages
  Sign In to access your topic list  |New Topic |My Topics|Profile
Upgrade to Pro   Customize, show pictures, add an intro, and more:   QuickTopic Pro...and check out QuickThreadSM
Topic: Balancing multiple sources of reward in reinforcement learning
Views: 274, Unique: 187 
Subscribers: 0
What's
this?
Printer-Friendly Page
Subscribe to get & post, or stop messages by email Subscribe
All messages    << 6-10  5-5 of 10  1-4 >>
About these ads
Who | When
Messagessort recent-bottom   
Post a new message
 
Yohan Kim  5
04-25-2002 03:29 PM ET (US)
I would've liked more on the choice of the form of equation estimating the return (equation 2). What guided this choice and so on.

Question concerning equation 5:
gradient descent optimization was used to arrive at a solution for alpha_s(x). I was wondering whether the author was able to guarantee that the arrived value is the global minimum.
RSS link What's this?
All messages    << 6-10  5-5 of 10  1-4 >>
QuickTopicSM message boards
Over 200,000 topics served
Learn more Frequently asked questions  Acknowledgements
What they're saying about QuickTopic
 Questions, comments, or suggestions? Contact Us
Read our use policy before beginning. We value your privacy; please read our privacy statement.
Copyright ©1999-2008 Internicity Inc. All rights reserved.