QuickTopic (SM) free message boards QuickTopic (SM) free message boards
Skip to Messages
  Sign In to access your topic list  |New Topic |My Topics|Profile
Upgrade to Pro   Customize, show pictures, add an intro, and more:   QuickTopic Pro...and check out QuickThreadSM
Topic: Sequential cost-sensitive decision-making with reinforcement learning
Views: 332, Unique: 220 
Subscribers: 0
What's
this?
Printer-Friendly Page
Subscribe to get & post, or stop messages by email Subscribe
All messages    << 7-9  6-6 of 9  1-5 >>
About these ads
Who | When
Messagessort recent-bottom   
Post a new message
 
Dana Dahlstrom  6
04-04-2002 03:24 PM ET (US)
[follow-up to previous post]

Actually, looking forward, my proposed patch would
require modification to Equations 5 and 7 (the
Q-learning and sarsa update rules) as well. It's
probably better to revise the notation the other way:

1. Change the descriptions of the state, action, and
reward sequences to begin with zero elements.

2. Change the summations in Equations 1 and 2 to begin
with $t=0$.

3. Change $\gamma^{t-1}$ to $\gamma^t$ in Equation 1.

This way action $a_t$ is performed in state $s_t$ and
reward $r_t$ results; I believe this is conventional.
RSS link What's this?
All messages    << 7-9  6-6 of 9  1-5 >>
QuickTopicSM message boards
Over 200,000 topics served
Learn more Frequently asked questions  Acknowledgements
What they're saying about QuickTopic
 Questions, comments, or suggestions? Contact Us
Read our use policy before beginning. We value your privacy; please read our privacy statement.
Copyright ©1999-2008 Internicity Inc. All rights reserved.