QuickTopic (SM) free message boards QuickTopic (SM) free message boards
Skip to Messages
  Sign In to access your topic list  |New Topic |My Topics|Profile
Upgrade to Pro   Customize, show pictures, add an intro, and more:   QuickTopic Pro...and check out QuickThreadSM
Topic: CSE 151 in Fall 2008
Printer-Friendly Page
All messages    << 150-153  149-149 of 153  133-148 >>
About these ads
Who | When
Messagessort recent-top    (not accepting new messages)
Charles Elkan  149
12-09-2008 11:07 PM ET (US)
/m147, /m148: Evaluating the policy cannot be part of the agent's learning process. It is simply a way for you the programmer to measure the success of the learning process you design.

Since the agent could never evaluate a policy with policy iteration (PI), the learning process cannot decide to stop based on this.

However, after the agent stops learning using a heuristic, then you the programmer can run PI. Your goal is to invent a heuristic that the agent can use to stop as quickly as possible with a policy that is good as possible.
RSS link What's this?
All messages    << 150-153  149-149 of 153  133-148 >>
QuickTopicSM message boards
Over 200,000 topics served
Learn more Frequently asked questions  Acknowledgements
What they're saying about QuickTopic
 Questions, comments, or suggestions? Contact Us
Read our use policy before beginning. We value your privacy; please read our privacy statement.
Copyright ©1999-2008 Internicity Inc. All rights reserved.