QuickTopic (SM) free message boards QuickTopic (SM) free message boards
Skip to Messages
  Sign In to access your topic list  |New Topic |My Topics|Profile
Upgrade to Pro   Customize, show pictures, add an intro, and more:   QuickTopic Pro...and check out QuickThreadSM
Topic: CSE 151 in Fall 2008
Printer-Friendly Page
All messages    << 147-153  146-146 of 153  130-145 >>
About these ads
Who | When
Messagessort recent-top    (not accepting new messages)
Charles Elkan  146
12-09-2008 06:20 PM ET (US)
You could use the definition V(start) = Q(start,a) where a is the action recommended by the final learned policy.

However, the final Q values may not be perfectly accurate, so it is better to do what the project description says: use policy evaluation to measure the goodness of the final learned policy.
RSS link What's this?
All messages    << 147-153  146-146 of 153  130-145 >>
QuickTopicSM message boards
Over 200,000 topics served
Learn more Frequently asked questions  Acknowledgements
What they're saying about QuickTopic
 Questions, comments, or suggestions? Contact Us
Read our use policy before beginning. We value your privacy; please read our privacy statement.
Copyright ©1999-2008 Internicity Inc. All rights reserved.