| Who | When |
Messages | |
|
|
|
| Erik Corona
|
147
|
 |
|
06-03-2007 03:00 AM ET (US)
|
|
I'm a bit confused on problem 10.1. I'm not 100% convinced I did it right. Is the format of the answer to the Result(a,s) function?
|
| Erik Corona
|
146
|
 |
|
06-03-2007 02:26 AM ET (US)
|
|
Yeah, I'm at around 580 spam and 290 ham/non-spam e-mails. It would be nice to have a standard way of splitting the messages so that I can output the correct number of 0's and 1's.
|
Stephen Boyd
|
145
|
 |
|
06-02-2007 05:48 PM ET (US)
|
|
I think that splitting on return-paths causes there to be problems. Consider the message on line 12569, in which the attached message has a return-path but no From line in the header. This is really one message with the bounced message attached but with the return-path technique it will be split into two messages. Doing a simple "[Ff]rom .*@" search I get 598 messages for the spam and 293 ham. Maybe it would be best if we were given a standard way to split messages, because I don't think 20 smaller chunks will help.
|
| Kei Shun Ma
|
144
|
 |
|
06-02-2007 05:10 PM ET (US)
|
|
Edited by author 06-02-2007 05:10 PM
I think one of the problem is that the email given is normally started with a sentence like "From abc@def WEEKDAY MONTH(one or two space)DATE TIME YEAR" and then the second line is "Return-Path: xxxx"
However in both spam and hams emails, some miss the first line and start with the second line "Return-Path: xxx" instead, so the numbers I found is by counting the lines start with "Return-Path: ", but I am not sure if there is any email missing both the first and second line....
|
mhtong
|
143
|
 |
|
06-02-2007 02:41 PM ET (US)
|
|
600 and 300 message files were used (+/- 1, I'd say). *Supposedly* breaking them apart is easy, but it's not something I've had to do, honestly. If people are having problems getting consistent separation, would it be helpful if the test set were in, say, 20 email chunks?
|
| Josh
|
142
|
 |
|
06-02-2007 12:21 AM ET (US)
|
|
Edited by author 06-02-2007 12:27 AM
for 11.4, i envision the problem a block world-like place, but where left position is A, middle position is B, and right position is C.
if so, shouldnt there also be an "At" fluent that is true if (box,monkey,banana) is at place A,B, or C? Thus we need a Move action for the monkey to go to and from positions?
|
| Kei Shun Ma
|
141
|
 |
|
06-01-2007 11:21 PM ET (US)
|
|
The numbers I get is 611 spams and 297 hams, but I am not sure it is correct or not...
|
| Steffan McMurrin
|
140
|
 |
|
06-01-2007 05:59 PM ET (US)
|
|
Anyone know yet how many spam/ham messages are in each of the files? Is there any best way to split up the emails other than trying to recognize what combination of strings a new email starts as? I'm sometimes having trouble figuring out when a new email starts and ends when I'm looking at the list... let alone abstracting the beginning or the end.
Right now i'm counting 608 spam, and 348 ham
|
mhtong
|
139
|
 |
|
06-01-2007 04:14 AM ET (US)
|
|
Just to be clear. Both the learner and the classifier take in the name of the email file (as text) and call the preprocessor to generate the features. The classifier does not call the learner, since it can be run on both the training and testing data.
|
| mhtong
|
138
|
 |
|
06-01-2007 01:01 AM ET (US)
|
|
For 11.4, there's also a "holding" fluent, which you should assume is false in the initial state. I'm not sure why it wasn't italicized in R&N.
|
| mhtong
|
137
|
 |
|
06-01-2007 12:59 AM ET (US)
|
|
Your answer isn't really what we're looking for. (Is "Water" a constant? What does it refer to?) Generally with logic we've done things like "forall x,y Dog(x) ^ Bone(y) -> Likes(x,y)". There's actually an example in the text of almost this exact same problem you ask about... Basically, use as basic and general propositions as possible. For (b) you shouldn't need to invent a proposition like IsLiquidBetween(...), but should identify the conditions under which something like IsLiquid(x) is true. I think it should be pretty obvious from the text what extensions are relevant.
|
mhtong
|
136
|
 |
|
05-31-2007 01:17 PM ET (US)
|
|
I'll be late for my office hours (probably at least half an hour). I'll still stay an hour from whenever I arrive.
|
| Josh
|
135
|
 |
|
05-31-2007 01:45 AM ET (US)
|
|
for problem 10.5 in the homework, it seems like there are a lot of "representations developed in the chapter" so I was wondering if something like this is what they are looking for--
b- "water boils at 100 degrees" answer- BoilsAt(Water, 100 degrees)
Also for the second part, i can't find any examples in the book of using the mereological approach. is it using the partof() and other like operators?
Thanks
|
mhtong
|
134
|
 |
|
05-30-2007 05:23 PM ET (US)
|
|
No, that simply refers to the report. I was pretty tired when I finalized it, I'll clarify the assignment. Thanks!
|
| Kei Shun Ma
|
133
|
 |
|
05-30-2007 02:17 PM ET (US)
|
|
I found that README/ README header were mentioned several times in PA4, does it mean besides the normal comment and the written report, we need to create a file README to describe how the program works?
|
mhtong
|
132
|
 |
|
05-30-2007 04:24 AM ET (US)
|
|
The ham training set was posted this afternoon.
|