[Ligncse256] POSTagger Trigrams
Brent Payne
brpayne at cs.ucsd.edu
Wed Feb 18 08:56:55 PST 2009
We smoothed ours so that every possibility had some probability. We still encounter -Infinity paths when decoding, so we might not have smoothed enough.
----- Original Message -----
From: "Andrea Biaggi" <abiaggi at cs.ucsd.edu>
To: ligncse256 at ling.ucsd.edu
Sent: Wednesday, February 18, 2009 12:16:11 AM GMT -08:00 US/Canada Pacific
Subject: [Ligncse256] POSTagger Trigrams
I have a question how to handle trigrams for P(t_i|t_{i-1}t_{i-2}).
It is always the case that this probability is >0 for every combinations
of tags in the validation/test set using counts of the trigram in the
training set?
Or we have to use a backoff/interpolation schema to handle all the
possible cases? Or use some other techniques? Or just saying that it is
= 0, therefore not a possible combination?
--
Andrea Biaggi abiaggi at cs.ucsd.edu
_______________________________________________
Ligncse256 mailing list
Ligncse256 at ling.ucsd.edu
http://pidgin.ucsd.edu/mailman/listinfo/ligncse256
More information about the Ligncse256
mailing list