For programming assignment 2, we are going to create an HMM-based part of speech tagger. I am planning on using a frequency-based approach to estimate the parameters of the Markov transition and emission matrices, along with some special arrangement for previously unseen words. Is there anything wrong with this approach? Thanks, Jacob