This page introduces you to Language Models. It consists of:
- three videos of short lectures. They cover:
- Probabilities of Word Sequences
- Maximum Likelihood Estimation (MLE) and the Sparse Data Problem
- Independence Assumptions
- some required reading from Jurafsky and Martin
- a quiz that tests your understanding of the material presented here.
Please do the required reading, and attempt the quiz. If there is anything you don't understand, then please ask questions in the lecture or on piazza.
Lecture 5 Slides: Whole!
5a: Language Models: Probabilities of Word Sequences
- Slides: 05a_slides.pdf
5b: Language Modes: Maximum Likelihood Estimation (MLE) and the Sparse Data Problem
- Slides: 05b_slides.pdf
5c: Language Models: Evaluation and Smoothing
- Slides: 05c_slides.pdf
Recommended Reading
J&M 3.intro-1 (2nd edition: 4.intro-4.3)
NOTE: The abbreviation J&M refers to the textbook:
Dan Jurafsky and James H. Martin, Speech and Language Processing.
When we specify 2nd edition, we are referring to the version of the book that was published by Pearson International in 2008.
When we specify 3rd edition, then we will supply links to the drafts of the relevant parts of that book (since the third editiion isn't published yet, but the current draft is available here).
Quiz 5: Language Models 1
These questions are designed to test your understanding of the above course content; doing this quiz does not contribute to your overall grade. Some questions require a text answer. You can ask for formative feedback on these from your tutor or on piazza. Other questions are multiple choice or they require a numeric answer: you will get immediate feedback for these. Please don't attempt this quiz until you have acquainted yourself with this lecture and the required reading.
You must be logged onto Learn to do this quiz.