Skip to main content

FNLP - top navigation

  • Learn
  • Piazza
  • DRPS

Breadcrumb

  1. Home
  2. FNLP: Foundations of Natural Language Processing
  3. FNLP: Course Materials

FNLP: Week 2: Annotation, Evaluation and Language Models

Last week, we talked about how NLP must meet the challenge of resolving ambiguity, and that machine learning applied to corpora can help to meet that challenge.  This week, we will talk some more about corpora and how one annotates it with the kind of (latent) information that you want your NLP model to acquire via machine learning methods.  We will also talk about various metrics that you can use to evaluate how well your NLP system is performing.  We will then focus on a particular task in NLP: language models.  A language model is a probability distribution over word sequences: it tells you how likely, or unlikely, a given sequence of words is to occur.  Language models form a part of many NLP applications.  We are going to focus in detail on how one can acquire a language model from corpus data, because it provides a forum in which we can introduce challenges that are common to all machine learning approaches to building NLP systems---in particular, the sparse data problem.

The content in this folder is structured as follows:

4: Methods in Annotation and Evaluation

5: N-gram Language Models

6: Evaluation and Smoothing

As always, each of the above includes videos, the slides that were used in the videos, required readings, and a post-lecture quiz.  The quiz is a chance for you to gauge your understanding of the material presented here, and so we strongly encourage you to review this content in the above order, and then complete the quiz.  If there is anything you don't understand, then you have several options: 
 
  1. Post a question on piazza;
  2. Ask a question at the in person lectures; and/or
  3. Ask your tutor.
License
All rights reserved The University of Edinburgh
  • FNLP: 4: Methods in Annotation and Evaluation
  • FNLP: 5: Ngram Language Models
  • FNLP: 6: Evaluation and Smoothing

Book traversal links for FNLP: Week 2: Annotation, Evaluation and Language Models

  • FNLP: 3: Corpora
  • Up
  • FNLP: 4: Methods in Annotation and Evaluation

Navigation links

  • FNLP: Resource List
  • FNLP: Assessment
  • FNLP: Course Materials
    • FNLP: Week 1: Overview, Ambiguity and Corpora
    • FNLP: Week 2: Annotation, Evaluation and Language Models
      • FNLP: 4: Methods in Annotation and Evaluation
      • FNLP: 5: Ngram Language Models
      • FNLP: 6: Evaluation and Smoothing
    • FNLP: Week 3: Important ML techniques for NLP
    • FNLP: Week 4: More ML methods, Morphology and POS tagging
    • FNLP: Week 5: POS Tagging, Context Free Grammars and Parsing
    • FNLP: Week 6: More Parsing and Compositional Semantics
    • FNLP: Week 7: Discourse Semantics and Lexical Semantics
    • FNLP: Week 8: Deep Learning for NLP
    • FNLP: Week 9: Neural Text Generation
    • FNLP Week 10: Transfer learning, Revision and Q&A
  • FNLP: Lab Exercises
  • FNLP: Tutorial Exercises
RSS feed

Opencourse privacy & accessibility statements; contact Informatics, ILTS.