Skip to main content

FNLP - top navigation

  • Learn
  • Piazza
  • DRPS

Breadcrumb

  1. Home
  2. FNLP: Foundations of Natural Language Processing
  3. FNLP: Course Materials
  4. FNLP: Week 9: Neural Text Generation

FNLP: 26: Neural Attention

The vanilla encoder-decoder models, which we discussed in the previous session, are not effective in capturing interactions between input text (e.g., a sentence in a source language) and the generated context (e.g., the sentence being generated), are prone to hallucination, and are hard to train. In this lecture, we will discuss how to integrate attention into the neural text generator. With the attention model, on each step of generation, the decoder will select the input tokens that are most relevant to the next prediction and rely on those tokens. We will discuss several variations of these ideas and also do some analysis to see what the attention weights can capture.

The folder contains slides, required reading and a quiz.

Slides and reading

Slides

(The recorded video contains animations which are not visible in pdf)

Recommended reading: Jurafsky and Martin, 3rd edition (online), section 9.8.  Note that there is material in the lecture which is not covered in J&M.

Also optionally: study language modeling and seq2seq sections in Lena Voita's NLP course:

  • https://lena-voita.github.io/nlp_course.html

Quiz 26: Attention 

These questions are designed to test your understanding of the above course content; doing this quiz does not contribute to your overall grade.  Some questions require a text answer.  You can ask for formative feedback on these from your tutor or on piazza.  Other questions are multiple choice or they require a numeric answer: you will get immediate feedback for these. Please don't attempt this quiz until you have acquainted yourself with this lecture and the required reading.

You must be logged onto Learn to do this quiz.

License
All rights reserved The University of Edinburgh

Book traversal links for FNLP: 26: Neural Attention

  • FNLP: 25: Sequence-to-Sequence Modeling
  • Up
  • FNLP: 27: Transformers

Navigation links

  • FNLP: Resource List
  • FNLP: Assessment
  • FNLP: Course Materials
    • FNLP: Week 1: Overview, Ambiguity and Corpora
    • FNLP: Week 2: Annotation, Evaluation and Language Models
    • FNLP: Week 3: Important ML techniques for NLP
    • FNLP: Week 4: More ML methods, Morphology and POS tagging
    • FNLP: Week 5: POS Tagging, Context Free Grammars and Parsing
    • FNLP: Week 6: More Parsing and Compositional Semantics
    • FNLP: Week 7: Discourse Semantics and Lexical Semantics
    • FNLP: Week 8: Deep Learning for NLP
    • FNLP: Week 9: Neural Text Generation
      • FNLP: 25: Sequence-to-Sequence Modeling
      • FNLP: 26: Neural Attention
      • FNLP: 27: Transformers
    • FNLP Week 10: Transfer learning, Revision and Q&A
  • FNLP: Lab Exercises
  • FNLP: Tutorial Exercises
RSS feed

Opencourse privacy & accessibility statements; contact Informatics, ILTS.