Skip to main content

FNLP - top navigation

  • Learn
  • Piazza
  • DRPS

Breadcrumb

  1. Home
  2. FNLP: Foundations of Natural Language Processing
  3. FNLP: Course Materials
  4. FNLP: Week 9: Neural Text Generation

FNLP: 27: Transformers

This time, we will introduce the currently most popular architecture of neural models: Transformers. Transformers, and their variations, power the majority of modern large language models, such as OpenAI's ChatGPT or Google's BERT. In this section, we will consider them in the context of text generation, but next week, we will also look at the ways we can utilize them to extract knowledge from large collections and transfer it to specific tasks. The key component of Transformers is the multi-head attention module, which builds on the attention modeling idea we introduced last time. We also touch on interpretability and observe that some of the behavior of large transformer models is human interpretable.

The folder contains slides, required reading and a quiz.

Slides and reading

Slides

(The recorded video contains animations which are not visible in pdf)

Please refer to Jurafsky and Martin, 3rd edition (online), chapter 10. However, again notice that the material presented in the lecture differs from the text books in some aspects. Also optionally study relevant sections in Lena Voita's NLP course:

  • https://lena-voita.github.io/nlp_course.html

Quiz 27: Transformers

These questions are designed to test your understanding of the above course content; doing this quiz does not contribute to your overall grade.  Some questions require a text answer.  You can ask for formative feedback on these from your tutor or on piazza.  Other questions are multiple choice or they require a numeric answer: you will get immediate feedback for these. Please don't attempt this quiz until you have acquainted yourself with this lecture and the required reading.

You must be logged onto Learn to do this quiz.

License
All rights reserved The University of Edinburgh

Book traversal links for FNLP: 27: Transformers

  • FNLP: 26: Neural Attention
  • Up
  • FNLP Week 10: Transfer learning, Revision and Q&A

Navigation links

  • FNLP: Resource List
  • FNLP: Assessment
  • FNLP: Course Materials
    • FNLP: Week 1: Overview, Ambiguity and Corpora
    • FNLP: Week 2: Annotation, Evaluation and Language Models
    • FNLP: Week 3: Important ML techniques for NLP
    • FNLP: Week 4: More ML methods, Morphology and POS tagging
    • FNLP: Week 5: POS Tagging, Context Free Grammars and Parsing
    • FNLP: Week 6: More Parsing and Compositional Semantics
    • FNLP: Week 7: Discourse Semantics and Lexical Semantics
    • FNLP: Week 8: Deep Learning for NLP
    • FNLP: Week 9: Neural Text Generation
      • FNLP: 25: Sequence-to-Sequence Modeling
      • FNLP: 26: Neural Attention
      • FNLP: 27: Transformers
    • FNLP Week 10: Transfer learning, Revision and Q&A
  • FNLP: Lab Exercises
  • FNLP: Tutorial Exercises
RSS feed

Opencourse privacy & accessibility statements; contact Informatics, ILTS.