NLU-11: Week 9

This week, we we will have our two last examinable lectures on QA and RAG, and we have a non-examinable guest lecture on Fact Checking by Emily Allaway. 

As a reminder, Coursework 2 is due on Friday 21 March at 12:00 noon. When submitting please make sure you *tag your partner* on Gradescope when you do that, otherwise ITO loses track of who did the assignment with whom

This week's lab is on prompting with GPT-3.5. It even includes a small competition on who can develop the best performing prompt on a small data set! Please note you will need an OpenAI key to run this lab; please obtain this before you attend your lab session, instructions can be found in the lab itself.

Lab 4: Prompting with GPT-3.5

The solutions for last week's tutorial are now available:

Solutions for Tutorial 3

And here is a preview of next week's tutorial. Please work through this before attending your tutorial group next week:

Tutorial 4: Practise Exam Questions 

SlidesLectureCourse Content
Question Answering25

Required Reading:

Speech and Language Processing Ed. 3, Ch. 14 on RAG and QA 

Natural Questions: A Benchmark for Question Answering Research, https://aclanthology.org/Q19-1026/ 

Optional Reading:

Bidirectional Attention Flow for Machine Comprehension, https://arxiv.org/abs/1611.01603 (BiDAF)

SQuAD: 100,000+ Questions for Machine Comprehension of Text, https://arxiv.org/abs/1606.05250 (SQuAD)

(Optional) Know What You Don’t Know: Unanswerable Questions for SQuAD, https://arxiv.org/abs/1806.03822 (SQuAD v2)

Latent Retrieval for Weakly Supervised Open Domain Question Answering https://arxiv.org/abs/1906.00300

(Optional; worth a reading!) The Bitter Lesson: https://www.cs.utexas.edu/~eunsol/courses/data/bitter_lesson.pdf

Retrieval-Augmented Generation26

Required Reading:

ATLAS: Few-shot Learning with Retrieval Augmented Language Models, https://arxiv.org/abs/2208.03299

Optional Reading:

Reliable, Adaptable, and Attributable Language Models with Retrieval, https://arxiv.org/abs/2403.03187

REALM: Retrieval-Augmented Language Model Pre-Training, https://arxiv.org/abs/2002.08909

Reading Wikipedia to Answer Open-Domain Questions, https://arxiv.org/abs/1704.00051

Question and Answer Test-Train Overlap in Open-Domain Question Answering Datasets, https://arxiv.org/abs/2008.02637

Challenges in Generalisation in Open Domain Question Answering, https://arxiv.org/abs/2109.01156

Fact Checking27 

Guest Lecture by Emily Allaway

Not examinable

License
All rights reserved The University of Edinburgh