ANLP: Readings
Required readings
The main readings for ANLP derive from two editions of the Jurafsky and Martin Speech and Language processing textbook:
- Second edition (denoted JM2), available from the ANLP Resource List on Learn (which requires a login).
- Third edition, draft as of Aug 2024 (denoted JM3). We will always refer to chapters in this complete PDF. You can download individual chapters from the textbook website, but these can change without warning.
When we refer to sections of the textbook, section 0 refers to whatever introductory material comes before section 1.
Past students have requested that we prioritize readings, so high priority readings are marked with a (*). If you are really short on time you should focus on these and return to others later. But you are ultimately responsible for all material in the assigned reading, and keeping up is the best strategy!
Some students find that it is useful to do the reading before watching the lecture, while others prefer to do it afterward. Please do what works best for you, and if you aren’t sure, experiment.
Optional readings
Linguistics background
In previous years some Informatics students have asked for more background reading on linguistics. A good place to start might be this text, which is also on the resource list, and available online through the University library:
Linguistic Fundamentals for Natural Language Processing: 100 Essentials from Morphology and Syntax by Emily M Bender. Synthesis Lectures on Human Language Technologies, June 2013, Vol. 6, No. 3 , Pages 1-184.
Further mathematical details
Some students may want a more rigourous treatment of the models and machine learning methods we discuss. In that case I suggest the following textbook. It covers many of the same topics we do, but assumes somewhat more background and comfort with formal methods.
Introduction to Natural Language Processing by Jacob Eisenstein. MIT Press, 2019. (Draft version is available for free from author's github page here.)