Prerequisites
Beginner-to-intermediate Python. A Google account for Colab. No prior deep learning or NLP experience required; the course builds from mathematical foundations.
Preliminary notebooks →LLMs for Social Science
Language models are reshaping how social scientists collect, annotate, and analyze text. This course builds the conceptual depth and hands-on skills to use them critically. Beginner Python required; everything else is taught from first principles.
The course runs from mathematical foundations to autonomous research agents. Each module builds on the last; start at Module 1.
From bag-of-words to embeddings and tokenization, then contextual representations, language modeling, the Transformer, scaling laws, and how this stack shows up in social science workflows. The conceptual spine for everything that follows.
Open module →Post-training alignment (SFT, RLHF, DPO), prompting from zero-shot to chain-of-thought, reasoning techniques, and how to evaluate models and choose one that fits your task.
Open module →Fine-tuning (including LoRA-style adaptation and encoder heads), moving from notebooks to APIs and self-hosted inference, and rigorous validation — so a classifier works as a measurement instrument you can defend in publication.
Open module →Faithful extraction and summarization (hallucination, omission, and distortion), then retrieval-augmented generation: chunking, embeddings, retrieval, and grounded answers with provenance checks over your own corpora.
Open module →Building autonomous research agents with tool use, ReAct patterns, and multi-step orchestration. The frontier of what language models can do for your research pipeline.
Open module →Beginner-to-intermediate Python. A Google account for Colab. No prior deep learning or NLP experience required; the course builds from mathematical foundations.
Preliminary notebooks →All exercises are open-source Jupyter notebooks. Clone the repository, open in Colab, and follow along at your own pace.
GitHub repository →Begin with Module 1: the mathematical and conceptual foundations that everything else builds on.
Begin Module 1 →The course was delivered in person at three institutions in spring 2026. All materials remain freely available online.
23–27 March 2026
5-day intensive · DPIR, University of Oxford
7 April 2026
1-day workshop
20–21 April 2026
2-day workshop · European University Institute