13A: LLMs Introduction

Materials:

Date: Thursday, 07-Nov-2024, 11.30-1pm, IST.

LING571@UW Deep Learning For NLP, Prof. Shane at UW, Spring’24. Introduction, Word Vectors, Language Modeling
CIS7000@UPenn LLMs, by Prof. Mayur Naik at UPenn, Fall’24, Background, Language Modeling
AIL821@IIT-Delhi LLMs: Introduction and Recent Advances ELL881/AIL821, LLMs: Introduction and Advances @ IIT-Delhi, Fall’24.
Transformers
- LLMs @ UPenn Part-1, Part-2
- LLMs: Introduction and Recent Advances @ IIT Delhi Module-5 on RNNs, Module-6 on Attention and Transformers

We will follow the “Follow the data” approach to organize the content.

Quick review of NLP and Deep Learning for NLP, pre- and post-GPT world.
- Lecture 2 from AIL821 Introduction to NLP,
- Lecture 3.1 from AIL821 Introduction to Language Models
LLM Flow: (Quality) Datasets, Model Training (Pre-training, Alignment, Fine-tuning), Prompt Optimization, Constrained Language Generation, Evaluation.
Datasets and Tasks (to train LLMs)
- Lecture 7 from AIL821
Model Training
- Pre-training
  - Lecture 12.1 from AIL821
- Alignment
  - Direct Preference Optimization: Your Language Model is Secretly a Reward Model
- Fine-tuning
  - LoRA: Low-Rank Adaptation of Large Language Models
Prompt Optimization
- Chain-of-Thought
Constrained Language Generation
- Collection
Evaluation
- Scaling Evaluation of LLMs Yann Bubois, CIS 7000 LLM Course
Applications and Design Patterns
- Tools
  - Gorilla
- Agents
  - Lilian Wang’s blog on LLM Powered Autonomous Agents
  - Aman’s blog on Agents
- RAG
  - Paper from NVidia FACTS About Building Retrieval Augmented Generation-based Chatbots
LLMs can not reason & plan
- LLMs Can’t Plan, But Can Help Planning in LLM-Modulo Frameworks

CIS7000 LLM Course @ UPenn by Prof. Mayur Naik. Covers many advanced topics.
AIL821 LLMs Course @ IIT-D
Deep Learning For NLP @ UW LING 574, Deep Learning For NLP, Prof. Shane @ UW, Spring’24.
Walk through the book Building LLMs from Scratch