13A: LLMs Introduction
Materials:
Date: Thursday, 07-Nov-2024, 11.30-1pm, IST.
Pre-work:
- LING571@UW Deep Learning For NLP, Prof. Shane at UW, Spring’24. Introduction, Word Vectors, Language Modeling
- CIS7000@UPenn LLMs, by Prof. Mayur Naik at UPenn, Fall’24, Background, Language Modeling
- AIL821@IIT-Delhi LLMs: Introduction and Recent Advances ELL881/AIL821, LLMs: Introduction and Advances @ IIT-Delhi, Fall’24.
- Transformers
- LLMs @ UPenn Part-1, Part-2
- LLMs: Introduction and Recent Advances @ IIT Delhi Module-5 on RNNs, Module-6 on Attention and Transformers
In-Class
We will follow the “Follow the data” approach to organize the content.
- Quick review of NLP and Deep Learning for NLP, pre- and post-GPT world.
- Lecture 2 from AIL821 Introduction to NLP,
- Lecture 3.1 from AIL821 Introduction to Language Models
- LLM Flow: (Quality) Datasets, Model Training (Pre-training, Alignment, Fine-tuning), Prompt Optimization, Constrained Language Generation, Evaluation.
- Datasets and Tasks (to train LLMs)
- Model Training
- Prompt Optimization
- Constrained Language Generation
- Evaluation
- Scaling Evaluation of LLMs Yann Bubois, CIS 7000 LLM Course
- Applications and Design Patterns
- Tools
- Agents
- Lilian Wang’s blog on LLM Powered Autonomous Agents
- Aman’s blog on Agents
- RAG
- Paper from NVidia FACTS About Building Retrieval Augmented Generation-based Chatbots
- LLMs can not reason & plan
Post-class
- Datasets and Tasks (to train LLMs)
- LIMA: less is more for alignment
- Instruction Tuning for Large Language Models: A Survey
- OLMo @ Allen AI - if you are interesting in all aspects of open-source LLM development.
- Model Training
- Pre-training
- Alignment
- Fine-tuning
- Performance Efficient Fine-Tuning collection
- Lecture: PEFT
- Lecture: : Quantization and Pruning
- QLoRA: Efficient Finetuning of Quantized LLMs
- Prompt Optimization
- Constrained Language Generation
- Evaluation
- Scaling Evaluation of LLMs Yann Bubois, CIS 7000 LLM Course
- Applications and Design Patterns
- Tools
- [Lecture 18.2 from AIL821] LLMs and Tools: Function Calling
- Agents
- [Lecture 18.3 from AIL821] LLMs and Tools: Agentic
- AutoGen repo
- CrewAI repo
- LLM Agent papers collection
- Survey: The Rise and Potential of Large Language Model Based Agents: A Survey
- RAG
- Tools
- LLMs can not reason & plan
LLMs and Influence Functions
- Studying Large Language Model Generalization with Influence Functions
- Do Influence Functions Work on Large Language Models?
- TextGrad Automatic ‘’Differentiation’’ via Text, paper
Full Courses
- CIS7000 LLM Course @ UPenn by Prof. Mayur Naik. Covers many advanced topics.
- AIL821 LLMs Course @ IIT-D
- Deep Learning For NLP @ UW LING 574, Deep Learning For NLP, Prof. Shane @ UW, Spring’24.
- Walk through the book Building LLMs from Scratch