Training LLMs from Scratch

1

Introduction to the Course
- Course Overview
- Course Handouts
2

Getting Started with Training LLMs from Scratch
- Introduction to Training LLMs From Scratch
- What do you mean by Training LLMs from Scratch?
- Why train your own LLMs?
- Quiz
3

Important Concepts for Training LLMs from Scratch
- Chinchilla Scaling Laws
- Parallel and Distributed paradigm
- Data Parallelism
- Model Parallelism
- Pipeline Parallelism
- Fully Sharded Data Parellelism/ZeRO 3
- Tensor Parallelism
- 2D and 3D Parallelism
- Quiz
4

Steps involved in Training LLMs from Scratch
- Overview of Steps Involved in Training LLMs from Scratch
- Training Data Curation
- Data preprocessing
- Tokenization
- Model Architecture
- Model Evaluation
- Quiz
5

Training your own LLMs from Scratch
- Estimate Cost of Training LLMs from Scratch
- Hands on - Setting up runpod and intro
- Hands on - Dataset curation and preprocessing
- Hands on - Training BPE Tokenizer
- Hands on - Training LLMs from Scratch using DeepSpeed ZeRO 3/FSDP Distributed Training on 8 GPUs
- Quiz
- Assignment
6

Aligining LLM with Human Preferences
- Recap of Instruction Following LLM
- Introduction to Alignment Methods
- Reinforcement Learning from Human Feedback (RLHF)
- Aligning LLMs to Human Preferences with RLHF
- DPO and IPO
- Preference tuning LLMs with DPO
- Quiz
7

What Next?
- Recent Advancements
- Share your Experience!
- Gen AI Advanced Assessment– Now Open for Booking!