From the course: Advanced Guide to ChatGPT, Embeddings, and Other Large Language Models (LLMs)
Unlock this course with a free trial
Join today to access over 25,200 courses taught by industry experts.
Instruction alignment of LLMs: Reward modeling - ChatGPT Tutorial
From the course: Advanced Guide to ChatGPT, Embeddings, and Other Large Language Models (LLMs)
Contents
-
-
-
-
-
-
-
-
-
-
-
-
(Locked)
Topics46s
-
(Locked)
BERT for multilabel classification: Part 112m 36s
-
(Locked)
BERT for multilabel classification: Part 226m 59s
-
(Locked)
Writing LaTeX with GPT-221m 31s
-
(Locked)
Case study: Sinan’s attempt at wise yet engaging responses—SAWYER23m 54s
-
(Locked)
Instruction alignment of LLMs: Supervised fine-tuning28m 23s
-
(Locked)
Instruction alignment of LLMs: Reward modeling21m 49s
-
(Locked)
Instruction alignment of LLMs: RLHF26m 54s
-
(Locked)
Instruction alignment of LLMs: Using an instruction-aligned LLM19m 10s
-
(Locked)
-
-
-