What are Transformer Models and how do they work?

Serrano.Academy

135,000 Subscribers

52,210 views since Nov 26, 2023

This is the last of a series of 3 videos where we demystify Transformer models and explain them with visuals and friendly examples.

Video 1: The attention mechanism in high level    • The Attention Mechanism in Large Lang...  

Video 2: The attention mechanism with math    • The math behind Attention: Keys, Quer...  

Video 3 (This one): Transformer models

If you like this material, check out LLM University from Cohere!
https://llm.university

Get the Grokking Machine Learning book!
https://manning.com/books/grokking-ma...
Discount code (40%): serranoyt
(Use the discount code on checkout)

00:00 Introduction
01:50 What is a transformer?
04:35 Generating one word at a time
08:59 Sentiment Analysis
13:05 Neural Networks
18:18 Tokenization
19:12 Embeddings
25:06 Positional encoding
27:54 Attention
32:29 Softmax
35:48 Architecture of a Transformer
39:00 Fine-tuning
42:20 Conclusion

Furr

© Furr.pk

[email protected]

What are Transformer Models and how do they work?

Serrano.Academy

135,000 Subscribers

1,656

Download

52,210 views since Nov 26, 2023

Furr

© Furr.pk

[email protected]

What are Transformer Models and how do they work?

Serrano.Academy

135,000 Subscribers

1,656

Download

52,210 views since Nov 26, 2023

03:39

Sajju arts and crafts

06:28

Yasmin Fawzy

44:40

Lisa Morales

44:26

Serrano.Academy

36:15

StatQuest with Josh Starmer

55:39

Hung-yi Lee

09:11

Google Cloud Tech

05:01

Learn with Whiteboard