We implement a multilayer perceptron (MLP) character-level language model. In this video we also introduce many basics of machine learning (e.g. model training, learning rate tuning, hyperparameters, evaluation, train/dev/test splits, under/overfitting, etc.).
Links:
- makemore on github:
- jupyter notebook I built in this video:
- collab notebook (new)!!!:
- Bengio et al. 2003 MLP language model paper (pdf):
Useful links:
- PyTorch internals ref
Exercises:
- E01: Tune the hyperparameters of the training to beat my best validation loss of 2.2
- E02: I was not careful with the intialization of the network in this video. (1) What is the loss you’d get if the predicted probabilities at initialization were perfectly uniform? What loss do we achieve? (2) Can you tune the initialization to get a starting loss that is much more similar to (1)?
- E03: Read the Bengio et al 2003 paper (link above), implement and try any idea from the paper. Did it work?
Chapters:
00:00:00 intro
00:01:48 Bengio et al. 2003 (MLP language model) paper walkthrough
00:09:03 (re-)building our training dataset
00:12:19 implementing the embedding lookup table
00:18:35 implementing the hidden layer internals of : storage, views
00:29:15 implementing the output layer
00:29:53 implementing the negative log likelihood loss
00:32:17 summary of the full network
00:32:49 introducing and why
00:37:56 implementing the training loop, overfitting one batch
00:41:25 training on the full dataset, minibatches
00:45:40 finding a good initial learning rate
00:53:20 splitting up the dataset into train/val/test splits and why
01:00:49 experiment: larger hidden layer
01:05:27 visualizing the character embeddings
01:07:16 experiment: larger embedding size
01:11:46 summary of our final code, conclusion
01:13:24 sampling from the model
01:14:55 google collab (new!!) notebook advertisement
15 views
5598
1994
13 years ago 00:03:25 32
CFCF - Exercise #3 (Building)
7 years ago 00:01:36 58
Building MOC #3
11 years ago 00:07:00 100
Malakas 636 Build (Part 3)
6 years ago 00:01:54 38
Derek Adams 4-3-3 - Building From The Back
2 years ago 00:01:31 1
Building 3 d print
9 years ago 00:11:13 58
Subnautica | Part 3 | BUILDING A BASE!!
8 years ago 00:01:31 1K
System can 3-D print an entire building
12 years ago 00:09:45 91
The sims 3 house building - Ziptonhome 66
9 years ago 00:26:16 13
Part 3: Make Building Envelope
8 years ago 00:50:32 1
Warcraft 3: Last One Building
11 years ago 00:04:40 62
Boat building part
11 years ago 00:05:54 212
#3 DIALux evo - Building Construction
9 years ago 00:19:19 8
Blender 2.7 Tutorial #3 : Building a Snowman #b3d
12 years ago 00:10:59 25
The Sims 3: Building a Modern Condo
6 years ago 00:04:23 10
Timelaps #3 (Building Bridge)
12 years ago 00:14:25 43
Let’s build #3.1(3.3)
11 years ago 00:11:20 18
Building The Batmobiles (Part 3)
12 years ago 00:02:09 207
Mass Effect 3 Build Alliances
14 years ago 00:10:00 18
Sims 3 - Modern Building
12 years ago 00:09:20 79
Building Neverland in the Sims 3
9 years ago 00:07:05 11
Steady vloggin 3: Building new jumps
5 years ago 00:16:37 12
Building 3 New BMX Bikes!
12 years ago 00:07:16 48
ROD BUILDING PERFECT GUIDE 3
8 years ago 00:17:28 34
СОЗДАНИЕ ВИДЕО РОЛИКА (ЧАСТЬ 3) Создание интро (Building a Reel Part 3)