| .. |
|
01_main-chapter-code
|
bcfdbd7008
Fix some wording issues in the notes (#695)
|
5 months ago |
|
02_alternative_weight_loading
|
3f93d73d6d
Alt weight loading code via PyTorch (#585)
|
7 months ago |
|
03_bonus_pretraining_on_gutenberg
|
15fa6a84f6
fixed plot_losses (#677)
|
5 months ago |
|
04_learning_rate_schedulers
|
cf39abac04
Add and link bonus material (#84)
|
1 year ago |
|
05_bonus_hparam_tuning
|
8b3e4b24b0
Remove unused params for hparam script (#710)
|
4 months ago |
|
06_user_interface
|
c21bfe4a23
Add PyPI package (#576)
|
8 months ago |
|
07_gpt_to_llama
|
80d4732456
add HF equivalency tests for standalone nbs (#774)
|
3 months ago |
|
08_memory_efficient_weight_loading
|
3233ddc475
get rid of redundant memory profiler import (#744)
|
4 months ago |
|
09_extending-tokenizers
|
c21bfe4a23
Add PyPI package (#576)
|
8 months ago |
|
10_llm-training-speed
|
83c76891fc
Fix issue 724: unused args (#726)
|
4 months ago |
|
11_qwen3
|
80d4732456
add HF equivalency tests for standalone nbs (#774)
|
3 months ago |
|
12_gemma3
|
80d4732456
add HF equivalency tests for standalone nbs (#774)
|
3 months ago |
|
README.md
|
f92b40e4ab
Qwen3 Coder Flash & MoE from Scratch (#760)
|
3 months ago |