Sebastian Raschka 1b635f760e fix misplaced parenthesis and update license (#466) vor 10 Monaten
..
README.md 283397aaf2 add main and optional sections vor 1 Jahr
ch05.ipynb bfa4215774 llama note vor 1 Jahr
exercise-solutions.ipynb 9f0bda7af5 add note about duplicated cell vor 1 Jahr
gpt_download.py 8d02cb1cee Add download help message (#274) vor 1 Jahr
gpt_generate.py f61c008c5d Add missing device transfer in gpt_generate.py (#436) vor 1 Jahr
gpt_train.py 1b635f760e fix misplaced parenthesis and update license (#466) vor 10 Monaten
previous_chapters.py bbb2a0c3d5 fixed num_workers (#229) vor 1 Jahr
tests.py bcccda728b check gpt files (#208) vor 1 Jahr

README.md

Chapter 5: Pretraining on Unlabeled Data

Main Chapter Code

  • ch05.ipynb contains all the code as it appears in the chapter
  • previous_chapters.py is a Python module that contains the MultiHeadAttention module and GPTModel class from the previous chapters, which we import in ch05.ipynb to pretrain the GPT model
  • gpt_download.py contains the utility functions for downloading the pretrained GPT model weights
  • exercise-solutions.ipynb contains the exercise solutions for this chapter

Optional Code

  • gpt_train.py is a standalone Python script file with the code that we implemented in ch05.ipynb to train the GPT model (you can think of it as a code file summarizing this chapter)
  • gpt_generate.py is a standalone Python script file with the code that we implemented in ch05.ipynb to load and use the pretrained model weights from OpenAI