Sebastian Raschka 68e2efe1c9 Mention small discrepancy due to Dropout non-reproducibility in PyTorch (#519) 9 月之前
..
README.md 283397aaf2 add main and optional sections 1 年之前
ch05.ipynb 68e2efe1c9 Mention small discrepancy due to Dropout non-reproducibility in PyTorch (#519) 9 月之前
exercise-solutions.ipynb 9f0bda7af5 add note about duplicated cell 1 年之前
gpt_download.py 701090815e Add backup URL for gpt2 weights (#469) 10 月之前
gpt_generate.py 25ea71e713 Alternative weight loading via .safetensors (#507) 9 月之前
gpt_train.py 1b635f760e fix misplaced parenthesis and update license (#466) 10 月之前
previous_chapters.py bbb2a0c3d5 fixed num_workers (#229) 1 年之前
tests.py 701090815e Add backup URL for gpt2 weights (#469) 10 月之前

README.md

Chapter 5: Pretraining on Unlabeled Data

Main Chapter Code

  • ch05.ipynb contains all the code as it appears in the chapter
  • previous_chapters.py is a Python module that contains the MultiHeadAttention module and GPTModel class from the previous chapters, which we import in ch05.ipynb to pretrain the GPT model
  • gpt_download.py contains the utility functions for downloading the pretrained GPT model weights
  • exercise-solutions.ipynb contains the exercise solutions for this chapter

Optional Code

  • gpt_train.py is a standalone Python script file with the code that we implemented in ch05.ipynb to train the GPT model (you can think of it as a code file summarizing this chapter)
  • gpt_generate.py is a standalone Python script file with the code that we implemented in ch05.ipynb to load and use the pretrained model weights from OpenAI