Sebastian Raschka bcfdbd7008 Fix some wording issues in the notes (#695) 5 ماه پیش
..
README.md 283397aaf2 add main and optional sections 1 سال پیش
ch05.ipynb bcfdbd7008 Fix some wording issues in the notes (#695) 5 ماه پیش
exercise-solutions.ipynb 02779f5e35 updated exercise 5.3 (#615) 7 ماه پیش
gpt_download.py 86b714a5e0 Specify UTF-8 encoding in the json load command explicitely (#557) 8 ماه پیش
gpt_generate.py 25ea71e713 Alternative weight loading via .safetensors (#507) 9 ماه پیش
gpt_train.py 1b635f760e fix misplaced parenthesis and update license (#466) 10 ماه پیش
previous_chapters.py bbb2a0c3d5 fixed num_workers (#229) 1 سال پیش
tests.py 701090815e Add backup URL for gpt2 weights (#469) 10 ماه پیش

README.md

Chapter 5: Pretraining on Unlabeled Data

Main Chapter Code

  • ch05.ipynb contains all the code as it appears in the chapter
  • previous_chapters.py is a Python module that contains the MultiHeadAttention module and GPTModel class from the previous chapters, which we import in ch05.ipynb to pretrain the GPT model
  • gpt_download.py contains the utility functions for downloading the pretrained GPT model weights
  • exercise-solutions.ipynb contains the exercise solutions for this chapter

Optional Code

  • gpt_train.py is a standalone Python script file with the code that we implemented in ch05.ipynb to train the GPT model (you can think of it as a code file summarizing this chapter)
  • gpt_generate.py is a standalone Python script file with the code that we implemented in ch05.ipynb to load and use the pretrained model weights from OpenAI