Matthew Hernandez 6f12edb0cc Fix issue: 731 by resolving semantic error (#738) vor 4 Monaten
..
README.md a22d612be6 Bonus material: extending tokenizers (#496) vor 10 Monaten
bpe-from-scratch.ipynb 6f12edb0cc Fix issue: 731 by resolving semantic error (#738) vor 4 Monaten
tests.py 6f12edb0cc Fix issue: 731 by resolving semantic error (#738) vor 4 Monaten

README.md

Byte Pair Encoding (BPE) Tokenizer From Scratch

  • bpe-from-scratch.ipynb contains optional (bonus) code that explains and shows how the BPE tokenizer works under the hood.