Build an LLM from Scratch 3: Coding attention mechanisms | Sebastian Raschka Transcripts