Building LLMs from Scratch

Following Sebastian Raschka's comprehensive guide with insights from music technology and MIR

View Repository Get the Book

Learning Progress

Overall Completion 0 / 7 chapters

Chapter 1 Upcoming

Understanding LLMs

Introduction to large language models and transformer architecture

Chapter 2 Upcoming

Working with Text Data

Tokenization, embeddings, and data preprocessing

Chapter 3 Upcoming

Attention Mechanisms

Self-attention, multi-head attention, and causal masking

Chapter 4 Upcoming

Implementing GPT

Building a GPT model from scratch in PyTorch

Chapter 5 Upcoming

Pretraining

Pretraining on unlabeled data at scale

Chapter 6 Upcoming

Classification Finetuning

Finetuning for downstream classification tasks

Chapter 7 Upcoming

Instruction Following

Finetuning models to follow instructions

Published Content

Deep dives, experiments, and connections to music technology

Journey Starting Soon

I'm just getting started with this learning journey. Check back soon for detailed notebooks and insights!

About This Learning Journey

I'm working through "Build a Large Language Model (From Scratch)" by Sebastian Raschka, documenting my learning process with a unique perspective from my background in music technology and MIR.

What makes this series different:

🎵 Music Tech Connections - Drawing parallels between LLMs and audio processing
🔬 Deep Experiments - Going beyond the book with additional explorations
📊 Visualizations - Interactive plots and diagrams to build intuition
💭 Honest Reflections - Documenting challenges and "aha" moments

All code, experiments, and detailed notes are available in my GitHub repository.