CS336 Assignment 1: Building a Transformer Language Model from Scratch
A comprehensive reflection on implementing a complete Transformer language model pipeline from scratch — including BPE tokenizer with parallel pre-tokenization, decoder-only Transformer with RMSNorm/RoPE/SwiGLU, AdamW optimizer, and autoregressive text generation. Trained on TinyStories and OpenWebText with extensive experiments on learning rate sweeps, batch size studies, and architectural ablations.
15645 Database systems: Hash Tables
Notes and summaries for CMU 15-645 Database Systems.
15645 Database systems: Storage
Notes and summaries for CMU 15-645 Database Systems.
11711 Advanced NLP: Learning & Inference
Notes on learning algorithms and inference methods from CMU 11-711 Advanced NLP.
11711 Advanced NLP: Architectures
Notes on RNN architectures, encoder-decoder models, and attention mechanisms from CMU 11-711 Advanced NLP.
11868 LLM Sys & 15642 ML Sys: DL Frameworks and Auto Differentiation
Comprehensive notes on deep learning frameworks and automatic differentiation from CMU 11-868 LLM Systems and 15-642 ML Systems courses.
11868 LLM Sys: GPU Programming & Acceleration
Notes and summaries for CMU 11-868 LLM Systems: GPU Programming & Acceleration.
11711 Advanced NLP: Fundamentals
Notes and summaries for CMU 11-711 Advanced NLP.
15618 Assignment 1 Report
Assignment 1 report for CMU 15-618, covering pthreads speedup analysis and SIMD vectorization.
15645 Database systems: Relational Model and SQL
Notes and summaries for CMU 15-645 Database Systems.




