15645 Database systems: Index Concurrency Control
Notes and summaries for CMU 15-645 Database Systems.
11711 Advanced NLP: Evaluation & Experimental Design
Notes on NLP evaluation, benchmarks, and experimental design from CMU 11-711 Advanced NLP, covering loss-based metrics, task metrics, LLM-as-judge, statistical testing, and annotation quality.
11711 Advanced NLP: Multimodal Modeling
Notes on multimodal modeling from CMU 11-711 Advanced NLP, covering ViT, CLIP, LLaVA-style integration, token-based image generation, diffusion, and flow matching.
11711 Advanced NLP: Retrieval and RAG
Notes on retrieval systems, retrievers, and retrieval-augmented generation (RAG) from CMU 11-711 Advanced NLP.
15642 Machine Learning Systems: Distributed Training and Parallelization
Notes and summaries for CMU 15-642 Machine Learning Systems on distributed training techniques including data parallelism, model parallelism, pipeline parallelism, and optimization strategies like ZeRO for training large-scale models.
11868 LLM Sys: Decoding
Notes on decoding strategies for LLM inference from CMU 11-868, covering efficient sampling, beam search algorithms, speculative decoding, and EAGLE.
11868 LLM Sys: Tokenization and Embedding
Comprehensive notes on tokenization techniques and vocabulary learning for Large Language Models, covering BPE, VOLT, and practical considerations in multilingual LLMs.
15642 Machine Learning Systems: Transformer, Attention, and Optimizations
Notes and summaries for CMU 15-642 Machine Learning Systems on Transformer architecture, attention mechanisms, and GPU optimizations including FlashAttention.
15645 Database systems: Index & Filter
Notes and summaries for CMU 15-645 Database Systems.
TinyKV: A Raft-Based KV Storage System
Implementation and architecture of TinyKV, a Raft-based distributed key-value storage system covering Multi-Raft, MVCC transactions, and the scheduler.




