LLM :: SAO Blog

From RL to RLHF

May 8, 2025, 02:15 PM 50 min read

This article is primarily based on Umar Jamil's course for learning and recording purposes. Our goal is to align LLM behavior with our desired outputs, and RLHF is one of the most famous techniques for this.

Deep LearningRLHFLLM

Implementing Simple LLM Inference in Rust

Feb 7, 2025, 02:48 PM 40 min read

I stumbled upon the 'Large Model and AI System Training Camp' hosted by Tsinghua University on Bilibili and signed up immediately. I planned to use the Spring Festival holiday to consolidate my theoretical knowledge of LLM Inference through practice. Coincidentally, the school VPN was down, preventing me from doing research, so it was the perfect time to organize my study notes.

LLMRustmlsys

History of LLM Evolution (6): Unveiling the Mystery of Tokenizers

Jul 4, 2024, 04:42 PM 50 min read

Deeply understand how tokenizers work, learning about the BPE algorithm, the tokenization strategies of the GPT series, and implementation details of SentencePiece.

LLMAITokenizerBPENLP

History of LLM Evolution (5): Building the Path of Self-Attention — The Future of Language Models from Transformer to GPT

Mar 20, 2024, 08:49 AM 60 min read

Building the Transformer architecture from scratch, deeply understanding core components like self-attention, multi-head attention, residual connections, and layer normalization.

LLMGPTDeep LearningTransformer

The Way of Fine-Tuning

Mar 15, 2024, 02:46 PM 20 min read

Learn how to fine-tune large language models under limited VRAM conditions, mastering key techniques like half-precision, quantization, LoRA, and QLoRA.

AILLMFine-tuning

History of LLM Evolution (4): WaveNet — Convolutional Innovation in Sequence Models

Mar 9, 2024, 04:01 PM 30 min read

Learn the progressive fusion concept of WaveNet and implement a hierarchical tree structure to build deeper language models.

AIDeep LearningLLM

The State of GPT

Feb 18, 2024, 08:16 PM 30 min read

A structured overview of Andrej Karpathy's Microsoft Build 2023 talk, deeply understanding GPT's training process, development status, the current LLM ecosystem, and future outlook.

AIChatGPTLLMGPTNLP

History of LLM Evolution (2): Embeddings — MLPs and Deep Language Connections

Feb 17, 2024, 09:48 PM 25 min read

Exploring Bengio's classic paper to understand how neural networks learn distributed representations of words and how to build a Neural Probabilistic Language Model (NPLM).

AILLMDeep LearningEmbeddingsNeural Networks

History of LLM Evolution (1): The Simplicity of Bigram

Feb 17, 2024, 11:05 AM 20 min read

Starting with the simplest Bigram model to explore the foundations of language modeling. Learn how to predict the next character through counting and probability distributions, and how to achieve the same effect using a neural network framework.

AIDeep LearningLLMLanguage Models

#LLM