AI :: SAO Blog

Reinforcement Learning Basics and Q-Learning

Oct 2, 2024, 06:17 PM 40 min read

Learning the fundamental concepts of Reinforcement Learning from scratch, and deeply understanding the Q-Learning algorithm and its application in discrete action spaces.

RLAI

Vector Add in Triton

Sep 19, 2024, 03:06 PM 20 min read

Starting from simple vector addition, learn how to write Triton kernels and explore performance tuning techniques.

TritonDeep LearningAI

History of LLM Evolution (6): Unveiling the Mystery of Tokenizers

Jul 4, 2024, 04:42 PM 50 min read

Deeply understand how tokenizers work, learning about the BPE algorithm, the tokenization strategies of the GPT series, and implementation details of SentencePiece.

LLMAITokenizerBPENLP

The Way of Fine-Tuning

Mar 15, 2024, 02:46 PM 20 min read

Learn how to fine-tune large language models under limited VRAM conditions, mastering key techniques like half-precision, quantization, LoRA, and QLoRA.

AILLMFine-tuning

History of LLM Evolution (4): WaveNet — Convolutional Innovation in Sequence Models

Mar 9, 2024, 04:01 PM 30 min read

Learn the progressive fusion concept of WaveNet and implement a hierarchical tree structure to build deeper language models.

AIDeep LearningLLM

History of LLM Evolution (3): Batch Normalization — Statistical Harmony of Activations and Gradients

Feb 29, 2024, 03:44 PM 35 min read

Deeply understand the activation and gradient issues in neural network training, and learn how batch normalization solves the training challenges of deep networks.

Deep LearningAI

The State of GPT

Feb 18, 2024, 08:16 PM 30 min read

A structured overview of Andrej Karpathy's Microsoft Build 2023 talk, deeply understanding GPT's training process, development status, the current LLM ecosystem, and future outlook.

AIChatGPTLLMGPTNLP

History of LLM Evolution (2): Embeddings — MLPs and Deep Language Connections

Feb 17, 2024, 09:48 PM 25 min read

Exploring Bengio's classic paper to understand how neural networks learn distributed representations of words and how to build a Neural Probabilistic Language Model (NPLM).

AILLMDeep LearningEmbeddingsNeural Networks

History of LLM Evolution (1): The Simplicity of Bigram

Feb 17, 2024, 11:05 AM 20 min read

Starting with the simplest Bigram model to explore the foundations of language modeling. Learn how to predict the next character through counting and probability distributions, and how to achieve the same effect using a neural network framework.

AIDeep LearningLLMLanguage Models

Building a Minimal Autograd Framework from Scratch

Feb 16, 2024, 10:28 AM 25 min read

Learning from Andrej Karpathy's micrograd project, we build an automatic differentiation framework from scratch to deeply understand the core principles of backpropagation and the chain rule.

Deep LearningAIPyTorchAutogradNeural Networks

#AI