GPT :: SAO Blog

History of LLM Evolution (5): Building the Path of Self-Attention — The Future of Language Models from Transformer to GPT

Mar 20, 2024, 08:49 AM 60 min read

Building the Transformer architecture from scratch, deeply understanding core components like self-attention, multi-head attention, residual connections, and layer normalization.

LLMGPTDeep LearningTransformer

The State of GPT

Feb 18, 2024, 08:16 PM 30 min read

A structured overview of Andrej Karpathy's Microsoft Build 2023 talk, deeply understanding GPT's training process, development status, the current LLM ecosystem, and future outlook.

AIChatGPTLLMGPTNLP

#GPT

History of LLM Evolution (5): Building the Path of Self-Attention — The Future of Language Models from Transformer to GPT

The State of GPT