Analysis updated 2026-06-24
Self-study the technical foundations of large language models chapter by chapter
Use as a Chinese-language textbook for a university course on LLMs
Find curated reading lists of key papers grouped by LLM subtopic
| zju-llms/foundations-of-llms | gustavoguanabara/html-css | steven-tey/novel | |
|---|---|---|---|
| Stars | 16,262 | 16,262 | 16,261 |
| Language | — | HTML | TypeScript |
| Setup difficulty | easy | easy | moderate |
| Complexity | 2/5 | 1/5 | 3/5 |
| Audience | researcher | general | developer |
Figures from each repo's GitHub metadata at analysis time.
Material is in Chinese, so non-Chinese readers will need translation help.
This repository is a free, open-access textbook on the foundations of large language models (LLMs), the AI systems that power tools like ChatGPT and Claude. Written in Chinese by a research team, it is designed for readers who want to understand how LLMs work at a technical level, from the basics up to cutting-edge methods. The book covers six main areas: how traditional and modern language models work at their core, how the architecture of large AI models has evolved over time, how to write effective prompts (instructions given to an AI model) to get better results, how to efficiently fine-tune (customize and adapt) a pre-existing model without retraining it from scratch, how to edit or correct specific pieces of knowledge stored inside a model, and how retrieval-augmented generation works, a technique where the model pulls in relevant information from an external source before responding. Each chapter is downloadable as a PDF, and the repo also provides curated lists of academic papers linked to each chapter's topic. The team plans monthly updates to keep the content current. This material is aimed at students, researchers, and practitioners who want a rigorous yet readable guide to LLM technology. No specific prior coding skill is assumed, but some familiarity with machine learning concepts would help.
Free open-access textbook in Chinese on the foundations of large language models, covering architectures, prompting, fine-tuning, knowledge editing, and RAG.
Setup difficulty is rated easy, with roughly 5min to a first successful run.
Mainly researcher.
This repo across BitVibe Labs
Verify against the repo before relying on details.