hiyouga/chatglm-efficient-tuning

Analysis updated 2026-07-03

★ 3,722PythonAudience · researcherComplexity · 4/5Setup · hard

Mindmap

mindmap
  root((ChatGLM Tuning))
    What it does
      Fine-tune ChatGLM-6B
      Instruction following
      Domain adaptation
    Training methods
      LoRA
      P-Tuning V2
      Freeze layers
      Full fine-tuning
      RLHF
    Tech stack
      Python
      PyTorch
      Gradio UI
      Hugging Face
    Datasets
      English and Chinese
      Custom JSON format
      Hugging Face datasets
    Output
      OpenAI-compatible API
      Checkpoint files

mindmap root((ChatGLM Tuning)) What it does Fine-tune ChatGLM-6B Instruction following Domain adaptation Training methods LoRA P-Tuning V2 Freeze layers Full fine-tuning RLHF Tech stack Python PyTorch Gradio UI Hugging Face Datasets English and Chinese Custom JSON format Hugging Face datasets Output OpenAI-compatible API Checkpoint files

Click or tap to explore — scroll the page freely

What do people build with it?

USE CASE 1

Fine-tune ChatGLM-6B on a custom Chinese or English dataset to create a domain-specific AI assistant without retraining from scratch

USE CASE 2

Use the Gradio web interface to run model training and test responses without writing any command-line code

USE CASE 3

Deploy the fine-tuned model as an API server that accepts the same request format as ChatGPT, so it works in existing ChatGPT-based apps

USE CASE 4

Apply RLHF to align the model's responses with human preferences using your own comparison data

What is it built with?

PythonPyTorchGradioHugging FaceLoRARLHF

How does it compare?

	hiyouga/chatglm-efficient-tuning	llm-workflow-engine/llm-workflow-engine	insanum/gcalcli
Stars	3,722	3,722	3,721
Language	Python	Python	Python
Setup difficulty	hard	moderate	moderate
Complexity	4/5	3/5	2/5
Audience	researcher	developer	developer

Figures from each repo's GitHub metadata at analysis time.

How do you get it running?

Difficulty · hard Time to first run · 1day+

Requires a GPU with significant VRAM, project is archived and superseded by LLaMA-Factory.

In plain English

ChatGLM-6B is an open-source AI language model developed by Tsinghua University, capable of conducting conversations in both Chinese and English. ChatGLM Efficient Tuning is a Python toolkit for adjusting, or fine-tuning, this model on your own data, so it learns to follow your specific instructions or domain knowledge rather than its general training. Fine-tuning a large language model from scratch is expensive because it requires updating all the model's parameters. This project supports several techniques that make the process much cheaper. The main method is LoRA, which adds small trainable weight matrices on top of the frozen model, updating only those instead of the full set of billions of parameters. The project also supports P-Tuning V2, a method that adjusts how the model reads its input context, and a simpler freeze approach that only updates the final layers. Full tuning of all parameters is available for cases where compute is not a constraint. A web browser interface is included, built with Gradio, which means you can run training, evaluation, and inference without writing command-line instructions. The project also supports Reinforcement Learning with Human Feedback (RLHF), a technique used to align model output with human preferences by training a reward model on human comparisons. A demo API server is included that matches the OpenAI chat format, so the fine-tuned model can be dropped into existing ChatGPT-based applications. The project includes a large collection of supported datasets in English and Chinese, covering general chat, math, code, and web content. Some datasets require a Hugging Face account and access approval before downloading. Custom datasets can be added by providing a JSON file in the expected format and updating a dataset registry file. The maintainer has since moved active development to a successor project called LLaMA-Factory, which covers a broader range of models. This repository is no longer being updated. Users who want to fine-tune ChatGLM2-6B or later models are directed to that newer project.

Copy-paste prompts

Prompt 1

Using chatglm-efficient-tuning with LoRA, fine-tune ChatGLM-6B on a JSON dataset of customer support conversations so the model answers in the style of my support team

Prompt 2

Show me how to set up the Gradio web interface for chatglm-efficient-tuning and run an inference test on a fine-tuned checkpoint

Prompt 3

How do I format a custom dataset as the JSON structure expected by chatglm-efficient-tuning and register it in the dataset registry?

Prompt 4

What LoRA rank and alpha settings should I start with when fine-tuning ChatGLM-6B on a dataset of 5000 instruction-response pairs?

Frequently asked questions

What is chatglm-efficient-tuning?

Python toolkit for teaching the ChatGLM-6B AI language model to follow your custom instructions or domain knowledge, using efficient training techniques that don't require massive computing resources.

What language is chatglm-efficient-tuning written in?

Mainly Python. The stack also includes Python, PyTorch, Gradio.

How hard is chatglm-efficient-tuning to set up?

Setup difficulty is rated hard, with roughly 1day+ to a first successful run.

Who is chatglm-efficient-tuning for?

Mainly researcher.

Open on GitHub → Explain another repo

This repo across BitVibe Labs

Scan in gitsafehub Deploy in gitdeployhub hiyouga on gitmyhub

Verify against the repo before relying on details.