Analysis updated 2026-07-03
Fine-tune ChatGLM-6B on a custom Chinese or English dataset to create a domain-specific AI assistant without retraining from scratch
Use the Gradio web interface to run model training and test responses without writing any command-line code
Deploy the fine-tuned model as an API server that accepts the same request format as ChatGPT, so it works in existing ChatGPT-based apps
Apply RLHF to align the model's responses with human preferences using your own comparison data
| hiyouga/chatglm-efficient-tuning | llm-workflow-engine/llm-workflow-engine | insanum/gcalcli | |
|---|---|---|---|
| Stars | 3,722 | 3,722 | 3,721 |
| Language | Python | Python | Python |
| Setup difficulty | hard | moderate | moderate |
| Complexity | 4/5 | 3/5 | 2/5 |
| Audience | researcher | developer | developer |
Figures from each repo's GitHub metadata at analysis time.
Requires a GPU with significant VRAM, project is archived and superseded by LLaMA-Factory.
ChatGLM-6B is an open-source AI language model developed by Tsinghua University, capable of conducting conversations in both Chinese and English. ChatGLM Efficient Tuning is a Python toolkit for adjusting, or fine-tuning, this model on your own data, so it learns to follow your specific instructions or domain knowledge rather than its general training. Fine-tuning a large language model from scratch is expensive because it requires updating all the model's parameters. This project supports several techniques that make the process much cheaper. The main method is LoRA, which adds small trainable weight matrices on top of the frozen model, updating only those instead of the full set of billions of parameters. The project also supports P-Tuning V2, a method that adjusts how the model reads its input context, and a simpler freeze approach that only updates the final layers. Full tuning of all parameters is available for cases where compute is not a constraint. A web browser interface is included, built with Gradio, which means you can run training, evaluation, and inference without writing command-line instructions. The project also supports Reinforcement Learning with Human Feedback (RLHF), a technique used to align model output with human preferences by training a reward model on human comparisons. A demo API server is included that matches the OpenAI chat format, so the fine-tuned model can be dropped into existing ChatGPT-based applications. The project includes a large collection of supported datasets in English and Chinese, covering general chat, math, code, and web content. Some datasets require a Hugging Face account and access approval before downloading. Custom datasets can be added by providing a JSON file in the expected format and updating a dataset registry file. The maintainer has since moved active development to a successor project called LLaMA-Factory, which covers a broader range of models. This repository is no longer being updated. Users who want to fine-tune ChatGLM2-6B or later models are directed to that newer project.
Python toolkit for teaching the ChatGLM-6B AI language model to follow your custom instructions or domain knowledge, using efficient training techniques that don't require massive computing resources.
Mainly Python. The stack also includes Python, PyTorch, Gradio.
Setup difficulty is rated hard, with roughly 1day+ to a first successful run.
Mainly researcher.
This repo across BitVibe Labs
Verify against the repo before relying on details.