hiyouga/llamafactory

Analysis updated 2026-05-18

★ 70,974PythonAudience · developerComplexity · 3/5LicenseSetup · moderate

Mindmap

mindmap
  root((LlamaFactory))
    What it does
      Fine-tune LLMs
      Train vision models
      Deploy with API
    Training methods
      LoRA and QLoRA
      Reward modeling
      PPO and DPO
    Interfaces
      Web UI LLaMA Board
      Command-line CLI
      OpenAI-compatible API
    Tech stack
      Python and PyTorch
      Hugging Face
      Docker support
    Use cases
      Domain-specific experts
      Customer support bots
      Medical Q&A systems

mindmap root((LlamaFactory)) What it does Fine-tune LLMs Train vision models Deploy with API Training methods LoRA and QLoRA Reward modeling PPO and DPO Interfaces Web UI LLaMA Board Command-line CLI OpenAI-compatible API Tech stack Python and PyTorch Hugging Face Docker support Use cases Domain-specific experts Customer support bots Medical Q&A systems

Click or tap to explore — scroll the page freely

What do people build with it?

USE CASE 1

Fine-tune a general chatbot into a specialized customer support agent for your company's products.

USE CASE 2

Train a medical question-answering model on your hospital's internal documentation and case studies.

USE CASE 3

Adapt a pre-trained model to understand domain-specific jargon in legal, financial, or technical fields.

USE CASE 4

Create a smaller, quantized model that runs efficiently on consumer GPUs for local deployment.

What is it built with?

PythonPyTorchHugging FaceGradiovLLMDocker

How does it compare?

	hiyouga/llamafactory	binary-husky/gpt_academic	pallets/flask
Stars	70,974	70,590	71,494
Language	Python	Python	Python
Setup difficulty	moderate	moderate	easy
Complexity	3/5	3/5	2/5
Audience	developer	researcher	developer

Figures from each repo's GitHub metadata at analysis time.

How do you get it running?

Difficulty · moderate Time to first run · 30min

Requires PyTorch and CUDA/GPU setup, plus downloading a model checkpoint from Hugging Face.

Use freely for any purpose including commercial. Keep the notice and disclose changes to the patent grant.

In plain English

LlamaFactory is a Python toolkit for fine-tuning large language models (LLMs) and vision-language models (VLMs). Fine-tuning means taking a pre-trained AI model that has already learned from massive amounts of text or data and further training it on your specific dataset so it becomes specialized for your use case, for example making a general model become an expert at customer support conversations or medical question answering. LlamaFactory makes this process easier by providing a unified interface that supports over 100 different models with minimal or no coding required. The toolkit supports a range of training approaches beyond basic fine-tuning, including LoRA and QLoRA (which are parameter-efficient techniques that only update a small fraction of the model's weights to save memory and compute), reward modeling, and reinforcement learning from human feedback methods like PPO and DPO. It handles models in quantized 2 to 8-bit formats, which allows large models to be fine-tuned on consumer-grade GPUs with less memory. A web-based graphical interface called LLaMA Board, built with Gradio, lets users configure and launch training runs without writing any code. The command-line interface serves more advanced users. After training, models can be deployed with a vLLM-powered API that follows the OpenAI API format, making integration straightforward. You would use LlamaFactory if you are a researcher or developer who wants to customize a pre-trained LLM for a specific task, dataset, or domain without building all the training infrastructure from scratch. It is also suitable for cloud-based training via Google Colab or similar services for people who do not have local GPU hardware. The tech stack is Python, PyTorch, and Hugging Face libraries, with Docker support for reproducible environments. It was published as a paper at ACL 2024.

Copy-paste prompts

Prompt 1

I want to fine-tune a Llama model on my custom dataset using LlamaFactory. Walk me through the steps to set up training with LoRA on a consumer GPU.

Prompt 2

How do I use LlamaFactory's web UI to configure and launch a training run without touching the command line?

Prompt 3

Show me how to deploy a fine-tuned model from LlamaFactory using the vLLM API so it's compatible with OpenAI client libraries.

Prompt 4

I have a small dataset and limited GPU memory. How should I use QLoRA in LlamaFactory to train efficiently?

Prompt 5

What's the process for using LlamaFactory to train a reward model for reinforcement learning from human feedback?

Frequently asked questions

What is llamafactory?

A Python toolkit for fine-tuning large language models on your own data with minimal coding, supporting 100+ models and memory-efficient training techniques.

What language is llamafactory written in?

Mainly Python. The stack also includes Python, PyTorch, Hugging Face.

What license does llamafactory use?

Use freely for any purpose including commercial. Keep the notice and disclose changes to the patent grant.

How hard is llamafactory to set up?

Setup difficulty is rated moderate, with roughly 30min to a first successful run.

Who is llamafactory for?

Mainly developer.

Open on GitHub → Explain another repo

This repo across BitVibe Labs

Scan in gitsafehub Deploy in gitdeployhub hiyouga on gitmyhub

Verify against the repo before relying on details.