huggingface/peft

Analysis updated 2026-06-21

★ 21,070PythonAudience · researcherComplexity · 3/5Setup · moderate

Mindmap

mindmap
  root((peft))
    What It Does
      Cheap model fine-tuning
      Adapter training only
      Memory efficient
    Supported Methods
      LoRA
      Adapters
      Soft prompts
      IA3
    Tech Stack
      Python
      Transformers
      Diffusers
      Accelerate
    Use Cases
      Custom LLM tasks
      Single GPU training
      Image model adapters
    Audience
      ML researchers
      AI developers

mindmap root((peft)) What It Does Cheap model fine-tuning Adapter training only Memory efficient Supported Methods LoRA Adapters Soft prompts IA3 Tech Stack Python Transformers Diffusers Accelerate Use Cases Custom LLM tasks Single GPU training Image model adapters Audience ML researchers AI developers

Click or tap to explore — scroll the page freely

What do people build with it?

USE CASE 1

Fine-tune a large language model on your own dataset using a single consumer GPU with LoRA instead of full retraining.

USE CASE 2

Add task-specific behavior to a Hugging Face Transformers model while keeping the base model frozen and unchanged.

USE CASE 3

Save small adapter checkpoints per task instead of storing many full model copies.

USE CASE 4

Combine PEFT with quantization techniques like QLoRA to fit large model fine-tuning into minimal GPU memory.

What is it built with?

PythonPyTorchTransformersDiffusersAccelerate

How does it compare?

	huggingface/peft	zeyi-lin/hivisionidphotos	lilittlecat/awesome-free-chatgpt
Stars	21,070	21,059	21,039
Language	Python	Python	Python
Setup difficulty	moderate	moderate	easy
Complexity	3/5	2/5	1/5
Audience	researcher	developer	general

Figures from each repo's GitHub metadata at analysis time.

How do you get it running?

Difficulty · moderate Time to first run · 1h+

Requires a CUDA-capable GPU for practical use, full fine-tuning fits in ~14GB with LoRA vs ~47GB without for a 3B model.

The explanation does not specify the license terms for this repository.

In plain English

PEFT, short for Parameter-Efficient Fine-Tuning, is a Python library from Hugging Face for adapting very large pretrained AI models to new tasks without retraining the whole thing. Fine-tuning a modern large language model or image-generation model normally means updating billions of parameters, which is slow, expensive, and hungry for GPU memory. PEFT instead trains only a small number of extra parameters on top of the frozen base model, which the README says can match the quality of full fine-tuning while using a fraction of the compute and storage. The library packages several PEFT methods, adapters, LoRA, soft prompts, and IA3 are the ones called out in the README, with conceptual guides for each. You install it with pip install peft and use it by wrapping a regular Transformers model with a configuration object such as LoraConfig and a helper called get_peft_model. The example in the README adapts the Qwen2.5-3B-Instruct model with LoRA at rank 16, ends up training only about 0.12% of the model's parameters, and saves a small adapter checkpoint that can be reloaded for inference with PeftModel.from_pretrained. PEFT is designed to plug into the rest of the Hugging Face stack: it works with Transformers for training and inference, Diffusers for managing adapters in image models like Stable Diffusion, and Accelerate for distributed training. The README also shows memory tables where a 3B-parameter model that needs about 47GB of GPU memory for full fine-tuning fits in roughly 14GB with LoRA, and notes that PEFT pairs well with quantization techniques like QLoRA to fine-tune big LLMs on a single consumer GPU. This is useful for anyone who wants to specialize a large open model on their own data, researchers, hobbyists with a single GPU, or teams that need many task-specific variants without storing many giant checkpoints.

Copy-paste prompts

Prompt 1

I want to fine-tune a Llama model using LoRA on my custom dataset. Show me a complete PEFT setup with LoraConfig, get_peft_model, and a training loop.

Prompt 2

Using PEFT and Hugging Face Transformers, help me adapt a pretrained model for text classification on my own labeled data from start to finish.

Prompt 3

I have a Stable Diffusion model and want to add a LoRA adapter for a custom style. Walk me through doing this with PEFT and Diffusers.

Prompt 4

Help me load a saved PEFT LoRA adapter with PeftModel.from_pretrained and run inference combining the adapter with the base model.

Prompt 5

I want to use QLoRA to fine-tune a 7B-parameter model on a single consumer GPU. Walk me through the quantization and PEFT setup.

Frequently asked questions

What is peft?

PEFT is a Python library that lets you fine-tune large AI models cheaply by training only a tiny fraction of their parameters, so you can customize a model on your own data without needing expensive multi-GPU hardware.

What language is peft written in?

Mainly Python. The stack also includes Python, PyTorch, Transformers.

What license does peft use?

The explanation does not specify the license terms for this repository.

How hard is peft to set up?

Setup difficulty is rated moderate, with roughly 1h+ to a first successful run.

Who is peft for?

Mainly researcher.

Open on GitHub → Explain another repo

This repo across BitVibe Labs

Scan in gitsafehub Deploy in gitdeployhub huggingface on gitmyhub

Verify against the repo before relying on details.