QLoRA Support:

QLoRA uses 4-bit quantization to compress a pretrained language model. The LM parameters are then frozen and a relatively small number of trainable parameters are added to the model in the form of Low-Rank Adapters. During finetuning, QLoRA backpropagates gradients through the frozen 4-bit quantized pretrained language model into the Low-Rank Adapters. The LoRA layers are the only parameters being updated during training. For more details read the blog Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

4-bit QLoRA via bitsandbytes (4-bit base model + LoRA) by @TimDettmers in https://github.com/huggingface/peft/pull/476
[core] Protect 4bit import by @younesbelkada in https://github.com/huggingface/peft/pull/480
[core] Raise warning on using prepare_model_for_int8_training by @younesbelkada in https://github.com/huggingface/peft/pull/483

New PEFT methods: IA3 from T-Few paper

To make fine-tuning more efficient, IA3 (Infused Adapter by Inhibiting and Amplifying Inner Activations) rescales inner activations with learned vectors. These learned vectors are injected into the attention and feedforward modules in a typical transformer-based architecture. These learned vectors are the only trainable parameters during fine-tuning, and thus the original weights remain frozen. Dealing with learned vectors (as opposed to learned low-rank updates to a weight matrix like LoRA) keeps the number of trainable parameters much smaller. For more details, read the paper Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning

Add functionality to support IA3 by @SumanthRH in https://github.com/huggingface/peft/pull/578

Support for new tasks: QA and Feature Extraction

Addition of PeftModelForQuestionAnswering and PeftModelForFeatureExtraction classes to support QA and Feature Extraction tasks, respectively. This enables exciting new use-cases with PEFT, e.g., LoRA for semantic similarity tasks.

feat: Add PeftModelForQuestionAnswering by @sjrl in https://github.com/huggingface/peft/pull/473
add support for Feature Extraction using PEFT by @pacman100 in https://github.com/huggingface/peft/pull/647

AutoPeftModelForxxx for better and Simplified UX

Introduces a new paradigm, AutoPeftModelForxxx intended for users that want to rapidly load and run peft models.

from peft import AutoPeftModelForCausalLM

peft_model = AutoPeftModelForCausalLM.from_pretrained("ybelkada/opt-350m-lora")

Introducing AutoPeftModelForxxx by @younesbelkada in https://github.com/huggingface/peft/pull/694

LoRA for custom models

Not a transformer model, no problem, we have got you covered. PEFT now enables the usage of LoRA with custom models.

FEAT: Make LoRA work with custom models by @BenjaminBossan in https://github.com/huggingface/peft/pull/676

New LoRA utilities

Improvements to add_weighted_adapter method to support SVD for combining multiple LoRAs when creating new LoRA. New utils such as unload and delete_adapter providing users much better control about how they deal with the adapters.

[Core] Enhancements and refactoring of LoRA method by @pacman100 in https://github.com/huggingface/peft/pull/695

PEFT and Stable Diffusion

PEFT is very extensible and easy to use for performing DreamBooth of Stable Diffusion. Community has added conversion scripts to be able to use PEFT models with Civitai/webui format and vice-versa.

LoRA for Conv2d layer, script to convert kohya_ss LoRA to PEFT by @kovalexal in https://github.com/huggingface/peft/pull/461
Added Civitai LoRAs conversion to PEFT, PEFT LoRAs conversion to webui by @kovalexal in https://github.com/huggingface/peft/pull/596
[Bugfix] Fixed LoRA conv2d merge by @kovalexal in https://github.com/huggingface/peft/pull/637
Fixed LoraConfig alpha modification on add_weighted_adapter by @kovalexal in https://github.com/huggingface/peft/pull/654

What's Changed

Release: v0.4.0.dev0 by @pacman100 in https://github.com/huggingface/peft/pull/391
do not use self.device. In FSDP cpu offload mode. self.device is "CPU… by @sywangyi in https://github.com/huggingface/peft/pull/352
add accelerate example for DDP and FSDP in sequence classification fo… by @sywangyi in https://github.com/huggingface/peft/pull/358
[CI] Fix CI - pin urlib by @younesbelkada in https://github.com/huggingface/peft/pull/402
[docs] Fix index by @stevhliu in https://github.com/huggingface/peft/pull/397
Fix documentation links on index page by @mikeorzel in https://github.com/huggingface/peft/pull/406
Zero 3 init ReadME update by @dumpmemory in https://github.com/huggingface/peft/pull/399
[Tests] Add soundfile to docker images by @younesbelkada in https://github.com/huggingface/peft/pull/401
4-bit QLoRA via bitsandbytes (4-bit base model + LoRA) by @TimDettmers in https://github.com/huggingface/peft/pull/476
[core] Protect 4bit import by @younesbelkada in https://github.com/huggingface/peft/pull/480
[core] Raise warning on using prepare_model_for_int8_training by @younesbelkada in https://github.com/huggingface/peft/pull/483
Remove merge_weights by @Atry in https://github.com/huggingface/peft/pull/392
[core] Add gradient checkpointing check by @younesbelkada in https://github.com/huggingface/peft/pull/404
[docs] Fix LoRA image classification docs by @stevhliu in https://github.com/huggingface/peft/pull/524
[docs] Prettify index by @stevhliu in https://github.com/huggingface/peft/pull/478
change comment in tuners.lora, lora_alpha float to int by @codingchild2424 in https://github.com/huggingface/peft/pull/448
[LoRA] Allow applying LoRA at different stages by @younesbelkada in https://github.com/huggingface/peft/pull/429
Enable PeftConfig & PeftModel to load from revision by @lewtun in https://github.com/huggingface/peft/pull/433
[Llama-Adapter] fix half precision inference + add tests by @younesbelkada in https://github.com/huggingface/peft/pull/456
fix merge_and_unload when LoRA targets embedding layer by @0x000011b in https://github.com/huggingface/peft/pull/438
return load_result when load_adapter by @dkqkxx in https://github.com/huggingface/peft/pull/481
Fixed problem with duplicate same code. by @hotchpotch in https://github.com/huggingface/peft/pull/517
Add starcoder model to target modules dict by @mrm8488 in https://github.com/huggingface/peft/pull/528
Fix a minor typo where a non-default token_dim would crash prompt tuning by @thomas-schillaci in https://github.com/huggingface/peft/pull/459
Remove device_map when training 4,8-bit model. by @SunMarc in https://github.com/huggingface/peft/pull/534
add library name to model card by @pacman100 in https://github.com/huggingface/peft/pull/549
Add thousands separator in print_trainable_parameters by @BramVanroy in https://github.com/huggingface/peft/pull/443
[doc build] Use secrets by @mishig25 in https://github.com/huggingface/peft/pull/556
improve readability of LoRA code by @martin-liu in https://github.com/huggingface/peft/pull/409
[core] Add safetensors integration by @younesbelkada in https://github.com/huggingface/peft/pull/553
[core] Fix config kwargs by @younesbelkada in https://github.com/huggingface/peft/pull/561
Fix typo and url to openai/whisper-large-v2 by @alvarobartt in https://github.com/huggingface/peft/pull/563
feat: add type hint to get_peft_model by @samsja in https://github.com/huggingface/peft/pull/566
Add issues template by @younesbelkada in https://github.com/huggingface/peft/pull/562
[BugFix] Set alpha and dropout defaults in LoraConfig by @apbard in https://github.com/huggingface/peft/pull/390
enable lora for mpt by @sywangyi in https://github.com/huggingface/peft/pull/576
Fix minor typo in bug-report.yml by @younesbelkada in https://github.com/huggingface/peft/pull/582
[core] Correctly passing the kwargs all over the place by @younesbelkada in https://github.com/huggingface/peft/pull/575
Fix adalora device mismatch issue by @younesbelkada in https://github.com/huggingface/peft/pull/583
LoRA for Conv2d layer, script to convert kohya_ss LoRA to PEFT by @kovalexal in https://github.com/huggingface/peft/pull/461
Fix typo at peft_model.py by @Beomi in https://github.com/huggingface/peft/pull/588
[test] Adds more CI tests by @younesbelkada in https://github.com/huggingface/peft/pull/586
when from_pretrained is called in finetune case of lora with flag "… by @sywangyi in https://github.com/huggingface/peft/pull/591
feat: Add PeftModelForQuestionAnswering by @sjrl in https://github.com/huggingface/peft/pull/473
Improve the README when using PEFT by @pacman100 in https://github.com/huggingface/peft/pull/594
[tests] Fix dockerfile by @younesbelkada in https://github.com/huggingface/peft/pull/608
Fix final failing slow tests by @younesbelkada in https://github.com/huggingface/peft/pull/609
[core] Add adapter_name in get_peft_model by @younesbelkada in https://github.com/huggingface/peft/pull/610
[core] Stronger import of bnb by @younesbelkada in https://github.com/huggingface/peft/pull/605
Added Civitai LoRAs conversion to PEFT, PEFT LoRAs conversion to webui by @kovalexal in https://github.com/huggingface/peft/pull/596
update whisper test by @pacman100 in https://github.com/huggingface/peft/pull/617
Update README.md, citation by @pminervini in https://github.com/huggingface/peft/pull/616
Update train_dreambooth.py by @nafiturgut in https://github.com/huggingface/peft/pull/624
[Adalora] Add adalora 4bit by @younesbelkada in https://github.com/huggingface/peft/pull/598
[AdaptionPrompt] Add 8bit + 4bit support for adaption prompt by @younesbelkada in https://github.com/huggingface/peft/pull/604
Add seq2seq prompt tuning support by @thomas-schillaci in https://github.com/huggingface/peft/pull/519
[Bugfix] Fixed LoRA conv2d merge by @kovalexal in https://github.com/huggingface/peft/pull/637
[Bugfix] Inserted adapter_name to get_peft_model_state_dict function by @nafiturgut in https://github.com/huggingface/peft/pull/626
fix Prefix-tuning error in clm Float16 evaluation by @sywangyi in https://github.com/huggingface/peft/pull/520
fix ptun and prompt tuning generation issue by @sywangyi in https://github.com/huggingface/peft/pull/543
feat(model): Allow from_pretrained to accept PeftConfig class by @aarnphm in https://github.com/huggingface/peft/pull/612
Fix PeftModel.disable_adapter by @ain-soph in https://github.com/huggingface/peft/pull/644
bitsandbytes version check by @glerzing in https://github.com/huggingface/peft/pull/646
DOC: Remove loralib requirements from examples, a few small fixes by @BenjaminBossan in https://github.com/huggingface/peft/pull/640
style: tentatively add hints for some public function by @aarnphm in https://github.com/huggingface/peft/pull/614
Add pytest-cov for reporting test coverage by @BenjaminBossan in https://github.com/huggingface/peft/pull/641
Require Python version >= 3.8 by @BenjaminBossan in https://github.com/huggingface/peft/pull/649
Fixed LoraConfig alpha modification on add_weighted_adapter by @kovalexal in https://github.com/huggingface/peft/pull/654
[docs] API example by @stevhliu in https://github.com/huggingface/peft/pull/650
FIX: bug resulting in config copies not working by @BenjaminBossan in https://github.com/huggingface/peft/pull/653
Update clm-prompt-tuning.mdx by @richard087 in https://github.com/huggingface/peft/pull/652
Adding support for RoBERTa layers_transform in COMMON_LAYERS_PATTERN by @sunyuhan19981208 in https://github.com/huggingface/peft/pull/669
TST: Remove skipping certain tests by @BenjaminBossan in https://github.com/huggingface/peft/pull/668
Added wandb support for lora train_dreambooth by @nafiturgut in https://github.com/huggingface/peft/pull/639
FIX: Embedding LoRA weights are initialized randomly by @BenjaminBossan in https://github.com/huggingface/peft/pull/681
Fix broken docker images by @younesbelkada in https://github.com/huggingface/peft/pull/684
Add functionality to support IA3 by @SumanthRH in https://github.com/huggingface/peft/pull/578
Fix base_model_torch_dtype when using model.half() after init by @rayrayraykk in https://github.com/huggingface/peft/pull/688
Init IA³ weights randomly when so configured by @BenjaminBossan in https://github.com/huggingface/peft/pull/693
add support for Feature Extraction using PEFT by @pacman100 in https://github.com/huggingface/peft/pull/647
Fix a small bug in forward method of IA³ by @BenjaminBossan in https://github.com/huggingface/peft/pull/696
Update Readme to include IA3 by @SumanthRH in https://github.com/huggingface/peft/pull/702
Fix code typo in int8-asr.mdx by @zsamboki in https://github.com/huggingface/peft/pull/698
chore(type): annotate that peft does contains type hints by @aarnphm in https://github.com/huggingface/peft/pull/678
Introducing AutoPeftModelForxxx by @younesbelkada in https://github.com/huggingface/peft/pull/694
[WIP] FIX for disabling adapter, adding tests by @BenjaminBossan in https://github.com/huggingface/peft/pull/683
[Core] Enhancements and refactoring of LoRA method by @pacman100 in https://github.com/huggingface/peft/pull/695
[Feature] Save only selected adapters for LoRA by @younesbelkada in https://github.com/huggingface/peft/pull/705
[Auto] Support AutoPeftModel for custom HF models by @younesbelkada in https://github.com/huggingface/peft/pull/707
FEAT: Make LoRA work with custom models by @BenjaminBossan in https://github.com/huggingface/peft/pull/676
[core] Better hub kwargs management by @younesbelkada in https://github.com/huggingface/peft/pull/712
FIX: Removes warnings about unknown pytest marker by @BenjaminBossan in https://github.com/huggingface/peft/pull/715

New Contributors

@sywangyi made their first contribution in https://github.com/huggingface/peft/pull/352
@mikeorzel made their first contribution in https://github.com/huggingface/peft/pull/406
@TimDettmers made their first contribution in https://github.com/huggingface/peft/pull/476
@Atry made their first contribution in https://github.com/huggingface/peft/pull/392
@codingchild2424 made their first contribution in https://github.com/huggingface/peft/pull/448
@lewtun made their first contribution in https://github.com/huggingface/peft/pull/433
@0x000011b made their first contribution in https://github.com/huggingface/peft/pull/438
@dkqkxx made their first contribution in https://github.com/huggingface/peft/pull/481
@hotchpotch made their first contribution in https://github.com/huggingface/peft/pull/517
@thomas-schillaci made their first contribution in https://github.com/huggingface/peft/pull/459
@SunMarc made their first contribution in https://github.com/huggingface/peft/pull/534
@BramVanroy made their first contribution in https://github.com/huggingface/peft/pull/443
@mishig25 made their first contribution in https://github.com/huggingface/peft/pull/556
@martin-liu made their first contribution in https://github.com/huggingface/peft/pull/409
@alvarobartt made their first contribution in https://github.com/huggingface/peft/pull/563
@samsja made their first contribution in https://github.com/huggingface/peft/pull/566
@apbard made their first contribution in https://github.com/huggingface/peft/pull/390
@kovalexal made their first contribution in https://github.com/huggingface/peft/pull/461
@Beomi made their first contribution in https://github.com/huggingface/peft/pull/588
@sjrl made their first contribution in https://github.com/huggingface/peft/pull/473
@pminervini made their first contribution in https://github.com/huggingface/peft/pull/616
@nafiturgut made their first contribution in https://github.com/huggingface/peft/pull/624
@aarnphm made their first contribution in https://github.com/huggingface/peft/pull/612
@ain-soph made their first contribution in https://github.com/huggingface/peft/pull/644
@glerzing made their first contribution in https://github.com/huggingface/peft/pull/646
@BenjaminBossan made their first contribution in https://github.com/huggingface/peft/pull/640
@richard087 made their first contribution in https://github.com/huggingface/peft/pull/652
@sunyuhan19981208 made their first contribution in https://github.com/huggingface/peft/pull/669
@SumanthRH made their first contribution in https://github.com/huggingface/peft/pull/578
@rayrayraykk made their first contribution in https://github.com/huggingface/peft/pull/688
@zsamboki made their first contribution in https://github.com/huggingface/peft/pull/698

Full Changelog: https://github.com/huggingface/peft/compare/v0.3.0...v0.4.0

Significant community contributions

The following contributors have made significant changes to the library over the last release:

@TimDettmers

4-bit QLoRA via bitsandbytes (4-bit base model + LoRA) by @TimDettmers in https://github.com/huggingface/peft/pull/476

@SumanthRH

Add functionality to support IA3 by @SumanthRH in https://github.com/huggingface/peft/pull/578

@kovalexal

LoRA for Conv2d layer, script to convert kohya_ss LoRA to PEFT by @kovalexal in https://github.com/huggingface/peft/pull/461
Added Civitai LoRAs conversion to PEFT, PEFT LoRAs conversion to webui by @kovalexal in https://github.com/huggingface/peft/pull/596
[Bugfix] Fixed LoRA conv2d merge by @kovalexal in https://github.com/huggingface/peft/pull/637
Fixed LoraConfig alpha modification on add_weighted_adapter by @kovalexal in https://github.com/huggingface/peft/pull/654

@sywangyi

do not use self.device. In FSDP cpu offload mode. self.device is "CPU… by @sywangyi in https://github.com/huggingface/peft/pull/352
add accelerate example for DDP and FSDP in sequence classification fo… by @sywangyi in https://github.com/huggingface/peft/pull/358
enable lora for mpt by @sywangyi in https://github.com/huggingface/peft/pull/576
fix Prefix-tuning error in clm Float16 evaluation by @sywangyi in https://github.com/huggingface/peft/pull/520
fix ptun and prompt tuning generation issue by @sywangyi in https://github.com/huggingface/peft/pull/543
when from_pretrained is called in finetune case of lora with flag "… by @sywangyi in https://github.com/huggingface/peft/pull/591

@aarnphm

feat(model): Allow from_pretrained to accept PeftConfig class by @aarnphm in https://github.com/huggingface/peft/pull/612
style: tentatively add hints for some public function by @aarnphm in https://github.com/huggingface/peft/pull/614
chore(type): annotate that peft does contains type hints by @aarnphm in https://github.com/huggingface/peft/pull/678

@martin-liu

improve readability of LoRA code by @martin-liu in https://github.com/huggingface/peft/pull/409

@thomas-schillaci

Add seq2seq prompt tuning support by @thomas-schillaci in https://github.com/huggingface/peft/pull/519