Releases2Avg0/wkVersionsv0.19.0 → v0.19.1

Patch Release v0.8.1

This is a small patch release of PEFT that should:

Fix breaking change related to support for saving resized embedding layers and Diffusers models. Contributed by @younesbelkada in https://github.com/huggingface/peft/pull/1414

What's Changed

Release 0.8.1.dev0 by @pacman100 in https://github.com/huggingface/peft/pull/1412
Fix breaking change by @younesbelkada in https://github.com/huggingface/peft/pull/1414
Patch Release v0.8.1 by @pacman100 in https://github.com/huggingface/peft/pull/1415

Full Changelog: https://github.com/huggingface/peft/compare/v0.8.0...v0.8.1

v0.8.0: Poly PEFT method, LoRA improvements, Documentation improvements and more

Highlights

Poly PEFT method

Parameter-efficient fine-tuning (PEFT) for cross-task generalization consists of pre-training adapters on a multi-task training set before few-shot adaptation to test tasks. Polytropon [Ponti et al., 2023] (𝙿𝚘𝚕𝚢) jointly learns an inventory of adapters and a routing function that selects a (variable-size) subset of adapters for each task during both pre-training and few-shot adaptation. To put simply, you can think of it as Mixture of Expert Adapters. 𝙼𝙷𝚁 (Multi-Head Routing) combines subsets of adapter parameters and outperforms 𝙿𝚘𝚕𝚢 under a comparable parameter budget; by only fine-tuning the routing function and not the adapters (𝙼𝙷𝚁-z) they achieve competitive performance with extreme parameter efficiency.

Add Poly by @TaoSunVoyage in https://github.com/huggingface/peft/pull/1129

LoRA improvements

Now, you can specify all-linear to target_modules param of LoraConfig to target all the linear layers which has shown to perform better in QLoRA paper than only targeting query and valuer attention layers

Add an option 'ALL' to include all linear layers as target modules by @SumanthRH in https://github.com/huggingface/peft/pull/1295

Embedding layers of base models are now automatically saved when the embedding layers are resized when fine-tuning with PEFT approaches like LoRA. This enables extending the vocabulary of tokenizer to include special tokens. This is a common use-case when doing the following:

Instruction finetuning with new tokens being added such as <|user|>, <|assistant|>, <|system|>, <|im_end|>, <|im_start|>, </s>, <s> to properly format the conversations
Finetuning on a specific language wherein language specific tokens are added, e.g., Korean tokens being added to vocabulary for finetuning LLM on Korean datasets.
Instruction finetuning to return outputs in a certain format to enable agent behaviour of new tokens such as <|FUNCTIONS|>, <|BROWSE|>, <|TEXT2IMAGE|>, <|ASR|>, <|TTS|>, <|GENERATECODE|>, <|RAG|>. A good blogpost to learn more about this https://www.philschmid.de/fine-tune-llms-in-2024-with-trl.

save the embeddings even when they aren't targetted but resized by @pacman100 in https://github.com/huggingface/peft/pull/1383

New option use_rslora in LoraConfig. Use it for ranks greater than 32 and see the increase in fine-tuning performance (same or better performance for ranks lower than 32 as well).

Added the option to use the corrected scaling factor for LoRA, based on new research. by @Damjan-Kalajdzievski in https://github.com/huggingface/peft/pull/1244

Documentation improvements

Refactoring and updating of the concept guides. [docs] Concept guides by @stevhliu in https://github.com/huggingface/peft/pull/1269
Improving task guides to focus more on how to use different PEFT methods and related nuances instead of focusing more on different type of tasks. It condenses the individual guides into a single one to highlight the commonalities and differences, and to refer to existing docs to avoid duplication. [docs] Task guides by @stevhliu in https://github.com/huggingface/peft/pull/1332
DOC: Update docstring for the config classes by @BenjaminBossan in https://github.com/huggingface/peft/pull/1343
LoftQ: edit README.md and example files by @yxli2123 in https://github.com/huggingface/peft/pull/1276
[Docs] make add_weighted_adapter example clear in the docs. by @sayakpaul in https://github.com/huggingface/peft/pull/1353
DOC Add PeftMixedModel to API docs by @BenjaminBossan in https://github.com/huggingface/peft/pull/1354
[docs] Docstring link by @stevhliu in https://github.com/huggingface/peft/pull/1356
QOL improvements and doc updates by @pacman100 in https://github.com/huggingface/peft/pull/1318
Doc about AdaLoraModel.update_and_allocate by @kuronekosaiko in https://github.com/huggingface/peft/pull/1341
DOC: Improve target modules description by @BenjaminBossan in https://github.com/huggingface/peft/pull/1290
DOC Troubleshooting for unscaling error with fp16 by @BenjaminBossan in https://github.com/huggingface/peft/pull/1336
DOC Extending the vocab and storing embeddings by @BenjaminBossan in https://github.com/huggingface/peft/pull/1335
Improve documentation for the all-linear flag by @SumanthRH in https://github.com/huggingface/peft/pull/1357
Fix various typos in LoftQ docs. by @arnavgarg1 in https://github.com/huggingface/peft/pull/1408

What's Changed

Bump version to 0.7.2.dev0 post release by @BenjaminBossan in https://github.com/huggingface/peft/pull/1258
FIX Error in log_reports.py by @BenjaminBossan in https://github.com/huggingface/peft/pull/1261
Fix ModulesToSaveWrapper getattr by @zhangsheng377 in https://github.com/huggingface/peft/pull/1238
TST: Revert device_map for AdaLora 4bit GPU test by @BenjaminBossan in https://github.com/huggingface/peft/pull/1266
remove a duplicated description in peft BaseTuner by @butyuhao in https://github.com/huggingface/peft/pull/1271
Added the option to use the corrected scaling factor for LoRA, based on new research. by @Damjan-Kalajdzievski in https://github.com/huggingface/peft/pull/1244
feat: add apple silicon GPU acceleration by @NripeshN in https://github.com/huggingface/peft/pull/1217
LoftQ: Allow quantizing models loaded on the CPU for LoftQ initialization by @hiyouga in https://github.com/huggingface/peft/pull/1256
LoftQ: edit README.md and example files by @yxli2123 in https://github.com/huggingface/peft/pull/1276
TST: Extend LoftQ tests to check CPU initialization by @BenjaminBossan in https://github.com/huggingface/peft/pull/1274
Refactor and a couple of fixes for adapter layer updates by @BenjaminBossan in https://github.com/huggingface/peft/pull/1268
[Tests] Add bitsandbytes installed from source on new docker images by @younesbelkada in https://github.com/huggingface/peft/pull/1275
TST: Enable LoftQ 8bit tests by @BenjaminBossan in https://github.com/huggingface/peft/pull/1279
[bnb] Add bnb nightly workflow by @younesbelkada in https://github.com/huggingface/peft/pull/1282
Fixed several errors in StableDiffusion adapter conversion script by @kovalexal in https://github.com/huggingface/peft/pull/1281
[docs] Concept guides by @stevhliu in https://github.com/huggingface/peft/pull/1269
DOC: Improve target modules description by @BenjaminBossan in https://github.com/huggingface/peft/pull/1290
[bnb-nightly] Address final comments by @younesbelkada in https://github.com/huggingface/peft/pull/1287
[BNB] Fix bnb dockerfile for latest version by @SunMarc in https://github.com/huggingface/peft/pull/1291
fix fsdp auto wrap policy by @pacman100 in https://github.com/huggingface/peft/pull/1302
[BNB] fix dockerfile for single gpu by @SunMarc in https://github.com/huggingface/peft/pull/1305
Fix bnb lora layers not setting active adapter by @tdrussell in https://github.com/huggingface/peft/pull/1294
Mistral IA3 config defaults by @pacman100 in https://github.com/huggingface/peft/pull/1316
fix the embedding saving for adaption prompt by @pacman100 in https://github.com/huggingface/peft/pull/1314
fix diffusers tests by @pacman100 in https://github.com/huggingface/peft/pull/1317
FIX Use torch.long instead of torch.int in LoftQ for PyTorch versions <2.x by @BenjaminBossan in https://github.com/huggingface/peft/pull/1320
Extend merge_and_unload to offloaded models by @blbadger in https://github.com/huggingface/peft/pull/1190
Add an option 'ALL' to include all linear layers as target modules by @SumanthRH in https://github.com/huggingface/peft/pull/1295
Refactor dispatching logic of LoRA layers by @BenjaminBossan in https://github.com/huggingface/peft/pull/1319
Fix bug when load the prompt tuning in inference. by @yileld in https://github.com/huggingface/peft/pull/1333
DOC Troubleshooting for unscaling error with fp16 by @BenjaminBossan in https://github.com/huggingface/peft/pull/1336
ENH: Add attribute to show targeted module names by @BenjaminBossan in https://github.com/huggingface/peft/pull/1330
fix some args desc by @zspo in https://github.com/huggingface/peft/pull/1338
Fix logic in target module finding by @s-k-yx in https://github.com/huggingface/peft/pull/1263
Doc about AdaLoraModel.update_and_allocate by @kuronekosaiko in https://github.com/huggingface/peft/pull/1341
DOC: Update docstring for the config classes by @BenjaminBossan in https://github.com/huggingface/peft/pull/1343
fix prepare_inputs_for_generation logic for Prompt Learning methods by @pacman100 in https://github.com/huggingface/peft/pull/1352
QOL improvements and doc updates by @pacman100 in https://github.com/huggingface/peft/pull/1318
New transformers caching ETA now v4.38 by @BenjaminBossan in https://github.com/huggingface/peft/pull/1348
FIX Setting active adapter for quantized layers by @BenjaminBossan in https://github.com/huggingface/peft/pull/1347
DOC Extending the vocab and storing embeddings by @BenjaminBossan in https://github.com/huggingface/peft/pull/1335
[Docs] make add_weighted_adapter example clear in the docs. by @sayakpaul in https://github.com/huggingface/peft/pull/1353
DOC Add PeftMixedModel to API docs by @BenjaminBossan in https://github.com/huggingface/peft/pull/1354
Add Poly by @TaoSunVoyage in https://github.com/huggingface/peft/pull/1129
[docs] Docstring link by @stevhliu in https://github.com/huggingface/peft/pull/1356
Added missing getattr dunder methods for mixed model by @kovalexal in https://github.com/huggingface/peft/pull/1365
Handle resizing of embedding layers for AutoPeftModel by @pacman100 in https://github.com/huggingface/peft/pull/1367
account for the new merged/unmerged weight to perform the quantization again by @pacman100 in https://github.com/huggingface/peft/pull/1370
add mixtral in LoRA mapping by @younesbelkada in https://github.com/huggingface/peft/pull/1380
save the embeddings even when they aren't targetted but resized by @pacman100 in https://github.com/huggingface/peft/pull/1383
Improve documentation for the all-linear flag by @SumanthRH in https://github.com/huggingface/peft/pull/1357
Fix LoRA module mapping for Phi models by @arnavgarg1 in https://github.com/huggingface/peft/pull/1375
[docs] Task guides by @stevhliu in https://github.com/huggingface/peft/pull/1332
Add generic PeftConfig constructor from kwargs by @sfriedowitz in https://github.com/huggingface/peft/pull/1398
Fix various typos in LoftQ docs. by @arnavgarg1 in https://github.com/huggingface/peft/pull/1408
Release: v0.8.0 by @pacman100 in https://github.com/huggingface/peft/pull/1406

New Contributors

@butyuhao made their first contribution in https://github.com/huggingface/peft/pull/1271
@Damjan-Kalajdzievski made their first contribution in https://github.com/huggingface/peft/pull/1244
@NripeshN made their first contribution in https://github.com/huggingface/peft/pull/1217
@hiyouga made their first contribution in https://github.com/huggingface/peft/pull/1256
@tdrussell made their first contribution in https://github.com/huggingface/peft/pull/1294
@blbadger made their first contribution in https://github.com/huggingface/peft/pull/1190
@yileld made their first contribution in https://github.com/huggingface/peft/pull/1333
@s-k-yx made their first contribution in https://github.com/huggingface/peft/pull/1263
@kuronekosaiko made their first contribution in https://github.com/huggingface/peft/pull/1341
@TaoSunVoyage made their first contribution in https://github.com/huggingface/peft/pull/1129
@arnavgarg1 made their first contribution in https://github.com/huggingface/peft/pull/1375
@sfriedowitz made their first contribution in https://github.com/huggingface/peft/pull/1398

Full Changelog: https://github.com/huggingface/peft/compare/v0.7.1...v0.8.0

v0.7.1 patch release

This is a small patch release of PEFT that should handle:

Issues with loading multiple adapters when using quantized models (#1243)
Issues with transformers v4.36 and some prompt learning methods (#1252)

What's Changed

[docs] OFT by @stevhliu in https://github.com/huggingface/peft/pull/1221
Bump version to 0.7.1.dev0 post release by @BenjaminBossan in https://github.com/huggingface/peft/pull/1227
Don't set config attribute on custom models by @BenjaminBossan in https://github.com/huggingface/peft/pull/1200
TST: Run regression test in nightly test runner by @BenjaminBossan in https://github.com/huggingface/peft/pull/1233
Lazy import of bitsandbytes by @BenjaminBossan in https://github.com/huggingface/peft/pull/1230
FIX: Pin bitsandbytes to <0.41.3 temporarily by @BenjaminBossan in https://github.com/huggingface/peft/pull/1234
[docs] PeftConfig and PeftModel by @stevhliu in https://github.com/huggingface/peft/pull/1211
TST: Add tolerance for regression tests by @BenjaminBossan in https://github.com/huggingface/peft/pull/1241
Bnb integration test tweaks by @Titus-von-Koeller in https://github.com/huggingface/peft/pull/1242
[docs] PEFT integrations by @stevhliu in https://github.com/huggingface/peft/pull/1224
Revert "FIX Pin bitsandbytes to <0.41.3 temporarily (#1234)" by @Titus-von-Koeller in https://github.com/huggingface/peft/pull/1250
Fix model argument issue (#1198) by @ngocbh in https://github.com/huggingface/peft/pull/1205
TST: Add tests for 4bit LoftQ by @BenjaminBossan in https://github.com/huggingface/peft/pull/1208
[docs] Quantization by @stevhliu in https://github.com/huggingface/peft/pull/1236
FIX: Truncate slack message to not exceed 3000 chars by @BenjaminBossan in https://github.com/huggingface/peft/pull/1251
Issue with transformers 4.36 by @BenjaminBossan in https://github.com/huggingface/peft/pull/1252
Fix: Multiple adapters with bnb layers by @BenjaminBossan in https://github.com/huggingface/peft/pull/1243
Release: 0.7.1 by @BenjaminBossan in https://github.com/huggingface/peft/pull/1257

New Contributors

@Titus-von-Koeller made their first contribution in https://github.com/huggingface/peft/pull/1242
@ngocbh made their first contribution in https://github.com/huggingface/peft/pull/1205

Full Changelog: https://github.com/huggingface/peft/compare/v0.7.0...v0.7.1

v0.7.0: Orthogonal Fine-Tuning, Megatron support, better initialization, safetensors, and more

Highlights

Orthogonal Fine-Tuning (OFT): A new adapter that is similar to LoRA and shows a lot of promise for Stable Diffusion, especially with regard to controllability and compositionality. Give it a try! By @okotaku in https://github.com/huggingface/peft/pull/1160
Support for parallel linear LoRA layers using Megatron. This should lead to a speed up when using LoRA with Megatron. By @zhangsheng377 in https://github.com/huggingface/peft/pull/1092
LoftQ provides a new method to initialize LoRA layers of quantized models. The big advantage is that the LoRA layer weights are chosen in a way to minimize the quantization error, as described here: https://arxiv.org/abs/2310.08659. By @yxli2123 in https://github.com/huggingface/peft/pull/1150.

Other notable additions

It is now possible to choose which adapters are merged when calling merge (#1132)
IA³ now supports adapter deletion, by @alexrs (#1153)
A new initialization method for LoRA has been added, "gaussian" (#1189)
When training PEFT models with new tokens being added to the embedding layers, the embedding layer is now saved by default (#1147)
It is now possible to mix certain adapters like LoRA and LoKr in the same model, see the docs (#1163)
We started an initiative to improve the documenation, some of which should already be reflected in the current docs. Still, help by the community is always welcome. Check out this issue to get going.

Migration to v0.7.0

Safetensors are now the default format for PEFT adapters. In practice, users should not have to change anything in their code, PEFT takes care of everything -- just be aware that instead of creating a file adapter_model.bin, calling save_pretrained now creates adapter_model.safetensors. Safetensors have numerous advantages over pickle files (which is the PyTorch default format) and well supported on Hugging Face Hub.
When merging multiple LoRA adapter weights together using add_weighted_adapter with the option combination_type="linear", the scaling of the adapter weights is now performed differently, leading to improved results.
There was a big refactor of the inner workings of some PEFT adapters. For the vast majority of users, this should not make any difference (except making some code run faster). However, if your code is relying on PEFT internals, be aware that the inheritance structure of certain adapter layers has changed (e.g. peft.lora.Linear is no longer a subclass of nn.Linear, so isinstance checks may need updating). Also, to retrieve the original weight of an adapted layer, now use self.get_base_layer().weight, not self.weight (same for bias).

What's Changed

As always, a bunch of small improvements, bug fixes and doc improvements were added. We thank all the external contributors, both new and recurring. Below is the list of all changes since the last release.

After release: Bump version to 0.7.0.dev0 by @BenjaminBossan in https://github.com/huggingface/peft/pull/1074
FIX: Skip adaption prompt tests with new transformers versions by @BenjaminBossan in https://github.com/huggingface/peft/pull/1077
FIX: fix adaptation prompt CI and compatibility with latest transformers (4.35.0) by @younesbelkada in https://github.com/huggingface/peft/pull/1084
Improve documentation for IA³ by @SumanthRH in https://github.com/huggingface/peft/pull/984
[Docker] Update Dockerfile to force-use transformers main by @younesbelkada in https://github.com/huggingface/peft/pull/1085
Update the release checklist by @BenjaminBossan in https://github.com/huggingface/peft/pull/1075
fix-gptq-training by @SunMarc in https://github.com/huggingface/peft/pull/1086
fix the failing CI tests by @pacman100 in https://github.com/huggingface/peft/pull/1094
Fix f-string in import_utils by @KCFindstr in https://github.com/huggingface/peft/pull/1091
Fix IA3 config for Falcon models by @SumanthRH in https://github.com/huggingface/peft/pull/1007
FIX: Failing nightly CI tests due to IA3 config by @BenjaminBossan in https://github.com/huggingface/peft/pull/1100
[core] Fix safetensors serialization for shared tensors by @younesbelkada in https://github.com/huggingface/peft/pull/1101
Change to 0.6.1.dev0 by @younesbelkada in https://github.com/huggingface/peft/pull/1102
Release: 0.6.1 by @younesbelkada in https://github.com/huggingface/peft/pull/1103
set dev version by @younesbelkada in https://github.com/huggingface/peft/pull/1104
avoid unnecessary import by @winglian in https://github.com/huggingface/peft/pull/1109
Refactor adapter deletion by @BenjaminBossan in https://github.com/huggingface/peft/pull/1105
Added num_dataloader_workers arg to fix Windows issue by @lukaskuhn-lku in https://github.com/huggingface/peft/pull/1107
Fix import issue transformers with id_tensor_storage by @younesbelkada in https://github.com/huggingface/peft/pull/1116
Correctly deal with ModulesToSaveWrapper when using Low-level API by @younesbelkada in https://github.com/huggingface/peft/pull/1112
fix doc typo by @coding-famer in https://github.com/huggingface/peft/pull/1121
Release: v0.6.2 by @pacman100 in https://github.com/huggingface/peft/pull/1125
Release: v0.6.3.dev0 by @pacman100 in https://github.com/huggingface/peft/pull/1128
FIX: Adding 2 adapters when target_modules is a str fails by @BenjaminBossan in https://github.com/huggingface/peft/pull/1111
Prompt tuning: Allow to pass additional args to AutoTokenizer.from_pretrained by @BenjaminBossan in https://github.com/huggingface/peft/pull/1053
Fix: TorchTracemalloc ruins Windows performance by @lukaskuhn-lku in https://github.com/huggingface/peft/pull/1126
TST: Improve requires grad testing: by @BenjaminBossan in https://github.com/huggingface/peft/pull/1131
FEAT: Make safe serialization the default one by @younesbelkada in https://github.com/huggingface/peft/pull/1088
FEAT: Merging only specified adapter_names when calling merge by @younesbelkada in https://github.com/huggingface/peft/pull/1132
Refactor base layer pattern by @BenjaminBossan in https://github.com/huggingface/peft/pull/1106
[Tests] Fix daily CI by @younesbelkada in https://github.com/huggingface/peft/pull/1136
[core / LoRA] Add adapter_names in bnb layers by @younesbelkada in https://github.com/huggingface/peft/pull/1139
[Tests] Do not stop tests if a job failed by @younesbelkada in https://github.com/huggingface/peft/pull/1141
CI Add Python 3.11 to test matrix by @BenjaminBossan in https://github.com/huggingface/peft/pull/1143
FIX: A few issues with AdaLora, extending GPU tests by @BenjaminBossan in https://github.com/huggingface/peft/pull/1146
Use huggingface_hub.file_exists instead of custom helper by @Wauplin in https://github.com/huggingface/peft/pull/1145
Delete IA3 adapter by @alexrs in https://github.com/huggingface/peft/pull/1153
[Docs fix] Relative path issue by @mishig25 in https://github.com/huggingface/peft/pull/1157
Dataset was loaded twice in 4-bit finetuning script by @lukaskuhn-lku in https://github.com/huggingface/peft/pull/1164
fix add_weighted_adapter method by @pacman100 in https://github.com/huggingface/peft/pull/1169
(minor) correct type annotation by @vwxyzjn in https://github.com/huggingface/peft/pull/1166
Update release checklist about release notes by @BenjaminBossan in https://github.com/huggingface/peft/pull/1170
[docs] Migrate doc files to Markdown by @stevhliu in https://github.com/huggingface/peft/pull/1171
Fix dockerfile build by @younesbelkada in https://github.com/huggingface/peft/pull/1177
FIX: Wrong use of base layer by @BenjaminBossan in https://github.com/huggingface/peft/pull/1183
[Tests] Migrate to AWS runners by @younesbelkada in https://github.com/huggingface/peft/pull/1185
Fix code example in quicktour.md by @merveenoyan in https://github.com/huggingface/peft/pull/1181
DOC Update a few places in the README by @BenjaminBossan in https://github.com/huggingface/peft/pull/1152
Fix issue where you cannot call PeftModel.from_pretrained with a private adapter by @elyxlz in https://github.com/huggingface/peft/pull/1076
Added lora support for phi by @umarbutler in https://github.com/huggingface/peft/pull/1186
add options to save or push model by @callanwu in https://github.com/huggingface/peft/pull/1159
ENH: Different initialization methods for LoRA by @BenjaminBossan in https://github.com/huggingface/peft/pull/1189
Training PEFT models with new tokens being added to the embedding layers and tokenizer by @pacman100 in https://github.com/huggingface/peft/pull/1147
LoftQ: Add LoftQ method integrated into LoRA. Add example code for LoftQ usage. by @yxli2123 in https://github.com/huggingface/peft/pull/1150
Parallel linear Lora by @zhangsheng377 in https://github.com/huggingface/peft/pull/1092
[Feature] Support OFT by @okotaku in https://github.com/huggingface/peft/pull/1160
Mixed adapter models by @BenjaminBossan in https://github.com/huggingface/peft/pull/1163
[DOCS] README.md by @Akash190104 in https://github.com/huggingface/peft/pull/1054
Fix parallel linear lora by @zhangsheng377 in https://github.com/huggingface/peft/pull/1202
ENH: Enable OFT adapter for mixed adapter models by @BenjaminBossan in https://github.com/huggingface/peft/pull/1204
DOC: Update & improve docstrings and type annotations for common methods and classes by @BenjaminBossan in https://github.com/huggingface/peft/pull/1201
remove HF tokens by @yxli2123 in https://github.com/huggingface/peft/pull/1207
[docs] Update index and quicktour by @stevhliu in https://github.com/huggingface/peft/pull/1191
[docs] API docs by @stevhliu in https://github.com/huggingface/peft/pull/1196
MNT: Delete the delete doc workflows by @BenjaminBossan in https://github.com/huggingface/peft/pull/1213
DOC: Initialization options for LoRA by @BenjaminBossan in https://github.com/huggingface/peft/pull/1218
Fix an issue with layer merging for LoHa and OFT by @lukaskuhn-lku in https://github.com/huggingface/peft/pull/1210
DOC: How to configure new transformers models by @BenjaminBossan in https://github.com/huggingface/peft/pull/1195
Raise error when modules_to_save is specified and multiple adapters are being unloaded by @pacman100 in https://github.com/huggingface/peft/pull/1137
TST: Add regression tests 2 by @BenjaminBossan in https://github.com/huggingface/peft/pull/1115
Release: 0.7.0 by @BenjaminBossan in https://github.com/huggingface/peft/pull/1214

New Contributors

@KCFindstr made their first contribution in https://github.com/huggingface/peft/pull/1091
@winglian made their first contribution in https://github.com/huggingface/peft/pull/1109
@lukaskuhn-lku made their first contribution in https://github.com/huggingface/peft/pull/1107
@coding-famer made their first contribution in https://github.com/huggingface/peft/pull/1121
@Wauplin made their first contribution in https://github.com/huggingface/peft/pull/1145
@alexrs made their first contribution in https://github.com/huggingface/peft/pull/1153
@merveenoyan made their first contribution in https://github.com/huggingface/peft/pull/1181
@elyxlz made their first contribution in https://github.com/huggingface/peft/pull/1076
@umarbutler made their first contribution in https://github.com/huggingface/peft/pull/1186
@callanwu made their first contribution in https://github.com/huggingface/peft/pull/1159
@yxli2123 made their first contribution in https://github.com/huggingface/peft/pull/1150
@zhangsheng377 made their first contribution in https://github.com/huggingface/peft/pull/1092
@okotaku made their first contribution in https://github.com/huggingface/peft/pull/1160
@Akash190104 made their first contribution in https://github.com/huggingface/peft/pull/1054

Full Changelog: https://github.com/huggingface/peft/compare/v0.6.2...v0.7.0

Significant community contributions

The following contributors have made significant changes to the library over the last release:

@alexrs

Delete IA3 adapter by @alexrs in https://github.com/huggingface/peft/pull/1153

@callanwu

add options to save or push model by @callanwu in https://github.com/huggingface/peft/pull/1159

@elyxlz

Fix issue where you cannot call PeftModel.from_pretrained with a private adapter by @elyxlz in https://github.com/huggingface/peft/pull/1076

@lukaskuhn-lku

Fix: TorchTracemalloc ruins Windows performance by @lukaskuhn-lku in https://github.com/huggingface/peft/pull/1126
Dataset was loaded twice in 4-bit finetuning script by @lukaskuhn-lku in https://github.com/huggingface/peft/pull/1164

@okotaku

[Feature] Support OFT by @okotaku in https://github.com/huggingface/peft/pull/1160

@yxli2123

LoftQ: Add LoftQ method integrated into LoRA. Add example code for LoftQ usage. by @yxli2123 in https://github.com/huggingface/peft/pull/1150

@zhangsheng377

Parallel linear Lora by @zhangsheng377 in https://github.com/huggingface/peft/pull/1092
Fix parallel linear lora by @zhangsheng377 in https://github.com/huggingface/peft/pull/1202

v0.6.2 Patch Release: Refactor of adapter deletion API and fixes to `ModulesToSaveWrapper` when using Low-level API

This patch release refactors the adapter deletion API and fixes to ModulesToSaveWrapper when using Low-level API.

Refactor adapter deletion

Refactor adapter deletion by @BenjaminBossan in https://github.com/huggingface/peft/pull/1105

Fix `ModulesToSaveWrapper` when using Low-level API

Correctly deal with ModulesToSaveWrapper when using Low-level API by @younesbelkada in https://github.com/huggingface/peft/pull/1112

What's Changed

Release: 0.6.1 by @younesbelkada in https://github.com/huggingface/peft/pull/1103
set dev version by @younesbelkada in https://github.com/huggingface/peft/pull/1104
avoid unnecessary import by @winglian in https://github.com/huggingface/peft/pull/1109
Refactor adapter deletion by @BenjaminBossan in https://github.com/huggingface/peft/pull/1105
Added num_dataloader_workers arg to fix Windows issue by @lukaskuhn-lku in https://github.com/huggingface/peft/pull/1107
Fix import issue transformers with id_tensor_storage by @younesbelkada in https://github.com/huggingface/peft/pull/1116
Correctly deal with ModulesToSaveWrapper when using Low-level API by @younesbelkada in https://github.com/huggingface/peft/pull/1112
fix doc typo by @coding-famer in https://github.com/huggingface/peft/pull/1121

New Contributors

@winglian made their first contribution in https://github.com/huggingface/peft/pull/1109
@lukaskuhn-lku made their first contribution in https://github.com/huggingface/peft/pull/1107
@coding-famer made their first contribution in https://github.com/huggingface/peft/pull/1121

Full Changelog: https://github.com/huggingface/peft/compare/v0.6.1...v0.6.2

0.6.1 Patch Release: compatibility of Adaptation Prompt with transformers 4.35.0

This patch release fixes the compatbility issues with Adaptation Prompt that users faced with transformers 4.35.0. Moreover, it fixes an issue with token classification PEFT models when saving them using safetensors

Adaptation prompt fixes

FIX: Skip adaption prompt tests with new transformers versions by @BenjaminBossan in https://github.com/huggingface/peft/pull/1077
FIX: fix adaptation prompt CI and compatibility with latest transformers (4.35.0) by @younesbelkada in https://github.com/huggingface/peft/pull/1084

Safetensors fixes:

[core] Fix safetensors serialization for shared tensors by @younesbelkada in https://github.com/huggingface/peft/pull/1101

What's Changed

After release: Bump version to 0.7.0.dev0 by @BenjaminBossan in https://github.com/huggingface/peft/pull/1074
Improve documentation for IA³ by @SumanthRH in https://github.com/huggingface/peft/pull/984
[Docker] Update Dockerfile to force-use transformers main by @younesbelkada in https://github.com/huggingface/peft/pull/1085
Update the release checklist by @BenjaminBossan in https://github.com/huggingface/peft/pull/1075
fix-gptq-training by @SunMarc in https://github.com/huggingface/peft/pull/1086
fix the failing CI tests by @pacman100 in https://github.com/huggingface/peft/pull/1094
Fix f-string in import_utils by @KCFindstr in https://github.com/huggingface/peft/pull/1091
Fix IA3 config for Falcon models by @SumanthRH in https://github.com/huggingface/peft/pull/1007
FIX: Failing nightly CI tests due to IA3 config by @BenjaminBossan in https://github.com/huggingface/peft/pull/1100
Change to 0.6.1.dev0 by @younesbelkada in https://github.com/huggingface/peft/pull/1102

New Contributors

@KCFindstr made their first contribution in https://github.com/huggingface/peft/pull/1091

Full Changelog: https://github.com/huggingface/peft/compare/v0.6.0...v0.6.1

🧨 Diffusers now uses 🤗 PEFT, new tuning methods, better quantization support, higher flexibility and more

Highlights

Integration with diffusers

🧨 Diffusers now leverage PEFT as a backend for LoRA inference for Stable Diffusion models (#873, #993, #961). Relevant PRs on 🧨 Diffusers are https://github.com/huggingface/diffusers/pull/5058, https://github.com/huggingface/diffusers/pull/5147, https://github.com/huggingface/diffusers/pull/5151 and https://github.com/huggingface/diffusers/pull/5359. This helps in unlocking a vast number of practically demanding use cases around adapter-based inference 🚀. Now you can do the following with easy-to-use APIs and it supports different checkpoint formats (Diffusers format, Kohya format ...):

use multiple LoRAs
switch between them instantaneously
scale and combine them
merge/unmerge
enable/disable

For details, refer to the documentation at Inference with PEFT.

New tuning methods

Multitask Prompt Tuning: Thanks @mayank31398 for implementing this method from https://arxiv.org/abs/2303.02861 (#400)
LoHa (low-rank Hadamard product): @kovalexal did a great job adding LoHa from https://arxiv.org/abs/2108.06098 (#956)
LoKr (Kronecker Adapter): Not happy with just one new adapter, @kovalexal also added LoKr from https://arxiv.org/abs/2212.10650 to PEFT (#978)

Other notable additions

Allow merging of LoRA weights when using 4bit and 8bit quantization (bitsandbytes), thanks to @jiqing-feng (#851, #875)
IA³ now supports 4bit quantization thanks to @His-Wardship (#864)
We increased the speed of adapter layer initialization: This should be most notable when creating a PEFT LoRA model on top of a large base model (#887, #915, #994)
More fine-grained control when configuring LoRA: It is now possible to have different ranks and alpha values for different layers (#873)

Experimental features

For some adapters like LoRA, it is now possible to activate multiple adapters at the same time (#873)

Breaking changes

It is no longer allowed to create a LoRA adapter with rank 0 (r=0). This used to be possible, in which case the adapter was ignored.

What's Changed

Fixed typos in custom_models.mdx by @Psancs05 in https://github.com/huggingface/peft/pull/847
Release version 0.6.0.dev0 by @pacman100 in https://github.com/huggingface/peft/pull/849
DOC: Add a contribution guide by @BenjaminBossan in https://github.com/huggingface/peft/pull/848
clarify the new model size by @stas00 in https://github.com/huggingface/peft/pull/839
DOC: Remove backlog section from README.md by @BenjaminBossan in https://github.com/huggingface/peft/pull/853
MNT: Refactor tuner forward methods for simplicity by @BenjaminBossan in https://github.com/huggingface/peft/pull/833
🎉 Add Multitask Prompt Tuning by @mayank31398 in https://github.com/huggingface/peft/pull/400
Fix typos in ia3.py by @metaprotium in https://github.com/huggingface/peft/pull/844
Support merge lora module for 4bit and 8bit linear by @jiqing-feng in https://github.com/huggingface/peft/pull/851
Fix seq2seq prompt tuning (#439) by @glerzing in https://github.com/huggingface/peft/pull/809
MNT: Move tuners to subpackages by @BenjaminBossan in https://github.com/huggingface/peft/pull/807
FIX: Error in forward of 4bit linear lora layer by @BenjaminBossan in https://github.com/huggingface/peft/pull/878
MNT: Run tests that were skipped previously by @BenjaminBossan in https://github.com/huggingface/peft/pull/884
FIX: PeftModel save_pretrained Doc (#881) by @houx15 in https://github.com/huggingface/peft/pull/888
Upgrade docker actions to higher versions by @younesbelkada in https://github.com/huggingface/peft/pull/889
Fix error using deepspeed zero2 + load_in_8bit + lora by @tmm1 in https://github.com/huggingface/peft/pull/874
Fix doc for semantic_segmentation_lora by @raghavanone in https://github.com/huggingface/peft/pull/891
fix_gradient_accumulation_steps_in_examples by @zspo in https://github.com/huggingface/peft/pull/898
FIX: linting issue in example by @BenjaminBossan in https://github.com/huggingface/peft/pull/908
ENH Remove redundant initialization layer calls by @BenjaminBossan in https://github.com/huggingface/peft/pull/887
[docs] Remove duplicate section by @stevhliu in https://github.com/huggingface/peft/pull/911
support prefix tuning for starcoder models by @pacman100 in https://github.com/huggingface/peft/pull/913
Merge lora module to 8bit model by @jiqing-feng in https://github.com/huggingface/peft/pull/875
DOC: Section on common issues encountered with PEFT by @BenjaminBossan in https://github.com/huggingface/peft/pull/909
Enh speed up init emb conv2d by @BenjaminBossan in https://github.com/huggingface/peft/pull/915
Make base_model.peft_config single source of truth by @BenjaminBossan in https://github.com/huggingface/peft/pull/921
Update accelerate dependency version by @rohithkrn in https://github.com/huggingface/peft/pull/892
fix lora layer init by @SunMarc in https://github.com/huggingface/peft/pull/928
Fixed LoRA conversion for kohya_ss by @kovalexal in https://github.com/huggingface/peft/pull/916
[CI] Pin diffusers by @younesbelkada in https://github.com/huggingface/peft/pull/936
[LoRA] Add scale_layer / unscale_layer by @younesbelkada in https://github.com/huggingface/peft/pull/935
TST: Add GH action to run unit tests with torch.compile by @BenjaminBossan in https://github.com/huggingface/peft/pull/943
FIX: torch compile gh action installs pytest by @BenjaminBossan in https://github.com/huggingface/peft/pull/944
Fix NotImplementedError for no bias. by @Datta0 in https://github.com/huggingface/peft/pull/946
TST: Fix some tests that would fail with torch.compile by @BenjaminBossan in https://github.com/huggingface/peft/pull/949
ENH Allow compile GH action to run on torch nightly by @BenjaminBossan in https://github.com/huggingface/peft/pull/952
Install correct PyTorch nightly in GH action by @BenjaminBossan in https://github.com/huggingface/peft/pull/954
support multiple ranks and alphas for LoRA by @pacman100 in https://github.com/huggingface/peft/pull/873
feat: add type hints by @SauravMaheshkar in https://github.com/huggingface/peft/pull/858
FIX: setting requires_grad on adapter layers by @BenjaminBossan in https://github.com/huggingface/peft/pull/905
[tests] add transformers & diffusers integration tests by @younesbelkada in https://github.com/huggingface/peft/pull/962
Fix integrations_tests.yml by @younesbelkada in https://github.com/huggingface/peft/pull/965
Add 4-bit support to IA3 - Outperforms QLoRA in both speed and memory consumption by @His-Wardship in https://github.com/huggingface/peft/pull/864
Update integrations_tests.yml by @younesbelkada in https://github.com/huggingface/peft/pull/966
add the lora target modules for Mistral Models by @pacman100 in https://github.com/huggingface/peft/pull/974
TST: Fix broken save_pretrained tests by @BenjaminBossan in https://github.com/huggingface/peft/pull/969
[tests] add multiple active adapters tests by @pacman100 in https://github.com/huggingface/peft/pull/961
Fix missing tokenizer attribute in test by @BenjaminBossan in https://github.com/huggingface/peft/pull/977
Add implementation of LyCORIS LoHa (FedPara-like adapter) for SD&SDXL models by @kovalexal in https://github.com/huggingface/peft/pull/956
update BibTeX by @pacman100 in https://github.com/huggingface/peft/pull/989
FIX: issues with (un)merging multiple LoRA and IA³ adapters by @BenjaminBossan in https://github.com/huggingface/peft/pull/976
add lora target modules for stablelm models by @kbulutozler in https://github.com/huggingface/peft/pull/982
Correct minor errors in example notebooks for causal language modelling by @SumanthRH in https://github.com/huggingface/peft/pull/926
Fix typo in custom_models.mdx by @Pairshoe in https://github.com/huggingface/peft/pull/964
Add base model metadata to model card by @BenjaminBossan in https://github.com/huggingface/peft/pull/975
MNT Make .merged a property by @BenjaminBossan in https://github.com/huggingface/peft/pull/979
Fix lora creation by @pacman100 in https://github.com/huggingface/peft/pull/993
TST: Comment out flaky LoHA test by @BenjaminBossan in https://github.com/huggingface/peft/pull/1002
ENH Support Conv2d layers for IA³ by @BenjaminBossan in https://github.com/huggingface/peft/pull/972
Fix word_embeddings match for deepspeed wrapped model by @mayank31398 in https://github.com/huggingface/peft/pull/1000
FEAT: Add safe_merge option in merge by @younesbelkada in https://github.com/huggingface/peft/pull/1001
[core / LoRA] Add safe_merge to bnb layers by @younesbelkada in https://github.com/huggingface/peft/pull/1009
ENH: Refactor LoRA bnb layers for faster initialization by @BenjaminBossan in https://github.com/huggingface/peft/pull/994
FIX Don't assume model_config contains the key model_type by @BenjaminBossan in https://github.com/huggingface/peft/pull/1012
FIX stale.py uses timezone-aware datetime by @BenjaminBossan in https://github.com/huggingface/peft/pull/1016
FEAT: Add fp16 + cpu merge support by @younesbelkada in https://github.com/huggingface/peft/pull/1017
fix lora scaling and unscaling by @pacman100 in https://github.com/huggingface/peft/pull/1027
[LoRA] Revert original behavior for scale / unscale by @younesbelkada in https://github.com/huggingface/peft/pull/1029
[LoRA] Raise error when adapter name not found in set_scale by @younesbelkada in https://github.com/huggingface/peft/pull/1034
Fix target_modules type in config.from_pretrained by @BenjaminBossan in https://github.com/huggingface/peft/pull/1046
docs(README): bit misspell current path link StackLLaMa by @guspan-tanadi in https://github.com/huggingface/peft/pull/1047
Fixed wrong construction of LoHa weights, updated adapters conversion script by @kovalexal in https://github.com/huggingface/peft/pull/1021
Fix P-tuning for sequence classification docs by @ehcalabres in https://github.com/huggingface/peft/pull/1049
FIX: Setting active adapter correctly by @BenjaminBossan in https://github.com/huggingface/peft/pull/1051
Fix Conv1D merge error for IA3 by @SumanthRH in https://github.com/huggingface/peft/pull/1014
Add implementation of LyCORIS LoKr (KronA-like adapter) for SD&SDXL models by @kovalexal in https://github.com/huggingface/peft/pull/978
[core] Fix use_reentrant issues by @younesbelkada in https://github.com/huggingface/peft/pull/1036
[tests] Update Dockerfile to use cuda 12.2 by @younesbelkada in https://github.com/huggingface/peft/pull/1050
Add testing for regex matching and other custom kwargs by @SumanthRH in https://github.com/huggingface/peft/pull/1031
Fix Slack bot not displaying error messages by @younesbelkada in https://github.com/huggingface/peft/pull/1068
Fix slow tests not running by @younesbelkada in https://github.com/huggingface/peft/pull/1071
Release version 0.6.0 by @BenjaminBossan in https://github.com/huggingface/peft/pull/1072

New Contributors

@Psancs05 made their first contribution in https://github.com/huggingface/peft/pull/847
@metaprotium made their first contribution in https://github.com/huggingface/peft/pull/844
@jiqing-feng made their first contribution in https://github.com/huggingface/peft/pull/851
@houx15 made their first contribution in https://github.com/huggingface/peft/pull/888
@tmm1 made their first contribution in https://github.com/huggingface/peft/pull/874
@raghavanone made their first contribution in https://github.com/huggingface/peft/pull/891
@zspo made their first contribution in https://github.com/huggingface/peft/pull/898
@rohithkrn made their first contribution in https://github.com/huggingface/peft/pull/892
@Datta0 made their first contribution in https://github.com/huggingface/peft/pull/946
@kbulutozler made their first contribution in https://github.com/huggingface/peft/pull/982
@Pairshoe made their first contribution in https://github.com/huggingface/peft/pull/964
@ehcalabres made their first contribution in https://github.com/huggingface/peft/pull/1049

Full Changelog: https://github.com/huggingface/peft/compare/v0.5.0...v0.6.0

GPTQ Quantization, Low-level API

GPTQ Integration

Now, you can finetune GPTQ quantized models using PEFT. Here are some examples of how to use PEFT with a GPTQ model: colab notebook and finetuning script.

GPTQ Integration by @SunMarc in https://github.com/huggingface/peft/pull/771

Low-level API

Enables users and developers to use PEFT as a utility library, at least for injectable adapters (LoRA, IA3, AdaLoRA). It exposes an API to modify the model in place to inject the new layers into the model.

[core] PEFT refactor + introducing inject_adapter_in_model public method by @younesbelkada https://github.com/huggingface/peft/pull/749
[Low-level-API] Add docs about LLAPI by @younesbelkada in https://github.com/huggingface/peft/pull/836

Support for XPU and NPU devices

Leverage the support for more devices for loading and fine-tuning PEFT adapters.

Support XPU adapter loading by @abhilash1910 in https://github.com/huggingface/peft/pull/737
Support Ascend NPU adapter loading by @statelesshz in https://github.com/huggingface/peft/pull/772

Mix-and-match LoRAs

Stable support and new ways of merging multiple LoRAs. There are currently 3 ways of merging loras supported: linear, svd and cat.

Added additional parameters to mixing multiple LoRAs through SVD, added ability to mix LoRAs through concatenation by @kovalexal in https://github.com/huggingface/peft/pull/817

What's Changed

Release version 0.5.0.dev0 by @pacman100 in https://github.com/huggingface/peft/pull/717
Fix subfolder issue by @younesbelkada in https://github.com/huggingface/peft/pull/721
Add falcon to officially supported LoRA & IA3 modules by @younesbelkada in https://github.com/huggingface/peft/pull/722
revert change by @pacman100 in https://github.com/huggingface/peft/pull/731
fix(pep561): include packaging type information by @aarnphm in https://github.com/huggingface/peft/pull/729
[Llama2] Add disabling TP behavior by @younesbelkada in https://github.com/huggingface/peft/pull/728
[Patch] patch trainable params for 4bit layers by @younesbelkada in https://github.com/huggingface/peft/pull/733
FIX: Warning when initializing prompt encoder by @BenjaminBossan in https://github.com/huggingface/peft/pull/716
ENH: Warn when disabling adapters and bias != 'none' by @BenjaminBossan in https://github.com/huggingface/peft/pull/741
FIX: Disabling adapter works with modules_to_save by @BenjaminBossan in https://github.com/huggingface/peft/pull/736
Updated Example in Class:LoraModel by @TianyiPeng in https://github.com/huggingface/peft/pull/672
[AdaLora] Fix adalora inference issue by @younesbelkada in https://github.com/huggingface/peft/pull/745
Add btlm to officially supported LoRA by @Trapper4888 in https://github.com/huggingface/peft/pull/751
[ModulesToSave] add correct hook management for modules to save by @younesbelkada in https://github.com/huggingface/peft/pull/755
Example notebooks for LoRA with custom models by @BenjaminBossan in https://github.com/huggingface/peft/pull/724
Add tests for AdaLoRA, fix a few bugs by @BenjaminBossan in https://github.com/huggingface/peft/pull/734
Add progressbar unload/merge by @BramVanroy in https://github.com/huggingface/peft/pull/753
Support XPU adapter loading by @abhilash1910 in https://github.com/huggingface/peft/pull/737
Support Ascend NPU adapter loading by @statelesshz in https://github.com/huggingface/peft/pull/772
Allow passing inputs_embeds instead of input_ids by @BenjaminBossan in https://github.com/huggingface/peft/pull/757
[core] PEFT refactor + introducing inject_adapter_in_model public method by @younesbelkada in https://github.com/huggingface/peft/pull/749
Add adapter error handling by @BenjaminBossan in https://github.com/huggingface/peft/pull/800
add lora default target module for codegen by @sywangyi in https://github.com/huggingface/peft/pull/787
DOC: Update docstring of PeftModel.from_pretrained by @BenjaminBossan in https://github.com/huggingface/peft/pull/799
fix crash when using torch.nn.DataParallel for LORA inference by @sywangyi in https://github.com/huggingface/peft/pull/805
Peft model signature by @kiansierra in https://github.com/huggingface/peft/pull/784
GPTQ Integration by @SunMarc in https://github.com/huggingface/peft/pull/771
Only fail quantized Lora unload when actually merging by @BlackHC in https://github.com/huggingface/peft/pull/822
Added additional parameters to mixing multiple LoRAs through SVD, added ability to mix LoRAs through concatenation by @kovalexal in https://github.com/huggingface/peft/pull/817
TST: add test about loading custom models by @BenjaminBossan in https://github.com/huggingface/peft/pull/827
Fix unbound error in ia3.py by @His-Wardship in https://github.com/huggingface/peft/pull/794
[Docker] Fix gptq dockerfile by @younesbelkada in https://github.com/huggingface/peft/pull/835
[Tests] Add 4bit slow training tests by @younesbelkada in https://github.com/huggingface/peft/pull/834
[Low-level-API] Add docs about LLAPI by @younesbelkada in https://github.com/huggingface/peft/pull/836
Type annotation fix by @vwxyzjn in https://github.com/huggingface/peft/pull/840

New Contributors

@TianyiPeng made their first contribution in https://github.com/huggingface/peft/pull/672
@Trapper4888 made their first contribution in https://github.com/huggingface/peft/pull/751
@abhilash1910 made their first contribution in https://github.com/huggingface/peft/pull/737
@statelesshz made their first contribution in https://github.com/huggingface/peft/pull/772
@kiansierra made their first contribution in https://github.com/huggingface/peft/pull/784
@BlackHC made their first contribution in https://github.com/huggingface/peft/pull/822
@His-Wardship made their first contribution in https://github.com/huggingface/peft/pull/794
@vwxyzjn made their first contribution in https://github.com/huggingface/peft/pull/840

Full Changelog: https://github.com/huggingface/peft/compare/v0.4.0...v0.5.0

QLoRA, IA3 PEFT method, support for QA and Feature Extraction tasks, AutoPeftModelForxxx for simplified UX , LoRA for custom models with new added utils

QLoRA Support:

QLoRA uses 4-bit quantization to compress a pretrained language model. The LM parameters are then frozen and a relatively small number of trainable parameters are added to the model in the form of Low-Rank Adapters. During finetuning, QLoRA backpropagates gradients through the frozen 4-bit quantized pretrained language model into the Low-Rank Adapters. The LoRA layers are the only parameters being updated during training. For more details read the blog Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

4-bit QLoRA via bitsandbytes (4-bit base model + LoRA) by @TimDettmers in https://github.com/huggingface/peft/pull/476
[core] Protect 4bit import by @younesbelkada in https://github.com/huggingface/peft/pull/480
[core] Raise warning on using prepare_model_for_int8_training by @younesbelkada in https://github.com/huggingface/peft/pull/483

New PEFT methods: IA3 from T-Few paper

To make fine-tuning more efficient, IA3 (Infused Adapter by Inhibiting and Amplifying Inner Activations) rescales inner activations with learned vectors. These learned vectors are injected into the attention and feedforward modules in a typical transformer-based architecture. These learned vectors are the only trainable parameters during fine-tuning, and thus the original weights remain frozen. Dealing with learned vectors (as opposed to learned low-rank updates to a weight matrix like LoRA) keeps the number of trainable parameters much smaller. For more details, read the paper Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning

Add functionality to support IA3 by @SumanthRH in https://github.com/huggingface/peft/pull/578

Support for new tasks: QA and Feature Extraction

Addition of PeftModelForQuestionAnswering and PeftModelForFeatureExtraction classes to support QA and Feature Extraction tasks, respectively. This enables exciting new use-cases with PEFT, e.g., LoRA for semantic similarity tasks.

feat: Add PeftModelForQuestionAnswering by @sjrl in https://github.com/huggingface/peft/pull/473
add support for Feature Extraction using PEFT by @pacman100 in https://github.com/huggingface/peft/pull/647

AutoPeftModelForxxx for better and Simplified UX

Introduces a new paradigm, AutoPeftModelForxxx intended for users that want to rapidly load and run peft models.

from peft import AutoPeftModelForCausalLM

peft_model = AutoPeftModelForCausalLM.from_pretrained("ybelkada/opt-350m-lora")

Introducing AutoPeftModelForxxx by @younesbelkada in https://github.com/huggingface/peft/pull/694

LoRA for custom models

Not a transformer model, no problem, we have got you covered. PEFT now enables the usage of LoRA with custom models.

FEAT: Make LoRA work with custom models by @BenjaminBossan in https://github.com/huggingface/peft/pull/676

New LoRA utilities

Improvements to add_weighted_adapter method to support SVD for combining multiple LoRAs when creating new LoRA. New utils such as unload and delete_adapter providing users much better control about how they deal with the adapters.

[Core] Enhancements and refactoring of LoRA method by @pacman100 in https://github.com/huggingface/peft/pull/695

PEFT and Stable Diffusion

PEFT is very extensible and easy to use for performing DreamBooth of Stable Diffusion. Community has added conversion scripts to be able to use PEFT models with Civitai/webui format and vice-versa.

LoRA for Conv2d layer, script to convert kohya_ss LoRA to PEFT by @kovalexal in https://github.com/huggingface/peft/pull/461
Added Civitai LoRAs conversion to PEFT, PEFT LoRAs conversion to webui by @kovalexal in https://github.com/huggingface/peft/pull/596
[Bugfix] Fixed LoRA conv2d merge by @kovalexal in https://github.com/huggingface/peft/pull/637
Fixed LoraConfig alpha modification on add_weighted_adapter by @kovalexal in https://github.com/huggingface/peft/pull/654

What's Changed

Release: v0.4.0.dev0 by @pacman100 in https://github.com/huggingface/peft/pull/391
do not use self.device. In FSDP cpu offload mode. self.device is "CPU… by @sywangyi in https://github.com/huggingface/peft/pull/352
add accelerate example for DDP and FSDP in sequence classification fo… by @sywangyi in https://github.com/huggingface/peft/pull/358
[CI] Fix CI - pin urlib by @younesbelkada in https://github.com/huggingface/peft/pull/402
[docs] Fix index by @stevhliu in https://github.com/huggingface/peft/pull/397
Fix documentation links on index page by @mikeorzel in https://github.com/huggingface/peft/pull/406
Zero 3 init ReadME update by @dumpmemory in https://github.com/huggingface/peft/pull/399
[Tests] Add soundfile to docker images by @younesbelkada in https://github.com/huggingface/peft/pull/401
4-bit QLoRA via bitsandbytes (4-bit base model + LoRA) by @TimDettmers in https://github.com/huggingface/peft/pull/476
[core] Protect 4bit import by @younesbelkada in https://github.com/huggingface/peft/pull/480
[core] Raise warning on using prepare_model_for_int8_training by @younesbelkada in https://github.com/huggingface/peft/pull/483
Remove merge_weights by @Atry in https://github.com/huggingface/peft/pull/392
[core] Add gradient checkpointing check by @younesbelkada in https://github.com/huggingface/peft/pull/404
[docs] Fix LoRA image classification docs by @stevhliu in https://github.com/huggingface/peft/pull/524
[docs] Prettify index by @stevhliu in https://github.com/huggingface/peft/pull/478
change comment in tuners.lora, lora_alpha float to int by @codingchild2424 in https://github.com/huggingface/peft/pull/448
[LoRA] Allow applying LoRA at different stages by @younesbelkada in https://github.com/huggingface/peft/pull/429
Enable PeftConfig & PeftModel to load from revision by @lewtun in https://github.com/huggingface/peft/pull/433
[Llama-Adapter] fix half precision inference + add tests by @younesbelkada in https://github.com/huggingface/peft/pull/456
fix merge_and_unload when LoRA targets embedding layer by @0x000011b in https://github.com/huggingface/peft/pull/438
return load_result when load_adapter by @dkqkxx in https://github.com/huggingface/peft/pull/481
Fixed problem with duplicate same code. by @hotchpotch in https://github.com/huggingface/peft/pull/517
Add starcoder model to target modules dict by @mrm8488 in https://github.com/huggingface/peft/pull/528
Fix a minor typo where a non-default token_dim would crash prompt tuning by @thomas-schillaci in https://github.com/huggingface/peft/pull/459
Remove device_map when training 4,8-bit model. by @SunMarc in https://github.com/huggingface/peft/pull/534
add library name to model card by @pacman100 in https://github.com/huggingface/peft/pull/549
Add thousands separator in print_trainable_parameters by @BramVanroy in https://github.com/huggingface/peft/pull/443
[doc build] Use secrets by @mishig25 in https://github.com/huggingface/peft/pull/556
improve readability of LoRA code by @martin-liu in https://github.com/huggingface/peft/pull/409
[core] Add safetensors integration by @younesbelkada in https://github.com/huggingface/peft/pull/553
[core] Fix config kwargs by @younesbelkada in https://github.com/huggingface/peft/pull/561
Fix typo and url to openai/whisper-large-v2 by @alvarobartt in https://github.com/huggingface/peft/pull/563
feat: add type hint to get_peft_model by @samsja in https://github.com/huggingface/peft/pull/566
Add issues template by @younesbelkada in https://github.com/huggingface/peft/pull/562
[BugFix] Set alpha and dropout defaults in LoraConfig by @apbard in https://github.com/huggingface/peft/pull/390
enable lora for mpt by @sywangyi in https://github.com/huggingface/peft/pull/576
Fix minor typo in bug-report.yml by @younesbelkada in https://github.com/huggingface/peft/pull/582
[core] Correctly passing the kwargs all over the place by @younesbelkada in https://github.com/huggingface/peft/pull/575
Fix adalora device mismatch issue by @younesbelkada in https://github.com/huggingface/peft/pull/583
LoRA for Conv2d layer, script to convert kohya_ss LoRA to PEFT by @kovalexal in https://github.com/huggingface/peft/pull/461
Fix typo at peft_model.py by @Beomi in https://github.com/huggingface/peft/pull/588
[test] Adds more CI tests by @younesbelkada in https://github.com/huggingface/peft/pull/586
when from_pretrained is called in finetune case of lora with flag "… by @sywangyi in https://github.com/huggingface/peft/pull/591
feat: Add PeftModelForQuestionAnswering by @sjrl in https://github.com/huggingface/peft/pull/473
Improve the README when using PEFT by @pacman100 in https://github.com/huggingface/peft/pull/594
[tests] Fix dockerfile by @younesbelkada in https://github.com/huggingface/peft/pull/608
Fix final failing slow tests by @younesbelkada in https://github.com/huggingface/peft/pull/609
[core] Add adapter_name in get_peft_model by @younesbelkada in https://github.com/huggingface/peft/pull/610
[core] Stronger import of bnb by @younesbelkada in https://github.com/huggingface/peft/pull/605
Added Civitai LoRAs conversion to PEFT, PEFT LoRAs conversion to webui by @kovalexal in https://github.com/huggingface/peft/pull/596
update whisper test by @pacman100 in https://github.com/huggingface/peft/pull/617
Update README.md, citation by @pminervini in https://github.com/huggingface/peft/pull/616
Update train_dreambooth.py by @nafiturgut in https://github.com/huggingface/peft/pull/624
[Adalora] Add adalora 4bit by @younesbelkada in https://github.com/huggingface/peft/pull/598
[AdaptionPrompt] Add 8bit + 4bit support for adaption prompt by @younesbelkada in https://github.com/huggingface/peft/pull/604
Add seq2seq prompt tuning support by @thomas-schillaci in https://github.com/huggingface/peft/pull/519
[Bugfix] Fixed LoRA conv2d merge by @kovalexal in https://github.com/huggingface/peft/pull/637
[Bugfix] Inserted adapter_name to get_peft_model_state_dict function by @nafiturgut in https://github.com/huggingface/peft/pull/626
fix Prefix-tuning error in clm Float16 evaluation by @sywangyi in https://github.com/huggingface/peft/pull/520
fix ptun and prompt tuning generation issue by @sywangyi in https://github.com/huggingface/peft/pull/543
feat(model): Allow from_pretrained to accept PeftConfig class by @aarnphm in https://github.com/huggingface/peft/pull/612
Fix PeftModel.disable_adapter by @ain-soph in https://github.com/huggingface/peft/pull/644
bitsandbytes version check by @glerzing in https://github.com/huggingface/peft/pull/646
DOC: Remove loralib requirements from examples, a few small fixes by @BenjaminBossan in https://github.com/huggingface/peft/pull/640
style: tentatively add hints for some public function by @aarnphm in https://github.com/huggingface/peft/pull/614
Add pytest-cov for reporting test coverage by @BenjaminBossan in https://github.com/huggingface/peft/pull/641
Require Python version >= 3.8 by @BenjaminBossan in https://github.com/huggingface/peft/pull/649
Fixed LoraConfig alpha modification on add_weighted_adapter by @kovalexal in https://github.com/huggingface/peft/pull/654
[docs] API example by @stevhliu in https://github.com/huggingface/peft/pull/650
FIX: bug resulting in config copies not working by @BenjaminBossan in https://github.com/huggingface/peft/pull/653
Update clm-prompt-tuning.mdx by @richard087 in https://github.com/huggingface/peft/pull/652
Adding support for RoBERTa layers_transform in COMMON_LAYERS_PATTERN by @sunyuhan19981208 in https://github.com/huggingface/peft/pull/669
TST: Remove skipping certain tests by @BenjaminBossan in https://github.com/huggingface/peft/pull/668
Added wandb support for lora train_dreambooth by @nafiturgut in https://github.com/huggingface/peft/pull/639
FIX: Embedding LoRA weights are initialized randomly by @BenjaminBossan in https://github.com/huggingface/peft/pull/681
Fix broken docker images by @younesbelkada in https://github.com/huggingface/peft/pull/684
Add functionality to support IA3 by @SumanthRH in https://github.com/huggingface/peft/pull/578
Fix base_model_torch_dtype when using model.half() after init by @rayrayraykk in https://github.com/huggingface/peft/pull/688
Init IA³ weights randomly when so configured by @BenjaminBossan in https://github.com/huggingface/peft/pull/693
add support for Feature Extraction using PEFT by @pacman100 in https://github.com/huggingface/peft/pull/647
Fix a small bug in forward method of IA³ by @BenjaminBossan in https://github.com/huggingface/peft/pull/696
Update Readme to include IA3 by @SumanthRH in https://github.com/huggingface/peft/pull/702
Fix code typo in int8-asr.mdx by @zsamboki in https://github.com/huggingface/peft/pull/698
chore(type): annotate that peft does contains type hints by @aarnphm in https://github.com/huggingface/peft/pull/678
Introducing AutoPeftModelForxxx by @younesbelkada in https://github.com/huggingface/peft/pull/694
[WIP] FIX for disabling adapter, adding tests by @BenjaminBossan in https://github.com/huggingface/peft/pull/683
[Core] Enhancements and refactoring of LoRA method by @pacman100 in https://github.com/huggingface/peft/pull/695
[Feature] Save only selected adapters for LoRA by @younesbelkada in https://github.com/huggingface/peft/pull/705
[Auto] Support AutoPeftModel for custom HF models by @younesbelkada in https://github.com/huggingface/peft/pull/707
FEAT: Make LoRA work with custom models by @BenjaminBossan in https://github.com/huggingface/peft/pull/676
[core] Better hub kwargs management by @younesbelkada in https://github.com/huggingface/peft/pull/712
FIX: Removes warnings about unknown pytest marker by @BenjaminBossan in https://github.com/huggingface/peft/pull/715

New Contributors

@sywangyi made their first contribution in https://github.com/huggingface/peft/pull/352
@mikeorzel made their first contribution in https://github.com/huggingface/peft/pull/406
@TimDettmers made their first contribution in https://github.com/huggingface/peft/pull/476
@Atry made their first contribution in https://github.com/huggingface/peft/pull/392
@codingchild2424 made their first contribution in https://github.com/huggingface/peft/pull/448
@lewtun made their first contribution in https://github.com/huggingface/peft/pull/433
@0x000011b made their first contribution in https://github.com/huggingface/peft/pull/438
@dkqkxx made their first contribution in https://github.com/huggingface/peft/pull/481
@hotchpotch made their first contribution in https://github.com/huggingface/peft/pull/517
@thomas-schillaci made their first contribution in https://github.com/huggingface/peft/pull/459
@SunMarc made their first contribution in https://github.com/huggingface/peft/pull/534
@BramVanroy made their first contribution in https://github.com/huggingface/peft/pull/443
@mishig25 made their first contribution in https://github.com/huggingface/peft/pull/556
@martin-liu made their first contribution in https://github.com/huggingface/peft/pull/409
@alvarobartt made their first contribution in https://github.com/huggingface/peft/pull/563
@samsja made their first contribution in https://github.com/huggingface/peft/pull/566
@apbard made their first contribution in https://github.com/huggingface/peft/pull/390
@kovalexal made their first contribution in https://github.com/huggingface/peft/pull/461
@Beomi made their first contribution in https://github.com/huggingface/peft/pull/588
@sjrl made their first contribution in https://github.com/huggingface/peft/pull/473
@pminervini made their first contribution in https://github.com/huggingface/peft/pull/616
@nafiturgut made their first contribution in https://github.com/huggingface/peft/pull/624
@aarnphm made their first contribution in https://github.com/huggingface/peft/pull/612
@ain-soph made their first contribution in https://github.com/huggingface/peft/pull/644
@glerzing made their first contribution in https://github.com/huggingface/peft/pull/646
@BenjaminBossan made their first contribution in https://github.com/huggingface/peft/pull/640
@richard087 made their first contribution in https://github.com/huggingface/peft/pull/652
@sunyuhan19981208 made their first contribution in https://github.com/huggingface/peft/pull/669
@SumanthRH made their first contribution in https://github.com/huggingface/peft/pull/578
@rayrayraykk made their first contribution in https://github.com/huggingface/peft/pull/688
@zsamboki made their first contribution in https://github.com/huggingface/peft/pull/698

Full Changelog: https://github.com/huggingface/peft/compare/v0.3.0...v0.4.0

Significant community contributions

The following contributors have made significant changes to the library over the last release:

@TimDettmers

4-bit QLoRA via bitsandbytes (4-bit base model + LoRA) by @TimDettmers in https://github.com/huggingface/peft/pull/476

@SumanthRH

Add functionality to support IA3 by @SumanthRH in https://github.com/huggingface/peft/pull/578

@kovalexal

LoRA for Conv2d layer, script to convert kohya_ss LoRA to PEFT by @kovalexal in https://github.com/huggingface/peft/pull/461
Added Civitai LoRAs conversion to PEFT, PEFT LoRAs conversion to webui by @kovalexal in https://github.com/huggingface/peft/pull/596
[Bugfix] Fixed LoRA conv2d merge by @kovalexal in https://github.com/huggingface/peft/pull/637
Fixed LoraConfig alpha modification on add_weighted_adapter by @kovalexal in https://github.com/huggingface/peft/pull/654

@sywangyi

do not use self.device. In FSDP cpu offload mode. self.device is "CPU… by @sywangyi in https://github.com/huggingface/peft/pull/352
add accelerate example for DDP and FSDP in sequence classification fo… by @sywangyi in https://github.com/huggingface/peft/pull/358
enable lora for mpt by @sywangyi in https://github.com/huggingface/peft/pull/576
fix Prefix-tuning error in clm Float16 evaluation by @sywangyi in https://github.com/huggingface/peft/pull/520
fix ptun and prompt tuning generation issue by @sywangyi in https://github.com/huggingface/peft/pull/543
when from_pretrained is called in finetune case of lora with flag "… by @sywangyi in https://github.com/huggingface/peft/pull/591

@aarnphm

feat(model): Allow from_pretrained to accept PeftConfig class by @aarnphm in https://github.com/huggingface/peft/pull/612
style: tentatively add hints for some public function by @aarnphm in https://github.com/huggingface/peft/pull/614
chore(type): annotate that peft does contains type hints by @aarnphm in https://github.com/huggingface/peft/pull/678

@martin-liu

improve readability of LoRA code by @martin-liu in https://github.com/huggingface/peft/pull/409

@thomas-schillaci

Add seq2seq prompt tuning support by @thomas-schillaci in https://github.com/huggingface/peft/pull/519

Docs, Testing Suite, Multi Adapter Support, New methods and examples

Brand new Docs

With task guides, conceptual guides, integration guides, and code references all available at your fingertips, 🤗 PEFT's docs (found at https://huggingface.co/docs/peft) provide an insightful and easy-to-follow resource for anyone looking to how to use 🤗 PEFT. Whether you're a seasoned pro or just starting out, PEFT's documentation will help you to get the most out of it.

[WIP-docs] Accelerate scripts by @stevhliu in https://github.com/huggingface/peft/pull/355
[docs] Quicktour update by @stevhliu in https://github.com/huggingface/peft/pull/346
[docs] Conceptual overview of prompting methods by @stevhliu in https://github.com/huggingface/peft/pull/339
[docs] LoRA for token classification by @stevhliu in https://github.com/huggingface/peft/pull/302
[docs] int8 training by @stevhliu in https://github.com/huggingface/peft/pull/332
[docs] P-tuning for sequence classification by @stevhliu in https://github.com/huggingface/peft/pull/281
[docs] Prompt tuning for CLM by @stevhliu in https://github.com/huggingface/peft/pull/264
[docs] Prefix tuning for Seq2Seq by @stevhliu in https://github.com/huggingface/peft/pull/272
[docs] Add API references by @stevhliu in https://github.com/huggingface/peft/pull/241
[docs] Build notebooks from Markdown by @stevhliu in https://github.com/huggingface/peft/pull/240
[docs] Supported models tables by @MKhalusova in https://github.com/huggingface/peft/pull/364
[docs] Task guide with Dreambooth LoRA example by @MKhalusova in https://github.com/huggingface/peft/pull/330
[docs] LoRA conceptual guide by @MKhalusova in https://github.com/huggingface/peft/pull/331
[docs] Task guide for semantic segmentation with LoRA by @MKhalusova in https://github.com/huggingface/peft/pull/307
Move image classification example to the docs by @MKhalusova in https://github.com/huggingface/peft/pull/239

Comprehensive Testing Suite

Comprised of both unit and integration tests, it rigorously tests core features, examples, and various models on different setups, including single and multiple GPUs. This commitment to testing helps ensure that PEFT maintains the highest levels of correctness, usability, and performance, while continuously improving in all areas.

[CI] Add ci tests by @younesbelkada in https://github.com/huggingface/peft/pull/203
Fix CI tests by @younesbelkada in https://github.com/huggingface/peft/pull/210
[CI] Add more ci tests by @younesbelkada in https://github.com/huggingface/peft/pull/223
[tests] Adds more tests + fix failing tests by @younesbelkada in https://github.com/huggingface/peft/pull/238
[tests] Adds GPU tests by @younesbelkada in https://github.com/huggingface/peft/pull/256
[tests] add slow tests to GH workflow by @younesbelkada in https://github.com/huggingface/peft/pull/304
[core] Better log messages by @younesbelkada in https://github.com/huggingface/peft/pull/366

Multi Adapter Support

PEFT just got even more versatile with its new Multi Adapter Support! Now you can train and infer with multiple adapters, or even combine multiple LoRA adapters in a weighted combination. This is especially handy for RLHF training, where you can save memory by using a single base model with multiple adapters for actor, critic, reward, and reference. And the icing on the cake? Check out the LoRA Dreambooth inference example notebook to see this feature in action.

Multi Adapter support by @pacman100 in https://github.com/huggingface/peft/pull/263

New PEFT methods: AdaLoRA and Adaption Prompt

PEFT just got even better, thanks to the contributions of the community! The AdaLoRA method is one of the exciting new additions. It takes the highly regarded LoRA method and improves it by allocating trainable parameters across the model to maximize performance within a given parameter budget. Another standout is the Adaption Prompt method, which enhances the already popular Prefix Tuning by introducing zero init attention.

The Implementation of AdaLoRA (ICLR 2023) by @QingruZhang in https://github.com/huggingface/peft/pull/233
Implement adaption prompt from Llama-Adapter paper by @yeoedward in https://github.com/huggingface/peft/pull/268

New LoRA utilities

Good news for LoRA users! PEFT now allows you to merge LoRA parameters into the base model's parameters, giving you the freedom to remove the PEFT wrapper and apply downstream optimizations related to inference and deployment. Plus, you can use all the features that are compatible with the base model without any issues.

[utils] add merge_lora utility function by @younesbelkada in https://github.com/huggingface/peft/pull/227
Add nn.Embedding Support to Lora by @Splo2t in https://github.com/huggingface/peft/pull/337

What's Changed

release v0.3.0.dev0 by @pacman100 in https://github.com/huggingface/peft/pull/166
fixing merged_linear lora issues by @pacman100 in https://github.com/huggingface/peft/pull/172
Replace base_model's function temporarily by @PanQiWei in https://github.com/huggingface/peft/pull/170
Support for LLaMA models by @zphang in https://github.com/huggingface/peft/pull/160
[core] Fix peft multi-gpu issue by @younesbelkada in https://github.com/huggingface/peft/pull/145
Update README.md by @dumpmemory in https://github.com/huggingface/peft/pull/167
ChatGLM support by @mymusise in https://github.com/huggingface/peft/pull/180
[CI] Add ci tests by @younesbelkada in https://github.com/huggingface/peft/pull/203
Fix CI tests by @younesbelkada in https://github.com/huggingface/peft/pull/210
Update train_dreambooth.py by @haofanwang in https://github.com/huggingface/peft/pull/204
Fix failing test on main by @younesbelkada in https://github.com/huggingface/peft/pull/224
Causal LM generation fix for prefix tuning: GPT2 model by @vineetm in https://github.com/huggingface/peft/pull/222
[CI] Add more ci tests by @younesbelkada in https://github.com/huggingface/peft/pull/223
Show CONFIG_NAME instead of "config.json" by @aitor-gamarra in https://github.com/huggingface/peft/pull/231
add docs by @pacman100 in https://github.com/huggingface/peft/pull/214
[utils] add merge_lora utility function by @younesbelkada in https://github.com/huggingface/peft/pull/227
Have fix typo in README by @guspan-tanadi in https://github.com/huggingface/peft/pull/243
Move image classification example to the docs by @MKhalusova in https://github.com/huggingface/peft/pull/239
[docs] Add API references by @stevhliu in https://github.com/huggingface/peft/pull/241
[docs] Build notebooks from Markdown by @stevhliu in https://github.com/huggingface/peft/pull/240
[core] Fix offload issue by @younesbelkada in https://github.com/huggingface/peft/pull/248
[Automation] Add stale bot by @younesbelkada in https://github.com/huggingface/peft/pull/247
[resources] replace pdf links with abs links by @stas00 in https://github.com/huggingface/peft/pull/255
[Automation] Update stale.py by @younesbelkada in https://github.com/huggingface/peft/pull/254
docs: have fix bit typo README by @guspan-tanadi in https://github.com/huggingface/peft/pull/252
Update other.py by @tpoisonooo in https://github.com/huggingface/peft/pull/250
Fixing a bug where a wrong parameter name is used for the offload_folder by @toncho11 in https://github.com/huggingface/peft/pull/257
[tests] Adds more tests + fix failing tests by @younesbelkada in https://github.com/huggingface/peft/pull/238
The Implementation of AdaLoRA (ICLR 2023) by @QingruZhang in https://github.com/huggingface/peft/pull/233
Add BLIP2 Example by @younesbelkada in https://github.com/huggingface/peft/pull/260
Multi Adapter support by @pacman100 in https://github.com/huggingface/peft/pull/263
Fix typo in examples/causal_language_modeling/peft_lora_clm_accelerate_ds_zero3_offload.py by @rmill040 in https://github.com/huggingface/peft/pull/277
[tests] Adds GPU tests by @younesbelkada in https://github.com/huggingface/peft/pull/256
Fix half precision forward by @younesbelkada in https://github.com/huggingface/peft/pull/261
fix trainable params setting by @pacman100 in https://github.com/huggingface/peft/pull/283
[docs] Prefix tuning for Seq2Seq by @stevhliu in https://github.com/huggingface/peft/pull/272
Fix lora_dropout operator type when dropout=0 by @bigeagle in https://github.com/huggingface/peft/pull/288
[test] Add Dockerfile by @younesbelkada in https://github.com/huggingface/peft/pull/278
fix and update examples and readme by @pacman100 in https://github.com/huggingface/peft/pull/295
[docs] Prompt tuning for CLM by @stevhliu in https://github.com/huggingface/peft/pull/264
Change gather for gather_for_metrics in eval. by @JulesGM in https://github.com/huggingface/peft/pull/296
Fix: unexpected keyword argument 'has_fp16_weights' by @cyberfox in https://github.com/huggingface/peft/pull/299
[tests] add CI training tests by @younesbelkada in https://github.com/huggingface/peft/pull/311
[docs] Task guide for semantic segmentation with LoRA by @MKhalusova in https://github.com/huggingface/peft/pull/307
[docs] P-tuning for sequence classification by @stevhliu in https://github.com/huggingface/peft/pull/281
Fix merge_and_unload when having additional trainable modules by @pacman100 in https://github.com/huggingface/peft/pull/322
feat(ci): add pip caching to CI by @SauravMaheshkar in https://github.com/huggingface/peft/pull/314
Fix eval for causal language modeling example by @BabyChouSr in https://github.com/huggingface/peft/pull/327
[docs] LoRA for token classification by @stevhliu in https://github.com/huggingface/peft/pull/302
[docs] int8 training by @stevhliu in https://github.com/huggingface/peft/pull/332
fix lora modules_to_save issue by @pacman100 in https://github.com/huggingface/peft/pull/343
[docs] Task guide with Dreambooth LoRA example by @MKhalusova in https://github.com/huggingface/peft/pull/330
[docs] LoRA conceptual guide by @MKhalusova in https://github.com/huggingface/peft/pull/331
[docs] Conceptual overview of prompting methods by @stevhliu in https://github.com/huggingface/peft/pull/339
Implement adaption prompt from Llama-Adapter paper by @yeoedward in https://github.com/huggingface/peft/pull/268
[tests] add slow tests to GH workflow by @younesbelkada in https://github.com/huggingface/peft/pull/304
[core] Better log messages by @younesbelkada in https://github.com/huggingface/peft/pull/366
Use try and finally in disable_adapter() to catch exceptions by @mukobi in https://github.com/huggingface/peft/pull/368
[docs] Supported models tables by @MKhalusova in https://github.com/huggingface/peft/pull/364
[WIP-docs] Accelerate scripts by @stevhliu in https://github.com/huggingface/peft/pull/355
[docs] Quicktour update by @stevhliu in https://github.com/huggingface/peft/pull/346
[CI] Fix nightly CI issues by @younesbelkada in https://github.com/huggingface/peft/pull/375
Fix a link to the example script by @nzw0301 in https://github.com/huggingface/peft/pull/383
Add nn.Embedding Support to Lora by @Splo2t in https://github.com/huggingface/peft/pull/337
Fix missing arg for transpose in AdaLora by @yasyf in https://github.com/huggingface/peft/pull/347
fix INT8 prepare function by @pacman100 in https://github.com/huggingface/peft/pull/389

New Contributors

@PanQiWei made their first contribution in https://github.com/huggingface/peft/pull/170
@mymusise made their first contribution in https://github.com/huggingface/peft/pull/180
@haofanwang made their first contribution in https://github.com/huggingface/peft/pull/204
@vineetm made their first contribution in https://github.com/huggingface/peft/pull/222
@aitor-gamarra made their first contribution in https://github.com/huggingface/peft/pull/231
@guspan-tanadi made their first contribution in https://github.com/huggingface/peft/pull/243
@MKhalusova made their first contribution in https://github.com/huggingface/peft/pull/239
@stevhliu made their first contribution in https://github.com/huggingface/peft/pull/241
@stas00 made their first contribution in https://github.com/huggingface/peft/pull/255
@tpoisonooo made their first contribution in https://github.com/huggingface/peft/pull/250
@toncho11 made their first contribution in https://github.com/huggingface/peft/pull/257
@QingruZhang made their first contribution in https://github.com/huggingface/peft/pull/233
@rmill040 made their first contribution in https://github.com/huggingface/peft/pull/277
@bigeagle made their first contribution in https://github.com/huggingface/peft/pull/288
@JulesGM made their first contribution in https://github.com/huggingface/peft/pull/296
@cyberfox made their first contribution in https://github.com/huggingface/peft/pull/299
@BabyChouSr made their first contribution in https://github.com/huggingface/peft/pull/327
@yeoedward made their first contribution in https://github.com/huggingface/peft/pull/268
@mukobi made their first contribution in https://github.com/huggingface/peft/pull/368
@nzw0301 made their first contribution in https://github.com/huggingface/peft/pull/383
@Splo2t made their first contribution in https://github.com/huggingface/peft/pull/337
@yasyf made their first contribution in https://github.com/huggingface/peft/pull/347

Significant community contributions

The following contributors have made significant changes to the library over the last release:

@QingruZhang

The Implementation of AdaLoRA (ICLR 2023) in https://github.com/huggingface/peft/pull/233

@yeoedward

Implement adaption prompt from Llama-Adapter paper in https://github.com/huggingface/peft/pull/268

@Splo2t

Add nn.Embedding Support to Lora in https://github.com/huggingface/peft/pull/337

Whisper large tuning using PEFT LoRA+INT-8 on T4 GPU in Colab notebooks

We tested PEFT on @OpenAI's Whisper Large model and got: i) 5x larger batch sizes ii) Less than 8GB GPU VRAM iii) Best part? Almost no degredation to WER 🤯

Without PEFT:

OOM on a T4 GPU ❌
6GB checkpoint ❌
13.64 WER ✅

With PEFT:

Train on a T4 GPU ✅
60MB checkpoint ✅
14.01 WER ✅

adding whisper large peft+int8 training example by @pacman100 in https://github.com/huggingface/peft/pull/95

`prepare_for_int8_training` utility

This utility enables preprocessing the base model to be ready for INT8 training.

[core] add prepare_model_for_training by @younesbelkada in https://github.com/huggingface/peft/pull/85
[core] Some changes with prepare_model_for_training & few fixes by @younesbelkada in https://github.com/huggingface/peft/pull/105

`disable_adapter()` context manager

Enables to disable adapter layers to get the outputs from the frozen base models. An exciting application of this feature allows only a single model copy to be used for policy model and reference model generations in RLHF.

add disable adapter context manager by @pacman100 in https://github.com/huggingface/peft/pull/106

What's Changed

release v0.2.0.dev0 by @pacman100 in https://github.com/huggingface/peft/pull/69
Update README.md by @sayakpaul in https://github.com/huggingface/peft/pull/72
Fixed typo in Readme by @Muhtasham in https://github.com/huggingface/peft/pull/73
Update README.md by @pacman100 in https://github.com/huggingface/peft/pull/77
convert prompt tuning vocab to fp32 by @mayank31398 in https://github.com/huggingface/peft/pull/68
[core] add prepare_model_for_training by @younesbelkada in https://github.com/huggingface/peft/pull/85
[bnb] add flan-t5 example by @younesbelkada in https://github.com/huggingface/peft/pull/86
making prepare_model_for_training flexible by @pacman100 in https://github.com/huggingface/peft/pull/90
adding whisper large peft+int8 training example by @pacman100 in https://github.com/huggingface/peft/pull/95
making bnb optional by @pacman100 in https://github.com/huggingface/peft/pull/97
add support for regex target modules in lora by @pacman100 in https://github.com/huggingface/peft/pull/104
[core] Some changes with prepare_model_for_training & few fixes by @younesbelkada in https://github.com/huggingface/peft/pull/105
Fix typo by @mrm8488 in https://github.com/huggingface/peft/pull/107
add disable adapter context manager by @pacman100 in https://github.com/huggingface/peft/pull/106
add EleutherAI/gpt-neox-20b to support matrix by @pacman100 in https://github.com/huggingface/peft/pull/109
fix merging lora weights for inference by @pacman100 in https://github.com/huggingface/peft/pull/117
[core] Fix autocast issue by @younesbelkada in https://github.com/huggingface/peft/pull/121
fixes prepare_for_int8_training by @pacman100 in https://github.com/huggingface/peft/pull/127
issue#126: torch.load device issue. by @gabinguo in https://github.com/huggingface/peft/pull/134
fix: count params when zero init'd by @zanussbaum in https://github.com/huggingface/peft/pull/140
chore: update pyproject.toml by @SauravMaheshkar in https://github.com/huggingface/peft/pull/125
support option for encoder only prompts by @mayank31398 in https://github.com/huggingface/peft/pull/150
minor fixes to the examples by @pacman100 in https://github.com/huggingface/peft/pull/149
Add local saving for whisper largev2 example notebook by @alvanli in https://github.com/huggingface/peft/pull/163
fix count by @dumpmemory in https://github.com/huggingface/peft/pull/162
Add Prefix Tuning citation by @zphang in https://github.com/huggingface/peft/pull/159
lora fixes and adding 8bitMegredLinear lora by @pacman100 in https://github.com/huggingface/peft/pull/157
Update README.md by @pacman100 in https://github.com/huggingface/peft/pull/164
minor changes by @pacman100 in https://github.com/huggingface/peft/pull/165

New Contributors

@Muhtasham made their first contribution in https://github.com/huggingface/peft/pull/73
@mayank31398 made their first contribution in https://github.com/huggingface/peft/pull/68
@mrm8488 made their first contribution in https://github.com/huggingface/peft/pull/107
@gabinguo made their first contribution in https://github.com/huggingface/peft/pull/134
@zanussbaum made their first contribution in https://github.com/huggingface/peft/pull/140
@SauravMaheshkar made their first contribution in https://github.com/huggingface/peft/pull/125
@alvanli made their first contribution in https://github.com/huggingface/peft/pull/163
@dumpmemory made their first contribution in https://github.com/huggingface/peft/pull/162
@zphang made their first contribution in https://github.com/huggingface/peft/pull/159

Significant community contributions

The following contributors have made significant changes to the library over the last release:

@mayank31398
- Prompt Tuning method enhancements and fixes (#68, #150)

Full Changelog: https://github.com/huggingface/peft/compare/v0.1.0...v0.2.0

v0.1.0 Initial release

Initial release of 🤗 PEFT. Checkout the main README to learn more about it!

PEFT

What's Changed

Highlights

Poly PEFT method

LoRA improvements

Documentation improvements

What's Changed

New Contributors

What's Changed

New Contributors

Highlights

Other notable additions

Migration to v0.7.0

What's Changed

New Contributors

Significant community contributions

Refactor adapter deletion

Fix ModulesToSaveWrapper when using Low-level API

What's Changed

What's Changed

New Contributors

Adaptation prompt fixes

Safetensors fixes:

What's Changed

New Contributors

Highlights

Integration with diffusers

New tuning methods

Other notable additions

Experimental features

Breaking changes

What's Changed

New Contributors

GPTQ Integration

Low-level API

Support for XPU and NPU devices

Mix-and-match LoRAs

What's Changed

New Contributors

QLoRA Support:

New PEFT methods: IA3 from T-Few paper

Support for new tasks: QA and Feature Extraction

AutoPeftModelForxxx for better and Simplified UX

LoRA for custom models

New LoRA utilities

PEFT and Stable Diffusion

What's Changed

New Contributors

Significant community contributions

Brand new Docs

Comprehensive Testing Suite

Multi Adapter Support

New PEFT methods: AdaLoRA and Adaption Prompt

New LoRA utilities

What's Changed

New Contributors

Significant community contributions

Whisper large tuning using PEFT LoRA+INT-8 on T4 GPU in Colab notebooks

prepare_for_int8_training utility

disable_adapter() context manager

What's Changed

New Contributors

Significant community contributions

More from this team

Similar releases

Other sources from this team

Similar sources

More from this team

Similar releases

Other sources from this team

Similar sources

Fix `ModulesToSaveWrapper` when using Low-level API

`prepare_for_int8_training` utility

`disable_adapter()` context manager