All Topics
All Topics
Technology
Technology
AI
AI
Business
Business
Entertainment
Entertainment
News
News
Programming
Programming
Security
Security
Science
Science
Design
Design
Environment
Environment
Finance
Finance
Crypto
Crypto
Politics
Politics
Sports
Sports
Education
Education
Gaming
Gaming
Art
Art
Music
Music
Health
Health
Books
Books
Food
Food
Travel
Travel
Personal
Personal
Bluesky
Twitter

PEFT-Arena: A Benchmark Evaluating Parameter-Efficient Finetuning Through the Stability-Plasticity Dilemma

By

[Submitted on 27 May 2026]

19d ago· 2 min readenInsight

Summary

This paper introduces PEFT-Arena, a benchmark for evaluating parameter-efficient finetuning (PEFT) methods for large language models through the lens of the stability-plasticity dilemma — the trade-off between adapting to new tasks (plasticity) and retaining pretrained capabilities (stability). The authors find that different PEFT methods exhibit distinct stability-plasticity profiles, with orthogonal finetuning achieving the best Pareto frontier under comparable parameter budgets. They analyze PEFT updates from geometric perspectives in weight space (spectral analysis) and activation space (representation distortion), and show that final SFT checkpoints often overshoot optimal target-retention trade-offs, suggesting post-hoc improvements like path-wise rewinding.

Source

bskyPEFT-Arena: A Benchmark Evaluating Parameter-Efficient Finetuning Through the Stability-Plasticity Dilemmaarxiv.org

Key quotes

· 4 pulled
We argue that PEFT should be assessed through the stability-plasticity dilemma: the trade-off between target-task adaptation and resistance to forgetting.
Across methods, we find distinct stability-plasticity profiles; under comparable parameter budgets, orthogonal finetuning achieves the most favorable Pareto frontier.
In activation space, retention metrics show whether finetuning preserves or distorts general-capability representations, with forgetting linked to non-isometric representation distortion.
An analysis shows that final SFT checkpoints often overshoot a better target-retention operating point.
Snippet from the RSS feed
Parameter-efficient finetuning (PEFT) has become the standard approach for adapting large language models, yet evaluations largely emphasize downstream accuracy while overlooking the retention of pretrained capabilities. We argue that PEFT should be asses

You might also wanna read

Comments

Sign in to join the conversation.

No comments yet. Be the first.