Conditional Language Policy: A General Framework for Steerable Multi-Objective Finetuning

Wang, Kaiwen; Kidambi, Rahul; Sullivan, Ryan; Agarwal, Alekh; Dann, Christoph; Michi, Andrea; Gelmi, Marco; Li, Yunxuan; Gupta, Raghav; Dubey, Avinava; Ramé, Alexandre; Ferret, Johan; Cideron, Geoffrey; Hou, Le; Yu, Hongkun; Ahmed, Amr; Mehta, Aranyak; Hussenot, Léonard; Bachem, Olivier; Leurent, Edouard

Computer Science > Machine Learning

arXiv:2407.15762 (cs)

[Submitted on 22 Jul 2024 (v1), last revised 23 Oct 2024 (this version, v2)]

Title:Conditional Language Policy: A General Framework for Steerable Multi-Objective Finetuning

Abstract:Reward-based finetuning is crucial for aligning language policies with intended behaviors (e.g., creativity and safety). A key challenge is to develop steerable language models that trade-off multiple (conflicting) objectives in a flexible and efficient manner. This paper presents Conditional Language Policy (CLP), a general framework for finetuning language models on multiple objectives. Building on techniques from multi-task training and parameter-efficient finetuning, CLP learn steerable models that effectively trade-off conflicting objectives at inference time. Notably, this does not require training or maintaining multiple models to achieve different trade-offs between the objectives. Through extensive experiments and ablations on two summarization datasets, we show that CLP learns steerable language models that outperform and Pareto-dominate the existing approaches for multi-objective finetuning.

Comments:	40 pages. Findings of EMNLP 2024
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Cite as:	arXiv:2407.15762 [cs.LG]
	(or arXiv:2407.15762v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2407.15762

Submission history

From: Kaiwen Wang [view email]
[v1] Mon, 22 Jul 2024 16:13:38 UTC (8,453 KB)
[v2] Wed, 23 Oct 2024 17:42:39 UTC (8,581 KB)

Computer Science > Machine Learning

Title:Conditional Language Policy: A General Framework for Steerable Multi-Objective Finetuning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Conditional Language Policy: A General Framework for Steerable Multi-Objective Finetuning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators