Frictive Policy Optimization for LLM Agent Interactions

James Pustejovsky; Nikhil Krishnaswamy

Back

Frictive Policy Optimization for LLM Agent Interactions

Conference presentation

Open access

Frictive Policy Optimization for LLM Agent Interactions

James Pustejovsky and Nikhil Krishnaswamy

05/20/2025

Abstract

Epistemic Tracking, Friction, Policy Learning, LLM Alignment

ACM Reference Format:

Anonymous Author(s) 2025 Frictive Policy Optimization for LLM Agent

Recent advances in the alignment of large language models (LLMs) toward human preference and values have dramatically expanded the capabilities of artificial intelligence in natural language understanding and generation. However, despite their impressive performance , these models often lack the reflective and deliberative qualities necessary for effective human-AI collaboration. Traditional policy optimization methods, such as Reinforcement Learning from Human Feedback (RLHF), Proximal Policy Optimization (PPO), or Direct Preference Optimization (DPO), primarily focus on maximizing task-related rewards or aligning outputs with human preferences. These approaches, however, tend to neglect the critical epistemic dimension of alignment: the ability of an AI system to reason about, question, and update its underlying beliefs. In this paper, we propose a novel framework termed Frictive Policy Optimization (FPO), which explicitly incorporates " friction " as a desirable property in the policy optimization process for LLMs. Beyond fostering reflective deliberation, our approach also challenges the conventional expectation that autonomous agents must always comply with human commands. By integrating mechanisms that incentivize appropriate non-compliance, what we term " beneficial disobedience " , FPO equips AI systems with the capacity to question potentially harmful or ill-advised instructions. This dual focus on epistemic alignment and responsible disobedience paves the way for more robust, safe, and collaborative human-AI interactions.

Files and links (1)

pdf

RaD_AI_2025 (5)358.28 kBDownload View

Open Access

Metrics

1 Record Views

Details

Title: Frictive Policy Optimization for LLM Agent Interactions
Creators: James Pustejovsky - Brandeis University, Michtom School of Computer Science
Nikhil Krishnaswamy
Grants: TRACE, HR00112490377, Defense Advanced Research Projects Agency (United States, Arlington) - ARPA
Identifiers: 9924458563501921
Academic Unit: Benjamin and Mae Volen National Center for Complex Systems; Interdepartmental Program in Linguistics and Computational Linguistics; Michtom School of Computer Science
Language: English
Resource Type: Conference presentation

Frictive Policy Optimization for LLM Agent Interactions

Abstract

Files and links (1)

Metrics

Details

Brandeis University Social media