ProPara-CRTS: Canonical Referent Tracking for Reliable Evaluation of Entity State Tracking in Process Narratives

Bingyang Ye; Timothy Obiso; Jingxuan Tu; James Pustejovsky

Back

Conference proceeding

ProPara-CRTS: Canonical Referent Tracking for Reliable Evaluation of Entity State Tracking in Process Narratives

Bingyang Ye, Timothy Obiso, Jingxuan Tu and James Pustejovsky

PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SEMANTICS, pp.254-268

01/01/2025

Abstract

Computer Science

Computer Science, Artificial Intelligence

Computer Science, Theory & Methods

Language & Linguistics

Linguistics

Science & Technology

Social Sciences

Technology

Despite the abundance of datasets for procedural texts such as cooking recipes, resources that capture full process narratives, paragraph-long descriptions that follow how multiple entities evolve across a sequence of steps, remain scarce. Although synthetic resources offer useful toy settings, they fail to capture the linguistic variability of naturally occurring prose. ProPara remains the only sizeable, naturally occurring corpus of process narratives, yet ambiguities and inconsistencies in its schema and annotations hinder reliable evaluation of its core task Entity State Tracking (EST). In this paper, we introduce a Canonical Referent Tracking Schema (CRTS) that assigns every surface mention to a unique, immutable discourse referent and records that referent's existence and location at each step. Applying CRTS to ProPara, we release the re-annotated result as ProPara-CRTS. The new corpus resolves ambiguous participant mentions in ProPara and consistently boosts performance across a variety of models. This suggests that principled schema design and targeted re-annotation can unlock measurable improvements in EST, providing a sharper diagnostic of model capacity in process narratives understanding without any changes to model architecture.

Metrics

1 Record Views

Details

Title: ProPara-CRTS: Canonical Referent Tracking for Reliable Evaluation of Entity State Tracking in Process Narratives
Creators: Bingyang Ye - Brandeis University
Timothy Obiso - Brandeis Univ, 415 South St, Waltham, MA 02453 USA
Jingxuan Tu - Brandeis University
James Pustejovsky - Brandeis Univ, 415 South St, Waltham, MA 02453 USA
Contributors: K Evang (Editor)
L Kallmeyer (Editor)
S Pogodalla (Editor)
Publication Details: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SEMANTICS, pp.254-268
Publisher: Assoc Computational Linguistics-Acl
Number of pages: 15
Identifiers: 9924595811701921
Academic Unit: Michtom School of Computer Science; Benjamin and Mae Volen National Center for Complex Systems; Interdepartmental Program in Linguistics and Computational Linguistics
Language: English
Resource Type: Conference proceeding

ProPara-CRTS: Canonical Referent Tracking for Reliable Evaluation of Entity State Tracking in Process Narratives

Abstract

Metrics

Details

Brandeis University Social media