Towards Informative Few-Shot Prompt with Maximum Information Gain for In-Context Learning

Hongfu Liu; Ye Wang

doi:10.48550/arxiv.2310.08923

Back

Preprint

Towards Informative Few-Shot Prompt with Maximum Information Gain for In-Context Learning

Hongfu Liu and Ye Wang

arXiv.org

10/13/2023

DOI: https://doi.org/10.48550/arxiv.2310.08923

Abstract

Computer Science - Computation and Language

Large Language models (LLMs) possess the capability to engage In-context Learning (ICL) by leveraging a few demonstrations pertaining to a new downstream task as conditions. However, this particular learning paradigm suffers from high instability stemming from substantial variances induced by factors such as the input distribution of selected examples, their ordering, and prompt formats. In this work, we demonstrate that even when all these factors are held constant, the random selection of examples still results in high variance. Consequently, we aim to explore the informative ability of data examples by quantifying the Information Gain (IG) obtained in prediction after observing a given example candidate. Then we propose to sample those with maximum IG. Additionally, we identify the presence of template bias, which can lead to unfair evaluations of IG during the sampling process. To mitigate this bias, we introduce Calibration Before Sampling strategy. The experimental results illustrate that our proposed method can yield an average relative improvement of 14.3% across six classification tasks using three LLMs.

Metrics

53 Record Views

Details

Title: Towards Informative Few-Shot Prompt with Maximum Information Gain for In-Context Learning
Creators: Hongfu Liu
Ye Wang
Publication Details: arXiv.org
Identifiers: 9924292932601921
Academic Unit: Benjamin and Mae Volen National Center for Complex Systems; Michtom School of Computer Science
Language: English
Resource Type: Preprint

Towards Informative Few-Shot Prompt with Maximum Information Gain for In-Context Learning

Abstract

Metrics

Details

Brandeis University Social media