All Topics
All Topics
Technology
Technology
Design
Design
Programming
Programming
Science
Science
News
News
Gaming
Gaming
Entertainment
Entertainment
Business
Business
Finance
Finance
Sports
Sports
Health
Health
Food
Food
Travel
Travel
Art
Art
Music
Music
Books
Books
Education
Education
Politics
Politics
Personal
Personal
No algorithm. No AI slop. No ads. Just RSS. Pro-human. Indie writers. Real journalism. Open web. Chronological. Hand toasted.

RICP: A Teacher-Student Framework for Retrieved In-Context Principles from Mistakes in LLMs

By

@ai-firehose.column.social

4d ago· 2 min readenInsight

Summary

This paper introduces Retrieved In-Context Principles (RICP), a novel teacher-student framework for improving Large Language Models (LLMs) through in-context learning. Unlike existing approaches that lack customization and error coverage, RICP has the teacher model analyze mistakes from the student model, cluster them by underlying reasons to create task-level principles, and retrieve the most relevant mistakes per question for customized guidance. The framework is orthogonal to existing prompting methods and requires no teacher intervention during inference. Experimental results across seven reasoning benchmarks show RICP enhances performance when applied to various prompting strategies.

Key quotes

· 5 pulled
In RICP, the teacher model analyzes mistakes from the student model to generate reasons and insights for preventing similar mistakes.
These mistakes are clustered based on their underlying reasons for developing task-level principles, enhancing the error coverage of principles.
During inference, the most relevant mistakes for each question are retrieved to create question-level principles, improving the customization of the provided guidance.
RICP is orthogonal to existing prompting methods and does not require intervention from the teacher model during inference.
Experimental results across seven reasoning benchmarks reveal that RICP effectively enhances performance when applied to various prompting strategies.
Snippet from the RSS feed
In-context learning (ICL) has been instrumental in adapting Large Language Models (LLMs) to downstream tasks using correct input-output examples. Recent advances have attempted to improve model performance through principles derived from mistakes, yet the

You might also wanna read