All Topics
All Topics
Technology
Technology
Design
Design
Programming
Programming
Science
Science
News
News
Gaming
Gaming
Entertainment
Entertainment
Business
Business
Finance
Finance
Sports
Sports
Health
Health
Food
Food
Travel
Travel
Art
Art
Music
Music
Books
Books
Education
Education
Politics
Politics
Personal
Personal
No algorithm. No AI slop. No ads. Just RSS. Pro-human. Indie writers. Real journalism. Open web. Chronological. Hand toasted.

Unsupervised Algorithm for Language Model Fine-Tuning Introduced

By

kordlessagain

11mo ago· 2 min readenInsight

Summary

The article introduces an unsupervised algorithm, Internal Coherence Maximization (ICM), to fine-tune pretrained language models without external supervision. It shows that this method matches or outperforms training on human supervision in various tasks, including those where language models have superhuman capabilities. Additionally, the algorithm improves the training of advanced language models and assists in tasks like Haiku generation.

Key quotes

· 3 pulled
To steer pretrained language models for downstream tasks, today's post-training paradigm relies on humans to specify desired behaviors.
Our method matches the performance of training on golden supervision and outperforms training on crowdsourced human supervision.
On tasks where LMs' capabilities are strongly superhuman, our method can elicit those capabilities significantly better than training on human labels.
Snippet from the RSS feed
To steer pretrained language models for downstream tasks, today's post-training paradigm relies on humans to specify desired behaviors. However, for models with superhuman capabilities, it is difficult or impossible to get high-quality human supervision.

You might also wanna read