All Topics
All Topics
Technology
Technology
AI
AI
Business
Business
Entertainment
Entertainment
News
News
Programming
Programming
Security
Security
Science
Science
Design
Design
Environment
Environment
Finance
Finance
Crypto
Crypto
Politics
Politics
Sports
Sports
Education
Education
Gaming
Gaming
Art
Art
Music
Music
Health
Health
Books
Books
Food
Food
Travel
Travel
Personal
Personal
Bluesky
Twitter

Using Diffusion Models to Visualize What Self-Supervised Neural Networks Actually Learn

By

[Submitted on 16 Dec 2021 (v1), last revised 16 Aug 2022 (this version, v2)]

2d ago· 2 min readenInsight

Summary

This paper introduces the use of Representation Conditional Diffusion Models (RCDM) to visualize what self-supervised learning (SSL) models actually learn. The authors demonstrate that RCDMs can generate high-quality samples faithful to the representations they condition on. Using this visualization technique, they make four key findings: (1) SSL backbone representations are NOT invariant to data augmentations they were trained with, debunking a common misconception; (2) SSL post-projector embeddings do appear invariant to augmentations and other symmetries; (3) SSL representations are more robust to small adversarial perturbations than supervised representations; and (4) SSL representations have an inherent structure that enables image manipulation through RCDM visualization.

Source

bskyUsing Diffusion Models to Visualize What Self-Supervised Neural Networks Actually Learnarxiv.org

Key quotes

· 5 pulled
Discovering what is learned by neural networks remains a challenge.
In self-supervised learning, classification is the most common task used to evaluate how good a representation is.
SSL (backbone) representation are not invariant to the data augmentations they were trained with -- thus debunking an often restated but mistaken belief.
SSL post-projector embeddings appear indeed invariant to these data augmentation, along with many other data symmetries.
SSL-trained representations exhibit an inherent structure that can be explored thanks to RCDM visualization and enables image manipulation.
Snippet from the RSS feed
Discovering what is learned by neural networks remains a challenge. In self-supervised learning, classification is the most common task used to evaluate how good a representation is. However, relying only on such downstream task can limit our understandin

You might also wanna read

Comments

Sign in to join the conversation.

No comments yet. Be the first.