PICO: A Practical Learned Image Codec Optimized for Human Visual Perception
By
ksec
Toasted to a respectable shade. No regrets, no crumbs left.
Summary
The article introduces PICO (Perceptual Image Codec), a learned image compression codec optimized for the human visual system. It was developed through a comprehensive study of modeling choices and millions of model configurations, jointly optimizing for perceptual quality and on-device runtime. Based on large-scale subjective user studies, PICO achieves 2.3-3× bitrate savings compared to traditional codecs like AV1, AV2, VVC, ECM, and JPEG-AI, with an additional 20-40% bitrate savings.
Key quotes
· 2 pulledWe introduce PICO (Perceptual Image Codec) — the first learned codec that is both practical, and optimized directly for the human visual system.
PICO provides 2.3-3× bitrate savings against AV1, AV2, VVC, ECM and JPEG-AI, and 20-40% bitrate savings ag
You might also wanna read
PromptEmbedder: A Dual-LLM Framework for Efficient, Architecture-Agnostic Text Embedding
The article presents PromptEmbedder, a novel dual-LLM framework for efficient and transferable text embedding. It addresses the bottleneck o
Unified Framework for Variational Quantum Knowledge Graph Embeddings on NISQ Devices
This paper introduces a unified framework for variational quantum algorithms (VQAs) applied to knowledge graph embeddings on near-term NISQ
Contextual Rollout Bandits: A Neural Scheduling Framework for Efficient Reinforcement Learning with Verifiable Rewards
This paper introduces Contextual Rollout Bandits, a novel framework for Reinforcement Learning with Verifiable Rewards (RLVR) that addresses
Eureka: An LLM-Driven Framework for Automated Feature Engineering in Enterprise AI
This paper presents Eureka, an LLM-driven framework for automated feature engineering in machine learning. It treats feature engineering as
Sleep-Like Consolidation Mechanism Improves Long-Context Performance in Transformer Language Models
This paper proposes a sleep-like consolidation mechanism for transformer-based large language models to address the poor scaling of attentio
Multi-Stream LLMs: A Parallel Architecture to Overcome Single-Stream Bottlenecks in Language Models
This paper introduces "Multi-Stream LLMs," a novel approach to overcoming the limitations of current language model architectures that rely
