All Topics
All Topics
Technology
Technology
Design
Design
Programming
Programming
Science
Science
News
News
Gaming
Gaming
Entertainment
Entertainment
Business
Business
Finance
Finance
Sports
Sports
Health
Health
Food
Food
Travel
Travel
Art
Art
Music
Music
Books
Books
Education
Education
Politics
Politics
Personal
Personal
No algorithm. No AI slop. No ads. Just RSS. Pro-human. Indie writers. Real journalism. Open web. Chronological. Hand toasted.

AI-Generated Metal Kernels Accelerate PyTorch Inference by 87% on Apple Devices

By

nserrino

9mo ago· 14 min readenInsight

Summary

Researchers developed AI-generated Metal kernels that accelerate PyTorch inference on Apple devices by 87% across 215 modules. The study demonstrates that frontier AI models can effectively write optimized GPU kernels, with some workloads achieving hundreds of times speed improvement over baseline implementations.

Key quotes

· 3 pulled
Our lab investigated whether frontier models can write optimized GPU kernels for Apple devices to speed up inference
our AI-generated Metal kernels were 1.87x faster across 215 PyTorch modules
some workloads running hundreds of times faster than baseline
Snippet from the RSS feed
Our lab investigated whether frontier models can write optimized GPU kernels for Apple devices to speed up inference. We found that they can: our AI-generated Metal kernels were 1.87x faster across 215 PyTorch modules.

You might also wanna read