Intel and AMD release ACE CPU extensions specification for efficient AI processing on x86 processors
By
Bruno Ferreira
Summary
Intel and AMD have jointly released the full specification for new ACE (Advanced CPU Extensions) CPU extensions designed to make AI model inference more efficient on x86 processors. The extensions introduce a new instruction set that improves matrix multiplication operations, making them more power-efficient and density-efficient. This allows smaller AI models and latency-sensitive operations to run on CPUs rather than requiring GPU offloading, which avoids data transfer overhead. The development is particularly relevant for systems without dedicated GPUs or with limited integrated graphics capabilities.
Source
Key quotes
· 4 pulledMost all you hear about 'running an AI model' involves a GPU of some sort, but not every AI task is suited to that hardware.
Smaller models or single-user latency-sensitive operations can benefit from running on the CPU instead, as it avoids the overhead of shuffling data to and from the GPU.
There are also many situations where there is no GPU available to begin with, or it's a meek integrated affair with limited capabilities.
Intel and AMD have recently released the full specification for the ACE CPU extensions that make it easier and more power-efficient to run AI models on x86 CPUs.
You might also wanna read
Intel Announces Xeon 6+ Processors and AI Platform Expansions for Data Center and Edge
Intel announced new data center advancements at Computex Taipei, including the Xeon 6+ processor with 128 performance cores and 504MB of cac
AI-Driven Approach for Portable GPU Kernels in High-Performance Computing
This academic paper from North Carolina State University researchers presents an approach to leveraging AI ecosystems for creating portable
Developing a BLAS Library for the AMD AI Engine [pdf]

AMD Partners with OpenAI to Supply AI Processors in Challenge to Nvidia
AMD has announced a five-year partnership with OpenAI to supply six gigawatts worth of processors for AI data centers, challenging Nvidia's
Exploring Advanced Matrix Multiplication Engines for AI Applications
The article discusses the importance of matrix multiplication in modern computing, particularly in AI applications like neural networks and
Intel develops AI chip that bypasses expensive memory technology used by Nvidia
Intel has developed a new AI chip that avoids using the expensive high-bandwidth memory (HBM) that Nvidia's GPUs rely on. The article discus
