Introduction to Memory Subsystem Optimization Blog Series
By
mfiguiere
Front-window bakery material. Catches the eye, delivers the goods.
Summary
This blog post introduces a series of 18 articles focused on memory subsystem optimizations for software performance. The author explains that these optimizations aim to make software faster by better utilizing the memory subsystem, with most techniques applicable to software working with large datasets, though some apply to any data size. The post serves as an introduction to the blog series and includes contact information for performance consulting and training services.
Key quotes
· 4 pulledIn this blog I wrote 18 blog posts about memory subsystem optimizations.
By memory subsystem optimizations, I mean optimizations that aim at making software faster by better using the memory subsystem.
Most of them are applicable to software that works with large datasets; but some of them are applicable to software that works with any data regardless of its size.
Do you need to discuss a performance problem in your project? Or maybe you want a vectorization training for yourself or your team?
You might also wanna read
Performance Optimization: How a 185-Microsecond Type Hint Boosted Throughput 13× in Clojure Roughtime Implementation
The article describes a performance optimization in a Clojure implementation of the Roughtime protocol, where a seemingly trivial change to

Research-Driven Coding Agents Improve llama.cpp Performance with Literature Search Phase
The article discusses how coding agents that incorporate a research phase—reading academic papers and studying competing projects—before wri
Company Uses AI to Rewrite JSONata in Go, Achieving 1,000x Speedup and $500K Annual Savings
A company used AI to rewrite JSONata, a JSON transformation language, as a pure-Go library called gnata in just seven hours with $400 in AI
Optimizing the asin() Function: A Technical Follow-up on Performance Improvements
The author revisits their previous work on optimizing the asin() (arcsine) function in C/C++ after receiving feedback from online communitie
Learning from Redundant Optimization: The asin() Function Case Study
The article discusses the author's realization that a faster implementation of the asin() (arcsine) function was already available in standa
Optimizing C Software Performance with Dynamic CPU Feature Detection
The article discusses techniques for optimizing C software performance through dynamic feature detection, focusing on x86-64 processors. It
