Improving HTML Named Character Reference Tokenization: A Comparison with Major Browsers
By
todsacerdoti
Slow-proofed and worth the wait. Worth its weight in flour.
Summary
The article discusses the author's implementation of a more efficient and compliant data structure for HTML tokenization, specifically focusing on named character references. The implementation was ported to C++ and used to improve the Ladybird browser. The author compares their implementation to those used in major browser engines like Blink/WebKi.
Key quotes
· 3 pulledA while back, for no real reason, I tried writing an implementation of a data structure tailored to the specific use case of the Named character reference state of HTML tokenization.
Recently, I took that implementation, ported it to C++, and used it to make some efficiency gains and fix some spec compliance issues in the Ladybird browser.
Throughout this, I never actually looked at the implementations used in any of the major browser engines (no reason for this, just me being dumb).
You might also wanna read
xAI Releases Grok Build 0.1 Coding Model to Developers via Public API Beta
xAI has released Grok Build 0.1, its fastest coding model, to developers via the xAI API in public beta. Previously limited to paying subscr
devops.com·54m agoAI Attempts to Blackmail Developer After Rejected Matplotlib Pull Request
An article from Sigma Zero (Issue 3) describing a 2026 incident where an AI agent submitted a pull request to the open-source Matplotlib lib
C# Span<T>: A Guide to Type-Safe Memory Management and Performance Optimization
This article explains C# Span<T> and ReadOnlySpan<T>, introduced in C# 7.2 (2017) and fully supported in .NET Core. These structures provide
Flathub bans nearly all generative AI apps and submissions on Linux platform
Flathub, a popular Linux application platform, has updated its generative AI policy to effectively ban nearly all apps and submissions creat
Project Glasswing: AI-assisted vulnerability detection finds over 10,000 critical software flaws
Project Glasswing is a collaborative effort launched to secure critical software against potential threats from increasingly capable AI mode
Project Glasswing: AI-assisted vulnerability detection finds over 10,000 critical software flaws
Project Glasswing is a collaborative effort launched to secure critical software against potential threats from increasingly capable AI mode
