All Topics
All Topics
Technology
Technology
Design
Design
Programming
Programming
Science
Science
News
News
Gaming
Gaming
Entertainment
Entertainment
Business
Business
Finance
Finance
Sports
Sports
Health
Health
Food
Food
Travel
Travel
Art
Art
Music
Music
Books
Books
Education
Education
Politics
Politics
Personal
Personal
No algorithm. No AI slop. No ads. Just RSS. Pro-human. Indie writers. Real journalism. Open web. Chronological. Hand toasted.

New Generation LLMs Show Improved Character-Level Text Manipulation Capabilities

By

curioussquirrel

7mo ago· 7 min readenInsight

Summary

The article discusses how the latest generation of large language models (LLMs) like GPT-5 and Claude 4.5 have shown significant improvements in character-level text manipulation tasks, including character counting, character manipulation in sentences, and solving encoding and ciphers. This represents a notable advancement over previous LLM generations that struggled with granular character-level operations due to tokenization processes where text is encoded as tokens representing character clusters or full words rather than individual characters.

Key quotes

· 4 pulled
Surprisingly, the newest models were able to solve these kinds of tasks, unlike previous generations of LLMs.
LLMs handle individual characters poorly. This is due to all text being encoded as tokens via the LLM tokenizer and its vocabulary.
Individual tokens typically represent clusters of characters, sometimes even full words (especially in English and other common languages in the training dataset).
This makes any considerations on a more granular level than tokens fairly difficult, although LLMs have been capable of certain simple tasks (such as spelling out individual characters in a word) for a while.
Snippet from the RSS feed
Recently, I have been testing how well the newest generations of large language models (such as GPT-5 or Claude 4.5) handle natural language, specifically counting characters, manipulating characters in a sentences, or solving encoding and ciphers. Surpri

You might also wanna read