AI training data glitch causes nonsensical phrase "vegetative electron microscopy" to spread through scientific papers
By
Rayane El Masri
Baker's choice. Dense with flavour, light on filler.
Summary
Scientists discovered the nonsensical phrase "vegetative electron microscopy" appearing in published academic papers. The term originated as an error in AI training data and has become a "digital fossil" — a mistake preserved and reinforced by AI systems that is nearly impossible to remove from knowledge repositories. The article explores how AI systems can perpetuate errors once they enter the information ecosystem, drawing parallels between biological fossils trapped in rock and these persistent digital artefacts.
Key quotes
· 3 pulledThis phrase, which sounds technical but is actually nonsense, has become a 'digital fossil' – an error preserved and reinforced in artificial intelligence (AI) systems that is nearly impossible to remove from our knowledge repositories.
Like biological fossils trapped in rock, these digital artefacts may become permanent fixtures in our information ecosystem.
The case of vegetative electron microscopy offers a troubling glimpse into how AI systems can perpetuate an
You might also wanna read
Why supervised learning AI cannot make truly novel scientific discoveries
The article presents a recorded speech arguing that generative AI trained via supervised learning is fundamentally incapable of making truly
The Curious Case of AI Language Models' Obsession with Em-Dashes
The article explores the curious phenomenon of AI language models' excessive use of em-dashes in their writing. It examines why this punctua

Understanding AI-Generated 'Slop': The Problem of Low-Quality Automated Content
The article discusses the concept of 'slop' in AI-generated content, which refers to low-quality, spammy, or deceptive AI outputs that flood
awnist.com·1mo agoSemantic Ablation: How AI Writing Becomes Generic Through Information Erosion
The article introduces the concept of "semantic ablation" as a counterpart to AI hallucinations, describing it as the algorithmic erosion of

The Struggle to Label AI-Generated Content: Why Deepfake Detection Efforts Are Failing
The article examines the challenges of labeling AI-generated content (deepfakes) to protect shared reality, discussing how current labeling
The Challenges of Mistaking AI-Generated Text for Human Writing
The article discusses the discomfort and challenges of interacting with LLM-generated text, particularly when it is mistaken for human-writt
