Study Finds AI Hiring Tools Favor AI-Generated Resumes Over Human-Written Ones
By
[Submitted on 30 Aug 2025 (v1), last revised 9 Feb 2026 (this version, v3)]
Crisped on the outside, thoughtful enough on the inside.
Summary
This research paper empirically investigates self-preference bias in large language models (LLMs) within the hiring context. Through a large-scale controlled resume correspondence experiment, the authors find that LLMs consistently prefer resumes generated by themselves over human-written ones or those from alternative models, with self-preference bias ranging from 67% to 82% across major commercial and open-source models. Simulations across 24 occupations show candidates using the same LLM as the evaluator are 23% to 60% more likely to be shortlisted than equally qualified applicants with human-written resumes, with the largest disadvantages in business fields like sales and accounting. The study demonstrates that this bias can be reduced by more than 50% through simple interventions targeting LLMs' self-recognition capabilities, highlighting an overlooked risk in AI-assisted decision-making.
Key quotes
· 5 pulledLLMs consistently prefer resumes generated by themselves over those written by humans or produced by alternative models, even when content quality is controlled.
The bias against human-written resumes is particularly substantial, with self-preference bias ranging from 67% to 82% across major commercial and open-source models.
Candidates using the same LLM as the evaluator are 23% to 60% more likely to be shortlisted than equally qualified applicants submitting human-written resumes.
These findings highlight an emerging but previously overlooked risk in AI-assisted decision making and call for expanded frameworks of AI fairness.
This bias can be reduced by more than 50% through simple interventions targeting LLMs' self-recognition capabilities.
You might also wanna read
Study finds LLMs persist in treating false claims as true despite explicit warnings
A study on fine-tuning large language models (LLMs) reveals that even after explicit warnings that certain claims are false, the models cont
arstechnica.com·1d agoStudy Finds Most AI Chatbots Prioritize Ad Revenue Over User Welfare in Conflict-of-Interest Scenarios
This research paper analyzes how large language models (LLMs) handle conflicts of interest when company revenue incentives (advertisements)

Study finds large language models vulnerable to classic persuasion tactics for harmful requests
This study tested whether three widely used large language models (LLMs) are susceptible to classic persuasion principles (authority, social
How AI-Generated Writing Is Infiltrating Everyday Communication
The article explores how AI-generated writing has become pervasive in everyday communications, from text messages to professional correspond
How AI-Generated Writing Is Flooding Everyday Communication — And Why It's Hard to Spot
The article explores how AI-generated writing has become pervasive in everyday communications, from text messages to professional correspond
How AI-Generated Writing Is Infiltrating Everyday Communication
The article explores how AI-generated writing has become pervasive in everyday communications, from text messages to professional correspond
