All Topics
All Topics
Technology
Technology
Design
Design
Programming
Programming
Science
Science
News
News
Gaming
Gaming
Entertainment
Entertainment
Business
Business
Finance
Finance
Sports
Sports
Health
Health
Food
Food
Travel
Travel
Art
Art
Music
Music
Books
Books
Education
Education
Politics
Politics
Personal
Personal
No algorithm. No AI slop. No ads. Just RSS. Pro-human. Indie writers. Real journalism. Open web. Chronological. Hand toasted.

Systematic Analysis Reveals Widespread Information Leakage in Preprint Archives

By

oldfuture

7mo ago· 2 min readenInsight

Summary

This research paper presents a systematic security analysis of preprint archives like arXiv, revealing significant information leakage risks. The study analyzed 1.2 TB of source data from 100,000 arXiv submissions using the LaTeXpOsEd framework, which combines pattern matching, logical filtering, and large language models. The analysis uncovered thousands of PII leaks, GPS-tagged files, exposed cloud credentials, confidential author communications, and conference submission credentials. The researchers urge immediate action to address these security gaps while releasing their methods for open science.

Key quotes

· 4 pulled
In the absence of sanitization, submissions may disclose sensitive information that adversaries can harvest using open-source intelligence.
Our analysis uncovered thousands of PII leaks, GPS-tagged EXIF files, publicly available Google Drive and Dropbox folders, editable private SharePoint links, exposed GitHub and Google credentials, and cloud API keys.
We also uncovered confidential author communications, internal disagreements, and conference submission credentials, exposing information that poses serious reputational risks to both researchers and institutions.
We urge the research community and repository operators to take immediate action to close these hidden security gaps.
Snippet from the RSS feed
The widespread use of preprint repositories such as arXiv has accelerated the communication of scientific results but also introduced overlooked security risks. Beyond PDFs, these platforms provide unrestricted access to original source materials, includi

You might also wanna read