All Topics
All Topics
Technology
Technology
Design
Design
Programming
Programming
Science
Science
News
News
Gaming
Gaming
Entertainment
Entertainment
Business
Business
Finance
Finance
Sports
Sports
Health
Health
Food
Food
Travel
Travel
Art
Art
Music
Music
Books
Books
Education
Education
Politics
Politics
Personal
Personal
No algorithm. No AI slop. No ads. Just RSS. Pro-human. Indie writers. Real journalism. Open web. Chronological. Hand toasted.

x-ray: Python Library for Detecting Improper PDF Redactions

By

rendx

5mo ago· 5 min readenCode

Summary

x-ray is a Python library developed by Free Law Project to detect improperly redacted PDF documents. The tool addresses the common problem where people incorrectly redact sensitive information by simply drawing black rectangles or highlights over text, which can still be selected and read. The project was motivated by years of encountering poorly redacted documents, including one that exposed Taylor Swift's personal phone number.

Key quotes

· 4 pulled
x-ray is a Python library for finding bad redactions in PDF documents.
An ongoing problem is that people fail to properly redact things. Instead of doing it the right way, they just draw a black rectangle or a black highlight on top of black text and call it a day.
Well, when that happens you just select the text under the rectangle, and you can read it again. Not great.
After witnessing this problem for years (our favorite is the doc that shared Taylor Swift's personal phone number), we decided it would be good to do something about it.
Snippet from the RSS feed
A tool to detect whether a PDF has a bad redaction - freelawproject/x-ray

You might also wanna read