Skip to main content
AI Socratic
February 2026
Random

Recreating MIME Files from Redacted PDFs

image.png

Do you remember the Vesuvius Challenge? Ancient scrolls, carbonized by a volcano, unreadable for centuries, were decoded using ML. This is the same, but with screenshots of files that have been redacted (poorly). One of the flawed redaction includes screenshots of emails with attachment in MIME format, which means the file is in text form (base64) at the bottom of the email. It's possible to recreate that attachment by copy/pasting the text and renaming it into pdf, png, or other.

The challenge is that they used Courier New font so 1 and L look the same. One engineer experimented around this with OCR first and then training a CNN.

Sources: tweet, blog, Vesuvius Challenge

Federico UlfoFederico Ulfo
Random

Face Recognition in a Few Lines of Python

An engineer coded a face recognition app to recognize people who look similar from different pictures. The code is surprisingly easy, just use

from deepface import DeepFace
from sklearn.metrics.pairwise import cosine_distances
import sys

## Faces Cosine Distance 
faces = DeepFace.extract_faces(sys.argv[1], enforce_detection=False)

## get embedding for first two faces
e0 = DeepFace.represent(faces[0]["face"], model_name="ArcFace")[0]["embedding"]
e1 = DeepFace.represent(faces[1]["face"], model_name="ArcFace")[0]["embedding"]

## compute cosine distance
print(cosine_distances([e0], [e1])[0][0])

Sources: tweet

Federico UlfoFederico Ulfo
← NewerFebruary 2026Older →

Search

Search across events, members, and blog posts