All Topics
All Topics
Technology
Technology
Design
Design
Programming
Programming
Science
Science
News
News
Gaming
Gaming
Entertainment
Entertainment
Business
Business
Finance
Finance
Sports
Sports
Health
Health
Food
Food
Travel
Travel
Art
Art
Music
Music
Books
Books
Education
Education
Politics
Politics
Personal
Personal
No algorithm. No AI slop. No ads. Just RSS. Pro-human. Indie writers. Real journalism. Open web. Chronological. Hand toasted.

Burla demo: Analyzing 1.7M Airbnb photos and 50.7M reviews at scale using CLIP and Claude Haiku Vision

By

jmp1062

1mo ago· 2 min readen

Summary

A technical demonstration of processing all public Airbnb listings (119 cities, 4 quarterly snapshots) using CLIP to score 1.7M photos for suspicious content, with Claude Haiku Vision double-checking shortlists. Also scored 50.7M reviews and reranked the weirdest 12K. All processing was parallelized on Burla's platform using ~1.7K CPU workers and 20 A100 GPUs.

Key quotes

· 3 pulled
We scored 1.7M photos with CLIP (a model that turns an image into a vector you can compare to a text prompt), shortlisted the most suspicious ones, and had Claude Haiku Vision double-check each shortlist.
We also scored every review and reranked the weirdest 12K with Haiku.
Everything was parallelized on Burla, on a single dynamic cluster that scaled to ~1.7K CPU workers for photo download and CLIP, with 20 A100 GPUs.
Snippet from the RSS feed
119 cities x 4 quarterly snapshots of Inside Airbnb data. 1.7M photos through CLIP, the most suspicious shortlists double-checked with Claude Haiku Vision, 50.7M reviews scored through Haiku, all parallelized on Burla.

You might also wanna read