🍔🧠 How DropBox Built Multimedia Search for 700M Users

PLUS: Design a Rate Limiter ⏳, What is Cache Warming ♨️, Instagram Staff Career Gold 🏆

Alexandre Zajac

Aug 11, 2025

Today’s issue of Hungry Minds is brought to you by:

Happy Monday! ☀️

Welcome to the 489 new hungry minds who have joined us since last Monday!

If you aren’t subscribed yet, join smart, curious, and hungry folks by subscribing below.

📚 Software Engineering Articles

This product finally solved DevOPS
Learn how a rate limiter works with this practical system design guide
Build the world's fastest VIN decoder from scratch
Smart cache warming strategies to boost application performance
How to handle duplicates in retry mechanisms
N+1 query problem explained and solved in 4 minutes

🗞️ Tech and AI Trends

Reddit transforms into a search engine powerhouse
Anthropic blocks OpenAI from accessing Claude models
Google launches Gemini Deep Think for parallel reasoning

👨🏻‍💻 Coding Tip

Use partial indexes in PostgreSQL to optimize query performance and reduce storage

Time-to-digest: 5 minutes

Using Jenkins? Find a better workflow!

Bitbucket Pipelines is not just a tool, it’s a DevOps force multiplier.

Unlock cloud-native CI/CD with 70% reduction in operational costs, no installation or plugins to manage, and efficient autoscaling.

Let engineers focus on delivering value – not maintaining Jenkins.

Test BitBucket Pipelines today

How Dropbox Built Multimedia Search For Dash 🫨

Searching through thousands of unlabeled images and videos is a nightmare for knowledge workers. Dropbox tackled this by building a scalable multimedia search system that makes finding visual content as easy as finding text documents in their universal search product, Dash.

The challenge: Build a cost-effective system that can process and search through massive amounts of media files while maintaining low latency and high relevance, despite limited metadata and heavy compute requirements.

Implementation highlights:

Metadata-first indexing: Extract lightweight features first (file paths, EXIF data) to enable basic search with minimal overhead
Just-in-time previews: Generate previews on-demand rather than upfront to optimize storage and compute costs
Location-aware queries: Built custom geocoding logic to enable searching by location where photos were taken
Parallel processing: Run preview generation, ranking, and permission checks concurrently to minimize latency
Smart caching: Store previews for 30 days and leverage existing Dropbox infrastructure for efficiency

Results and learnings:

Scalable performance: Successfully processes 97% of media files while maintaining responsive search
Cost optimization: Achieved 3x storage efficiency for images and 13x for video compared to raw storage
Developer velocity: Parallel workflows and clear API boundaries enabled faster team execution

This solution shows how thoughtful architecture decisions around caching, parallel processing, and metadata extraction can make multimedia search both performant and cost-effective.

Design a Rate Limiter

Written by

Hello Interview

Design a Rate Limiter

Read the full breakdown on hellointerview.com…

10 days ago · 26 likes · Hello Interview

Bitbucket Pipelines

Bitbucket Pipelines brings continuous delivery to Bitbucket Cloud, empowering teams with full branching to deployment visibility and faster feedback loops

How to Optimize Performance with Cache Warming?

Written by

Sid

The Scalable Thread

How to Optimize Performance with Cache Warming?

What is Cache Warming…

15 days ago · 14 likes · Sid

How we built the world's fastest VIN decoder

Nine VS Code (or Cursor) Extensions That Make My Daily Work Much Easier

Written by

Petar Ivanov

The T-Shaped Dev

Nine VS Code (or Cursor) Extensions That Make My Daily Work Much Easier

As a Software Engineer, you read, update, and write code daily…

14 days ago · 41 likes · 2 comments · Petar Ivanov

Write a postmortem like a Senior Engineer

Written by

Fran Soto

Strategize Your Career

Write a postmortem like a Senior Engineer

Most people treat a postmortem as an exercise to find the root cause. Once they find it, it's finished…

7 days ago · 19 likes · Fran Soto

Instagram Staff (IC6) Promo Despite 10 Team Switches in 9 Years (Career Story)

Written by

Ryan Peterman

The Developing Dev

Instagram Staff (IC6) Promo Despite 10 Team Switches in 9 Years (Career Story)

Sash Zats grew to be a Staff Engineer (IC6) at Meta despite switching teams 10 times in 9 years. His career journey was a series of jumps to exciting projects and letting career growth happen as a byproduct. I interviewed him to show you how team switches can play out…

Listen now

8 days ago · 13 likes · Ryan Peterman

Career Frameworks 🪜 — part 2

Written by

Nicola Ballotta

and

Luca Rossi

Hybrid Hacker

Career Frameworks 🪜 — part 2

Hey! Today we are back with the second half of this two-part series about career frameworks…

10 days ago · 8 likes · Luca Rossi

ESSENTIAL (testing gone wrong)
The #1 Mistake in Unit Testing (and How to Fix It)

ESSENTIAL (retry, retry, oops)
Retries Have an Evil Twin: Duplicates

ESSENTIAL (metrics go brrr)
Essential System Design Performance Metrics

ARTICLE (vim-wizardry unlocked)
Why I Switched to Vim Keybindings

GITHUB REPO (trade like a boss)
A high-performance algorithmic trading platform and event-driven backtester

ARTICLE (database drama solved)
What is the N+1 Query Problem and How to Solve it?

ARTICLE (claude’s coding spree)
6 Weeks of Claude Code

ARTICLE (chrome goes zoom)
Making Sense of the Performance Extensibility API

ARTICLE (AI hype debunked)
No, AI is not Making Engineers 10x as Productive

ARTICLE (flashy list magic)
FlashList v2: A Ground-Up Rewrite for React Native's New Architecture

Want to reach 190,000+ engineers?

Let’s work together! Whether it’s your product, service, or event, we’d love to help you connect with this awesome community.

WORK WITH US

🔍 Reddit Aims to Unify Search Interface in Bid to Rival Major Search Engines (2 min)

Brief: Reddit is working on a unified search interface as part of its strategy to compete with dominant search engines, potentially leveraging its vast community-driven content for better results.

🤖 Anthropic Blocks OpenAI’s Access to Claude Models in Legal Dispute (2 min)

Brief: Anthropic has revoked OpenAI’s access to its Claude AI models amid an escalating legal dispute, leaving OpenAI to seek alternatives for its AI research and services.

🤖 Google Unveils Gemini DeepThink AI for Parallel Reasoning (2 min)

Brief: Google introduces Gemini DeepThink, an AI model designed to evaluate multiple ideas simultaneously, enhancing reasoning and problem-solving efficiency.

🤖 OpenAI Launches Customizable Open-Weight AI Models (2 min)

Brief: OpenAI releases gpt-oss-120b and gpt-oss-20b, two open-weight models under Apache 2.0 license, designed for agentic tasks, customization, and commercial use with full chain-of-thought reasoning.

🤖 GPT-5 Arrives: Faster, Cheaper, and More Reliable Than Ever (4 min)

Brief: OpenAI's GPT-5 offers three model tiers (regular, mini, nano) with competitive pricing and reduced hallucinations, making it a strong contender in the AI race.

🤖 OpenAI’s GPT-OSS Models Now Available on AWS (4 min)

Brief: AWS adds OpenAI’s open-weight GPT-OSS-120B and GPT-OSS-20B models to Bedrock and SageMaker, enabling developers to build AI applications with full infrastructure control and advanced reasoning capabilities.

This week’s coding challenge:

Build Your Own Interpreter

This week’s tip:

Create efficient database indices by using partial indexes with WHERE clauses to significantly reduce index size and improve query performance on specific data subsets. PostgreSQL's partial indices allow you to create smaller, more focused indexes that only include rows matching certain conditions.

Wen?

Large tables with skewed data: When certain status values represent a small, frequently-queried subset of all rows.
Time-series data management: Index recent data more aggressively while maintaining lighter indices for historical data.
Multi-tenant systems: Create focused indices for high-traffic customers while keeping the overall index footprint manageable.

“The harder you work, the harder it is to surrender.”
Vince Lombardi

That’s it for today! ☀️

Enjoyed this issue? Send it to your friends here to sign up, or share it on Twitter!

If you want to submit a section to the newsletter or tell us what you think about today’s issue, reply to this email or DM me on Twitter! 🐦

Thanks for spending part of your Monday morning with Hungry Minds.
See you in a week — Alex.

Icons by Icons8.

*I may earn a commission if you get a subscription through the links marked with “aff.” (at no extra cost to you).

🍔🧠 How DropBox Built Multimedia Search for 700M Users

PLUS: Design a Rate Limiter ⏳, What is Cache Warming ♨️, Instagram Staff Career Gold 🏆

Discussion about this post