The System Design Interview Process

Table of Contents +

Picture this. You sit down in a system design interview, feeling pretty good. Then the interviewer leans back and says, “Design YouTube for me.”

And just like that, your mind goes blank, right?
Where do you even start? The video player? The database? The servers?
It feels like they handed you a whole ocean and asked you to draw a map of it.

Here’s the good news. There’s a clear, repeatable way to handle this. You don’t need to memorize one perfect answer. You just need a process you can follow every single time, no matter what they throw at you. That’s exactly what we’ll build in this lesson.

🎯 Why There’s No Single Right Answer

The first thing to get straight is what the interviewer is actually after. Because it’s not what most people think:

They’re not checking whether you’ve memorized how YouTube is built.
They’re watching how you think when you face a big, fuzzy problem.
They want to see you break a huge problem into smaller pieces and reason through each one.

So a system design question is really a conversation, not a quiz. Here’s what they’re quietly grading you on:

Can you ask good questions instead of guessing?
Can you explain your choices out loud so they follow your thinking?
Can you spot where your design might break, and say how you’d fix it?

That’s why two people can give totally different designs and both pass. The “right answer” is a solid, well-explained design with sensible trade-offs, not one secret blueprint.

It's a conversation, not a test

Treat the interviewer as a teammate you’re designing with, not an examiner watching you sweat. Talk to them. Ask them things. The more you bring them along, the better it goes.

🗺️ The Process at a Glance

Before we dig into each step, here’s the whole flow. Every system design interview, whatever the topic, follows roughly this same path.

Keep this map in your head. When you feel lost mid-interview, just ask yourself “which box am I in?” and move to the next one. Now let’s walk through each step.

1️⃣ Clarify the Requirements

This is the step everyone wants to skip, and skipping it is the fastest way to fail. So slow down here:

Do not start drawing boxes the second they finish talking.
“Design YouTube” could mean a hundred different things. Your first job is to shrink it down to something you can actually build.
You do that by asking questions, like a real engineer would on day one of a project.

The questions you ask fall into two buckets. First, the functional requirements. These are the features, what the system actually does for the user:

Can users upload videos? Can they watch them?
Do we need comments, likes, search, recommendations?
Are we building all of it, or just the core “upload and watch” flow?

Then the non-functional requirements. These aren’t features, they’re qualities the system must have, like how fast or reliable it is:

How many users are we expecting? A thousand, or a billion?
Is it okay if a video takes a moment to appear after upload, or must it be instant?
How important is it that the site never goes down?

After a few questions, agree on the scope out loud. Say something like, “Okay, so let’s focus on uploading and watching videos for a large number of users, and skip comments for now.” That one sentence keeps you and the interviewer pointed at the same target.

Don't assume, ask

If you assume the scope and start building, you might design the wrong system for twenty minutes. A single clarifying question up front saves you from that. When in doubt, ask.

2️⃣ Estimate the Scale

Now that you know what you’re building, get a rough feel for how big it is. This is called back-of-the-envelope estimation, which just means quick math you could do on the back of an envelope, no calculator needed:

The point is not to be exact. Nobody expects perfect numbers.
The point is to figure out whether you’re building for a small crowd or a massive one, because that changes everything.
These rough numbers guide your real decisions later, like whether you need a cache or many database copies.

Keep the math simple. A few things are worth a quick estimate:

Users. Roughly how many people use it per day? This sets the overall size.
Reads vs writes. Are people mostly reading data or writing it? On YouTube, way more people watch videos than upload them, so reads massively outnumber writes. That’s a big hint, it tells you to optimize for fast reading.
Storage. Roughly how much data piles up over time? If millions of videos get uploaded, you’ll need serious storage, and that shapes your design.

Here’s the thing, don’t get lost in the numbers. Round everything to easy figures, say it out loud, and move on. The estimate is a compass, not a destination.

Reads vs writes, in one line

A read is when the system fetches data to show you, like watching a video. A write is when it stores new data, like uploading one. Most big systems read far more than they write, and knowing the ratio drives a lot of your design.

3️⃣ Define the API

Before you draw any servers, nail down what the system actually offers. The cleanest way to do that is to list the main API endpoints. An API is just the set of requests a client can send to your system, the menu of things it can ask for:

This step turns fuzzy features into concrete actions.
Each endpoint maps to one thing a user can do.
It keeps everyone honest about what the system does and doesn’t do.

For our trimmed-down YouTube, the core API might look like this.

uploadVideo(userId, videoFile, title)   → returns videoId
watchVideo(videoId)                      → returns video stream + details
searchVideos(query)                      → returns list of videos

Let’s read what we just wrote:

uploadVideo is the write path, a user sends a video and we hand back an ID for it.
watchVideo is the read path, the one most people will hit, so it has to be fast.
searchVideos lets people find things, which hints we’ll need some kind of search later.

See how just listing three endpoints already shapes the whole design? Now you know the system has a heavy read path and a lighter write path, and you’ve barely drawn anything yet.

4️⃣ Draw the High-Level Design

Now you finally get to draw the boxes. This is the high-level design, a simple diagram of the main pieces and how a request flows through them. Start basic, you can always add detail later:

Don’t try to draw the perfect, final system in one go.
Put down the core components first, then walk a request through them out loud.
Keep talking as you draw, so the interviewer follows your reasoning.

Most web systems share the same handful of building blocks. Here’s the cast:

Client. The user’s phone or browser, where requests start.
Load balancer. A traffic cop that spreads incoming requests across many servers so no single one gets overwhelmed.
App servers. The machines that run your actual logic, like handling an upload or fetching a video.
Database. The organized store that holds your data, like video details and user info.
Cache. A small, super-fast store that keeps popular data handy so you don’t hit the slow database every time.

Here’s how they connect for a typical request.

Now walk the main flow out loud. For watching a video: the client asks the load balancer, which picks an app server, which checks the cache first for the video details, and only goes to the database if the cache doesn’t have them. Narrating the path like this is half the interview.

5️⃣ Deep Dive

Your high-level design is on the board. Now the interviewer usually says, “Tell me more about X.” This is the deep dive, where you zoom into one or two pieces and get specific:

You won’t have time to go deep on everything, and that’s fine.
Pick the parts that matter most for this system, or the part the interviewer points at.
Going deep on one thing shows real understanding, far more than staying shallow on ten things.

What you choose to dive into depends on the system. Some common deep-dive topics:

The database choice. Will you use a SQL database, which keeps data in strict tables with clear relationships, or a NoSQL one, which is more flexible and often scales out more easily? Say why you picked one for this case.
Caching. What exactly do you cache, and when do you clear it out? For YouTube, caching popular videos’ details makes a lot of sense, since the same hits get watched over and over.
A tricky component. For video, storing and streaming the actual files is the hard part. You might mention storing videos in a special storage service and serving them through a CDN, a network of servers spread worldwide that keeps copies close to users.

The key move here is to justify every choice. Don’t just say “I’ll use NoSQL.” Say “I’ll use NoSQL here because our data is simple and we need it to scale to huge numbers easily.” The reason is what they’re listening for.

6️⃣ Discuss Trade-offs and Bottlenecks

No design is perfect, and good engineers know it. So in this step you point out where your own system would struggle at scale, then say how you’d fix it. A bottleneck is the part that gets overwhelmed first as traffic grows, like a narrow doorway in a crowded hall:

Showing you can find your own weak spots is a strong signal.
For each weak spot, name a fix, and be honest that the fix has a cost too.
There’s no free lunch in system design, every fix trades one thing for another.

Here are the usual tools you reach for, and what each one buys you:

Caching. Keep hot data in fast memory so the database isn’t overloaded. The trade-off is that cached data can go stale, showing slightly old info.
Replicas. Keep extra copies of the database so reads can spread across them. Great for read-heavy systems like ours, but keeping copies in sync adds complexity.
Sharding. Split one giant database into smaller chunks, each holding part of the data, so no single machine holds it all. It scales well but makes some queries trickier.
Queues. Put slow jobs, like processing an uploaded video, into a waiting line handled in the background, so the user isn’t stuck waiting. It adds moving parts, but keeps the app fast.

The pattern is always the same. Name the bottleneck, name the fix, name the cost. Do that a couple of times and you’ve shown exactly the kind of thinking they came to see.

Say the cost out loud

Anyone can say “add a cache.” What sets you apart is adding “but then I have to handle stale data.” Naming the downside of your own fix is what makes you sound senior.

⏱️ A Rough Time Budget

A system design interview is usually around forty-five minutes, and running out of time is a real risk. So keep a rough clock in your head. Here’s a sensible split.

Step	Time	What you’re doing
Clarify requirements	~5 min	Ask questions, pin the scope
Estimate the scale	~5 min	Quick math on users, reads, storage
Define the API	~5 min	List the main endpoints
High-level design	~10 min	Draw the boxes, walk the flow
Deep dive	~10 min	Go deep on one or two pieces
Trade-offs & wrap up	~10 min	Bottlenecks, fixes, summary

Don’t treat these as hard rules, treat them as guardrails. If you notice you’ve spent twenty minutes still clarifying requirements, that’s your signal to move on. Managing the clock is part of the test.

💡 Tips That Actually Help

A few habits make a huge difference once you’re in the room. Keep these close:

Think out loud. Silence is your enemy here. Even when you’re unsure, narrate your thinking so the interviewer can follow and nudge you.
Communicate constantly. Check in with them. Ask “does that make sense?” or “should I go deeper here?” It turns a monologue into a conversation.
Start simple, then scale. Get a basic working design first, then add caching, replicas, and the rest. A simple thing that works beats a fancy thing that’s half-drawn.
Manage your time. Glance at the clock now and then. Don’t blow your whole budget on one corner of the system.
Remember it’s a conversation. The interviewer often drops hints. Listen for them, and follow where they lead.

⚠️ Common Mistakes

Most people who stumble in these interviews trip on the same handful of things. Watch out for these:

Jumping straight to a solution. Drawing boxes before asking questions means you might solve the wrong problem. Clarify first.
Going silent. If you stop talking and just think, the interviewer can’t see your reasoning. To them, silence looks like a blank. Keep narrating.
Over-engineering. Throwing in every fancy component “just in case” makes the design messy and hard to follow. Add complexity only when a real need calls for it.
Ignoring the requirements. Designing features nobody asked for wastes your precious time. Stick to the scope you agreed on.
Losing track of time. Spending half the interview on one piece means you never get to the rest. Pace yourself.

🛠️ Practice Challenge

Take a simple prompt, “Design a link-in-bio page like a simple Linktree, where a user has one page that lists their links.” Walk through every step in the same order we just learned, working each part below.

Clarify. What questions would you ask?

Show the answer

Estimate. Is this read-heavy or write-heavy?

Show the answer

API. What endpoints do you need?

Show the answer

High-level design. Draw client, load balancer, app servers, database, cache. Walk the “view a page” flow.

Show the answer

Deep dive. Pick one piece, like caching, since pages are read far more than they’re edited.

Show the answer

Trade-offs. Where would it strain if one page went viral, and how would you handle the sudden traffic?

Show the answer

🧩 What You’ve Learned

You now have a repeatable game plan for any system design question. Here’s what you’ve picked up.

✅ The interview tests your thinking and communication, not a memorized answer.
✅ Always clarify requirements first, both functional and non-functional, and pin the scope.
✅ Do quick estimation on users, reads vs writes, and storage to guide your choices.
✅ Define the API to turn fuzzy features into concrete actions.
✅ Draw a simple high-level design, then deep dive on one or two key pieces.
✅ Discuss trade-offs and bottlenecks honestly, naming each fix and its cost.
✅ Watch the clock and keep talking the whole way through.

Check Your Knowledge

Test what you learned. Pick an answer for each question, then click Check.

🚀 What’s Next?

You’ve got the process. Now the best way to lock it in is to use it on a real problem, end to end.

Design a URL Shortener walks the full process on a classic, beginner-friendly prompt.
What Happens When You Type a URL gives you the network fundamentals that come up in almost every system design interview.

Run the process a few times on small designs, and “design YouTube” will stop feeling like an ocean and start feeling like a checklist you can work through.

Previous Design a Task Board (Jira / Trello) Next Functional vs Non-Functional Requirements

Share & Connect

Share on LinkedIn