Interview Strategy

A guide for mid-to-senior (L4/L5) ML Engineer roles, focusing on system design, project deep-dives, and technical strategy.

The Interview Landscape (Engineer 2/L4)

At the Engineer 2 / Senior level, interviews shift from "What is X?" to "Why X and not Y?".

Recruiter Screen: 30 min high-level background.
Coding / DSA (1-2 Rounds): LeetCode Medium/Hard. Focus on strings, graphs, and dynamic programming.
ML Coding: Implementing an algorithm (e.g., K-Means, LR) or data maniupulation (Pandas/Numpy).
ML System Design (The "Closer"): 45-60 min of leader-led design discussion.
ML Theory Depth: Probabilistic "what-if" scenarios.
Behavioral / Project Deep Dive: Deep dive into 1-2 major projects using the STAR method.

This is the most critical round. Use this 5-step framework to lead the discussion.

Goal: What is the business metric? (e.g., Click-Through Rate (CTR), Revenue, Latency).
Constraints: Max latency (e.g., <100ms), throughput (QPS), scale (millions of users).
Type: Is this search, recommendation, ranking, or classification?

Sources: Logs, user profile, item metadata.
Labeling: Explicit (ratings) vs. Implicit (clicks, watch time). Handling delay in labels.
Features: Categorical (one-hot vs. embeddings), Numerical (scaling), Text (BERT/Word2Vec), Temporal (recency).
Storage: Feature store considerations.

Interviewer: "Design a system to rank videos on the YouTube home page."

Clarify:
- Multi-stage ranking? Yes, Candidate Generation → Ranking.
- Objective? Maximize long-term watch time.
Candidate Generation:
- Filter 1B videos to ~500.
- Use Two-Tower Networks (User Tower, Video Tower) with Dot Product similarity.
Ranking (The "Heavy" Model):
- Goal: Predict the probability of watch time $> X$.
- Features: User history, Video embeddings, Context (device, time).
- Label: Continuous watch time or binary "watched > 30s".
Online Serving:
- Use ANN (Approximate Nearest Neighbors) like Faiss for candidate retrieval.
- Re-rank the top 500 using the heavy DNN.
Cold Start:
- For new videos, use content-based features (title, tags) before they get user interactions.

Explain the "Why": Why did you choose Adam over SGD in your project? (e.g., faster convergence, handles noisy gradients).
Acknowledge Trade-offs: "We used XGBoost because it was interpretable and handled our tabular data well, even though a MLP might have slightly higher accuracy."
Focus on Impact: Always mention the % improvement in business metrics, not just ML metrics.

[] LeetCode: ~100-150 Mediums, focus on the "Top 75".
[] System Design: Read "Designing Data-Intensive Applications" (DDIA) and "Machine Learning System Design" by Chip Huyen.
[] Projects: Be ready to talk for 20 minutes about your best work, including failures and pivots.

Last updated 21 days ago