URL Shortener

Difficulty: Easy Topics: Hashing, Base62 Encoding, Database Sharding Time: 45-60 minutes Companies: Google, Amazon, Meta, Microsoft

Problem Statement

Design a URL shortening service like bit.ly or TinyURL that:

Converts long URLs to short URLs
Redirects short URLs to original long URLs
Tracks click analytics (optional)

Example:

Long URL: https://www.example.com/very/long/url/with/many/parameters?id=12345
Short URL: https://short.ly/abc1234

Requirements Gathering

Functional Requirements

Must-Have:

Generate short URL from long URL
Redirect short URL to long URL (301/302)
Short URLs never expire

Nice-to-Have: 4. Custom aliases (e.g., short.ly/mycompany) 5. Analytics (clicks, geographic, referrers) 6. Expiration time (TTL) 7. Rate limiting

Non-Functional Requirements

Performance:

Latency: P99 < 100ms for redirects
Throughput: 10K writes/sec, 100K reads/sec (10:1 read:write ratio common)

Availability:

99.99% uptime (53 minutes downtime/year)
No single point of failure

Scalability:

100M new URLs/month
10B redirects/month

Security:

Prevent spam/malicious URLs
Rate limiting per user/IP

Reliability:

URLs never lost (durable storage)
Eventual consistency acceptable

Capacity Estimation

Traffic Estimates

Given:
- 100M new URLs/month
- 100:1 read:write ratio

Writes (creates):
100M/month ÷ 30 days ÷ 86,400 sec/day = 40 URLs/sec (average)
Peak (3×): 120 writes/sec

Reads (redirects):
40 writes/sec × 100 = 4,000 reads/sec (average)
Peak (3×): 12,000 reads/sec

Storage Estimates

Per URL record:
├─ short_code (7 chars): 7 bytes
├─ long_url (2 KB avg): 2,000 bytes
├─ user_id (BIGINT): 8 bytes
├─ created_at (TIMESTAMP): 8 bytes
├─ metadata (JSONB): 500 bytes
└─ Total: ~2.5 KB per URL

5 years of URLs:
100M/month × 12 months × 5 years = 6 billion URLs
6B × 2.5 KB = 15 TB

With replication (3×): 45 TB
With indexes (2×): 90 TB

**Final Estimate: 90-100 TB** for 5 years

Bandwidth Estimates

Writes:
120 writes/sec × 2.5 KB = 300 KB/sec ≈ 2.4 Mbps

Reads:
12,000 reads/sec × 2.5 KB = 30 MB/sec ≈ 240 Mbps (peak)

Cache Requirements (80/20 Rule)

Assumption: 20% of URLs generate 80% of traffic

Daily redirects:
12,000 reads/sec × 86,400 sec = 1B redirects/day

Cache hot URLs (estimate 0.1% of total):
6B URLs × 0.001 = 6M URLs
6M × 2.5 KB = 15 GB cache (feasible!)

Redis cache: 32 GB instance (covers hot URLs + overhead)

API Design

REST Endpoints

1. Create Short URL

POST /api/v1/urls
Content-Type: application/json

Request:
{
  "long_url": "https://example.com/very/long/url",
  "custom_alias": "mylink",  // optional
  "expiration": "2026-12-31T23:59:59Z"  // optional
}

Response: 201 Created
{
  "short_url": "https://short.ly/abc1234",
  "long_url": "https://example.com/very/long/url",
  "short_code": "abc1234",
  "created_at": "2026-02-08T10:00:00Z",
  "expires_at": null
}

2. Redirect Short URL

GET /{short_code}

Response: 301 Moved Permanently OR 302 Found
Location: https://example.com/very/long/url

Note for Interviews: Discuss 301 vs 302.

301 (Moved Permanently): Browser caches the redirect. Reduces load on servers but prevents accurate analytics tracking.
302 (Found/Moved Temporarily): Browser contacts our server for every click. Higher load but 100% accurate analytics. Use 302 if analytics is a functional requirement.

3. Get URL Info (Optional)

GET /api/v1/urls/{short_code}

Response: 200 OK
{
  "short_code": "abc1234",
  "long_url": "https://example.com/very/long/url",
  "created_at": "2026-02-08T10:00:00Z",
  "click_count": 12,345
}

4. Delete URL

DELETE /api/v1/urls/{short_code}

Response: 204 No Content

Database Schema

SQL Schema (PostgreSQL)

CREATE TABLE urls (
    short_code VARCHAR(7) PRIMARY KEY,
    long_url TEXT NOT NULL,
    user_id BIGINT,  -- NULL for anonymous users
    created_at TIMESTAMP DEFAULT NOW(),
    expires_at TIMESTAMP,  -- NULL = never expires
    click_count BIGINT DEFAULT 0,
    INDEX idx_user_id (user_id),
    INDEX idx_created_at (created_at)
);

CREATE TABLE analytics (
    id BIGSERIAL PRIMARY KEY,
    short_code VARCHAR(7) REFERENCES urls(short_code),
    clicked_at TIMESTAMP DEFAULT NOW(),
    ip_address INET,
    user_agent TEXT,
    referer TEXT,
    country VARCHAR(2)
);

-- Partitioning analytics by date for efficient queries
CREATE TABLE analytics_2026_02 PARTITION OF analytics
    FOR VALUES FROM ('2026-02-01') TO ('2026-03-01');

NoSQL Schema (DynamoDB)

Table: URLs
Partition Key: short_code (String)
Attributes:
  - long_url (String)
  - user_id (String)
  - created_at (Number) // Unix timestamp
  - expires_at (Number) // TTL for DynamoDB auto-delete
  - click_count (Number)

GSI: user_id-created_at-index
  - Partition Key: user_id
  - Sort Key: created_at
  - Purpose: Query user's URLs sorted by creation time

High-Level Design

Architecture Diagram

┌─────────┐
│ Client  │
└────┬────┘
     │
     ▼
┌──────────────────┐
│ CDN/CloudFlare   │ (DDoS protection, rate limiting)
└─────────┬────────┘
          │
          ▼
┌──────────────────┐
│  Load Balancer   │ (ALB, Nginx)
│  (Round Robin)   │
└─────────┬────────┘
          │
     ┌────┴────┐
     │         │
     ▼         ▼
┌─────────┐ ┌─────────┐
│  App    │ │  App    │ (Stateless, auto-scaling 10-100 instances)
│ Server  │ │ Server  │
└────┬────┘ └────┬────┘
     │           │
     ├───────────┤
     │           │
     ▼           ▼
┌──────────────────────┐     ┌──────────────┐
│  Redis Cache (15GB)  │────▶│ Cache-Aside  │
│  - LRU eviction      │     │ 95% hit rate │
└──────────┬───────────┘     └──────────────┘
           │                        
           │ Cache Miss (5%)
           ▼                        
┌────────────────────────────────┐
│   PostgreSQL (Primary)         │
│   - URLs table                 │
│   - Write: 120 QPS             │
└────────────┬───────────────────┘
             │
             │ Async Replication
             ▼
  ┌──────────────────────────┐
  │  Read Replicas (×3)      │ (For analytics, fallback reads)
  │  - Replication lag: <1s  │
  └──────────────────────────┘
  
┌──────────────────────────────┐
│  ID Generation Service       │ (Snowflake, Zookeeper-based)
│  - Distributed counter       │
└──────────────────────────────┘

Data Flow

Write Flow (Create Short URL)

1. Client → POST /api/v1/urls {"long_url": "..."}
2. Load Balancer → App Server
3. App Server:
   a. Generate unique short code (see algorithms below)
   b. Check if short_code exists in DB (collision check)
   c. INSERT into PostgreSQL: (short_code, long_url, user_id, ...)
   d. Update Redis cache (write-through): SET short_code → long_url
4. Return {"short_url": "https://short.ly/abc1234"}

Read Flow (Redirect Short URL)

1. Client → GET /abc1234
2. Load Balancer → App Server
3. App Server:
   a. Check Bloom Filter in-memory: If false, return 404 (Saves DB load for invalid URLs)
   b. Check Redis cache: GET abc1234
   c. If HIT (95% of requests):
      → Return long_url from cache (2ms latency)
   d. If MISS (5%):
      → Query PostgreSQL: SELECT long_url FROM urls WHERE short_code = 'abc1234'
      → Update cache: SET abc1234 → long_url (TTL: 24 hours)
      → Return long_url (50ms latency)
4. Return HTTP 301/302 redirect to long_url
5.(Async) Increment click_count via Kafka message queue to analytics service

Deep Dive Topics

1. Short Code Generation

Requirements:

Unique (no collisions)
Short (7 characters = 62^7 = 3.5 trillion combinations)
URL-safe characters: [a-zA-Z0-9] (Base62)

Option A: Hash-based (MD5/SHA + Base62)

import java.security.MessageDigest;
import java.util.Random;

public String generateShortCode(String longUrl) {
    MessageDigest md = MessageDigest.getInstance("MD5");
    byte[] hashBytes = md.digest(longUrl.getBytes());
    String hashHex = bytesToHex(hashBytes);  // 32 hex chars
    
    long decimal = Long.parseLong(hashHex.substring(0, 8), 16);
    String shortCode = Base62.encode(decimal).substring(0, 7);
    
    return shortCode;
}

// Collision handling
while (db.exists(shortCode)) {
    longUrl += String.valueOf(new Random().nextInt(1000));
    shortCode = generateShortCode(longUrl);
}

Pros: Deterministic (same URL → same short code) Cons: Collision possible (need retry logic)

Option B: Auto-Incrementing Counter + Base62 (Recommended)

public String generateShortCode(long counter) {
    String encoded = Base62.encode(counter);
    return String.format("%7s", encoded).replace(' ', '0');  // Pad to 7 chars
}

// Usage
long counter = db.incrementCounter();  // Distributed counter (e.g., PostgreSQL sequence)
String shortCode = generateShortCode(counter);

Pros: Guaranteed unique, no collisions Cons: Sequential (somewhat predictable)

Option C: Snowflake ID (Distributed ID Generator)

Snowflake ID (64 bits):
├─ 1 bit: Unused (sign bit)
├─ 41 bits: Timestamp (milliseconds since epoch)
├─ 10 bits: Machine ID (1024 machines)
├─ 12 bits: Sequence number (4096 IDs/ms per machine)

Generates unique IDs across distributed servers

long snowflakeId = idGenerator.getId();  // e.g., 123456789012345L
String shortCode = Base62.encode(snowflakeId).substring(0, 7);

Pros: Distributed, no coordination, time-ordered Cons: Requires separate ID service

2. Scaling the Database

Sharding Strategy:

Shard by: hash(short_code) % num_shards

Example (4 shards):
├─ Shard 0: short_codes with hash % 4 = 0
├─ Shard 1: short_codes with hash % 4 = 1
├─ Shard 2: short_codes with hash % 4 = 2
└─ Shard 3: short_codes with hash % 4 = 3

Lookup:
shard_id = hash(short_code) % 4
db = shards[shard_id]
result = db.query("SELECT * FROM urls WHERE short_code = ?", short_code)

Pros: Even distribution, horizontal scaling Cons: Cross-shard queries difficult (e.g., "get all user's URLs")

Solution for user queries: Use GSI in DynamoDB or replicate to separate analytics DB

3. Caching Strategy and Optimizations

Cache-Aside (Lazy Loading):

public String getLongUrl(String shortCode) {
    // 0. Check Bloom Filter
    if (!bloomFilter.mightContain(shortCode)) {
        throw new NotFoundException();
    }

    // 1. Check cache
    String longUrl = redis.get("url:" + shortCode);
    if (longUrl != null) {
        return longUrl;  // Cache HIT
    }
    
    // 2. Cache MISS → Query DB
    longUrl = db.query(
        "SELECT long_url FROM urls WHERE short_code = ?", 
        shortCode
    );
    
    // 3. Update cache
    if (longUrl != null) {
        redis.setex("url:" + shortCode, 86400, longUrl);  // TTL: 24 hours
    }
    
    return longUrl;
}

Cache Eviction: LRU (Least Recently Used) Cache Size: 32 GB (covers 6M hot URLs)

SDE-3 Optimization: Bloom Filters

Problem: Malicious users might request millions of random/invalid short URLs, causing cache misses and hitting the DB.
Solution: Place a Bloom Filter in front of the cache. Space-efficient probabilistic data structure. False positives are possible (will hit DB and return 404), false negatives are impossible.

4. Rate Limiting

Token Bucket Algorithm (per user/IP):

public void createUrl(String userId, String longUrl) {
    String key = "rate_limit:" + userId;
    Integer tokens = Integer.parseInt(redis.get(key) != null ? redis.get(key) : "10");  // 10 tokens initially
    
    if (tokens <= 0) {
        throw new RateLimitExceededException("Try again in 1 minute");
    }
    
    // Deduct token
    redis.decr(key);
    redis.expire(key, 60);  // Refill after  60 seconds
    
    // Proceed with URL creation
    String shortCode = generateShortCode(...);
    db.insert(shortCode, longUrl, userId);
}

Limits:

10 URLs/minute per user (free tier)
1000 URLs/minute per user (paid tier)

Failure Scenarios & Mitigation

Scenario 1: Database Primary Failure

Impact: Cannot create new URLs, writes fail Mitigation:

Promote read replica to primary (automated failover)
Reads continue from replicas (cached redirects unaffected)
RTO: < 5 minutes, RPO: < 1 minute of data loss

Scenario 2: Cache Failure (Redis Down)

Impact: All requests hit database (performance degradation) Mitigation:

Database read replicas handle load (12K QPS tolerable)
Degrade gracefully (slightly higher latency)
Auto-restart Redis, persistent storage (RDB/AOF)

Scenario 3: ID Generation Service Down

Impact: Cannot generate new short codes Mitigation:

Pre-generate IDs and store in buffer (1M pre-generated IDs)
Fallback to hash-based generation
Multi-region ID service (active-active)

Monitoring & Alerts

Key Metrics:

- Success rate: 99.99% (SLA)
- P99 latency: <100ms
- QPS: 12K reads/sec, 120 writes/sec
- Cache hit rate: >95%
- Database connection pool usage: <80%

Alerts:

- P0: Success rate < 99.9% for 5 min → Page on-call
- P1: P99 latency > 200ms for 10 min → Ticket
- P2: Cache hit rate < 90% → Investigate

Cost Estimation (AWS)

Compute (EC2, 50× t3.medium):
50 × $30/month = $1,500/month

Database (RDS db.m5.2xlarge + 3 replicas):
4 × $280/month = $1,120/month

Cache (ElastiCache Redis, cache.m5.large):
$180/month

Storage (100 TB RDS):
100 TB × $0.125/GB = $12,500/month

Load Balancer (ALB):
$20/month

Data Transfer (240 Gbps out):
~1 PB/month × $90/TB = $90,000/month (use CDN to reduce!)

CDN (CloudFlare):
~$5,000/month (caches static redirects)

TOTAL: ~$20,000/month (optimized with CDN caching)

Interview Tips

Common Questions:

"How do you generate short codes?" → Base62 encoding of counter/Snowflake ID
"What if the database goes down?" → Read replicas, automated failover
"How do you prevent spam?" → Rate limiting, CAPTCHA, blacklist malicious domains
"How do you scale to billions of URLs?" → Sharding, caching, read replicas

Good Answer Template:

"I'd use a counter-based approach with Base62 encoding to generate 7-character short codes (3.5 trillion combinations). Store URLs in PostgreSQL, shard by hash(short_code) for horizontal scaling. Use Redis cache for 95% hit rate on redirects. Implement token bucket rate limiting per user. Monitor success rate (99.99% SLA) and P99 latency (<100ms)."

Time Allocation:

Requirements: 5 min
Estimation: 5 min
API + Schema: 5 min
Architecture: 10 min
Deep dives (short code gen, sharding, caching): 20 min
Failure scenarios: 5 min

PreviousRate Limiter NextPastebin

Last updated 1 month ago

hashtagProblem Statement

hashtagRequirements Gathering

hashtagFunctional Requirements

hashtagNon-Functional Requirements

hashtagCapacity Estimation

hashtagTraffic Estimates

hashtagStorage Estimates

hashtagBandwidth Estimates

hashtagCache Requirements (80/20 Rule)

hashtagAPI Design

hashtagREST Endpoints

hashtagDatabase Schema

hashtagSQL Schema (PostgreSQL)

hashtagNoSQL Schema (DynamoDB)

hashtagHigh-Level Design

hashtagArchitecture Diagram

hashtagData Flow

hashtagWrite Flow (Create Short URL)

hashtagRead Flow (Redirect Short URL)

hashtagDeep Dive Topics

hashtag1. Short Code Generation

hashtag2. Scaling the Database

hashtag3. Caching Strategy and Optimizations

hashtag4. Rate Limiting

hashtagFailure Scenarios & Mitigation

hashtagScenario 1: Database Primary Failure

hashtagScenario 2: Cache Failure (Redis Down)

hashtagScenario 3: ID Generation Service Down

hashtagMonitoring & Alerts

hashtagCost Estimation (AWS)

hashtagInterview Tips