Internals

This directory contains deep-dive guides into the internals of popular distributed systems and databases. Understanding these internals is crucial for SDE-3 level system design interviews.

Available Internals

Message Queues & Streaming

Kafka - Distributed streaming platform
- Partitions, ISR, Zero-Copy, Log Compaction
- Exactly-once semantics, Consumer groups

Databases

NoSQL

Cassandra - Wide-column distributed database
- Ring architecture, Consistent hashing, LSM-tree
- Compaction strategies, Tunable consistency

SQL

PostgreSQL - Advanced relational database
- MVCC, WAL, VACUUM, Transaction isolation
- Query planner, Indexes (B-tree, GIN, BRIN)

Caching & In-Memory Stores

Redis - In-memory data structure store
- Data structures, Persistence (RDB, AOF)
- Replication, Cluster mode

Search & Analytics

Elasticsearch - Distributed search engine
- Inverted index, Sharding, Segment merging
- Query execution, Aggregations

Coordination Services

ZooKeeper - Distributed coordination
- Znodes, ZAB protocol, Leader election
- Watches, Session management

How to Use These Guides

For Interview Prep

Read sequentially: Start with overview, then dive into specific sections
Focus on trade-offs: Understand why each design choice was made
Practice explaining: Can you explain MVCC or ZAB to an interviewer?
Review interview questions: Each guide has typical interview questions at the end

During System Design

Reference architecture patterns: Cassandra's ring, Kafka's partitioning
Compare alternatives: When to use PostgreSQL vs Cassandra?
Justify technology choices: "I'd use Kafka because of its log compaction for..."

Deep Dive Topics by Role

Backend Engineer:

PostgreSQL: MVCC, WAL, query optimization
Redis: Persistence strategies, replication
Kafka: Exactly-once semantics, consumer groups

Distributed Systems Engineer:

Cassandra: Consistency levels, anti-entropy repair
ZooKeeper: ZAB protocol, leader election
Elasticsearch: Cluster state, shard allocation

Data Engineer:

Kafka: Log retention, compaction policies
Elasticsearch: Inverted index, aggregations
Cassandra: Data modeling, compaction strategies

Common Interview Patterns

"How does X handle failures?"

Cassandra: Hinted handoff, read repair
PostgreSQL: WAL replay, replication
Kafka: ISR, controlled shutdown
ZooKeeper: Fast leader election, quorum

"Explain the write path in X"

Cassandra: Commit log → MemTable → SSTable
PostgreSQL: WAL → Buffer pool → Disk
Kafka: Append to partition → Replicate to ISR
Elasticsearch: Translog → Refresh → Flush

"How does X achieve consistency?"

Cassandra: Tunable (ONE, QUORUM, ALL)
PostgreSQL: Isolation levels (Read Committed, Repeatable Read)
Kafka: ISR + acks configuration
ZooKeeper: ZAB (total order, sequential consistency)

Comparison Matrix

System

Type

Read

Write

Consistency

Use Case

Cassandra

NoSQL (Wide-column)

Fast

Very Fast

Tunable (eventual→strong)

Time-series, high write

PostgreSQL

RDBMS

Fast

Moderate

Strong (ACID)

Transactional, complex queries

Redis

In-memory

Very Fast

Eventual→Strong

Caching, real-time

Elasticsearch

Search Engine

Very Fast

Moderate

Near real-time

Full-text search, analytics

Kafka

Streaming

N/A

Very Fast

Configurable

Event streaming, logs

ZooKeeper

Coordination

Fast

Moderate

Sequential

Leader election, config

Further Learning

Books

Designing Data-Intensive Applications (DDIA) by Martin Kleppmann
Database Internals by Alex Petrov
Kafka: The Definitive Guide by Neha Narkhede

Official Docs

Practice

Deploy these systems locally (Docker)
Run benchmarks, observe behavior
Intentionally fail nodes, observe recovery

PreviousChaos Engineering NextKafka

Last updated 1 month ago

hashtagAvailable Internals

hashtagMessage Queues & Streaming

hashtagDatabases

hashtagNoSQL

hashtagSQL

hashtagCaching & In-Memory Stores

hashtagSearch & Analytics

hashtagCoordination Services

hashtagHow to Use These Guides

hashtagFor Interview Prep

hashtagDuring System Design

hashtagDeep Dive Topics by Role

hashtagCommon Interview Patterns

hashtag"How does X handle failures?"

hashtag"Explain the write path in X"

hashtag"How does X achieve consistency?"

hashtagComparison Matrix

hashtagFurther Learning

hashtagBooks

hashtagOfficial Docs

hashtagPractice

Available Internals

Message Queues & Streaming

Databases

NoSQL

SQL

Caching & In-Memory Stores

Search & Analytics

Coordination Services

How to Use These Guides

For Interview Prep

During System Design

Deep Dive Topics by Role

Common Interview Patterns

"How does X handle failures?"

"Explain the write path in X"

"How does X achieve consistency?"

Comparison Matrix

Further Learning

Books

Official Docs

Practice