HashMap internal working

HashMap Internals - Simple & Interview Ready

Overview: What is HashMap?

A HashMap stores key-value pairs using hashing for fast access. Think of it as a smart dictionary where words (keys) instantly point to definitions (values).

HashMap Initialization

flowchart TD
    Start[HashMap Initialization]

    Start --> S1

    subgraph S1[Creating a HashMap]
        Code["HashMap map = new HashMap<>()"]
    end

    S1 --> S2

    subgraph S2[Default Parameters]
        direction TB
        P1["• Initial Capacity: 16 buckets"]
        P1 --> P2["• Load Factor: 0.75"]
        P2 --> P3["• Threshold: 12 (16 × 0.75)"]
        P3 --> P4["• Empty bucket array created"]
    end

    S2 --> Visual

    subgraph Visual[Internal Bucket Array]
        direction TB
        B["Bucket Array (size: 16)"]
        B --> B0["Bucket 0: null"]
        B --> B1["Bucket 1: null"]
        B --> B2["Bucket 2: null"]
        B --> B3["Bucket 3: null"]
        B --> BDots["..."]
        B --> B14["Bucket 14: null"]
        B --> B15["Bucket 15: null"]
    end

    style Start fill:#222,stroke:#444,stroke-width:2px,color:#fff
    style S1 fill:#1a1a2e,stroke:#6c5ce7,stroke-width:2px,color:#fff
    style S2 fill:#1a1a2e,stroke:#45b7d1,stroke-width:2px,color:#fff
    style Visual fill:#1a1a2e,stroke:#4ecdc4,stroke-width:2px,color:#fff
    style B fill:#2a2a3e,stroke:#4ecdc4,stroke-width:1px,color:#fff

How does put() work?

flowchart TD
    Start["putkey, value"]
    Start --> Hash["Calculate hash = key.hashCode()"]
    Hash --> Index["index = hash & (capacity - 1)"]
    Index --> Check["Check bucket at index"]
    Check --> Empty["Empty? → Create Node"]
    Check --> Occupied["Occupied? → Handle collision"]
    Occupied --> List["Linked list: add to end"]
    Occupied --> Tree["Tree: insert in tree"]
    List --> SizeCheck["Check size > threshold?"]
    SizeCheck --> Yes["Yes → Resize (double)"]
    SizeCheck --> No["No → Done"]

Interview Answer: “HashMap uses an array of buckets. When we insert a key-value pair, it calculates which bucket to use based on the key’s hash.”

HashMap Time Complexity Summary

Operation	Average Case	Worst Case (Pre-Java 8)	Worst Case (Java 8+)
get()	O(1)	O(n)	O(log n)
put()	O(1)	O(n)	O(log n)
remove()	O(1)	O(n)	O(log n)
containsKey()	O(1)	O(n)	O(log n)

Interview Answer: “HashMap gives O(1) average time for get/put/remove. Worst case is O(n) with many collisions, improved to O(log n) with trees in Java 8. Load factor 0.75 balances time vs space.”

Quick Interview Cheat Sheet

Q: How does HashMap work internally?

A: It uses an array of buckets. When you put a key-value pair:

Calculate hash of key
Find bucket: hash & (array.length - 1)
If bucket empty → add Node
If bucket occupied → handle collision (linked list/tree)
If size exceeds threshold → resize (double capacity)

Q: What is collision?

A: When two keys end up in the same bucket.

Q: What happens during collision?

A: Keys with same bucket form a linked list. In Java 8+, when list exceeds 8 elements, it converts to Red-Black tree for better performance.

Q: When does HashMap resize?

A: When number of entries exceeds capacity × load factor (default: 16 × 0.75 = 12). It doubles capacity and rehashes all entries.

Q: Why is load factor 0.75?

A: It’s a trade-off: higher values save memory but cause more collisions, lower values reduce collisions but waste memory. 0.75 is the sweet spot.

Q: What’s the time complexity?

A: Average case O(1) for all operations. Worst case O(n) with many collisions, improved to O(log n) with trees in Java 8+.

Key Points to Remember:

Buckets array + Linked lists + Trees (Java 8+)
Resize when 75% full (doubles capacity)
Tree conversion at 8+ collisions
O(1) average, O(log n) worst with trees
Not thread-safe (use ConcurrentHashMap for threads)