Java HashMap Class | W3Docs Learn Java

HashMap<K, V> is the default Map implementation in the JDK and the most-used data structure in Java application code. It backs HashSet (which is a HashMap with all values set to a single dummy), it's what Collectors.toMap builds, and it's the structure behind every "lookup table" you write that isn't sorted or concurrent. Operations are expected O(1) — a hash, a bucket index, one or two equals checks — independent of size.

Core operations

These are the methods you reach for daily. Every one is expected O(1).

Method	What it does
`put(k, v)`	Inserts or overwrites; returns the previous value (or `null`).
`get(k)`	Returns the value, or `null` if the key is absent.
`getOrDefault(k, def)`	Like `get`, but returns `def` instead of `null` on a miss.
`putIfAbsent(k, v)`	Sets the value only if the key is absent or mapped to `null`.
`merge(k, v, fn)`	Combines an existing value with `v` via `fn` — the counter idiom.
`computeIfAbsent(k, fn)`	Computes and stores a value on a miss — the cache idiom.
`remove(k)`	Removes the entry; returns the removed value, or `null`.
`containsKey(k)`	The only reliable way to tell "absent" from "mapped to `null`".

Iterate over entries (not keys, when you need the values too) to avoid a second lookup per key:

Map<String, Integer> scores = new HashMap<>();
scores.put("alice", 90);
scores.put("bob", 75);

for (Map.Entry<String, Integer> e : scores.entrySet()) {
    System.out.println(e.getKey() + " -> " + e.getValue());
}

// Or the lambda form:
scores.forEach((name, score) -> System.out.println(name + " -> " + score));

How the table is laid out

A HashMap keeps a power-of-two-sized array of buckets. Inserting an entry does five things:

Compute h = hashCode(key). Mix the upper and lower 16 bits together — h ^ (h >>> 16) — so a hash like 0x12340000 doesn't drop its top bits when masked.
Mask: i = h & (table.length - 1). That's h mod length for power-of-two lengths, and it's faster than the modulo operator.
Walk the chain at table[i]. If a node with an equal key exists, overwrite its value and return the old one.
Otherwise, prepend (or, since Java 8, append) a new node.
If size > capacity * loadFactor, resize: double the table and re-bucket every entry.

Up to Java 7 the bucket chain was a singly-linked list, full stop. From Java 8 on, once a chain reaches eight entries, the bucket is converted to a small balanced tree (a red-black tree) keyed by the hash. Lookup in that bucket becomes O(log n) instead of O(n), which caps the damage of a denial-of-service attack that crafts colliding hashes. If the tree shrinks back to six or fewer entries, it reverts to a list. You won't see this in normal code — it only matters when your hashCode is adversarial or pathologically bad.

Capacity, load factor, and pre-sizing

Same dials as HashSet:

Initial capacity — default 16, rounded up to a power of two.
Load factor — default 0.75. When size > capacity * 0.75, the table doubles.

If you know the size up front, pre-size:

Map<String, User> users = new HashMap<>(expectedSize * 4 / 3); // skip the doublings

Or, since Java 19, the explicit factory:

Map<String, User> users = HashMap.newHashMap(expectedSize);

That's the cleanest expression of intent — it computes the right initial capacity from a target size so the table doesn't need to grow.

Null keys and null values

HashMap allows one null key (it's stored in bucket 0 with hash 0) and any number of null values. That's a convenience over Hashtable (which rejects both) but it muddies the meaning of get(k) == null:

m.put("key", null);
m.get("key");          // returns null
m.containsKey("key"); // returns true

The disambiguation cost is real. Prefer to not store null values; use Optional, a sentinel, or just leave the key out. The Java 9+ factory Map.of(...) enforces this for you.

`hashCode` and `equals` are your contract

Putting your own class into a HashMap only works if hashCode and equals are consistent. The same rules as HashSet:

Equal objects must have equal hash codes.
Unequal objects may collide (it's fine, that's why buckets are chains).
Mutating a key after insertion is undefined behaviour.

Use a record if you can — both methods are generated correctly. Or let the IDE generate them. Never hand-write hashCode if you can help it.

record UserId(String tenant, String localPart) {}
Map<UserId, User> directory = new HashMap<>();
directory.put(new UserId("acme", "alice"), new User(/*...*/));
directory.get(new UserId("acme", "alice")); // hit

Iteration order — explicitly undefined

HashMap makes no guarantee about iteration order. The order depends on the bucket layout, which depends on the hash, the capacity, and the resize history — it can change between runs and between JVM versions. If you rely on the order, your code is broken; if your tests rely on the order, they're flaky.

If iteration order matters, use LinkedHashMap for insertion order or TreeMap for sorted order. Both are drop-in replacements.

Not thread-safe

HashMap will corrupt itself under concurrent mutation — and historically a famously bad failure mode was an infinite loop during a concurrent resize. Don't share a HashMap between threads. The right structure for multi-threaded code is ConcurrentHashMap (covered later in the concurrency part). Collections.synchronizedMap(new HashMap<>()) exists but uses a single lock around every operation, which is slower and rarely the right answer.

A worked example: counter, lookup table, and the modern idioms

The program below uses a HashMap several ways: a word-count counter via merge, a recursive memoization cache, a value-null ambiguity demonstration, the Java-19 newHashMap factory, and a record as a composite key.

java— editable, runs on the server

import java.util.*;
import java.util.function.*;

public class HashMapShowcase {
  public static void main(String[] args) {
    // --- 1. Counter via merge ---
    String[] words = { "java", "map", "java", "set", "java", "map", "list" };
    Map<String, Integer> counts = new HashMap<>();
    for (String w : words) counts.merge(w, 1, Integer::sum);
    System.out.println("counts: " + counts);

// --- 2. Recursive memoization (get + put) ---
    Map<Integer, Long> fibCache = new HashMap<>();
    Function<Integer, Long> fib = new Function<>() {
      public Long apply(Integer n) {
        if (n <= 1) return (long) n;
        Long cached = fibCache.get(n);
        if (cached != null) return cached;
        long value = apply(n - 1) + apply(n - 2);
        fibCache.put(n, value);
        return value;
      }
    };
    System.out.println("\nfib(50) = " + fib.apply(50));
    System.out.println("cache size: " + fibCache.size());

// --- 3. The null-value ambiguity ---
    Map<String, String> m = new HashMap<>();
    m.put("present", null);
    System.out.println("\nm.get('present')         = " + m.get("present"));
    System.out.println("m.get('absent')          = " + m.get("absent"));
    System.out.println("m.containsKey('present') = " + m.containsKey("present"));
    System.out.println("m.containsKey('absent')  = " + m.containsKey("absent"));

// --- 4. The Java-19 newHashMap factory: precise pre-sizing ---
    Map<Integer, String> sized = HashMap.newHashMap(1_000_000);
    long t0 = System.nanoTime();
    for (int i = 0; i < 1_000_000; i++) sized.put(i, "v" + i);
    long t1 = System.nanoTime();
    System.out.println("\n1M inserts into pre-sized HashMap: " + ((t1 - t0) / 1_000_000) + " ms");

// --- 5. Custom keys via record (auto equals/hashCode) ---
    record UserId(String tenant, String localPart) {}
    Map<UserId, String> directory = new HashMap<>();
    directory.put(new UserId("acme", "alice"), "Alice Smith");
    System.out.println("\ndirectory lookup: " + directory.get(new UserId("acme", "alice")));
  }
}

What to take from the run:

merge collapses the three-step "get, default-or-add-one, put" into one call. Use it whenever you're maintaining a per-key counter or sum.
The Fibonacci cache turns an exponential recursion into a linear one: check the map, recurse on a miss, then put the result. Note it uses get + put rather than computeIfAbsent — a recursive computeIfAbsent mutates the map while its own mapping function is still running, and since Java 9 that throws ConcurrentModificationException. Reserve computeIfAbsent for non-recursive "load-or-compute" lookups.
The null ambiguity is real. get returned null for a present key and an absent key the same way. The only way to tell them apart is containsKey — or by deciding you don't store nulls in the first place.
Pre-sizing with HashMap.newHashMap(1_000_000) lets a million inserts finish without any rehashes — the table starts at the right capacity.
The UserId record gives correct equals/hashCode for free. That's the modern way to compose hash-map keys from multiple fields.

What's next

HashMap doesn't promise iteration order. If you need insertion order remembered — say you're serializing the map to JSON and want stable output — the right tool is LinkedHashMap. It's also the basis of a textbook LRU cache, which we cover in the same chapter.

Practice

You see `m.merge(key, 1, Integer::sum)` in code where `m` is a `Map<String, Integer>`. What does it do?

Increments the count for `key`, treating an absent key as starting from 0 — equivalent to `m.put(key, m.getOrDefault(key, 0) + 1)`Inserts `1` only if the key is absent, returning the existing value otherwiseReplaces the value at `key` with `1` and returns the sum of all values in the mapThrows if `key` is absent because `merge` requires a non-null current value