Java 21 Virtual Threads: The End of Reactive Complexity

For the past several years, if you wanted high-throughput Java applications, you had two choices: thread pools with careful tuning, or reactive programming with Project Reactor or RxJava. I've built production systems with both approaches, and I can tell you — reactive code is powerful but painful. With Java 21's virtual threads (Project Loom), we finally have a third option that gives us the throughput of reactive with the simplicity of synchronous code.

What Are Virtual Threads?

Virtual threads are lightweight threads managed by the JVM rather than the operating system. A traditional platform thread maps 1:1 to an OS thread, consuming around 1MB of stack memory and carrying the overhead of OS-level scheduling. Virtual threads, on the other hand, are scheduled by the JVM on a small pool of carrier threads.

The numbers tell the story: you can create millions of virtual threads in a single JVM, whereas platform threads cap out around a few thousand before your system starts choking on memory and context-switching.

// The old way — platform threads with a fixed pool
ExecutorService executor = Executors.newFixedThreadPool(200);

// The new way — virtual threads, one per task
ExecutorService executor = Executors.newVirtualThreadPerTaskExecutor();

That single line change is deceptively powerful. Each task gets its own thread, there is no pool to size, and the JVM handles the rest.

Platform Threads vs Virtual Threads

Here is a direct comparison to make the differences concrete:

Aspect	Platform Threads	Virtual Threads
Memory per thread	~1MB stack	~1KB initially
Max practical count	~2,000-5,000	Millions
Scheduling	OS kernel	JVM (user-space)
Blocking cost	Expensive (holds OS thread)	Cheap (unmounts from carrier)
Context switch	~1-10 microseconds (kernel)	~200 nanoseconds (user-space)
Best for	CPU-bound work	I/O-bound work

The key insight is blocking is no longer expensive. When a virtual thread blocks on I/O — a database call, an HTTP request, file reads — the JVM unmounts it from the carrier thread and mounts another virtual thread. The carrier thread never sits idle.

Creating and Using Virtual Threads

Java 21 provides multiple ways to work with virtual threads:

// 1. Direct creation
Thread.startVirtualThread(() -> {
    System.out.println("Running on: " + Thread.currentThread());
});

// 2. Using the builder API
Thread vThread = Thread.ofVirtual()
    .name("worker-", 0)
    .start(() -> processRequest());

// 3. ExecutorService (most practical for real applications)
try (var executor = Executors.newVirtualThreadPerTaskExecutor()) {
    List<Future<String>> futures = new ArrayList<>();

    for (int i = 0; i < 10_000; i++) {
        futures.add(executor.submit(() -> {
            // Simulate I/O: database query, API call, etc.
            Thread.sleep(Duration.ofMillis(100));
            return fetchFromDatabase();
        }));
    }

    // Collect results
    for (Future<String> future : futures) {
        String result = future.get();
        process(result);
    }
}

Try launching 10,000 platform threads with Executors.newFixedThreadPool(10000) and watch your JVM beg for mercy. With virtual threads, this is a non-event.

Migrating from Reactive to Virtual Threads

This is where it gets exciting. Consider a typical WebFlux service:

// Reactive style — functional but hard to read, debug, and maintain
public Mono<OrderResponse> getOrderDetails(String orderId) {
    return orderRepository.findById(orderId)
        .flatMap(order -> Mono.zip(
            paymentClient.getPayment(order.getPaymentId()),
            shippingClient.getTracking(order.getTrackingId())
        ).map(tuple -> new OrderResponse(order, tuple.getT1(), tuple.getT2())))
        .switchIfEmpty(Mono.error(new OrderNotFoundException(orderId)))
        .onErrorResume(WebClientException.class,
            ex -> Mono.error(new ServiceUnavailableException(ex)));
}

Now the same logic with virtual threads and plain Spring MVC:

// Virtual threads style — straightforward, debuggable, familiar
public OrderResponse getOrderDetails(String orderId) {
    Order order = orderRepository.findById(orderId)
        .orElseThrow(() -> new OrderNotFoundException(orderId));

    try (var executor = Executors.newVirtualThreadPerTaskExecutor()) {
        Future<Payment> paymentFuture = executor.submit(
            () -> paymentClient.getPayment(order.getPaymentId()));
        Future<Tracking> trackingFuture = executor.submit(
            () -> shippingClient.getTracking(order.getTrackingId()));

        return new OrderResponse(order, paymentFuture.get(), trackingFuture.get());
    } catch (ExecutionException ex) {
        throw new ServiceUnavailableException(ex.getCause());
    }
}

Same concurrency, same throughput, zero reactive complexity. Stack traces make sense. Debugging works. Every Java developer on your team can read this code without learning a new programming paradigm.

Spring Boot 3.2+ Virtual Thread Support

Spring Boot 3.2 added first-class virtual thread support. Enabling it is a one-liner:

# application.yml
spring:
  threads:
    virtual:
      enabled: true

With this property set, Spring Boot configures Tomcat to handle each incoming request on a virtual thread. Your existing @RestController classes, @Service beans, and blocking JdbcTemplate calls all benefit immediately — no code changes required.

@RestController
@RequestMapping("/api/users")
public class UserController {

    private final UserService userService;

    @GetMapping("/{id}")
    public UserResponse getUser(@PathVariable Long id) {
        // This blocks — and that's perfectly fine on a virtual thread
        User user = userService.findById(id);
        List<Order> orders = userService.getRecentOrders(id);
        UserStats stats = userService.computeStats(id);

        return new UserResponse(user, orders, stats);
    }
}

Each request gets its own virtual thread. Under load, the JVM handles thousands of concurrent requests without the thread pool becoming a bottleneck.

Performance: Real Numbers

I benchmarked a REST API that makes two downstream HTTP calls and one database query per request, running on a 4-core machine with Spring Boot 3.2:

Configuration	Concurrent Users	Throughput (req/s)	Avg Latency	P99 Latency
Tomcat (200 threads)	500	1,850	270ms	890ms
Tomcat (200 threads)	2,000	1,920	1,040ms	3,200ms
WebFlux (Reactive)	2,000	4,100	485ms	780ms
Virtual Threads	2,000	3,950	500ms	740ms
WebFlux (Reactive)	10,000	4,300	2,300ms	5,100ms
Virtual Threads	10,000	4,150	2,400ms	4,800ms

Virtual threads match reactive throughput within 5% for I/O-bound workloads. The difference is negligible in practice, but the developer experience gap is massive.

When NOT to Use Virtual Threads

Virtual threads are not a silver bullet. They shine for I/O-bound work — waiting on databases, HTTP calls, file systems, message queues. They do not help with CPU-bound tasks.

CPU-intensive computation (image processing, encryption, complex algorithms) — virtual threads add scheduling overhead with no benefit. Stick with platform thread pools sized to your core count.
Synchronized blocks holding locks during I/O — this pins the virtual thread to its carrier, negating the benefits. Replace synchronized with ReentrantLock where possible.
Thread-local heavy code — virtual threads are cheap to create but each carries its own thread-local storage. If you create millions of threads with heavy thread-locals, memory adds up. Consider scoped values (ScopedValue) introduced alongside virtual threads.

// BAD: synchronized pins the virtual thread to its carrier
synchronized (lock) {
    result = httpClient.send(request, BodyHandlers.ofString());
}

// GOOD: ReentrantLock releases the carrier during blocking
lock.lock();
try {
    result = httpClient.send(request, BodyHandlers.ofString());
} finally {
    lock.unlock();
}

Final Thoughts

I spent two years writing reactive code in production. Mono, Flux, flatMap, switchIfEmpty, onErrorResume — I know these operators inside out. And I am happy to leave most of them behind.

Virtual threads represent something rare in the Java ecosystem: a genuinely simpler solution that does not sacrifice performance. You do not need to rewrite your mental model. You do not need to retrain your team. You write blocking code, the JVM makes it concurrent, and your application scales.

The best technology is the kind that disappears. You stop thinking about thread pools, backpressure, and reactive chains. You think about your business logic. Virtual threads get out of your way, and that is exactly what good infrastructure should do.

If you are starting a new project in 2025 or later and your workload is I/O-bound, there is very little reason to reach for WebFlux. Simplicity won.