Go EP19: sync.WaitGroup and The Interesting Alignment Problem

Oct 22, 2024

When you have multiple tasks that can run concurrently, you often use goroutines to handle them. However, a common issue arises when the main goroutine finishes its execution before the other goroutines complete their tasks.

This is where sync.WaitGroup comes into play, it acts as a synchronization mechanism that allows the main goroutine to wait for a collection of goroutines to finish.

By using WaitGroup.Add() to set the number of goroutines and
WaitGroup.Done() to signal the completion of each goroutine, you ensure that the main goroutine only proceeds once all tasks are complete.

This prevents premature termination and ensures orderly execution.

The Alignment Problem in sync.WaitGroup

The internal structure of sync.WaitGroup has evolved over different Go versions due to alignment issues, particularly on 32-bit architectures. The alignment problem arises because 64-bit values, like uint64, need to be stored at memory addresses that are multiples of 8 bytes for efficient access.

On 32-bit systems, this alignment isn't guaranteed, which can lead to crashes when using atomic operations. Over the years, Go has implemented various strategies to ensure proper alignment, such as using arrays to find aligned segments or introducing the atomic.Uint64 type to guarantee alignment.

The noCopy Struct

The noCopy struct is a clever addition to sync.WaitGroup that helps prevent accidental copying of the struct.

Copying a WaitGroup can lead to synchronization issues because the internal state might get out of sync between copies. The noCopy struct doesn't prevent copying at runtime but serves as a marker for tools like go vet to detect and warn about such mistakes during development.

Atomic Operations vs. Mutexes

There the trade-off between using atomic operations and mutexes for managing concurrency.

While mutexes provide a simple way to lock and unlock resources, they introduce overhead, especially in high-frequency operations. sync.WaitGroup uses atomic operations under the hood to modify its internal state without locks.

Evolution of sync.WaitGroup Across Go Versions

The internal structure of sync.WaitGroup has undergone several changes to address alignment issues and improve performance.

Initially, Go used a 12-byte array to ensure alignment, then moved to a more efficient 3-element uint32 array.
In Go 1.18, the structure was further optimized for 64-bit systems,
And by Go 1.20, the introduction of atomic.Uint64 provided a better solution for alignment across architectures.

It’s also importance to use sync.WaitGroup correctly to avoid common pitfalls.

For instance, calling wg.Add() with a positive delta should always precede wg.Wait().
Using defer wg.Done() within goroutines ensures that the counter is decremented even if the goroutine exits unexpectedly.
WaitGroup is very tricky to reuse, especially after a panic.

If you're interested in diving deeper into the content, you can follow the links https://victoriametrics.com/blog/go-sync-waitgroup/ to read the full posts.

Devtrovert

Discussion about this post