TL;DR: After reading
io_uring is not an event system, I think there is another way to consider why
io_uring can adapt to every use case:
io_uring is more than a generic asynchronous syscall facility. It's the state-of-the-art asynchronous interface for communication between subsystems implemented between the kernel and the userspace.
Starting from describing an abstract interface, a typical
io_uring like interface contains these parts:
- Control Plane
- Send control signal to the subsystem.
- Usually synchronous (e.g.
io_uring_enteris synchronous syscall because we wait for the control signal itself finished).
- Data Plane
- Exchange data between subsystems.
- Usually implemented by sharing cache or storage for reducing copying data.
- Can be synchronous, although it must be asynchronous if it's an
- Send events in a reverse direction to the control flow.
- Nice to have: We can poll the events if the interrupt is not available with some busy looping penalties.
These three components can describe not only the design of
io_uring but also lots of other system designs, including Hardware DMA interface, RDMA interface, netmap, Snap. All these system architectures share the same view of the subsystem, that the standalone subsystem will run continuously regardless of the application's state. In contrast, the synchronous view will be that the "remote function call" is part of the application instruction flow.
The growing interest in
io_uring means we are changing the view of syscall as a function call to that kernel is a standalone subsystem. That even makes more sense when comes to using eBPF with
io_uring. Hardware subsystems have their asynchronous nature and kernel is becoming one of them when more complex and customized computation happened in the kernel.
What's the future of
io_uring? One possible future is that if we keep improving the performance of
io_uring, adding fast user-level interrupt, it will become a userspace API mapping to hardware DMA. That means we can build all other syscalls in userspace on top of the DMA mapping.