Wednesday, February 24, 2021

Millennium prize problems but for Linux

There is a longstanding tradition in mathematics to create a list of hard unsolved problems to drive people to work on solving them. Examples include Hilbert's problems and the Millennium Prize problems. Wouldn't it be nice if we had the same for Linux? A bunch of hard problems with sexy names that would drive development forward? Sadly there is no easy source for tens of millions of euros in prize money, not to mention it would be very hard to distribute as this work would, by necessity, be spread over a large group of people.

Thus it seems is unlikely for this to work in practice, but that does not prevent us from stealing a different trick from mathematicians' toolbox and ponder how it would work in theory. In this case the list of problems will probably never exist, but let's assume that it does. What would it contain if it did exist? Here's one example I came up with. it is left as an exercise to the reader to work out what prompted me to write this post.

The Memory Depletion Smoothness Property

When running the following sequence of steps:

Check out the full source code for LLVM + Clang
Configure it to compile Clang and Clang-tools-extra, use the Ninja backend and RelWithDebInfo build type, leave all other settings to their default values
Start watching a video file with VLC or a browser
Start compilation by running nice -19 ninja

The outcome must be that the video playback works without skipping a single frame or audio sample.

What happens currently?

When Clang starts linking, each linker process takes up to 10 gigabytes of ram. This leads to memory exhaustion, flushing active memory to swap and eventually crashing the linker processes. Before that happens, however, every other app freezes completely and the entire desktop remains frozen until things get swapped back in to memory, which can take tens of seconds. Interestingly all browser tabs are killed before linker processes start failing. This happens both with Firefox and Chromium.

What should happen instead?

The system handles the problematic case in a better way. The linker processes will still die as there is not enough memory to run them all but the UI should never noticeably freeze. For extra points the same should happen even if you run Ninja without nice.

The wrong solution

A knee-jerk reaction many people have is something along the lines of "you can solve this by limiting the number of linker processes by doing X". That is not the answer. It solves the symptoms but not the underlying cause, which is that bad input causes the scheduler to do the wrong thing. There are many other ways of triggering the same issue, for example by copying large files around. A proper solution would fix all of those in one go.

9 comments:

koalilloFebruary 25, 2021 at 9:28 AM
How many swap do you have? I've always thought swap on an interactive system should be sized by the time it takes to page it back to RAM. With regards to large IO process, perhaps interactive processes should be ioniced above batch processes.
ReplyDelete
Replies
Benjamin BergFebruary 25, 2021 at 11:43 AM
The good news is, there has been a lot of progress!

Thanks to cgroup, we can do most of what is needed. We can prioritize one application over another, and we can prevent a task with a lot of processes to effectively fork-bomb the system.

There are still some rough edges. For example, IO scheduling may not work because the setup is not supported (e.g. LUKS is problematic) and because the underlying performance properties of the disk are unknown. Unfortunately, we may need to distribute performance characteristics of disks to solve part of this.

However, overall you should be seeing improvements in this area on some distributions already. Fedora already uses uresourced to better configure cgroups (https://lwn.net/Articles/829567/) and is now also switching to systemd-oomd for improved out-of-memory handling (https://fedoraproject.org/wiki/Changes/EnableSystemdOomd).

There will be more work needed and we hopefully will start to do more tricks in the desktop itself (e.g. prioritize focused application). However, this should have improved a lot already and work is underway for more improvements in the area.
ReplyDelete
Replies
Noel GrandinFebruary 25, 2021 at 2:29 PM
Or possibly the build systems should learn to monitor their children and back off (including killing some of them) when such things happen.
ReplyDelete
Replies
Federico BaroneMarch 21, 2021 at 4:13 PM
Just a question (and hopefully food for thought), how is this currently handled in other major OSes? Do current Windows, macOS, Android (to the extent it can have heavy background, non-interactive load) manage this situations any better, at least when the bottleneck is just CPU, not also RAM?
ReplyDelete
Replies

Add comment