January 4, 2024

Atop version 2.10.0

May 13, 2023

Atop version 2.9.0

January 7, 2023

Atop version 2.8.1

December 29, 2022

Atop version 2.8.0

January 8, 2022

Atop version 2.7.1

December 18, 2021

Atop version 2.7.0

Training "Linux Performance Analysis and Tuning"

Why are the load averages high, while there is hardly CPU utilization? How can I discover processes with a memory leakage? How can I avoid that they harm the overall system performance? Can poor response times be caused by zombie processes? What is the current size of the dynamically scaling page cache? Can I avoid aggressive behavior of my page cache? What is the real impact when I modify the nice value of a process? How can I avoid that all bandwidth of my network connection is used for bulk transfer at the expense of the interactive transfers? Which process will be killed when my system runs out of memory and swap space? How can I protect my critical processes against out-of-memory killing? How can I avoid that batch processes consume too much disk bandwidth at the expense of my interactive processes? How can counters be misleading, resulting in wrong interpretation and useless improvements? What are CPU scheduling policies? And what are scheduler classes? How can I run batch processes that only consume the leftovers of the processor capacity? Why is the kernel swapping out pages, while there is sufficient memory free? How can I determine what exactly a process (thread) is waiting for when it doesn't show any progress? What are NUMA memory policies?

As a Linux system administrator, lots of performance-related questions may cross your mind. These questions (and many more) will be answered in the training Linux Performance Analysis and Tuning. The training is lectured by Gerlof Langeveld, the creator/maintainer of atop, atopsar and the kernel module netatop. Gerlof is trainer/consultant at AT Computing, a company that provides professional Linux training and consultancy. As a consultant, he has a lot of experience with performance analysis and eliminating bottlenecks.

The training provides an insight into the methods and techniques to measure the performance of a Linux system, to interpret the results, and to implement improvements. It is targeted at Linux system administrators and takes 5 days.
During the training, the hardware components are discussed that might be overloaded and are most likely involved in the loss of performance: CPUs, disks, memory and network connections. In the theoretical lectures, you will learn how the Linux kernel deals with the capacity of these hardware components (CPU scheduling, disk scheduling, caching, memory management, swapping, and many other kernel mechanisms) and how this background knowledge correlates to the counters shown by atop and other relevant measurement tools. Furthermore, the training covers mechanisms like control groups that can be used to reallocate the bandwidth of these hardware components (see detailed technical content of the training).
The documentation consists of slides with accompanying text (sample chapter) and practical lab exercises.

This training is occasionally scheduled in one of the class rooms of AT Computing in Nieuwegein (near Utrecht), The Netherlands for individual registration (English spoken with English documentation).
However, it is also possible to organize the training as an on-site training at your location or as a 'live' virtual class (English spoken with English documentation).