Performance Variability on Ampere Altra Under Different Kernel Versions

volekay · February 21, 2025, 7:16am

Hello

I’ve been running workloads on an Ampere Altra system & noticed significant performance variability when switching between different Linux kernel versions. While some versions provide stable & predictable performance, others exhibit unexpected slowdowns or inconsistencies; particularly in multi-threaded workloads.

This is especially noticeable in CPU-bound applications where task scheduling & NUMA optimizations play a crucial role.

I suspect that certain kernel optimizations or regressions might be affecting how the Ampere Altra handles scheduling, memory management / power efficiency.

I’ve tried adjusting CPU governor settings and tuning kernel parameters, but the differences persist. It would be helpful to know if there are specific kernel versions that are best optimized for Ampere Altra / if any patches have addressed similar issues. Checked General Discussion - Ampere arm64 Developer Community MLOps guide related to this and found it quite informative.

Has anyone else observed similar performance differences? Are there recommended kernel configurations, patches, or specific distributions that provide the best experience for Ampere-based workloads? Any insights into tuning the kernel for better performance on Ampere Altra would be greatly appreciated.

Thank you !!

hrw · February 22, 2025, 11:16am

Have you considered sharing which kernel/distro versions you found fast/slow?

Maybe someone can then tell ‘ah, they do not have XYZ enabled’ or something?

vikingforties · February 24, 2025, 5:33pm

It’s hard to be too specific without data. Generally kernel 6.2 and on with GCC 11 or later.
CCIX between two sockets is not the fastest interconnect so I always recommend that users pin their workloads/containers/VMs to cores.

Home · HewlettPackard/LinuxKI Wiki · GitHub LinuxKI is a great tool for getting root causes if you have the time to delve into its capabilities.

quocbao · March 4, 2025, 10:21am

Performance differences between kernel versions happen with both ARM-based systems and x86 systems, such as the random number generator throughput.

There are also tons of optimizations paired with specific kinds of workloads. I suggest you go deeper into a specific workload, for example, AI model inference,…

joneill · May 6, 2025, 8:35pm

Does your workload have performance variability run-to-run using the identical configuration? Many multi-threaded applications have run-to-run variations, so that’s worth exploring before changing kernel and other system components.

This webpage lists what versions of Linux are supported and may or may not be useful,
Working with Open Source. For Enterprise Linux distros, it lists specific versions which would come with specific kernel version. I’d start by testing a kernel version that’s used in Enterprise Linux that’s as close to the version you want to use, and see if that helps.

I’m a software performance engineer and I personally run applications with perf stat and compare run-to-run performance. I will run perf record/perf report to look more in depth to see what exactly is running in different runs/configurations.

Hope this helps and let us know if it does or doesn’t! thanks,
john

Topic		Replies	Views
Could Someone Give me Advice for Optimizing Performance on Ampere Altra Systems? General Discussion oci , ampere	6	182	August 19, 2024
Weekend Read: The First 10 Questions to Answer While Running on Ampere Altra-Based Instances Content and Articles ampere , devto	2	441	May 30, 2023
Unstable benchmark results General Discussion oci , ampere	13	566	July 26, 2024
XZ compression performance on Ampere Altra General Discussion	10	125	May 9, 2025
Same HW, more fun News and Announcements performance , kernel	5	73	May 19, 2025

Performance Variability on Ampere Altra Under Different Kernel Versions

Related topics