ASRock Rack Ampere bundle

Is OpenBMC on the AsRock board supposed to correctly list the hardware inventory?

I do not see CPU, memory and power supply listed even though they are there.
And the system does not POST. Nothing happens after power on.
And yes, I do use RDIMMs although not QVL (4x Kingston KTD-PE432D8/32G)

I also have access to an X570D4U-2L2T.
That BMC (not OpenBMC) does not list any hardware either, yet it runs fine.

available now

also 128 core bundle $2,499

It lists the hardware inventory, but only after POSTing. From talking to you on Twitter it sounds like the board is unfortunately DOA so will need to be RMA’d. :frowning:

Is anyone else having issues with this Dynatron CPU fan control? We have one of these Dynatron heatsink and fan on our board but there seems to be no fan control at all? I mean, it either runs very slow (~0.5K RPM) or full blast (~13K RPM) when the CPU is under load, and it is super noisy at ~13K RPM.

1 Like

I’ve recently bought exactly that bundle and now I’m happily playing with it. A small and humble self-promotion (but mostly because it might be useful for some one): I wrote a first impression article about it.

But main points that I found interesting were:

  1. It came with BMC 02.05.00 and Firmware 2.04 out of the box (those are not available on ASRock website, as there are only two 1.x versions in stable and 02.06.00/2.05 in Beta section). So I’ve naturally upgraded to latest beta, and it seems it fixed few small problems that stock firmware had. BUt it is way less refined than what you expect of x86 server.

  2. Manual have typos. For example it mentions a debug console on port 2203, however in reality it is 2200. And it actually is pretty awesome as if you SSH to it when system is turned off it provides rather detailed boot log that otherwise you won’t be able to get.

  3. Forwarding services can get it for a small premium to EU or Switzerland (in my case), it is not free (in my case it was about 300$ of shipping costs and customs fees on top of the newegg price)

2 Likes

Before you put this machine into a rack - can you put some graphics card there and try desktop on it?

Without changing kernel of course.

1 Like

@hrw oh, long time no see (you might not remember me, but I used to bother you over Jabber about Zaurus kernel back in times of OpenEmbedded) :slight_smile:

I ill shove it into a rack probably next month or so, I still need a chassis that wouldn’t cost a fortune and I don’t have 1U radiator yet.

I have rx550, I’ve already quickly tried it in the system. As with other Ampere systems, before Linux boots, you gets no output on external graphics. At all.

By default (until you blocklist ast driver) all output goes to Aspeed integrated graphics even if external one plugged in.

(there should be few more screenshots, but as I’ve just registered I’m limited to only one, so they will follow later)

that is Debian stable, stock kernel (linux-image-6.1.0-20-arm64)

And as you can expect from such old kernel - it doesn’t boot with RDNA graphics cards (I have RX 5700).

I will later on try to update there to Debian testing or even Unstable and check the RDNA card.

2 Likes

Random 4k video from youtube:

(however I’m a bit limited by the resolution of my cheap USB-HDMI dongle here…)

2 Likes

And a rbdoom3 1.4.0 from the repos:

I’ll try withouth HDMI capture card on a proper screen after I get my RAM. I currently have only 1 stick as I don’t keep a lot of RDIMMs around the house :slight_smile:

2 Likes

You can enable ā€˜bookworm-backports’ repository. Then `apt install linux-image-arm64/bookworm-backports’ will give you 6.6 kernel.

Yeah, true. In that case I hoped to get overall newer version of packages :slight_smile: But that might not be worth it (I mean testing have 6.7, probably not a huge difference anyway).

And with backports kernel and RX 5700, I got this:

[    1.896399] [drm] Found VCN firmware Version ENC: 1.20 DEC: 5 VEP: 0 Revision: 1
[    1.896408] amdgpu 0000:03:00.0: amdgpu: Will use PSP to load VCN firmware
[    1.951408] [drm] reserve 0x900000 from 0x81fd000000 for PSP TMR
[    1.993248] amdgpu 0000:03:00.0: amdgpu: RAS: optional ras ta ucode is not available
[    1.999081] amdgpu 0000:03:00.0: amdgpu: RAP: optional rap ta ucode is not available
[    1.999082] amdgpu 0000:03:00.0: amdgpu: SECUREDISPLAY: securedisplay ta ucode is not available
[    1.999138] amdgpu 0000:03:00.0: amdgpu: use vbios provided pptable
[    1.999140] amdgpu 0000:03:00.0: amdgpu: smc_dpm_info table revision(format.content): 4.5
[    2.034704] amdgpu 0000:03:00.0: amdgpu: SMU is initialized successfully!
[    2.034879] [drm] Display Core v3.2.247 initialized on DCN 2.0
[    2.034881] [drm] DP-HDMI FRL PCON supported
[    2.064209] [drm] kiq ring mec 2 pipe 1 q 0
[    2.247100] amdgpu 0000:03:00.0: [drm:amdgpu_ring_test_helper [amdgpu]] *ERROR* ring kiq_0.2.1.0 test failed (-110)
[    2.257810] [drm:amdgpu_gfx_enable_kcq [amdgpu]] *ERROR* KCQ enable failed
[    2.264949] [drm:amdgpu_device_init [amdgpu]] *ERROR* hw_init of IP block <gfx_v10_0> failed -110
[    2.274077] amdgpu 0000:03:00.0: amdgpu: amdgpu_device_ip_init failed
[    2.280505] amdgpu 0000:03:00.0: amdgpu: Fatal error during GPU init
[    2.286980] amdgpu 0000:03:00.0: amdgpu: amdgpu: finishing device.

I’ll have a look if disabling ASPM would solve that.

1 Like

@hrw well, with Debian 6.6 kernel, RDNA gpu doesn’t work. Older Polaris though works just fine (I’ve tried rx550).

Even if I disable ASPM it ends up with the same error during initialization. I guess root cause here is that ERRATA that needs to be patched. And unless debian or ubuntu is willing to integrate that or unless some one want to upstream the patch that Altra did - yeah, you’ll need to patch the kernel :frowning: or rely on some one from the community to do that for you and upload it to the repo.

1 Like

Thanks a lot @Civiloid - you just saved me from spending 3k€ for AArch64 desktop.

Now have to wait another 3 years to let AmpereOne systems reach similar pricing. In meantime will make use of remote Altra systems.

Yeah, I’m still sad to see such small mistakes plaguing half of the ARM boards available and that still if you want things to just work - x86 is the only viable option.

Yeah, if you don’t need modern GPU it seems fine (at least so far). I might have more problems with the platform later though.

And yeah, I hate that so many years has passed and ARM ecosystem is still have such problems and that platform developer doesn’t work towards fixing that upstream, like they should. That basically hinders ARM adoption as there is no way that developer can spend reasonable-ish amount of money (1-2k $) and get a machine that is suitable for everyday use at home.

1 Like

Friend pointed me to 7800XT amdgpu initialisation fails on reboot with kernel 6.7 (#3062) Ā· Issues Ā· drm / amd Ā· GitLab showing similar issue on x86-64 (with RX 7800 card).

Maybe 6.7.12 from Debian ā€˜testing’ will work better for you.

1 Like

I will try to apply workaround patches on top 6.6, if that won’t work it would be probably easier for me to just grab vanilla kernel and build it with whatever patches I want. As I’m not a huge fan of debian kernel package build system…

Going up step by step: With kernel 6.7 from debian-testing, amdgpu fails at exactly same point during initialisation.

I will try latest vanilla kernel next

With 6.9.0@master I managed to get rx5700 initialized once, but then it reverted back to the same ā€œring kiq_0.2.1.0 test failedā€ problem. I even thought it was promising that board would work with RDNA out of the box on newest kernels, but I guess it is just not a 100% chance of getting an error.

With a patch ported to 6.9.0 (one const was renamed and option should depend on 52-bit VA space) it seems to work reliably. I haven’t done any more tests yet.

3 Likes

I also got an Intel ARC 750 to try and what can I say…

Card doesn’t fit into the third slot (and obviously doesn’t fit into bottom one), battery is in the way. Probably that can be solved by using a raiser cable, however it fits into second (PCIe x8 on Altra).

About driver, ARC 750 is not supported by Xe kernel module (officially) so I needed to add:

xe.force_probe='56a1' and i915.force_probe='!56a1'

to the kernel (kernel 6.9.0@master from 2 days ago with PCIe patches)

but it then fails to initialize:

[   57.555741] xe 0004:04:00.0: [drm] Using GuC firmware from i915/dg2_guc_70.bin version 70.20.0
[   57.581832] xe 0004:04:00.0: vgaarb: VGA decodes changed: olddecodes=io+mem,decodes=none:owns=none
[   57.581848] Unable to handle kernel paging request at virtual address ffffffffc08003cc
[   57.589768] Mem abort info:
[   57.592561]   ESR = 0x0000000096000006
[   57.596305]   EC = 0x25: DABT (current EL), IL = 32 bits
[   57.601615]   SET = 0, FnV = 0
[   57.604663]   EA = 0, S1PTW = 0
[   57.607794]   FSC = 0x06: level 2 translation fault
[   57.612668] Data abort info:
[   57.615538]   ISV = 0, ISS = 0x00000006, ISS2 = 0x00000000
[   57.621026]   CM = 0, WnR = 0, TnD = 0, TagAccess = 0
[   57.626075]   GCS = 0, Overlay = 0, DirtyBit = 0, Xs = 0
[   57.631384] swapper pgtable: 4k pages, 48-bit VAs, pgdp=00000801c6b7d000
[   57.638083] [ffffffffc08003cc] pgd=1800081fffcc6003, p4d=00000801c760a003, pud=00000801c760b003, pmd=0000000000000000
[   57.648696] Internal error: Oops: 0000000096000006 [#1] SMP
[   57.654259] Modules linked in: xe(+) snd_seq_dummy snd_hrtimer snd_seq snd_seq_device qrtr snd_hda_intel snd_intel_dspcfg snd_hda_codec snd_hda_core aes_ce_blk aes_ce_cipher polyval_ce snd_hwdep polyval_generic snd_pcm ghash_ce gf128mul snd_timer acpi_ipmi snd sha2_ce soundcore sha256_arm64 ipmi_ssif sbsa_gwdt sha1_ce arm_spe_pmu ipmi_devintf binfmt_misc arm_cmn ipmi_msghandler xgene_hwmon nls_ascii nls_cp437 vfat fat arm_dsu_pmu cppc_cpufreq joydev acpi_tad evdev dm_mod loop efi_pstore dax configfs nfnetlink efivarfs ip_tables x_tables autofs4 ext4 crc32c_generic crc16 cdc_ether mbcache usbnet jbd2 mii hid_generic usbhid hid drm_gpuvm drm_exec drm_buddy gpu_sched nvme video drm_suballoc_helper drm_ttm_helper ttm cec nvme_core rc_core ixgbe t10_pi drm_display_helper xhci_pci xfrm_algo mdio_devres xhci_hcd drm_kms_helper of_mdio crc64_rocksoft fixed_phy crc64 fwnode_mdio crc_t10dif drm usbcore igb crct10dif_generic libphy crct10dif_ce crct10dif_common mdio usb_common i2c_algo_bit i2c_designware_platform
[   57.654393]  i2c_designware_core [last unloaded: xe]
[   57.748068] CPU: 0 PID: 7 Comm: kworker/0:0 Tainted: G     U             6.9.0-rc6+ #4
[   57.755975] Hardware name:  ALTRAD8UD-1L2T/ALTRAD8UD-1L2T, BIOS 2.05 04/12/2024
[   57.763272] Workqueue: events work_for_cpu_fn
[   57.767623] pstate: 80400009 (Nzcv daif +PAN -UAO -TCO -DIT -SSBS BTYPE=--)
[   57.774575] pc : logic_inb+0xa0/0xe0
[   57.778145] lr : intel_vga_reset_io_mem+0x38/0x68 [xe]
[   57.783547] sp : ffff8000802dbaa0
[   57.786850] x29: ffff8000802dbaa0 x28: ffffa9bdd90e9928 x27: 0000000000000000
[   57.793978] x26: 0000000000000001 x25: 0000000000000002 x24: ffff07ff82cfb0c8
[   57.801105] x23: ffffa9bdd90f6c38 x22: ffff07ffa8901000 x21: 0000000000000000
[   57.808232] x20: 0000000000000000 x19: ffff07ff82cfb000 x18: ffffffffffffffff
[   57.815358] x17: 2c6d656d2b6f693d x16: ffffa9be0b0be3c8 x15: 6c6f203a6465676e
[   57.822484] x14: 616863207365646f x13: 656e6f6e3d736e77 x12: ffff081eef6e0000
[   57.829610] x11: 0000000000000001 x10: 0000000000000001 x9 : ffffa9be0aa2702c
[   57.836737] x8 : c0000000ffffbfff x7 : ffffa9be0ca1c120 x6 : 00000000000000ff
[   57.843863] x5 : ffffa9be0b659f10 x4 : 000000000000000a x3 : 0000000000000000
[   57.850989] x2 : 0000000000ffbffe x1 : 00000000000003cc x0 : ffffffffc08003cc
[   57.858116] Call trace:
[   57.860552]  logic_inb+0xa0/0xe0
[   57.863772]  hsw_power_well_enable+0x198/0x288 [xe]
[   57.868900]  intel_power_well_enable+0x74/0x98 [xe]
[   57.874019]  intel_power_well_get+0x2c/0x40 [xe]
[   57.878873]  __intel_display_power_get_domain.part.0+0x7c/0xd0 [xe]
[   57.885375]  intel_display_power_get+0x5c/0x98 [xe]
[   57.890487]  intel_power_domains_init_hw+0x64/0x320 [xe]
[   57.896031]  intel_display_driver_probe_noirq+0xa0/0x1f8 [xe]
[   57.902008]  xe_display_init_noirq+0x58/0x90 [xe]
[   57.906945]  xe_device_probe+0x248/0x4e8 [xe]
[   57.911538]  xe_pci_probe+0x5d8/0x918 [xe]
[   57.915866]  local_pci_probe+0x48/0xb8
[   57.919610]  work_for_cpu_fn+0x24/0x40
[   57.923349]  process_one_work+0x18c/0x400
[   57.927350]  worker_thread+0x204/0x420
[   57.931090]  kthread+0xe8/0xf8
[   57.934135]  ret_from_fork+0x10/0x20
[   57.937705] Code: d65f03c0 929fffe0 f2b81000 8b000020 (39400000)
[   57.943787] ---[ end trace 0000000000000000 ]---
1 Like