Right now, you can't get more performance than the TH1520 (C910 cores) and JH7110 (U74 cores) at ANY price. The only thing you can do is get more of the same cores i.e. SG2042 with 64x C910 cores.
The same goes for emulation. QEMU on my i9-13900HX laptop is about the same speed on each core, but there I have 24 cores (32 threads).
I just compiled gcc 14 natively on both LicheePi 4A (the same as this hosting) and docker/QEMU:
i9-13900HX (8 P + 16 E cores) docker running riscv64/ubuntu image
real 101m44.472s
user 1695m26.395s
sys 24m36.671s
LicheePi 4A (TH1520 SoC, 4x C910 cores)
real 422m41.367s
user 1430m56.638s
sys 70m17.994s
The LicheePi actually used less CPU time, but the i9 won with more cores.
Thanks for the clarification, you're completely right. I didn't realize the SG2042 used the same core. So, Scaleway's offer really does seem to make sense!
Arguably OpenXiangShan should be more powerful, but verilog simulation isn't really that usable in practice, and it has no working vector support yet. (the dev branch currently hangs after a few minutes of simulating)
The same goes for emulation. QEMU on my i9-13900HX laptop is about the same speed on each core, but there I have 24 cores (32 threads).
I just compiled gcc 14 natively on both LicheePi 4A (the same as this hosting) and docker/QEMU:
i9-13900HX (8 P + 16 E cores) docker running riscv64/ubuntu image
LicheePi 4A (TH1520 SoC, 4x C910 cores) The LicheePi actually used less CPU time, but the i9 won with more cores.