Nuvia: don’t hold your breath

Artemis · Nov 28, 2024

Clock speeds are still the most reliable gains to get and I’m a huge fan of Apple increasing them mostly steadily for instance or Arm, with node gains eating them. But like, this is such a 2017 PC gamer argument about clocks? The performance is what we care about. Fatter wider cores with better prediction and broad IPC in integer code but at slightly slower clocks in general are going to be better than slightly less robust ones with more frequency in real code if anything — consumer code is notoriously branchy, sloppy, etc.

I would take an M2 Avalanche Core at 3.5GHz for instance over Lion Cove at 5.1 without hesitation if we wanted to get super technical about who comes out on top in some mass assay of real applications.

dada_dave · Nov 28, 2024

Artemis said:
IMO the Linux issue wasn’t IPC so much as the power floor which riled me up because it jaded the curve more than anything else. I don’t think IPC even changed or aggregate performance really by much

I dunno actually comparing the two charts it does seem like Qualcomm's SpecInt performance improved, not just its power. Previously the 258V was getting higher performance than the Snapdragon and now it isn't. And yeah of course the power curve looks significantly better too. One of the things I will say for MS, WSL is supposedly *excellent*. I've never used it myself, but lots of people who have give it really high marks saying it has exceptionally low overhead, practically bare metal performance. So that's probably closer to its "real" performance, especially with respect to power, than the earlier charts despite those being "native" Linux.

Artemis · Nov 28, 2024

dada_dave said:
I dunno actually comparing the two charts it does seem like Qualcomm's SpecInt performance improved, not just its power. Previously the 258V was getting higher performance than the Snapdragon and now it isn't. And the curve looks significantly better too. One of the things I will say for MS, WSL is supposedly *excellent*. I've never used it myself, but lots of people who have give it really high marks with exceptionally low overhead. So that's probably closer to its "real" performance, especially with respect to power, than the earlier charts despite those being "native".

I guess what I meant was the curve is the much bigger noticeable thing to me below 8W but fair yeah

Artemis · Nov 28, 2024

WSL is great except for IO/networking. Everything else good

The Flame · Nov 29, 2024

Artemis said:
View attachment 32942
I really cannot stand PC DIY morons, holy hell.

Qualcomm will probably deliver 5 GHz next gen, so they can shut up.

KingOfPain · Nov 29, 2024

dada_dave said:
Yeah his analysis was similar to mine but way more brutal and frankly exasperated at the end, especially by the compatibility issues. Apple really did themselves a favor by culling 32bit apps a long time ago on both iOS and macOS to motivate the switch before transitioning to everything to ARM. The 32bit translation performance must've been what Charlie Demarjian was referring to when he was talking about the performance being dog shit 2000s Intel Core-level under certain circumstances (and he was blaming MS not Qualcomm).

I only wish, Apple would have made the Intel transition maybe a year later, then they would have skipped x86-32 completely.
I understand why they dropped the 32-bit API, but unfortuately, there is quite some legacy code that was never compiled for x86-64 and therefore doesn't run on current macOS (there might be other problems like deprecated API anyway).

I have to admit that I didn't do any performance tests with Windows on ARM, but I didn't get the feeling that the x86 to ARM64 translation was that bad.
Until GPTK was integrated into CrossOver, games certainly were more performant under Windows on ARM.

Artemis · Dec 3, 2024

The Flame · Dec 7, 2024

Looks like Qualcomm's roadmap is to follow Apple and creat SoCs with huge integrated GPUs.

Artemis · Dec 7, 2024

The Flame said:
View attachment 33085
Looks like Qualcomm's roadmap is to follow Apple and creat SoCs with huge integrated GPUs.

I think this is a given to some degree, that said I was surprised Qualcomm is heading for a 192-bit bus with LPDDR5 on the next gen already for one product — while the 6+6 core model is said to be 128-bit. I expected them to keep it at 128-bit with high speed and just fix the GPU outright and make it reasonably sized which which would be more than enough for most, but the rumors point to both that and a separate larger SKU with 50% more cores, or 100% more big cores, and then a much bigger GPU and 50% bigger bus.

**(on LPDDR6, that would be more like a 224 or 256-bit because of signal integrity/ECC & other profile metadata capabilities, at least at the same data rate, and I am not sure LPDDR6 with the same datarate as their 192-bit LPDDR5 solution makes sense with that in mind. So I could see LPDDR6 going to more like 256-bit for that product, at least if they don’t use LPCAMM)

FWIW I doubt we will see Mx Max and up size stuff from QC but they will clearly try to compete with the Pro lineup.

The Flame · Dec 10, 2024

The Flame said:
View attachment 33085
Looks like Qualcomm's roadmap is to follow Apple and creat SoCs with huge integrated GPUs.

However... for this to be viable, they need to significantly evolve their GPU architecture like Apple has been doing in the past 4 years.

Adreno 8 is a step in the right direction, but there is still a lot of work of to do.

This explains why they are being conservative and going for a 192 bit memory bus for the top tier part in Snapdragon X2.

The Flame · Dec 12, 2024

WO2023163863A1 - Dynamic wave pairing - Google Patents

This disclosure provides systems, devices, apparatus, and methods, including computer programs encoded on storage media, for dynamic wave pairing. A graphics processor may allocate one or more GPU workloads to one or more wave slots of a plurality of wave slots. The graphics processor may select...

patents.google.com

This is a Qualcomm patent for a GPU technology called 'Dynamic Wave Pairing'. According to a leak, this technology is deployed in the Adreno 830 GPU found in the Snapdragon 8 Elite.

Cmaier · Dec 12, 2024

The Flame said:
WO2023163863A1 - Dynamic wave pairing - Google Patents

This disclosure provides systems, devices, apparatus, and methods, including computer programs encoded on storage media, for dynamic wave pairing. A graphics processor may allocate one or more GPU workloads to one or more wave slots of a plurality of wave slots. The graphics processor may select...

patents.google.com

This is a Qualcomm patent for a GPU technology called 'Dynamic Wave Pairing'. According to a leak, this technology is deployed in the Adreno 830 GPU found in the Snapdragon 8 Elite.

That ain’t a patent, but this is the patent on the same thing:

https://patentimages.storage.googleapis.com/3f/2b/d3/65caf5f8e5bed3/US11954758.pdf

The Flame · Dec 13, 2024

If you don't mind, can you explain how DWP works, how it might improve performance and any comparisons to similar technologies from other GPU vendors?

leman · Dec 13, 2024

The Flame said:
If you don't mind, can you explain how DWP works, how it might improve performance and any comparisons to similar technologies from other GPU vendors?

While I find this particular patent very difficult to read, it seems to me that it is about packing multiple waves (shader execution fronts) into a single hardware SIMD. Qualcomms GPUs are very wide and have different SIMD widths for FP32 and FP16. The ability to combine waves like that could be very beneficial for hardware utilization on this type of GPU. I am not aware of similar functionality on other GPUs, probably because they don’t really need it. I think Nvidia can reshuffle threads to improve hardware utilization, not quite sure.

The Flame · Dec 15, 2024

ARM vs Qualcomm trial begins tommorow.

There is a great post on r/hardware about it;

Yoused · Dec 15, 2024

I am curious how far ahead of ARM the Nuvia design is. On Geekbench, the SC score for a device with a Cortex X925 is somewhere in the neighborhood of a Raptor Lake i5. Just throwing something together with the latest ARM-designed cores seems like it would be reasonably competitive with the Elite line.

Cmaier · Dec 15, 2024

The Flame said:
ARM vs Qualcomm trial begins tommorow.

There is a great post on r/hardware about it;

i had a bench trial with that judge. She’s also the judge that blew up the Hunter Biden plea deal. She cuts through nonsense really well.

jbailey · Dec 16, 2024

The Flame said:
ARM vs Qualcomm trial begins tommorow.

There is a great post on r/hardware about it;

Removed by the moderators for some reason.

The Flame · Dec 16, 2024

jbailey said:
Removed by the moderators for some reason.

The pinned comment by u/hardware-ModTeam says this:

Thank you for your submission! Unfortunately, your submission has been removed for the following reason:

Please do not post Straw poll links. Remove it and resubmit.

Which is honestly a ridiculous reason!

It is true that the OP did include a StrawPoll link in his post (the poll asked as to what do you think will be the outcome of the trial). r/hardware does not allow posting polls, so I agree the OP was in the wrong here, as what he did was circumvention of that rule.

So the Mods could have done one of the following:

1. Send a notice to the OP requesting them to remove the Strawpoll link from the post

or

2. Use the moderation tools to edit out the strawpoll link from the post.

Instead they went for the nuclear option (Delete Post).

It was a text post submitted directly to r/hardware, very long (2000+ words), thoroughly researched (links to dozens of court documents), and excellently formatted.

They asked the OP to resubmit the post without the strawpoll, and it remains to be seen if the OP will do so.

If I was the OP, I wouldn't acquiesce. What the mods did was disrespectful and showed utter disregard for the OP's hard work. As I mentioned above, it was a very high quality post, and if the OP doesn't have a saved copy, it would be a pain to recreate it from scratch.

Rulebreaking should be punished, but leniency should also be shown towards high quality posts/posters. The OP is known for making high quality posts, such as this one (which won an award and ended up ratio-ing an Ian Cutress video).

The Flame · Dec 16, 2024

OP has reposted it:

Nuvia: don’t hold your breath

Site Champ

Elite Member

Site Champ

Site Champ

Power User

Site Champ

Site Champ

Power User

Site Champ

Power User

Power User

Site Master

Power User

Site Champ

Power User

up

Site Master

Power User

Power User

Power User