Processador ARM Reference Cores

muddymind

1st Folding then Sex
Há imensos tópicos espalhados com notícias de cores ARM sem grande arrumação por isso resolvi criar uma thread dedicada aos cores de referência da arm para começar a agregar todas as notícias num único local.

Arm Unveils Next-Gen Flagship Core: Cortex-X3

Arm Introduces The Cortex-A715

Arm Refreshes The Cortex-A510, Squeezes Higher Efficiency

A maior mudança foi a eliminação de suporte para aplicações 32-bit. Honestamente desilude não ver uma evolução maior sendo o pior caso o A510 que consome demasiado para a performance que oferece. A ARM tem mesmo que colocar o pé no acelerador que assim nunca irá competir no topo.
 
👍

Resumo:



Parece-me que o mais interessante mesmo é o 715



cortex-a715-perf.png
 
Arm Unveils 2023 Mobile CPU Core Designs: Cortex-X4, A720, and A520 - the Armv9.2 Family
Timed to align with this year's Computex trade show in Taiwan, Arm is showing off a new set of Cortex-A and Cortex-X series CPU cores – as well as a new generation of GPU designs – which we'll see carrying the torch for Arm starting later this year and into 2024. These include the flagship Cortex-X4 core, as well as Arm's mid-core Cortex-A720. and the new little-core Cortex-A520.
For their 2023 IP, Arm is rolling out a wave of minor microarchitectural improvements through its Cortex line of cores with subtle changes designed to push efficiency and performance throughout, all the while moving entirely to the AArch64 64-bit instruction set.
Arm's focus on phasing out the 32-bit instruction set has been one it has been working towards for several years. For their latest TCS23, they have finally created a fully 64-bit cluster to capitalize on the benefit of a complete 64-bit mobile ecosystem, excising AArch32 (32-bit instruction) support entirely.. So whether it's a big, mid, or little core, for Arm's latest generation of IP there is only AArch64.
Arm%20Client%20Tech%20Days%20CPU%20Presentation_Final-12.png


Arm%20Client%20Tech%20Days%20Total%20Compute%20Solutions%202023%20Presentation_Final-16.png

Arm's latest DynamIQ Shared Unit, DSU-120, offers support for up to 14 CPU cores in a cluster, which opens the door to a significant number of different CPU core combinations. We'll see what SoC vendors have opted for later this year, but one probably configuration is a 1+5+2 (X4+720+520), which is likely a configuration for a high-end smartphone. Compared to a last-generation 1+3+4 cluster (X3+715+510), Arm is claiming an uplift of 27% in compute performance within GeekBench 6 MT and a more considerable uplift of between 33% and 64% in the Speedometer 2.1 benchmark depending on software optimizations implemented.

Arm%20Client%20Tech%20Days%20CPU%20Presentation_Final-10.png


Arm%20Client%20Tech%20Days%20CPU%20Presentation_Final-09.png

Finally, looking at performance, Arm claims that their latest generation CPU and core complex architecture has made solid gains in power efficiency. At iso-performance, Cortex-X4 offers upwards of a 40% reduction in power consumption versus Cortex-X3, while Cortex-A720 and A520 save 20-22% over their respective predecessors. On the DSU-120 hub itself, Arm claims an 18% improvement in power efficiency.

Arm Cortex-X4: Fastest Arm Core Ever Built (Again)​

Arm%20Client%20Tech%20Days%20CPU%20Presentation_Final-17.png

The Cortex-X4 is Arm's highest-performing core to date, featuring an anticipated core clock speed of 3.4 GHz and an increased L2 cache per core, doubling its capacity to 2 MB compared to last year's 1 MB Cortex-X3 . Despite these enhancements, Arm has managed to maintain a minimal increase in the physical size of the core, with the more complex X4 CPU core coming in at under a 10% die size increase (the additional L2 cache excluded).
The TSC23 core cluster now supports up to 14 cores which adds a level of flexibility for SoC vendors to implement into their latest designs. Perhaps one of the biggest changes is support for up to 32 MB of shared L3 cache within the TSC23 core cluster. The levels of L3 cache implemented is of course down to the SoC manufacturer, but the maximum levels that can be offered is 32 MB, which allows increased support for higher-end mobile devices such as tablets and notebooks, where applicable.
Arm%20Client%20Tech%20Days%20CPU%20Presentation_Final-07.png

The maximum number of cores across the entire TSC23 core cluster stretches to 14 in total, with a mixture of big and little cores, with multiple avenues for SoC vendors to explore to capitalize on things like performance gains and efficiency. All of this flexibility is given to the SoC vendors to design their own variations depending on the level of the device. So a flagship mobile device will leverage different combinations of Cortex-X4, Cortex-A720, and Cortex-A520 depending on multiple factors such as cost, power budget, and expected performance levels.


Cortex-A720: Middle Core, Big on Efficiency​

Arm%20Client%20Tech%20Days%20CPU%20Presentation_Final-25.png


Similar to the Cortex-X4 in that the Cortex-A720 is built around the Armv9.2 ISA, Arm has optimized its design to enable the A720 to deliver more performance within the same power budget compared to the Cortex A715. The Arm 700-series family typically covers a much broader range of applications and caters to various markets, including, and not limited to, digital TVs (DTV), smartphones, and laptops.


Cortex-A520: LITTLE Core with Big Improvements​

Arm%20Client%20Tech%20Days%20CPU%20Presentation_Final-32.png

Arm's smallest core for this generation is a new core in effect, but it is still more of a refinement of the Cortex-A510 than a completely new design. It has the lowest power-to-area ratio of all three announced Cortex Armv9.2 cores. The most significant differences come through optimizations on power, with Arm claiming that the Cortex-A520 is 22% more energy efficient than the previous Cortex-A510 core at iso-process and iso-frequency. The little core in Arm's TCS23 catalog is primarily designed for performing low-intensity and background operational tasks, which takes these loads off bigger cores such as the Cortex-A720/Cortex-X4 to allow better power efficiency overall within the cluster.


New DSU-120: More L3 Cache, Doubling Down on Efficiency​

Arm%20Client%20Tech%20Days%20CPU%20Presentation_Final-06.png

Building a more refined DSU instead of another ground-up design, Arm has made plenty of inroads to improving overall scalability, efficiency, and performance with its DSU-120. Some of the most notable improvements include support for up to 14 CPU cores in a single cluster, which allows SoC vendors to pick and choose their core cluster configurations to suit the device going to market. Arm has also improved its Power and Performance Area (PPA) by implementing new power-saving modes, including RAM and Slicing power-downs, which work in stages depending on the type of workload and the intensity to reduce the overall power footprint of the cluster.
Arm%20Client%20Tech%20Days%20CPU%20Presentation_Final-44.png

Perhaps the most significant change to DSU-120 from DSU-110 is that Arm has effectively doubled the total amount of shared L3 cache a cluster can implement.
Arm%20Client%20Tech%20Days%20CPU%20Presentation_Final-40.png

https://www.anandtech.com/show/1887...cture-cortex-x4-a720-and-a520-64bit-exclusive
 
Back
Topo