New Mali-T760 is ARM’s fastest GPU yet, 400% better energy efficiency

October 29, 2013
47 31 21

Mali-T720 and T760 graphARM has unveiled two new members of its Mali family of GPUs. The new units are the Mali-T720 and Mali-T760. The T720 is a mid-range GPU and is seen as the successor to the very popular Mali-400 MP and 450 MP GPUs, while the Mali-T760 is ARM’s new flagship GPU and boasts a 400% increase in energy efficiency when compared to the Mali-T604. At the moment around 50 percent of all Android tablets use SoCs with Mali GPUs which makes this new announcement by ARM significant in terms of the graphic processing power of tablets in the future. However it isn’t only tablets that use Mali GPUs, around 20 percent of Android smartphones have a Mali GPU and the new Samsung Galaxy Note 3 features the Mali-T628 with 6 shader cores.

ARM’s previous mid-range GPU’s were based on its now ageing Utgard Architecture which can be found in the Mali-300, 400 MP and 450. Starting with the Mali-T604 ARM switched to a new GPU architecture known as Midgard for its high-end GPUs. ARM has now brought this new architecture to the mid-range with the Mali-T720.

Mali-T720 block diagram

Mali-T720

The new Mali-T720, which can be used with ARM’s current Cortex-A7 and A12 cores as well as the upcoming 64-bit Cortex-A53 core, can run at a maximum clock speed of 600MHz and has a peak performance of 81.6 GLOPS with a pixel fillrate of 4.8 Gpix/sec. To put those numbers into some context, the Mali-400 MP configuration found in the Exynos 4412 peaks at 19.2 GFLOPS and a has theoretical pixel fillrate of 1.6 Gpix/sec.

Although the Mali-T720 offers better performance, ARM has managed to reduce the energy usage and the die size of the GPU. According to ARM the Mali-T720 has a “150% energy efficiency increase over previous cost optimized GPUs” and has a “30% area reduction from previous Midgard generations.” This is all possible because the Mali-T720 is designed using new small chip manufacturing processes (28nm) which has also allowed ARM to squeeze in up to 8 shader cores. The Mali-T720 has been optimized especially for Android and offers support for OpenGL ES 3.0 and RenderScript.

Mali-T760 block diagram

Mali-T760

At the other end of the scale, ARM has released details of the Mali-T760 which now becomes its de-facto flagship GPU. The new device scales up to 16 shader cores (giving 326 GFLOPs) while boasting a 400% increase in energy efficiency over the ARM Mali-T604.

The simplicity and feature rich design of the Mali-T760 GPU enables Rockchip to quickly bring new SoCs to market with the latest Android advantages in a wide range of mobile devices. -- Feng Chen, chief marketing officer, Rockchip.

One of the tricks in the Mali-T760’s architecture is the use of bandwidth reduction techniques which minimizes the amount of data shifted around and hence reduces the amount of power used by the GPU. Such techniques include ARM Frame Buffer Compression (AFBC), which compresses the data as it is passed from one part of the SoC to another, and Smart Composition which only renders the parts of the frame which have changed. For example when playing Angry Birds the whole frame doesn’t need to be re-rendered, only the bits where the bird has flown and hit those very bad piggies. By not re-rendering the other parts the GPU saves power!

The T760 also supports a very impressive list of software APIs including:

  • Khronos compliant OpenGL® ES 3.0 /2.0 / 1.1,
  • Microsoft Windows compliant Direct3D 11.1
  • Full profile OpenCL® 1.1
  • RenderScript/ FilterScript

At 600MHz with 16 shaders the Mali-T760 has a peak performance of 326.4 GLOPS and a pixel fillrate of 9.6 Gpix/sec. That should make the Mali-T720 the fastest mobile GPU available!

Another important aspect of the Mali-T720 is its full support for OpenCL. At the moment the Mali-T720 is the only mobile GPU that can run all the OpenCL benchmarks and this chip really marks the full arrival of GPU computing on smartphones and tablets. GPUs are better at certain operations than CPUs. Although a CPU can perform the same tasks they will be handled in software and are therefore slower and less efficient. A GPU can better handle certain tasks like those associated with facial recognition, gestures, computer vision and augmented reality. By utilizing the GPU the CPU is freed and on a big.LITTLE system the SoC can switch to the more energy efficient CPU cores while the GPU is working of the GPU compute tasks. This makes the GPU a companion to the CPU which can do more than just render frames for the display.

Who and When?

ARM currently has 84 Mali license holders and the new Mali-T720 and T760 have been licensed by companies like LG Electronics, MediaTek and Rockchip. Missing from the list is obviously Apple and Qualcomm, however Samsung continues to use Mali GPUs in some of its SoCs. The Exynos 5420, found in some variants of the Galaxy Note 3, dropped the Imagination PowerVR SGX544MP3 GPU in favor of the ARM Mali-T628 MP6 GPU.

According to Trina Watt, Vice President of Solutions Marketing at ARM, it is likely that SoCs using the new GPUs will become available by the end of 2014 meaning that some of the next generation of smartphones and tablets launching at the Mobile World Congress in early 2015 will feature these GPUs.

By that time 64-bit mobile computing will be firmly established and the use of 4K video will also be on the rise, technologies which the new GPUs support out of the box.

Comments

  • Piterson Massenat Desir

    Sounds cute, but compare it to adreno 320 and 330. Never owned product with Mali gpu.

    • Joshua Hill

      I think the Mali 628 6 core variant is similar to the adreno 320 on terms of performance.

      • Spencer

        The Mali t-628 outperforms the adreno 320 and is about 10% slower than the adreno 330

        • Joshua Hill

          Is that the 6 core or 8 core variant?

  • Bone

    Sound about right for the BEAST that is the 64-bit Exynos 6 for the SGS4

  • Amine Elouakil

    “the new Samsung Galaxy Note 3 features the Mali-T628 with 6 shader cores” You mean the legendary version of the Note 3 that isn’t available anywhere <.<

  • jusephe

    That looks like sweet numbers but mali isn’t very often used in such high core count, even note 3 has just 6 T628 cores.

    And also It doesn’t scale “up to 16 shading cores”.
    Every T760 core contains 16 shading units AND you can have up to 16 T760 cores (256 shading units) but I can hardly imagine running so many at 600 Mhz without a fan.

    Also i wouldn’t call this the fastest GPU aivable yet.
    There are some competition mainly from Imagination Technologies and their “Rouge” family of cores. They aren’t maybe as impressive in terms of raw GFLOPS, because they are all single core designs (for now) but they impress in real world performance and mainly in compute efficiency.

    • joser116

      Are you sure that each of the 16 shader cores contains 16 shader cores? I only see 16 clearly labeled “shader cores” in the diagram of the T760. How could 16 shader cores contain 16 shader cores each? They are not shading units, they are “shader cores”.

  • MasterMuffin

    “At 600MHz with 16 shaders the Mali-T760 has a peak performance of 326.4 GLOPS and a pixel fillrate of 9.6 Gpix/sec. That should make the Mali-T720 the fastest mobile GPU available!” and then you just start talking about T720. I don’t know if the rest T720s are typos, but that one most definitely is :)

    • BtotheT

      Ok so my comment was a redo. Good, pretty obvious but good.

  • joser116

    I really like this article. It is very thorough. I have to admit that I was a little disappointed at the end when it said that these GPUs will be in smartphones by early 2015.

  • Petraeus

    Dont get me wrong but Imagination Technologies rocks :)

  • Balraj

    2015 huh…wow

  • sourabh sekhar

    I hope this makes its way into the Nexus 5 or Nexus 10

  • Hao Zhang

    Wow, it’s faster than my computer’s GPU.

  • George Av

    so how does it stack up to the GTX TITAN? lol no i really mean the s800 gpu.. the newer one that is the something 400

  • Jancox13

    OMG its close enough to desktop PC VGA performance and even better than mid low level VGA

  • LOL

    Because of this… after they reach the best possible maximum
    GPU for ARM… I am expecting a Dual GPU SoC…. That would rock……. LMAO…

  • LOL

    Because of this… after they reach the best possible maximum
    GPU for ARM… I am expecting a Dual GPU SoC…. That would rock……. LMAO… imagine a MALI GPU with another MALI… HAHAHAHA just like LITTLE.big architecture…

  • BtotheT

    You started calling the 760 a 720 after mentioning it’s pixel fill rate.

  • BtotheT

    Rumored in the Exynos 5433 8core model note 4 by ibtimes in an iphone 6 comparison to the note 4.