News Feed

AWS Tunes Up Compute And Network For HPC

When it comes to hardware, there was not a lot of big news coming out of the Amazon Web Services re:Invent 2022 conference this week.

AWS Tunes Up Compute And Network For HPC was written by Timothy Prickett Morgan at The Next Platform.

ADIA Lab Announces Board Members, Collaborations and Research Award

Abu Dhabi, 2 December 2022 — ADIA Lab, focused on basic and applied research in data and computational sciences, commenced operations today, the 51st National Day of the UAE. As part of its launch, the lab announced the composition of its advisory board, the details of its first academic collaborations and the launch of a US$100,000 research award. ADIA […]

The post ADIA Lab Announces Board Members, Collaborations and Research Award appeared first on High-Performance Computing News Analysis | insideHPC.

SC22: CXL3.0, the Future of HPC Interconnects and Frontier vs. Fugaku

HPC luminary Jack Dongarra’s fascinating comments at SC22 on the low efficiency of leadership-class supercomputers highlighted by the latest High Performance Conjugate Gradients (HPCG) benchmark results will, I believe, influence the next generation of supercomputer architectures to optimize for sparse matrix computations. The upcoming technology that will help address this problem is CXL. Next generation architectures will use CXL3.0 switches to connect processing nodes, pooled memory and I/O resources into very large, coherent fabrics within a rack, and use Ethernet between racks. I call this a “Petalith” architecture (explanation below), and I think CXL will play a significant and growing role in shaping this emerging development in the high performance interconnect space.

The post SC22: CXL3.0, the Future of HPC Interconnects and Frontier vs. Fugaku appeared first on High-Performance Computing News Analysis | insideHPC.

Counting The Cost Of Training Large Language Models

It has been becoming increasingly clear – anecdotally at least – just how expensive it is to train large language models and recommender systems, which are arguably the two most important workloads driving AI into the enterprise.

Counting The Cost Of Training Large Language Models was written by Timothy Prickett Morgan at The Next Platform.

TOP500 News


None
ORNL’s Frontier First to Break the Exaflop Ceiling
May 30, 2022

The 59th edition of the TOP500 revealed the Frontier system to be the first true exascale machine with an HPL score of 1.102 Exaflop/s.

The No. 1 spot is now held by the Frontier system at Oak Ridge National Laboratory (ORNL) in the US. Based on the latest HPE Cray EX235a architecture and equipped with AMD EPYC 64C 2GHz processors, the system has 8,730,112 total cores, a power efficiency rating of 52.23 gigaflops/watt, and relies on gigabit ethernet for data transfer.


Still waiting for Exascale: Japan's Fugaku outperforms all competition once again
Nov. 15, 2021

FRANKFURT, Germany; BERKELEY, Calif.; and KNOXVILLE, Tenn.— The 58th annual edition of the TOP500 saw little change in the Top10. The Microsoft Azure system called Voyager-EUS2 was the only machine to shake up the top spots, claiming No. 10. Based on an AMD EPYC processor with 48 cores and 2.45GHz working together with an NVIDIA A100 GPU and 80 GB of memory, Voyager-EUS2 also utilizes a Mellanox HDR Infiniband for data transfer. 


The List

11/2022 Highlights

  • The Frontier system at the Oak Ridge National Laboratory, Tennessee, USA remains the No. 1 system on the TOP500 and is still the only system reported with an HPL performance exceeding one Exaflop/s. Frontier brought the pole position back to the USA on the June listing with an HPL score of 1.102 Exaflop/s.
  • The LUMI system at EuroHPC/CSC in Finland entered the list last June at No. 3. It is again listed as No. 3 but only thanks to an upgrade of the system, which doubled its size. With its increased HPL score of 309 Pflop/s it remains the largest system in Europe.
  • The only new machine to grace the top of the list was the No. 4 Leonardo system at EuroHPC/CINECA in Bologna, Italy. The machine achieved an HPL score of .174 EFlop/s with 1,463,616 cores.

 

read more »

List Statistics