The Power of Now: Accelerate the Datacenter

Datacenter service providers currently face a confluence of challenges, which require that they adapt and modernize to cope with upcoming requirements of their principal customers. These include the growth and proliferation of new compute-intensive and storage-intensive workloads and applications, such as those that leverage AI. They also include applications whose viability and success depends on […]

Lamini Chooses SuperMicro GPU Servers for LLM Tuning Offering

Lamini is developing an infrastructure for customers to run Large Language Models (LLMs) on innovative and fast servers. End-user customers can use Lamini’s LLMs or build their own using Python, an open-source programming language. Lamini has developed a software environment for customers that allows them to focus on their business needs and develop innovative AI […]

TIERPOINT SUPPORTS DEPLOYMENT OF HIGH-DENSITY DDC SCALABLE S-SERIES DATA CENTER CABINETS DESIGNED FOR AI ANYWHERE

DDC’s innovative hybrid liquid-air cooling cabinet technology enables TierPoint to deploy significant GPUs for HPC/AI workloads, in a new 16 MW high-density data center allowing the lowest TCO.  Learn how.

NVIDIA InfiniBand Adaptive Routing Technology

In this white paper, we’ll look at how adaptive routing from NVIDIA plays such an important role, eliminating congestion and increasing data center performance. InfiniBand adaptive routing technology reroutes data to eliminate congestion, and therefore, increases data center performance. As presented, both HPC applications and AI applications utilizing adaptive routing achieve higher performance. Adaptive routing is an important network element that drives your HPC systems toward new levels of utilization that increase return on investment.

It’s Time to Resolve the Root Cause of Congestion

Today, every high-performance computing (HPC) workload running globally faces the same crippling issue: Congestion in the network.

Congestion can delay workload completion times for crucial scientific and enterprise workloads, making HPC systems unpredictable and leaving high-cost cluster resources waiting for delayed data to arrive. Despite various brute-force attempts to resolve the congestion issue, the problem has persisted. Until now.

In this paper, Matthew Williams, CTO at Rockport Networks, explains how recent innovations in networking technologies have led to a new network architecture that targets the root causes of HPC network congestion, specifically:

– Why today’s network architectures are not a sustainable approach to HPC workloads
– How HPC workload congestion and latency issues are directly tied to the network architecture
– Why a direct interconnect network architecture minimizes congestion and tail latency

Six Smarter Scheduling Techniques For Optimizing EDA Productivity

Workload management plays a crucial role in helping design teams share limited resources, boost simulation throughput, and maximize productivity. In this paper, Altair discusses six valuable techniques to help improve design center productivity. By adopting these techniques and products in the Altair® Accelerator™ portfolio, organizations can realize higher levels of efficiency and performance and dramatically improve productivity with smarter workload scheduling.

High Performance Computing for R&D

In this eBook, sponsored by Rescale, Microsoft Azure and AMD, we take a look at HPC deployments in support of R&D efforts. In many ways, the HPC solution in the cloud offered by Rescale on Azure delivers an unprecedented amount of power while solving for many crucial and common challenges faced by R&D and design teams across  many industries.

Harnessing Data In Motion

This whitepaper from Cloudera and sponsored by Carahsoft describes how Cloudera DataFlow supports government data movement and processing. A simple, flexible, open-ended solution is required to streamline access to both structured and unstructured data from sources from the edge and across the enterprise so agencies can use their legacy and modernized systems to take advantage of the new insights available.

New Supercomputer Enables Rugged, Real-Time AI at the Edge

Program managers face hard tradeoffs bringing artificial intelligence to in-the-field use cases. This whitepaper describes a new AI server from One Stop Systems that shows what capabilities they should look for in portable, rugged AI deployments.

insideHPC Guide to Composable Disaggregated Infrastructure (CDI) Clusters

This technology guide will show how the Silicon Mechanics Miranda CDI Cluster™ reference architecture can be a CDI solution blueprint, ideal for tailoring to specific enterprise or other organizational needs and technical issues.