White Paper: OAI System Liquid Cooling Guidelines

White Paper: OAI System Liquid Cooling Guidelines
White Paper: OAI System Liquid Cooling Guidelines
0:53

This document is not a specification for OAI/OAM products. It is a set of guidelines on the design, validation, and implementation of liquid cooling solutions for AI Training Systems with 8x OAM products or others alike.

Contents of the document would help a user/designer/supplier of OAI/OAM products understand the basics around those topics/questions related to liquid cooling.

For most engineering topics/questions discussed in this document, we (the OAI Cooling workstream members) are contributing what we believed to be best practices as of today. However, for each product, there would be more than one way to design/validate/use it, not to mention potential technology evolvement or changes of dependencies down the road. Please keep open-minded while reading this document, and do not hesitate to contact us directly for feedback and further discussion.

Register to Download the whitepaper!

 

White Paper: Introduction of a new Firmware Update Workflow for PLDM & Redfish

1 min read

White Paper: Introduction of a new Firmware Update Workflow for PLDM & Redfish

Firmware updates are essential for the BMC system. Each device requires a unique update flow and utilizes different transport protocols, such as I2C...

Read More
White Paper: Beyond the Rack - The Elastic Management Framework for AI Data Centers

1 min read

White Paper: Beyond the Rack - The Elastic Management Framework for AI Data Centers

AI clusters using next-generation accelerators (e.g., NVIDIA GB200) push rack power density beyond 130 kW, making air cooling insufficient and...

Read More
White Paper: Power Efficiency Optimization in AI Systems

1 min read

White Paper: Power Efficiency Optimization in AI Systems

This whitepaper examines the growing importance of power efficiency in AI systems, where increasing computational demand translates into significant...

Read More