Executive Roundtable: Cooling, Costs, and Integration in the AI Data Center Era

As AI continues to reshape the physical and operational profile of data centers, the industry faces a pivotal inflection point in how it manages thermal loads, water resources, and infrastructure integration. In our Q3 2025 Executive Roundtable, we ask leaders from CoolIT, Trane, and Ecolab to weigh in on three critical questions that explore the rising demands of AI-driven density, the financial balancing act between CapEx and OpEx, and the accelerating push to integrate cooling, power, and water systems in unified, scalable designs.
Sept. 17, 2025
7 min read

At Data Center Frontier, we rely on industry leaders to help us understand the most urgent challenges reshaping digital infrastructure. And in the third quarter of 2025, few forces are driving deeper change than artificial intelligence and its massive thermal, water, and power demands.

AI data centers are not only bigger: they’re hotter, denser, and arriving faster than anything the industry has previously managed. These shifts are forcing operators to rethink the very fundamentals of cooling design, utility integration, and capital efficiency.

To launch our Executive Roundtable for Q3 2025, we asked our panel to tackle a foundational question: how the industry is addressing thermal risk at AI scale.

As AI workloads increasingly dominate new data center builds, operators face mounting challenges in managing extreme heat densities and water demands. We asked our experts to share which risks they see as most critical; and crucially, how their organizations are evolving solutions to meet these challenges at scale.

Our distinguished panelists this quarter bring expertise across liquid cooling, water chemistry, and system-level HVAC integration. They include:

 

  • Brandon Peterson, Senior Vice President of Business Development, CoolIT Systems
  • Mukul Girotra, Senior Vice President and General Manager, Global High-Tech Division, Ecolab
  • Becky Wacker, Vice President – Data Center Solutions, Trane

Today: Thermal Risk at AI Scale

In today’s kickoff article, Brandon Peterson of CoolIT describes how AI factories demand liquid cooling systems optimized from chip to chiller, with integration into Building Management Systems and support for standards like Redfish. He emphasizes the importance of bridging IT and facility silos through system-level design expertise.

Next, Mukul Girotra of Ecolab highlights how direct-to-chip cooling programs require ongoing coolant health insights and proactive water treatment. Ecolab’s 3D TRASAR™ technology and Water Quality IQ™ platform are helping operators monitor temperature, flow, and chemistry in real time to anticipate risks before they compromise uptime.

For her part, Becky Wacker, VP – Data Center Solutions at Trane Technologies, emphasizes that data centers are a vast ecosystem where rising AI heat densities, load spikes, and water scarcity must be managed holistically. She points to Trane’s focus on liquid and immersion cooling, integrated system design, and smart controls that predict and balance thermal loads in real time — all while advancing water recycling to ensure sustainability at AI scale.

What’s Ahead in the Roundtable

Over the coming week, our panel will also share their perspectives on other pressing Q3 themes:

  • CapEx vs. OpEx in the AI Era: Striking the balance between urgent build timelines and long-term efficiency in thermal and water systems.
  • The Integration Imperative: How the convergence of thermal, water, and power systems is breaking down old silos — and reshaping organizational strategies.

From liquid cooling hardware and intelligent water chemistry to system integration strategies, our Q3 panelists bring deep technical and operational insight.

Read on for our first Executive Roundtable discussion on thermal risk in the age of AI.

Data Center Frontier: AI workloads are now dominating new data center builds. What are the most critical thermal or water-related risks operators must solve at scale, and how are your solutions evolving to meet that challenge?

Brandon Peterson, CoolIT Systems:  As liquid cooling expands alongside AI, minor issues become major challenges for data center operators. AI factories are pushing for maximum token throughput which requires the rest of the system to run efficiently, ensuring energy is focused on computation rather than cooling. Today’s liquid cooling systems are often not fully optimized from chip through to chiller. Improving how the technology cooling system loop, from cold plates to Coolant Distribution Units (CDUs), is designed can deliver significant efficiency gains. Furthermore, integrating CDUs with Building Management Systems (BMS) can also help achieve additional energy and operational savings.

The gap is twofold: (1) IT and infrastructure teams are often siloed or struggle to align on fully optimized system requirements and (2) deep expertise in liquid cooling system design and optimization remains limited.

To address these challenges, we collaborate closely with customers from initial component design through system-level implementation to ensure cross-team alignment and fully optimized solutions.

CoolIT cold plates, manifolds and CDUs are engineered for maximum performance within the smallest footprint.

Our CDUs feature advanced communication and monitoring capabilities, including support for Redfish, an increasingly common standard among major data center operators.

While high-performance hardware is essential, seamless integration into the broader system is a key focus at CoolIT. That is where we also work with operators on system design.

Mukul Girotra, Ecolab:  The shift to AI workloads represents a fundamental inflection point for data center thermal management, creating challenges that traditional air cooling simply cannot address at scale.

The reality is AI chips are generating heat densities of 40-100kW per rack, which is often 3-5x higher than traditional enterprise workloads. This creates localized hot spots that overwhelm conventional facility cooling systems and can lead to catastrophic thermal failures if not properly managed.

Here’s how we’re addressing that cooling challenge at Ecolab.

We’ve developed site-to-chip cooling management solutions that include programs for cooling water, adiabatic and direct-to-chip systems, with coolant health insights delivered via digital control platforms like Ecolab® Water Quality IQ™. 

Specifically, our 3D TRASAR™ Technology for Direct-to-Chip Liquid Cooling provides real-time monitoring of system parameters including temperature, pH, flow rates, and glycol concentration, which act as leading indicators of coolant health. These insights help operators anticipate and address degradation risk before it impacts performance and uptime.

This kind of visibility is key to managing AI workloads at scale.

Ultimately, our integrated cooling programs are designed to optimize PUE (Power Usage Effectiveness) and WUE (Water Usage Effectiveness) - helping data centers run cooler, cleaner, and more sustainably, aspects that continue to be some of the industry’s biggest challenges.  
  

Becky Wacker, Trane:  As AI workloads increasingly dominate new data center builds, operators face significant challenges in managing thermal loads and water resources. These challenges include significantly higher heat density, large, aggregated load spikes, uneven distribution of cooling needs, and substantial water requirements if using traditional evaporative cooling methods. The most critical risks include overheating, inefficient cooling systems, and water scarcity.

These issues can lead to reduced hardware lifespan, hardware throttling, sudden shutdowns, failure to meet PUE targets, higher operational costs, and limitations on where AI data centers can be built due to water constraints.

At Trane, we are evolving our solutions to meet these challenges through advanced cooling technologies such as liquid cooling and immersion cooling, which offer higher efficiency and lower thermal resistance compared to traditional air-cooling methods.

Flexibility and scalability are central to our design philosophy. We believe a total system solution is crucial, integrating components such as CDUs, Fan Walls, CRAHs, and Chillers to anticipate demand and respond effectively.

In addition, we are developing smart monitoring and control systems that leverage AI to predict and manage thermal loads in real-time, ensuring optimal performance and preventing overheating through Building Management Systems and integration with DCIM platforms.

Our water management solutions are also being enhanced to recycle and reuse water, minimizing consumption and addressing scarcity concerns.

 

NEXT:  CapEx vs. OpEx in the AI Era

 
Keep pace with the fast-moving world of data centers and cloud computing by connecting with Data Center Frontier on LinkedIn, following us on X/Twitter and Facebook, as well as on BlueSky, and signing up for our weekly newsletters using the form below.

About the Author

Matt Vincent

A B2B technology journalist and editor with more than two decades of experience, Matt Vincent is Editor in Chief of Data Center Frontier.

Sign up for the Data Center Frontier Newsletter
Get the latest news and updates.
Image courtesy of Trane
Source: Image courtesy of Trane
Sponsored
Dave Isenhart, Sr. Product Manager at Trane, explains how treating thermal management as an infrastructure design cornerstone can drive stronger energy savings, uptime and return...
asharkyu/Shutterstock.com
Source: asharkyu/Shutterstock.com
Sponsored
Emma Harvey of Nexus Media Events previews the 30th European Conference on Optical Communication (ECOC) Exhibition, which will be held in Copenhagen, September 29 - October 2....