|
AI Collab Score: 8 / 2
Data Centers, Power, and the Communities Behind AI Infrastructure
For years, the cloud was described like something weightless.
Applications moved to the cloud. Storage moved to the cloud. Businesses modernized in the cloud. Consumers streamed, searched, posted, gamed, navigated, collaborated, and automated through the cloud. But the cloud was never really in the sky. It was always somewhere. It was on land. Connected to substations. Fed by transmission lines. Cooled by air, water, or liquid systems. Protected by physical security. Operated by engineers, electricians, facility teams, network teams, construction crews, and supply chain partners. And now, with the acceleration of artificial intelligence, that physical infrastructure is becoming one of the most important layers of the digital economy. AI has forced a much bigger conversation about data centers. Not just how many GPUs we can deploy. Not just how fast we can train or serve models. Not just how much compute we can bring online. The bigger conversation is about power, water, environmental impact, land, utilities, community trust, and long-term planning. The issue is not data centers themselves. Data centers are necessary. The issue is unchecked growth without power, water, environmental, and community planning. That is where the conversation needs to mature.
0 Comments
AI Collab Score: 7 / 3 Artificial intelligence is entering a new phase.
For the past several years, most enterprise conversations have focused on model capability. How large is the model? How many parameters does it contain? How many GPUs are required to train and serve it? What benchmark scores does it achieve? These questions remain important, but they are no longer the most pressing concern. A more consequential shift is underway. AI systems are evolving from assistants that generate responses to agents that can take action. That distinction changes everything. Chatbots answer. Agents act. And the moment an AI system can access files, call APIs, execute commands, and orchestrate multi-step workflows, the central challenge of enterprise AI is no longer intelligence alone. It is trust. AI Collab Score: 9 / 2 From model performance to operational economicsThe first wave of enterprise AI was funded like an experiment.
The next wave will be judged like operations. That shift changes everything. Once AI moves from pilots and demos into daily workflows, the question is no longer whether the model can respond. The question is whether the organization can afford to run intelligence repeatedly, securely, and at scale. That is where inference becomes the real enterprise challenge. For the past few years, much of the AI conversation has centered on models. Bigger models. Faster models. More capable models. Better benchmarks. More impressive demonstrations. Those things still matter, but they are no longer the whole story. Enterprise AI is moving from experimentation to operations, and inference is where the real economics show up. Training may create the model, but inference is where the business pays to use it. AI Collab Score: 9 / 2 For a long time, there was a rule everyone in modeling followed—whether you were in finance, statistics, or early machine learning:
Keep the model simple. The reasoning was straightforward. If you added too many parameters, your model would overfit—memorize the past instead of learning something that generalizes. Simpler models were safer. More stable. Easier to trust. That rule shaped decades of thinking in finance in particular. Factor models stayed small. Linear relationships dominated. Parsimony wasn’t just a preference; it was doctrine. But something has changed. Recent work in financial machine learning—and increasingly, real-world practice—has revealed a pattern that directly contradicts that intuition: Models with more parameters than data points can perform better out of sample. This isn’t just theory. At the Future Alpha quant event, in a session on Machine Learning, Market Risk, and the Future of Asset Pricing, the message was clear: leading firms are moving away from small, interpretable models toward highly parameterized ones that better reflect the actual structure of markets. AI Collab Score: 9 / 1 What GTC 2026 Revealed About the Future of AI Infrastructure We’ve Been Optimizing the Wrong LayerFor the past few years, most conversations around AI infrastructure have centered on one thing: building bigger and faster AI factories. More GPUs. Larger clusters. Faster interconnects. And for a while, that made sense. Training was the bottleneck. But sitting in this session at GTC 2026, it became clear that the bottleneck has shifted—and most organizations haven’t caught up yet. The real challenge is no longer how we train AI. That shift—from training to inference—is not subtle. It fundamentally changes how infrastructure needs to be designed, deployed, and operated.
AI Collab Score: 9 / 3
I created a short video overview of Continuing the Journey Toward Responsible AI.
If you’d rather go deeper into the operational and governance framework, continue reading below.
From Ethical Principles to Operational Governance
Artificial intelligence is scaling faster than any general-purpose technology in modern history.
Since 2012, the compute used to train leading AI systems has increased by an estimated factor of 10 billion (10¹⁰). Training cycles that once required months now iterate in weeks. Recent enterprise benchmarks show that more than 70% of executives cite ethical and regulatory risk as a primary barrier to AI deployment. AI is no longer experimental. It is infrastructural. And if AI is infrastructure, then responsible AI is not philosophy. It is risk management.
AI Collab Score: 9 / 1
Why TFLOPs and VRAM Are the Least Interesting Parts of Production AIIntroduction: The GPU Fallacy
When organizations plan large-scale LLM inference, the conversation almost always starts with hardware:
This fixation on raw compute is a textbook example of what I’ve previously called the AI Illusion: the belief that advanced infrastructure automatically produces outcomes. In reality, inference performance is determined far more by the system's behavior than by GPU specs. This article breaks down the hidden bottlenecks that dominate real-world LLM inference and explains why architects who only model TFLOPs and VRAM are consistently surprised in production.
AI Collab Score: 10 / 2
Why accelerating AI output often magnifies problems instead of fixing them.
AI doesn’t automatically improve outcomes; instead, it amplifies existing processes — good or bad.
AI investment has never been higher.
AI capability has never been stronger. Yet across industries, many organizations are quietly frustrated by the results. Projects stall. Adoption plateaus. Confidence erodes. The promised transformation never quite arrives. This isn’t because AI is ineffective or overhyped. It’s because many organizations fall into what we call the AI Illusion. The illusion is the belief that adding AI automatically improves outcomes. The reality is more uncomfortable: AI amplifies whatever already exists—good or bad. If processes are clear, AI helps. If they’re unclear, AI accelerates the problems.
--- ### Watch: The AI Illusion Explained
*In this short video, I break down why AI amplifies existing systems, how organizations fall into the Amplification Trap™, and what leaders can do to design for Decision Gravity™ instead.*
AI Collab Score: 9 / 2
AI success doesn’t begin with hardware or tools — it begins with clarity.
The most effective organizations don’t start with servers or GPUs — they start with outcomes. They focus on why AI matters, not just how it works. And that’s what allows them to align models, infrastructure, and business value from day one.
Watch this quick ~10-minute walkthrough of the blueprint before you dive into the blog details.
Step 1: Inventory Reality — Begin with the Current Environment
Before defining architecture, we first assess what exists today. This determines what can be reused, what must be modernized, and where AI will struggle to scale.
AI Collab Score: 9 / 2 A New Industrial Shift: From Data Centers to AI Factories“The price of intelligence just dropped by 10x.” With that declaration, Jensen Huang signaled a generational pivot: every conventional data center is now obsolete, replaced by the AI Factory — a purpose-built system designed to mass-produce cognitive work. In the same way the industrial revolution mechanized labor, the AI Factory industrializes thought. The keynote at NVIDIA GTC 2025 outlined not a single product, but an entire economic architecture for manufacturing intelligence at scale. Intelligence at the Edge: Arc + Nokia = 6G AI on RANNVIDIA’s partnership with Nokia brings AI directly to the wireless edge through the new NVIDIA Arc platform.
Why it matters to business leaders:
|





RSS Feed