virtualizationvelocity
  • Home
  • Video Hub
  • About
  • VMware
    • vExpert
    • VMware Explore >
      • VMware Explore 2025
      • VMware Explore 2024
      • VMware Explore 2023
      • VMware Explore 2022
    • VMworld >
      • VMworld 2021
      • VMworld 2020
      • VMworld 2019
      • VMworld 2018
      • VMworld 2017
      • VMworld 2016
      • VMWorld 2015
      • VMWorld 2014
  • The Class Room
  • AI Model Compute Planner
  • Contact
  • AI Collab Score

Your Definitive Source for Actionable Insights on Cloud, Virtualization & Modern Enterprise IT

Beyond GPUs: NVIDIA’s Layered Approach to the Enterprise AI Control Plane

6/8/2026

0 Comments

 
AI Collab Score: 7 / 3
Picture
Enterprise AI conversations often begin with GPUs.

That makes sense. GPUs are the visible engine behind modern AI. They are the scarce resource. They are the line item everyone notices. They are also the easiest part of the AI infrastructure conversation to simplify.

But production AI does not succeed because an organization owns accelerators.
It succeeds when the organization can control how data, models, users, workloads, policies, and costs move through the platform.

That is the real challenge.

Enterprises are quickly learning that AI infrastructure is not just about acquiring compute. It is about building an operating model around that compute. Who gets access? Which workloads take priority? How are models deployed? How is inference scaled? How are agents evaluated? How are risks governed? How do teams avoid wasting expensive GPU capacity?

This is where the idea of an AI control plane becomes important.

But there is a common misconception. The AI control plane is not usually one product or one dashboard. It is a layered architecture. Different platforms control different parts of the AI lifecycle.

NVIDIA’s approach reflects that reality.

NVIDIA’s answer is not simply, “Buy GPUs.” It is a broader AI Factory architecture: accelerated infrastructure combined with software layers for workload orchestration, inference deployment, model and agent development, governance, and operations.
​
In other words, NVIDIA’s enterprise strategy is increasingly about helping organizations move from owning AI infrastructure to operating an AI factory.

Read More
0 Comments

The Cloud Was Never Weightless

6/2/2026

0 Comments

 
AI Collab Score: 8 / 2

Data Centers, Power, and the Communities Behind AI Infrastructure

Picture
For years, the cloud was described like something weightless.

Applications moved to the cloud.
Storage moved to the cloud.
Businesses modernized in the cloud.
Consumers streamed, searched, posted, gamed, navigated, collaborated, and automated through the cloud.

But the cloud was never really in the sky.

It was always somewhere.

It was on land.
Connected to substations.
Fed by transmission lines.
Cooled by air, water, or liquid systems.
Protected by physical security.
Operated by engineers, electricians, facility teams, network teams, construction crews, and supply chain partners.

And now, with the acceleration of artificial intelligence, that physical infrastructure is becoming one of the most important layers of the digital economy.

AI has forced a much bigger conversation about data centers. Not just how many GPUs we can deploy. Not just how fast we can train or serve models. Not just how much compute we can bring online.

The bigger conversation is about power, water, environmental impact, land, utilities, community trust, and long-term planning.

The issue is not data centers themselves.

Data centers are necessary.

The issue is unchecked growth without power, water, environmental, and community planning.
That is where the conversation needs to mature.

Read More
0 Comments

NemoClaw: Why Trust Is Becoming Part of the AI Infrastructure Stack

5/13/2026

0 Comments

 
AI Collab Score: 7 / 3
Picture
Artificial intelligence is entering a new phase.

For the past several years, most enterprise conversations have focused on model capability. How large is the model? How many parameters does it contain? How many GPUs are required to train and serve it? What benchmark scores does it achieve?

These questions remain important, but they are no longer the most pressing concern.

A more consequential shift is underway.

AI systems are evolving from assistants that generate responses to agents that can take action.

That distinction changes everything.

Chatbots answer.

Agents act.

And the moment an AI system can access files, call APIs, execute commands, and orchestrate multi-step workflows, the central challenge of enterprise AI is no longer intelligence alone.

​It is trust.

Read More
0 Comments

The Inference Economy: Why Running AI Is Becoming the Real Enterprise Challenge

4/26/2026

0 Comments

 
AI Collab Score: 9 / 2
Picture

​From model performance to operational economics

The first wave of enterprise AI was funded like an experiment.

The next wave will be judged like operations.

That shift changes everything.

Once AI moves from pilots and demos into daily workflows, the question is no longer whether the model can respond. The question is whether the organization can afford to run intelligence repeatedly, securely, and at scale.

That is where inference becomes the real enterprise challenge.

For the past few years, much of the AI conversation has centered on models. Bigger models. Faster models. More capable models. Better benchmarks. More impressive demonstrations.
Those things still matter, but they are no longer the whole story.

Enterprise AI is moving from experimentation to operations, and inference is where the real economics show up.
​
Training may create the model, but inference is where the business pays to use it.

Read More
0 Comments

The Double Descent: Why Bigger Models Demand Smarter Infrastructure

4/4/2026

0 Comments

 
AI Collab Score: 9 / 2
Picture
For a long time, there was a rule everyone in modeling followed—whether you were in finance, statistics, or early machine learning:

Keep the model simple.

The reasoning was straightforward. If you added too many parameters, your model would overfit—memorize the past instead of learning something that generalizes. Simpler models were safer. More stable. Easier to trust.

That rule shaped decades of thinking in finance in particular. Factor models stayed small. Linear relationships dominated. Parsimony wasn’t just a preference; it was doctrine.
But something has changed.

Recent work in financial machine learning—and increasingly, real-world practice—has revealed a pattern that directly contradicts that intuition:

Models with more parameters than data points can perform better out of sample.
​
This isn’t just theory. At the Future Alpha quant event, in a session on Machine Learning, Market Risk, and the Future of Asset Pricing, the message was clear: leading firms are moving away from small, interpretable models toward highly parameterized ones that better reflect the actual structure of markets.
​

Read More
0 Comments

Beyond the AI Factory: How the AI Grid Is Redefining Distributed Intelligence

3/18/2026

0 Comments

 
AI Collab Score: 9 / 1
​​What GTC 2026 Revealed About the Future of AI Infrastructure

​We’ve Been Optimizing the Wrong Layer

Picture
For the past few years, most conversations around AI infrastructure have centered on one thing: building bigger and faster AI factories.

More GPUs.
Larger clusters.
Faster interconnects.

And for a while, that made sense. Training was the bottleneck.
​
But sitting in this session at GTC 2026, it became clear that the bottleneck has shifted—and most organizations haven’t caught up yet.
​The real challenge is no longer how we train AI.
The challenge is how we deliver it.
​That shift—from training to inference—is not subtle. It fundamentally changes how infrastructure needs to be designed, deployed, and operated.

Read More
0 Comments

Continuing the Journey Toward Responsible AI

2/25/2026

0 Comments

 
AI Collab Score: 9 / 3
Picture
I created a short video overview of Continuing the Journey Toward Responsible AI.
If you’d rather go deeper into the operational and governance framework, continue reading below.

​From Ethical Principles to Operational Governance

Artificial intelligence is scaling faster than any general-purpose technology in modern history.

Since 2012, the compute used to train leading AI systems has increased by an estimated factor of 10 billion (10¹⁰). Training cycles that once required months now iterate in weeks. Recent enterprise benchmarks show that more than 70% of executives cite ethical and regulatory risk as a primary barrier to AI deployment.

AI is no longer experimental.

It is infrastructural.

And if AI is infrastructure, then responsible AI is not philosophy.

​It is risk management.

Read More
0 Comments

The Hidden Bottlenecks in LLM Inference

1/24/2026

0 Comments

 
AI Collab Score: 9 / 1 

Why TFLOPs and VRAM Are the Least Interesting Parts of Production AI

Picture

Introduction: The GPU Fallacy

When organizations plan large-scale LLM inference, the conversation almost always starts with hardware:
  • How many GPUs?
  • How much VRAM?
  • How many TFLOPs?
  • What’s the max tokens per second?
Those numbers matter — but they are not where most production latency or cost comes from.

This fixation on raw compute is a textbook example of what I’ve previously called the AI Illusion: the belief that advanced infrastructure automatically produces outcomes. In reality, inference performance is determined far more by the system's behavior than by GPU specs.
​
This article breaks down the hidden bottlenecks that dominate real-world LLM inference and explains why architects who only model TFLOPs and VRAM are consistently surprised in production.

Read More
0 Comments

The AI Illusion: Why More AI Often Creates Less Value

1/3/2026

0 Comments

 
AI Collab Score: 10 / 2
Why accelerating AI output often magnifies problems instead of fixing them.
Picture
AI doesn’t automatically improve outcomes; instead, it amplifies existing processes — good or bad.
AI investment has never been higher.
AI capability has never been stronger.

Yet across industries, many organizations are quietly frustrated by the results. Projects stall. Adoption plateaus. Confidence erodes. The promised transformation never quite arrives.
​
This isn’t because AI is ineffective or overhyped. It’s because many organizations fall into what we call the AI Illusion.

The illusion is the belief that adding AI automatically improves outcomes. The reality is more uncomfortable: AI amplifies whatever already exists—good or bad. If processes are clear, AI helps. If they’re unclear, AI accelerates the problems.
--- ### Watch: The AI Illusion Explained
*In this short video, I break down why AI amplifies existing systems, how organizations fall into the Amplification Trap™, and what leaders can do to design for Decision Gravity™ instead.*

Read More
0 Comments

From Discovery to AI Outcomes: A Proven Method for On-Prem AI Success

11/26/2025

0 Comments

 
AI Collab Score: 9 / 2 
Picture
AI success doesn’t begin with hardware or tools — it begins with clarity.
The most effective organizations don’t start with servers or GPUs — they start with outcomes.

They focus on why AI matters, not just how it works.

​And that’s what allows them to align models, infrastructure, and business value from day one.
Watch this quick ~10-minute walkthrough of the blueprint before you dive into the blog details.

Step 1: Inventory Reality — Begin with the Current Environment

Before defining architecture, we first assess what exists today. This determines what can be reused, what must be modernized, and where AI will struggle to scale.

Read More
0 Comments
<<Previous

      Join Our Community

    Subscribe

    Categories

    All
    Artificial Intelligence
    Automation & Operations
    Certification & Careers
    Cloud & Hybrid IT
    Enterprise Technology & Strategy
    General
    Hardware & End-User Computing
    Virtualization & Core Infrastructure

    Recognition

    Picture
    Picture
    Picture
    Picture
    Picture
    Picture
    Picture
    Picture
    Picture
    Picture

    RSS Feed

    Follow @bdseymour

Virtualization Velocity

© 2025 Brandon Seymour. All rights reserved.

Privacy Policy | Contact

Follow:

LinkedIn X Facebook Email
  • Home
  • Video Hub
  • About
  • VMware
    • vExpert
    • VMware Explore >
      • VMware Explore 2025
      • VMware Explore 2024
      • VMware Explore 2023
      • VMware Explore 2022
    • VMworld >
      • VMworld 2021
      • VMworld 2020
      • VMworld 2019
      • VMworld 2018
      • VMworld 2017
      • VMworld 2016
      • VMWorld 2015
      • VMWorld 2014
  • The Class Room
  • AI Model Compute Planner
  • Contact
  • AI Collab Score