AI in production: the lesson of the first AI-powered World Championship

The automated offside system at the 2026 World Cup isn’t just a sporting curiosity. It’s a case study of what separates an AI system that impresses in a demo from one you can trust when a real decision is at stake.

The 2026 World Cup will be remembered for its scale: 48 teams, 104 matches, three host countries. But for those of us who build artificial intelligence systems that have to function outside the lab, the interesting thing isn’t the spectacle, but the engineering.

FIFA hasn’t deployed AI to impress. It’s deployed it to make refereeing decisions in real time, under adverse conditions, where a mistake is visible to half the planet and can decide a match. That requirement—to always be right, to be able to explain the decision, and for a human to be held accountable—is precisely what separates AI as a showcase from AI in production .

We’re not going to repeat FIFA’s statement here. We’re going to use the World Cup to explain the three conditions that any AI system must meet before you trust it with something that matters, whether it’s disallowing a goal or approving a transaction in your company.

The impressive demo isn’t the problem. That’s where the problems begin.

Almost any team can build a model that’s right 80% of the time under controlled conditions. That’s what you see every day on LinkedIn: spectacular videos, brilliant metrics.

The problem is that this 80% is where the real work begins, not where it ends. A system that fails one out of every five times is useless for flagging offsides. And it’s equally useless for detecting fraud, classifying a transaction, or automating a credit decision.

What FIFA has had to resolve boils down to the three properties that distinguish a production-ready AI: that it is observable, governable, and optimizable . These are the three questions we ask ourselves before putting any system in front of a real user.

1. Observable: if you cannot explain the decision, you cannot trust it

The semi-automated offside system doesn’t just issue a verdict. It generates a three-dimensional recreation that shows, joint by joint, why a player was in an illegal position. The referee, the analyst, and the fan at home all see the same thing and understand why that decision was made.

This isn’t about aesthetics. It’s about observability , and it’s the first condition of any AI that you can defend to a client, an auditor, or a regulator.

An AI that gets it right but can’t explain why is a liability, not an asset. The moment something goes wrong—and it does sometimes—you need to be able to audit the decision.

In a company, this is the difference between “the model rejected the application” and “the model rejected it due to these three factors, with this weight, based on this data.” Only the second version survives an audit.

2. Governed: AI proposes, human responds

Here’s the detail that almost all World Cup articles overlook, and the one that interests us most: the referee retains the final decision. The AI detects the potential infraction and generates an alert; a person decides.

This isn’t a technological limitation; it’s a deliberate governance design : in any decision that matters, the machine assists and the person responds. FIFA does it out of operational common sense. For a European company, it’s also not optional.

And this is where the example ceases to be a mere sporting anecdote. The European Artificial Intelligence Regulation (AI Act) transforms this same principle into a legal obligation: it classifies certain uses as “high risk” and, in Article 14 , requires that these systems be designed so that a person can effectively supervise them throughout their entire lifecycle. The regulation goes beyond a simple “human approval”: it expressly requires that this supervisor be aware of automation bias , the tendency to blindly trust what the machine says, and be able to detect anomalies and failures.

A system that decides on loans, diagnoses, or hiring faces the exact same challenge as video arbitration, but with one key difference: if it operates in Europe, that challenge comes with a legal obligation. The lesson for any European company incorporating AI is straightforward: human oversight is not a barrier to automation; it is the requirement that makes it legally deployable. Those who design with human input in the loop from the outset do not have to redesign the system when the AI Act applies. Those who don’t, do.

3. Optimizable: a system that is not monitored, silently degrades

The third property is the least visible and the most decisive in the long run. The offside system we’ll see in 2026 wasn’t created from scratch: it’s the latest iteration of a technology that FIFA has been refining tournament after tournament, making it progressively faster and more accurate. What was once a slow and controversial system is now a process that takes seconds. This improvement didn’t come without a price: it was the result of continuous measurement, correction, and retraining.

And that’s precisely the point. An AI system that’s deployed once and then abandoned begins to age from day one: the data changes, the context changes, and accuracy silently declines without anyone noticing. Again, what is common sense for FIFA is also law for European companies: Article 14 of the AI Act doesn’t settle for one-off monitoring and requires that the system be able to detect anomalies, malfunctions, and unexpected performance throughout its entire lifespan.

That’s optimization : monitoring performance, detecting degradation, and retraining. It’s what transforms a one-off project into a permanent capability, and what distinguishes an organization that “has a model” from one that has AI operating on which it truly depends.

The three questions that determine if your AI is ready

FIFA’s deployment is noteworthy for its scale, but the requirements are exactly the same as those faced by any company that wants to move from “we have a pilot” to “we have AI in production”:

Can you explain each decision? (Observable)
Is there real human oversight and do you comply with the AI Act? (Governed)
Does the system improve over time, or is it degrading without you knowing? (Optimizable)

Most AI projects that fail don’t fail because of a bad model. They fail because no one answered these three questions before putting the system in the hands of real users. What FIFA invested in making its AI reliable, transparent, and supervised is precisely the part that doesn’t appear in the videos. And it’s the only part that matters when the system actually has to work.

In summary

The 2026 World Cup will be the first powered by AI on a large scale, but its real lesson is not technological, it is methodological: artificial intelligence ceases to be an experiment and becomes a reliable tool when it is built to be observable, governed and optimizable.

These three properties are not just a slogan: they are the standard by which ALGO takes AI systems from prototype to production, adhering to the same traceability, governance, and security criteria that govern our ISO 27001 and SOC 2 certifications. What refereeing a goal is for FIFA, for our clients is automating a decision their business depends on. The demands are the same.

Would your AI system pass these three tests?

The difference between an impressive pilot and an AI you can truly trust is rarely in the model itself: it’s in whether it’s been built to be observable, governable, and optimizable, and, in Europe, to comply with the AI Act. If you’re bringing AI into your organization and want it to work beyond the demo stage, ALGO can help you make the leap from prototype to production. Tell us about your situation, and we’ll review it with you.