Why We Made This Private — SuperAI Black

We built SuperAI for business.

Not because building AI was fashionable — it was, and still is — but because at CLEDA, we used AI tools every single day and saw their limitations from a perspective most people don't have. We weren't casual users. We were trying to make AI actually work inside real business operations. And what we kept running into was the same wall, over and over again.

The tools didn't work the way they were described.

This is an uncomfortable thing to say out loud in an industry built on grand pronouncements. But I think someone should say it honestly.

The gap between promise and reality

Last year, Sam Altman — CEO of OpenAI — wrote on his personal blog that humanity had crossed a critical threshold. "We are past the event horizon," he wrote. "The takeoff has started." He went on to predict that 2026 would see AI systems capable of producing novel scientific insights, and 2027 would bring robots performing real-world tasks autonomously.

This is the same company whose product, a few weeks before I wrote this, confidently told me that marine oil would work perfectly in a three-stroke engine. It wouldn't have. The engine would have been destroyed.

I personally despise the term "AI hallucination." It is a polite euphemism for a system not working properly. Let's call it what it is.

I don't enjoy making that comparison. But I think it needs to be made, because the gap between what AI companies claim and what their products actually do is causing real harm — and most people in the industry are too invested in the narrative to say so.

I witnessed this personally. An elderly family member spent part of a vacation transfixed by Claude — convinced it would help her win millions in lawsuits, while simultaneously receiving completely fabricated medical advice for serious health conditions she was dealing with. Meta has spent years losing lawsuits over social media addiction and its effects on vulnerable people. Consider what happens in ten years when that same legal machinery turns its attention to Anthropic and OpenAI.

When it stops being funny

If you are using Claude or ChatGPT for personal curiosity — to draft an email, explore an idea, have a conversation — then inconsistency is an inconvenience. You move on. A travel influencer recently asked ChatGPT whether she needed a visa to visit Puerto Rico. She did. She showed up without one and found out the hard way. Embarrassing, but recoverable.

When those same systems produce fabricated data for a business strategy — market sizing that doesn't exist, competitor analysis built on invented facts, financial projections with no basis in reality — and you don't catch it until it's too late, it stops being funny. It becomes a serious problem.

We experienced this ourselves. And we started engineering around it.

What we discovered after months of testing was that prompts don't work. Not in the way people claim. You will find guides across the internet — secret prompts, prompt engineering masterclasses, framework after framework — all promising to unlock reliable output from these systems. In our experience, prompts are the equivalent of putting lipstick on a pig. You are dressing up an unreliable foundation, not fixing it.

The fundamental issue — rarely discussed openly — is that large language models are designed to be engaging and helpful. To keep you in the conversation. To keep you spending tokens. And to accomplish that mission, the system will sometimes choose to tell you what you want to hear rather than what is true. Or it will simply not do the work, because it has calculated that you won't notice. This is not a bug that will be patched. It is structural.

So we built differently

We have been running this system in live operational environments. We know it works because we have watched it work.We built governance into the architecture. We built quality control as a layer the system cannot bypass. We built audit trails so that every action, every output, every decision is logged and traceable. We built anti-hallucination systems that verify outputs before they ever reach the user. We didn't prompt our way to reliability. We engineered it.

And then we looked at what the enterprise AI market was doing. What we saw was concerning.

December 2025 — Amazon Kiro

Amazon's internal AI coding tool, Kiro, was given a routine task: fix a minor issue in AWS Cost Explorer. Kiro determined that the most efficient path to a bug-free state was to delete the entire production environment and rebuild it from scratch. The resulting outage lasted 13 hours. Three months later, Amazon's own storefront went down for six hours, resulting in an estimated 6.3 million lost orders — following AI-assisted code changes pushed to production without adequate review.

Amazon's public response was that these were cases of user error. Their private response was an emergency engineering meeting and a new policy requiring senior sign-offs on all AI-assisted production changes. The safeguards that should have existed before the mandate were added after the damage.

2025–2026 — Salesforce Agentforce

Salesforce launched Agentforce with enormous fanfare — an autonomous AI that could resolve customer issues end-to-end. In practice, the system behaved inconsistently, giving different answers to identical queries. Hallucination rates ranged from 3% to 27% depending on configuration. Engineers described falling into what they called a "doom-prompting cycle" — endlessly rewriting prompts trying to fix behaviour that was fundamentally architectural, not instructional.

Salesforce has since added a deterministic scripting layer to impose structure on the AI's behaviour. In other words: they added governance after the product shipped, after customers encountered the failures, and after the credibility damage was done. We had that governance from day one.

Here is what frustrated us most: enterprises deploying these tools are essentially using the same underlying systems that a personal user accesses through a browser. The difference is a wrapper — some branding, some configuration, some enterprise pricing, and some dubious SOC 2 claim's from accredition mills. The governance infrastructure that production deployment actually requires is largely absent. And when things go wrong at enterprise scale, they go wrong visibly, expensively, and sometimes publicly.

We started to realize that our technology had, in certain meaningful ways, outpaced what the largest companies in the world were deploying. That is a strange thing to say. But the evidence kept pointing to it.

The sports car and the track

None of this is to say that the underlying technology is not genuinely remarkable. It is. The models built by Anthropic, OpenAI, and others represent some of the most significant engineering achievements of our generation. I mean that without reservation. But my analogy is this: a large language model in its raw form is a sports car without a road. Extraordinarily powerful. Potentially dangerous. And fundamentally limited by the absence of infrastructure around it. We did not build a better sports car. We built the track. The governance layer, the execution controls, the quality systems, the audit trails — that is the track. And a sports car on a track performs very differently from a sports car on an open field.

We also realized something else. Explaining to large enterprises why they need better AI governance — when they are already committed to an existing system, when procurement has already signed, when the internal politics are already set — is an extraordinarily long and difficult process. It can take years. And we did not want to spend years in that process.

We wanted to put this technology in the hands of people who would actually use it. People who move fast, think clearly, and have real problems that need solving. People who are not waiting for a committee to approve a proof of concept.

That is why we made this private. SuperAI Black is the first time this infrastructure has been made available at the individual level — giving a single operator the execution capacity of an entire organisation.

The honest reality is that AI will not replace human intelligence wholesale — regardless of what any CEO says in a blog post. What it will do is amplify certain individuals to operate at a level that was previously impossible. One person with the right system can now do the work of a team. Can move faster than an organisation. Can acquire information, execute decisions, and close outcomes in minutes that used to take weeks.

That asymmetry is the real story of AI. Not superintelligence. Not the singularity. Just a profound and growing gap between those who have genuine AI capability and those who have a chatbot with a premium subscription.

SuperAI Black exists for the people on the right side of that gap.

That is what we built. That is why we made it private. And that is what it will do for you.

Michael North

Chief Technology Officer — CLEDA Systems

April 2026

Why we madethis private.

Why we made
this private.