All news articles

CX/CS

CX/CS

CX/CS

Experts laud AI latest models for speed and affordability, but regulated sectors can’t risk hallucinations

Experts laud AI latest models for speed and affordability, but regulated sectors can’t risk hallucinations

Lorikeet News Desk

Jun 10, 2025

TL;DR

  • AQ22's David Mataciunas discusses the need for precision in AI applications for regulated sectors.

  • GPT-4.1 models are faster and cheaper but still prone to hallucinations, posing risks in fields like finance and medicine.

  • Mataciunas foresees AI reliability reaching a level where insurance can cover the risks of hallucinations.


Finance and medicine are huge applications for AI and chat, and everything must as accurate as possible right from the first pass.

David Mataciunas

CTO and Co-Founder | AQ22

In highly-regulated fields like finance and medicine, false confidence can be fatal. Models are getting faster and cheaper—but they still make things up. Until hallucinations are under control, trust is off the table. Navigating the precarious terrain is David Mataciunas, Co-Founder and CTO of AQ22, a firm building AI agents for the financial sector. AQ22's platform enhances loan assessment efficiency for private equity lenders and commercial banks.

Keep it real: Mataciunas' team has seen both the promise of models like GPT-4.1 and the persistent risks that come with them. For AQ22’s core markets, precision isn’t optional; it’s the baseline. "Finance and medicine are huge applications for AI and chat," he says. "It’s very important that everything is as accurate as possible right from the first pass." That demand for reliability is what keeps many regulated sectors cautious. "The biggest problem right now isn't even the cost," explains Mataciunas. "It's the hallucinations, because you need to build so much on top to stay safe and compliant."

Time for a tune-up: Persistent unreliability is driving demand for better tools, with refinement at the top of the list. "I’m really waiting until we can fine-tune 4.1," says Mataciunas. "If it’s a smaller model and we can use our own data, we can get better results. But fine-tuning right now is kind of costly." The desire for more control is widespread, yet access to fine-tuning on advanced models like GPT-4.1 is still limited. And while price and context windows are a major differentiator from open-source models, dependability remains the sticking point.

The biggest problem right now isn't even the cost. It's the hallucinations, because you need to build so much on top to stay safe and compliant.

David Mataciunas

CTO and Co-Founder | AQ22

Assurance meets insurance: Looking ahead, Mataciunas imagines a future where reliability reaches a point that AI can actually be insured. "After we reduce hallucination as much as possible, then we can implement insurance on AI agents," he says. "If we reduce hallucination to 99%, that 1% could be insured by insurance companies." The idea of covering AI risks is gaining traction as businesses search for ways to manage the uncertainty that still shadows deployment.

Stay on the rails: Until reliability improves, Mataciunas is leaning on safeguards. "We never let LLMs just do any calculations," he explains. "Everything goes to a calculator." Despite using top-tier models, "it’s still hallucinating a lot on different tasks." His hope, and a clear call to action for model providers, is that "reducing hallucination guardrails is the next step. I hope so for OpenAI—because right now, it’s a huge problem. Not the cost.”

Blu background with lorikeet flypaths

Brought to you by Lorikeet

We're building an AI system that’s capable of providing high quality, human assistance because every company should be able to scale exceptional CX.

Learn More

Blu background with lorikeet flypaths

See Lorikeet
in action

We're building an AI system that’s capable of providing high quality, human assistance because every company should be able to scale exceptional CX.

Learn More

Blu background with lorikeet flypaths

Brought to you by Lorikeet

We're building an AI system that’s capable of providing high quality, human assistance because every company should be able to scale exceptional CX.

Learn More