All news articles

CX/CS

Experts laud AI latest models for speed and affordability, but regulated sectors can’t risk hallucinations

Lorikeet News Desk

Jun 10, 2025

TL;DR

AQ22's David Mataciunas discusses the need for precision in AI applications for regulated sectors.
GPT-4.1 models are faster and cheaper but still prone to hallucinations, posing risks in fields like finance and medicine.
Mataciunas foresees AI reliability reaching a level where insurance can cover the risks of hallucinations.

Finance and medicine are huge applications for AI and chat, and everything must as accurate as possible right from the first pass.

David Mataciunas

CTO and Co-Founder | AQ22

In highly-regulated fields like finance and medicine, false confidence can be fatal. Models are getting faster and cheaper—but they still make things up. Until hallucinations are under control, trust is off the table. Navigating the precarious terrain is David Mataciunas, Co-Founder and CTO of AQ22, a firm building AI agents for the financial sector. AQ22's platform enhances loan assessment efficiency for private equity lenders and commercial banks.

Keep it real: Mataciunas' team has seen both the promise of models like GPT-4.1 and the persistent risks that come with them. For AQ22’s core markets, precision isn’t optional; it’s the baseline. "Finance and medicine are huge applications for AI and chat," he says. "It’s very important that everything is as accurate as possible right from the first pass." That demand for reliability is what keeps many regulated sectors cautious. "The biggest problem right now isn't even the cost," explains Mataciunas. "It's the hallucinations, because you need to build so much on top to stay safe and compliant."

Time for a tune-up: Persistent unreliability is driving demand for better tools, with refinement at the top of the list. "I’m really waiting until we can fine-tune 4.1," says Mataciunas. "If it’s a smaller model and we can use our own data, we can get better results. But fine-tuning right now is kind of costly." The desire for more control is widespread, yet access to fine-tuning on advanced models like GPT-4.1 is still limited. And while price and context windows are a major differentiator from open-source models, dependability remains the sticking point.

The biggest problem right now isn't even the cost. It's the hallucinations, because you need to build so much on top to stay safe and compliant.

David Mataciunas

CTO and Co-Founder | AQ22

Assurance meets insurance: Looking ahead, Mataciunas imagines a future where reliability reaches a point that AI can actually be insured. "After we reduce hallucination as much as possible, then we can implement insurance on AI agents," he says. "If we reduce hallucination to 99%, that 1% could be insured by insurance companies." The idea of covering AI risks is gaining traction as businesses search for ways to manage the uncertainty that still shadows deployment.

Stay on the rails: Until reliability improves, Mataciunas is leaning on safeguards. "We never let LLMs just do any calculations," he explains. "Everything goes to a calculator." Despite using top-tier models, "it’s still hallucinating a lot on different tasks." His hope, and a clear call to action for model providers, is that "reducing hallucination guardrails is the next step. I hope so for OpenAI—because right now, it’s a huge problem. Not the cost.”

Latest posts

Aug 11, 2025

Balancing AI efficiency with 'back-of-the-hand' customer knowledge to build loyalty in CS

Jul 21, 2025

At Xbox, the future of support isn’t a department—it’s an invisible feature

Jul 13, 2025

Declared data is replacing behavioral signals as the foundation for modern personalization

Jul 9, 2025

BNY Mellon deploys ‘digital workers’ complete with logins and managers

Jul 8, 2025

How companies 'born in the cloud' are giving way to the next gen of innovators 'born in AI'

Jul 8, 2025

Productive or purposeful? How one healthcare leader is using AI to boost both culture and CS outcomes

Aug 11, 2025

Balancing AI efficiency with 'back-of-the-hand' customer knowledge to build loyalty in CS

Jul 21, 2025

At Xbox, the future of support isn’t a department—it’s an invisible feature

Jul 13, 2025

Declared data is replacing behavioral signals as the foundation for modern personalization

Jul 9, 2025

BNY Mellon deploys ‘digital workers’ complete with logins and managers

Industries

Latest

CX-CS

Brought to you by Lorikeet

By Lorikeet

Brought to you by Lorikeet

Learn more

Product

Pricing

Customer Stories

Integrations

FAQ

Nominate

Toolshed

Company

About

Careers

Blog

Partnership

Trust Center

Glossary

ABN: 53 669 390 149

Brought to you by Lorikeet

Learn more

Product

Pricing

Customer Stories

Integrations

FAQ

Nominate

Toolshed

Company

About

Careers

Blog

Partnership

Trust Center

Glossary

ABN: 53 669 390 149

Brought to you by Lorikeet

Learn more

Product

Pricing

Customer Stories

Integrations

FAQ

Nominate

Toolshed

Company

About

Careers

Blog

Partnership

Trust Center

Glossary

ABN: 53 669 390 149