• Latest
  • Trending
  • All
  • Market Updates
  • Cryptocurrency
  • Blockchain
  • Investing
  • Commodities
  • Personal Finance
  • Technology
  • Business
  • Real Estate
  • Finance
Contextual AI’s new AI model crushes GPT-4o in accuracy — here’s why it matters

Contextual AI’s new AI model crushes GPT-4o in accuracy — here’s why it matters

March 5, 2025
Bitcoin ATMs reprise a painful history in finance

Bitcoin ATMs reprise a painful history in finance

September 5, 2025
3 Days Left to Lock In Your Exhibitor Spot at TechCrunch Disrupt 2025

Your last chance to exhibit at Disrupt 2025 is today

September 5, 2025
The 7 coolest gadgets I’ve seen at IFA 2025 (including ones you can actually buy)

The 7 coolest gadgets I’ve seen at IFA 2025 (including ones you can actually buy)

September 5, 2025
Soft Manager – Trading Ideas – 5 August 2025

Instructions and recommendations for using the Neuro Future indicator – My Trading – 5 September 2025

September 5, 2025
Stocks making the biggest moves midday: AVGO, NX, LULU

Stocks making the biggest moves midday: AVGO, NX, LULU

September 5, 2025
European equity close: Soft start to September

European equity close: Soft start to September

September 5, 2025
Earth’ Episode 5 Should Have Been The Season’s Best, But Instead It Was Unbearably Stupid

Earth’ Episode 5 Should Have Been The Season’s Best, But Instead It Was Unbearably Stupid

September 5, 2025
Stock markets feel the recession pinch. Why the thinking about the economy is changin

Stock markets feel the recession pinch. Why the thinking about the economy is changin

September 5, 2025
$3.38B in Bitcoin Options Expiry Raises Concerns of Volatility

$3.38B in Bitcoin Options Expiry Raises Concerns of Volatility

September 5, 2025
Kazakhstan’s AFSA To Adopt Stablecoins for Regulatory Fees

Kazakhstan’s AFSA To Adopt Stablecoins for Regulatory Fees

September 5, 2025
Active funds struggle ‘mightily’ to beat index funds: Morningstar

Active funds struggle ‘mightily’ to beat index funds: Morningstar

September 5, 2025
How To Get Free Instacart Groceries

How To Get Free Instacart Groceries

September 5, 2025
Friday, September 5, 2025
No Result
View All Result
InvestorNewsToday.com
  • Home
  • Market
  • Business
  • Finance
  • Investing
  • Real Estate
  • Commodities
  • Crypto
  • Blockchain
  • Personal Finance
  • Tech
InvestorNewsToday.com
No Result
View All Result
Home Technology

Contextual AI’s new AI model crushes GPT-4o in accuracy — here’s why it matters

by Investor News Today
March 5, 2025
in Technology
0
Contextual AI’s new AI model crushes GPT-4o in accuracy — here’s why it matters
491
SHARES
1.4k
VIEWS
Share on FacebookShare on Twitter

Be part of our day by day and weekly newsletters for the newest updates and unique content material on industry-leading AI protection. Study Extra


Contextual AI unveiled its grounded language mannequin (GLM) right now, claiming it delivers the best factual accuracy within the {industry} by outperforming main AI methods from Google, Anthropic and OpenAI on a key benchmark for truthfulness.

The startup, based by the pioneers of retrieval-augmented era (RAG) expertise, reported that its GLM achieved an 88% factuality rating on the FACTS benchmark, in comparison with 84.6% for Google’s Gemini 2.0 Flash, 79.4% for Anthropic’s Claude 3.5 Sonnet and 78.8% for OpenAI’s GPT-4o.

Whereas massive language fashions have remodeled enterprise software program, factual inaccuracies — usually referred to as hallucinations — stay a vital problem for enterprise adoption. Contextual AI goals to resolve this by making a mannequin particularly optimized for enterprise RAG functions the place accuracy is paramount.

“We knew that a part of the answer could be a way referred to as RAG — retrieval-augmented era,” mentioned Douwe Kiela, CEO and cofounder of Contextual AI, in an unique interview with VentureBeat. “And we knew that as a result of RAG is initially my thought. What this firm is about is basically about doing RAG the suitable approach, to type of the following stage of doing RAG.”

The corporate’s focus differs considerably from general-purpose fashions like ChatGPT or Claude, that are designed to deal with every part from inventive writing to technical documentation. Contextual AI as a substitute targets high-stakes enterprise environments the place factual precision outweighs inventive flexibility.

“You probably have a RAG downside and also you’re in an enterprise setting in a extremely regulated {industry}, you haven’t any tolerance in anyway for hallucination,” defined Kiela. “The identical general-purpose language mannequin that’s helpful for the advertising and marketing division just isn’t what you need in an enterprise setting the place you’re far more delicate to errors.”

A benchmark comparability displaying Contextual AI’s new grounded language mannequin (GLM) outperforming opponents from Google, Anthropic and OpenAI on factual accuracy checks. The corporate claims its specialised strategy reduces AI hallucinations in enterprise settings.(Credit score: Contextual AI)

How Contextual AI makes ‘groundedness’ the brand new gold customary for enterprise language fashions

The idea of “groundedness” — guaranteeing AI responses stick strictly to data explicitly offered within the context — has emerged as a vital requirement for enterprise AI methods. In regulated industries like finance, healthcare and telecommunications, corporations want AI that both delivers correct data or explicitly acknowledges when it doesn’t know one thing.

Kiela provided an instance of how this strict groundedness works: “In the event you give a recipe or a components to a normal language mannequin, and someplace in it, you say, ‘however that is solely true for many circumstances,’ most language fashions are nonetheless simply going to provide the recipe assuming it’s true. However our language mannequin says, ‘Truly, it solely says that that is true for many circumstances.’ It’s capturing this extra little bit of nuance.”

The power to say “I don’t know” is an important one for enterprise settings. “Which is mostly a very highly effective characteristic, if you concentrate on it in an enterprise setting,” Kiela added.

Contextual AI’s RAG 2.0: A extra built-in solution to course of firm data

Contextual AI’s platform is constructed on what it calls “RAG 2.0,” an strategy that strikes past merely connecting off-the-shelf parts.

“A typical RAG system makes use of a frozen off-the-shelf mannequin for embeddings, a vector database for retrieval, and a black-box language mannequin for era, stitched collectively via prompting or an orchestration framework,” in keeping with an organization assertion. “This results in a ‘Frankenstein’s monster’ of generative AI: the person parts technically work, however the entire is much from optimum.”

As an alternative, Contextual AI collectively optimizes all parts of the system. “Now we have this mixture-of-retrievers element, which is mostly a solution to do clever retrieval,” Kiela defined. “It appears on the query, after which it thinks, primarily, like many of the newest era of fashions, it thinks, [and] first it plans a method for doing a retrieval.”

This whole system works in coordination with what Kiela calls “the very best re-ranker on this planet,” which helps prioritize probably the most related data earlier than sending it to the grounded language mannequin.

Past plain textual content: Contextual AI now reads charts and connects to databases

Whereas the newly introduced GLM focuses on textual content era, Contextual AI’s platform has not too long ago added assist for multimodal content material together with charts, diagrams and structured knowledge from common platforms like BigQuery, Snowflake, Redshift and Postgres.

“Essentially the most difficult issues in enterprises are on the intersection of unstructured and structured knowledge,” Kiela famous. “What I’m principally enthusiastic about is basically this intersection of structured and unstructured knowledge. A lot of the actually thrilling issues in massive enterprises are smack bang on the intersection of structured and unstructured, the place you might have some database information, some transactions, perhaps some coverage paperwork, perhaps a bunch of different issues.”

The platform already helps a wide range of complicated visualizations, together with circuit diagrams within the semiconductor {industry}, in keeping with Kiela.

Contextual AI’s future plans: Creating extra dependable instruments for on a regular basis enterprise

Contextual AI plans to launch its specialised re-ranker element shortly after the GLM launch, adopted by expanded document-understanding capabilities. The corporate additionally has experimental options for extra agentic capabilities in growth.

Based in 2023 by Kiela and Amanpreet Singh, who beforehand labored at Meta’s Elementary AI Analysis (FAIR) staff and Hugging Face, Contextual AI has secured prospects together with HSBC, Qualcomm and the Economist. The corporate positions itself as serving to enterprises lastly notice concrete returns on their AI investments.

“That is actually a chance for corporations who’re perhaps below stress to start out delivering ROI from AI to start out extra specialised options that truly clear up their issues,” Kiela mentioned. “And a part of that basically is having a grounded language mannequin that’s perhaps a bit extra boring than a normal language mannequin, however it’s actually good at ensuring that it’s grounded within the context and that you may actually belief it to do its job.”

Day by day insights on enterprise use circumstances with VB Day by day

If you wish to impress your boss, VB Day by day has you coated. We provide the inside scoop on what corporations are doing with generative AI, from regulatory shifts to sensible deployments, so you may share insights for optimum ROI.

Learn our Privateness Coverage

Thanks for subscribing. Take a look at extra VB newsletters right here.

An error occured.



Source link
Tags: accuracyAIsContextualcrushesGPT4oHeresMattersmodel
Share196Tweet123
Previous Post

True Oversold Overbought[New Upgrade: TOSOB Smoothing X days] – Trading Systems – 4 March 2025

Next Post

8 Straight Days Of Buying Beam Therapeutics As Stock Soars 14%

Investor News Today

Investor News Today

Next Post
8 Straight Days Of Buying Beam Therapeutics As Stock Soars 14%

8 Straight Days Of Buying Beam Therapeutics As Stock Soars 14%

  • Trending
  • Comments
  • Latest
The human harbor: Navigating identity and meaning in the AI age

The human harbor: Navigating identity and meaning in the AI age

July 14, 2025
Private equity groups prepare to offload Ensemble Health for up to $12bn

Private equity groups prepare to offload Ensemble Health for up to $12bn

May 16, 2025
Equinor scales back renewables push 7 years after ditching ‘oil’ from its name

Equinor scales back renewables push 7 years after ditching ‘oil’ from its name

February 5, 2025
Niels Troost has a staggering story to tell about how he got sanctioned

Niels Troost has a staggering story to tell about how he got sanctioned

December 14, 2024
Why America’s economy is soaring ahead of its rivals

Why America’s economy is soaring ahead of its rivals

0
Dollar climbs after Donald Trump’s Brics tariff threat and French political woes

Dollar climbs after Donald Trump’s Brics tariff threat and French political woes

0
Nato chief Mark Rutte’s warning to Trump

Nato chief Mark Rutte’s warning to Trump

0
Top Federal Reserve official warns progress on taming US inflation ‘may be stalling’

Top Federal Reserve official warns progress on taming US inflation ‘may be stalling’

0
Bitcoin ATMs reprise a painful history in finance

Bitcoin ATMs reprise a painful history in finance

September 5, 2025
3 Days Left to Lock In Your Exhibitor Spot at TechCrunch Disrupt 2025

Your last chance to exhibit at Disrupt 2025 is today

September 5, 2025
The 7 coolest gadgets I’ve seen at IFA 2025 (including ones you can actually buy)

The 7 coolest gadgets I’ve seen at IFA 2025 (including ones you can actually buy)

September 5, 2025
Soft Manager – Trading Ideas – 5 August 2025

Instructions and recommendations for using the Neuro Future indicator – My Trading – 5 September 2025

September 5, 2025

Live Prices

© 2024 Investor News Today

No Result
View All Result
  • Home
  • Market
  • Business
  • Finance
  • Investing
  • Real Estate
  • Commodities
  • Crypto
  • Blockchain
  • Personal Finance
  • Tech

© 2024 Investor News Today