• Latest
  • Trending
  • All
  • Market Updates
  • Cryptocurrency
  • Blockchain
  • Investing
  • Commodities
  • Personal Finance
  • Technology
  • Business
  • Real Estate
  • Finance
Researchers Propose a Better Way to Report Dangerous AI Flaws

Researchers Propose a Better Way to Report Dangerous AI Flaws

March 14, 2025
UK Seizes Crypto ATMs As Global Scrutiny Grows Over Unregulated Kiosks

UK Seizes Crypto ATMs As Global Scrutiny Grows Over Unregulated Kiosks

July 21, 2025
Google just teased its new flagship phone early – Here’s what we’ve gathered

Google just teased its new flagship phone early – Here’s what we’ve gathered

July 21, 2025
Iran said cannot abandon its nuclear enrichment program – 'national pride'

Iran said cannot abandon its nuclear enrichment program – 'national pride'

July 21, 2025
Stocks making the biggest moves midday: XYZ, SEDG, CLF, VZ

Stocks making the biggest moves midday: XYZ, SEDG, CLF, VZ

July 21, 2025
Trader Who Called Bitcoin, Ethereum, Solana Bottom In April Now Warns Local Top Likely In August

Trader Who Called Bitcoin, Ethereum, Solana Bottom In April Now Warns Local Top Likely In August

July 21, 2025
Trump’s Media Company Reports $2B BTC After Crypto Bills Pass US House

Trump’s Media Company Reports $2B BTC After Crypto Bills Pass US House

July 21, 2025
Why AI is moving from chatbots to the browser

Why AI is moving from chatbots to the browser

July 21, 2025
Alphabet highlights the earnings calendar this week

Alphabet highlights the earnings calendar this week

July 21, 2025
Need a new laptop for the office? Save $500 on the Dell 16 Plus and improve your workflow

Need a new laptop for the office? Save $500 on the Dell 16 Plus and improve your workflow

July 21, 2025
Crypto Tax Cuts Could Unleash Bitcoin Buying Spree In Japan

Crypto Tax Cuts Could Unleash Bitcoin Buying Spree In Japan

July 21, 2025
Volatility Master – User Manual (Intraquotes Product) – Trading Strategies – 21 July 2025

Volatility Master – User Manual (Intraquotes Product) – Trading Strategies – 21 July 2025

July 21, 2025
What It Takes to Feel Wealthy Today Is Less Than Before

What It Takes to Feel Wealthy Today Is Less Than Before

July 21, 2025
Monday, July 21, 2025
No Result
View All Result
InvestorNewsToday.com
  • Home
  • Market
  • Business
  • Finance
  • Investing
  • Real Estate
  • Commodities
  • Crypto
  • Blockchain
  • Personal Finance
  • Tech
InvestorNewsToday.com
No Result
View All Result
Home Technology

Researchers Propose a Better Way to Report Dangerous AI Flaws

by Investor News Today
March 14, 2025
in Technology
0
Researchers Propose a Better Way to Report Dangerous AI Flaws
491
SHARES
1.4k
VIEWS
Share on FacebookShare on Twitter


In late 2023, a crew of third-party researchers found a troubling glitch in OpenAI’s broadly used synthetic intelligence mannequin GPT-3.5.

When requested to repeat sure phrases a thousand instances, the mannequin started repeating the phrase time and again, then immediately switched to spitting out incoherent textual content and snippets of non-public data drawn from its coaching information, together with components of names, telephone numbers, and electronic mail addresses. The crew that found the issue labored with OpenAI to make sure the flaw was fastened earlier than revealing it publicly. It is only one of scores of issues present in main AI fashions lately.

In a proposal launched at this time, greater than 30 outstanding AI researchers, together with some who discovered the GPT-3.5 flaw, say that many different vulnerabilities affecting well-liked fashions are reported in problematic methods. They counsel a brand new scheme supported by AI corporations that provides outsiders permission to probe their fashions and a option to disclose flaws publicly.

“Proper now it is somewhat little bit of the Wild West,” says Shayne Longpre, a PhD candidate at MIT and the lead creator of the proposal. Longpre says that some so-called jailbreakers share their strategies of breaking AI safeguards the social media platform X, leaving fashions and customers in danger. Different jailbreaks are shared with just one firm regardless that they may have an effect on many. And a few flaws, he says, are saved secret due to concern of getting banned or dealing with prosecution for breaking phrases of use. “It’s clear that there are chilling results and uncertainty,” he says.

The safety and security of AI fashions is massively necessary given broadly the expertise is now getting used, and the way it might seep into numerous purposes and companies. Highly effective fashions must be stress-tested, or red-teamed, as a result of they will harbor dangerous biases, and since sure inputs may cause them to interrupt freed from guardrails and produce disagreeable or harmful responses. These embody encouraging susceptible customers to interact in dangerous conduct or serving to a nasty actor to develop cyber, chemical, or organic weapons. Some consultants concern that fashions might help cyber criminals or terrorists, and will even activate people as they advance.

The authors counsel three fundamental measures to enhance the third-party disclosure course of: adopting standardized AI flaw studies to streamline the reporting course of; for large AI companies to supply infrastructure to third-party researchers disclosing flaws; and for growing a system that permits flaws to be shared between completely different suppliers.

The strategy is borrowed from the cybersecurity world, the place there are authorized protections and established norms for out of doors researchers to reveal bugs.

“AI researchers don’t all the time know the way to disclose a flaw and might’t be sure that their good religion flaw disclosure gained’t expose them to authorized threat,” says Ilona Cohen, chief authorized and coverage officer at HackerOne, an organization that organizes bug bounties, and a coauthor on the report.

Giant AI corporations at the moment conduct in depth security testing on AI fashions previous to their launch. Some additionally contract with outdoors companies to do additional probing. “Are there sufficient individuals in these [companies] to deal with the entire points with general-purpose AI techniques, utilized by a whole bunch of hundreds of thousands of individuals in purposes we have by no means dreamt?” Longpre asks. Some AI corporations have began organizing AI bug bounties. Nonetheless, Longpre says that unbiased researchers threat breaking the phrases of use in the event that they take it upon themselves to probe highly effective AI fashions.



Source link

Tags: dangerousflawsProposeReportresearchers
Share196Tweet123
Previous Post

TDR looks at selling David Lloyd gym chain to itself after exit struggle

Next Post

FTX liquidated $1.5B in 3AC assets 2 weeks before hedge fund’s collapse

Investor News Today

Investor News Today

Next Post
FTX liquidated $1.5B in 3AC assets 2 weeks before hedge fund’s collapse

FTX liquidated $1.5B in 3AC assets 2 weeks before hedge fund’s collapse

  • Trending
  • Comments
  • Latest
Equinor scales back renewables push 7 years after ditching ‘oil’ from its name

Equinor scales back renewables push 7 years after ditching ‘oil’ from its name

February 5, 2025
Niels Troost has a staggering story to tell about how he got sanctioned

Niels Troost has a staggering story to tell about how he got sanctioned

December 14, 2024
Best High-Yield Savings Accounts & Rates for January 2025

Best High-Yield Savings Accounts & Rates for January 2025

January 3, 2025
Suleiman Levels limited V 3.00 Update and Offer – Analytics & Forecasts – 5 January 2025

Suleiman Levels limited V 3.00 Update and Offer – Analytics & Forecasts – 5 January 2025

January 5, 2025
Why America’s economy is soaring ahead of its rivals

Why America’s economy is soaring ahead of its rivals

0
Dollar climbs after Donald Trump’s Brics tariff threat and French political woes

Dollar climbs after Donald Trump’s Brics tariff threat and French political woes

0
Nato chief Mark Rutte’s warning to Trump

Nato chief Mark Rutte’s warning to Trump

0
Top Federal Reserve official warns progress on taming US inflation ‘may be stalling’

Top Federal Reserve official warns progress on taming US inflation ‘may be stalling’

0
UK Seizes Crypto ATMs As Global Scrutiny Grows Over Unregulated Kiosks

UK Seizes Crypto ATMs As Global Scrutiny Grows Over Unregulated Kiosks

July 21, 2025
Google just teased its new flagship phone early – Here’s what we’ve gathered

Google just teased its new flagship phone early – Here’s what we’ve gathered

July 21, 2025
Iran said cannot abandon its nuclear enrichment program – 'national pride'

Iran said cannot abandon its nuclear enrichment program – 'national pride'

July 21, 2025
Stocks making the biggest moves midday: XYZ, SEDG, CLF, VZ

Stocks making the biggest moves midday: XYZ, SEDG, CLF, VZ

July 21, 2025

Live Prices

© 2024 Investor News Today

No Result
View All Result
  • Home
  • Market
  • Business
  • Finance
  • Investing
  • Real Estate
  • Commodities
  • Crypto
  • Blockchain
  • Personal Finance
  • Tech

© 2024 Investor News Today