• Latest
  • Trending
  • All
  • Market Updates
  • Cryptocurrency
  • Blockchain
  • Investing
  • Commodities
  • Personal Finance
  • Technology
  • Business
  • Real Estate
  • Finance
I tested GPT-5’s coding skills, and it was so bad that I’m sticking with GPT-4o (for now)

I tested GPT-5’s coding skills, and it was so bad that I’m sticking with GPT-4o (for now)

August 11, 2025
AUD/USD extends rally as Fed rate cut bets rise, RBA decision looms

AUD/USD extends rally as Fed rate cut bets rise, RBA decision looms

August 11, 2025
LayerZero Plans $110M Acquisition of Stargate Bridge

LayerZero Plans $110M Acquisition of Stargate Bridge

August 11, 2025
Samson Mow Says Ethereum A ‘Vehicle’ For People To Buy More Bitcoin: ‘No One Wants ETH In The Long Run’

Samson Mow Says Ethereum A ‘Vehicle’ For People To Buy More Bitcoin: ‘No One Wants ETH In The Long Run’

August 11, 2025
Inside Dylan Field’s Big IPO—and His Even Bigger Plans for Figma

Inside Dylan Field’s Big IPO—and His Even Bigger Plans for Figma

August 11, 2025
Soft Manager – Trading Ideas – 5 August 2025

Can EA-ThinkBot IQ7 Predator be proven?! – Other – 11 August 2025

August 11, 2025
investingLive Asia-Pacific FX news wrap: Nvidia, AMD 15% China chip sale revenues to US

investingLive Asia-Pacific FX news wrap: Nvidia, AMD 15% China chip sale revenues to US

August 11, 2025
Trump says hopes China will quickly x4 it orders for soybeans (quadruple)

Trump says hopes China will quickly x4 it orders for soybeans (quadruple)

August 11, 2025
Bitcoin or Ethereum? – Why Peter Schiff’s pick may surprise you

Bitcoin or Ethereum? – Why Peter Schiff’s pick may surprise you

August 11, 2025
Crypto Exec Warns About Rising Tide of Wrench Attacks

Crypto Exec Warns About Rising Tide of Wrench Attacks

August 11, 2025
Join Our Next Livestream: What GPT-5 Means for ChatGPT Users

Join Our Next Livestream: What GPT-5 Means for ChatGPT Users

August 11, 2025
ES and NQ (and others!) are open on Globex, both are up

ES and NQ (and others!) are open on Globex, both are up

August 10, 2025
‘INSIDE INFORMATION,’ ‘Is Anybody Looking Into This?’

‘INSIDE INFORMATION,’ ‘Is Anybody Looking Into This?’

August 10, 2025
Monday, August 11, 2025
No Result
View All Result
InvestorNewsToday.com
  • Home
  • Market
  • Business
  • Finance
  • Investing
  • Real Estate
  • Commodities
  • Crypto
  • Blockchain
  • Personal Finance
  • Tech
InvestorNewsToday.com
No Result
View All Result
Home Blockchain

I tested GPT-5’s coding skills, and it was so bad that I’m sticking with GPT-4o (for now)

by Investor News Today
August 11, 2025
in Blockchain
0
I tested GPT-5’s coding skills, and it was so bad that I’m sticking with GPT-4o (for now)
491
SHARES
1.4k
VIEWS
Share on FacebookShare on Twitter


code

Vaselena/Getty Pictures

ZDNET’s key takeaways

  • OpenAI’s new GPT-5 flagship failed half of my programming assessments.
  • Earlier OpenAI releases have had nearly good outcomes.
  • Now that OpenAI has enabled fallbacks to different LLMs, there are alternatives.

So GPT-5 occurred. It is out. It is launched. It is the speak of the digital city. And it is acquired some issues. I am not gonna bury the lede. GPT-5 has failed half of my programming assessments. That is the worst that OpenAI’s flagship LLM has ever finished on my fastidiously designed assessments.

Additionally: The most effective AI for coding in 2025 (and what to not use)

Earlier than I get into the main points, let’s take a second to debate one different little function that is additionally a bit wonky. Try the brand new Edit button on the highest of the code dumps it generates.

edit-button
Screenshot by David Gewirtz/ZDNET

Clicking the Edit button takes you into a pleasant little code editor. Right here, I changed the Writer area, proper in ChatGPT’s outcomes.

editor
Screenshot by David Gewirtz/ZDNET

That appeared good, however it finally proved futile. After I closed the editor, it requested me if I needed to save lots of. I did. Then this unhelpful message confirmed up.

wonky-save
Screenshot by David Gewirtz/ZDNET

I by no means did get again to my unique session. I needed to submit my unique immediate once more, and let GPT-5 do its work a second time.

However wait. There’s extra. Let’s dig into my take a look at outcomes…

1. Writing a WordPress plugin

This was my very first take a look at of coding prowess for any AI. It is what gave me that first “the world is about to alter” feeling, and it was finished utilizing GPT-3.5.

Subsequent assessments, utilizing the identical immediate however with completely different AI fashions, generated blended outcomes. Some AIs did nice, some did not. Some AIs, like these from Microsoft and Google, improved over time.

Additionally: How I take a look at an AI chatbot’s coding potential – and you’ll, too

ChatGPT’s mannequin has been the gold normal for this take a look at for the reason that very starting. That makes the outcomes of GPT-5 all that rather more curious.

So, look, the precise coding with GPT-5 was partially profitable. GPT-5 generated a single block of code, which I pasted right into a file and was in a position to run. It offered the requisite UI.

After I pasted within the take a look at names, it dynamically up to date the road depend, though it described it as “Line to randomize” as a substitute of “Strains to randomize.”

plugin
Screenshot by David Gewirtz/ZDNET

However then, once I clicked Randomize, it did not. As a substitute, it redirected me to instruments.php. What?? ChatGPT has by no means had an issue with this take a look at, whether or not GPT-3.5, GPT-4, or GPT-4o. You imply to inform me that OpenAI’s much-anticipated GPT-5 is failing proper out of the gate? Ouch.

I then gave GPT-5 this immediate.

After I click on randomize, I am taken to http://testsite.native/wp-admin/instruments.php. I don’t get an inventory of randomized outcomes. Are you able to repair?

The outcome was a line to patch. I am not thrilled with that strategy as a result of it requires the person to dig by code and to make no errors changing a line.

patch
Screenshot by David Gewirtz/ZDNET

So, I requested GPT-5 for a full plugin. It gave me the total textual content of the plugin to repeat and paste. This time, it labored.

plugin2
Screenshot by David Gewirtz/ZDNET

This time, it did randomize the strains. When it encountered duplicates, it separated them from one another, because it was instructed. Lastly.

Additionally: I discovered 5 AI content material detectors that may accurately determine AI textual content 100% of the time

I am sorry, OpenAI. I’ve to fail you on this take a look at. You’ll have handed if the one error was not utilizing the plural of “line” when applicable. However the truth that it gave me again a non-working plugin on the primary strive is fail territory, even when the AI did finally make it work on the second strive.

Regardless of the way you spin it, it is a step again.

2. Rewriting a string perform

This second take a look at is designed to rewrite a string perform to higher verify for {dollars} and cents. The unique code that GPT-5 was requested to rewrite didn’t enable for cents (it solely checked for integers).

test2
Screenshot by David Gewirtz/ZDNET

GPT-5 did effective with this take a look at. It did return a minimal outcome as a result of it did not do any error checking. It did not verify for non-string enter, additional whitespace, 1000’s separators, or foreign money symbols.

However that is not what I requested for. I informed it to rewrite a perform, which itself didn’t have any error checking. GPT-5 did precisely what I requested with no embellishment. I am form of glad of that as a result of it would not know whether or not or not code previous to this routine already did that work.

GPT-5 handed this take a look at.

3. Discovering an annoying bug

This take a look at took place as a result of I used to be scuffling with a less-than-obvious bug in my code. With out going into the weeds about how the WordPress framework works, the plain reply is just not the proper reply.

You want some pretty arcane information about how WordPress filters cross their info. This take a look at has been a stumbling block for various AI LLMs.

Additionally: Gen AI disillusionment looms, based on Gartner’s 2025 Hype Cycle report

GPT-5, nonetheless, like GPT-4 and GPT-4o earlier than it, did perceive the issue. It articulated a transparent resolution.

GPT-5 handed this take a look at.

4. Writing a script

This take a look at asks the AI to include a reasonably obscure Mac scripting device known as Keyboard Maestro, in addition to Apple’s scripting language AppleScript, and Chrome scripting habits.

It is actually a take a look at of the attain of the AI when it comes to information, its understanding of how net pages are constructed, and the power to put in writing code throughout three interlinked environments.

Fairly just a few AIs have failed this take a look at, however the failure level is often a lack of understanding about Keyboard Maestro. GPT-3.5 did not learn about Keyboard Maestro. However ChatGPT has been passing this take a look at since GPT-4. Till now.

The place ought to we begin? Nicely, the excellent news is that GPT-5 dealt with the Keyboard Maestro a part of the issue simply effective. However it acquired the coding so flawed that it even doubled down on its lack of knowledge of how case works in AppleScript.

gpt5-applescript
Screenshot by David Gewirtz/ZDNET

It really invented a property. That is a type of instances the place an AI confidently presents a solution that’s utterly flawed.

Additionally: ChatGPT comes with character presets now – and different upgrades you might need missed

AppleScript is natively case-insensitive. If you’d like AppleScript to concentrate to case, it’s essential use a “contemplating case” block. So, this occurred.

lowercase
Screenshot by David Gewirtz/ZDNET

The rationale the error message referred to the title of one among my articles is as a result of that was the entrance window in Chrome. This perform checks the entrance window and does stuff primarily based on the title.

search-term
Screenshot by David Gewirtz/ZDNET

However misunderstanding how case works wasn’t the one AppleScript error GPT-5 generated. It additionally referenced a variable named searchTerm with out defining it. That is just about an error-creating observe in any programming language.

Fail, fail, fail, McFaildypants.

The web hath spoken

OpenAI appeared to endure from the identical hubris that its AIs do. It confidently moved everybody to GPT-5 and burned the bridges again to GPT-4o. I am paying $200 a month for a ChatGPT Professional account. On Friday, I could not transfer again to GPT-4o for coding work. Neither may anybody else.

There was, nonetheless, only a tiny little bit of person pushback on the entire bridges burning factor. And by tiny, I imply all the frickin’ web. So, by Saturday, ChatGPT had a brand new choice.

revert
Screenshot by David Gewirtz/ZDNET

To get to this, go to your ChatGPT settings and activate “Present legacy fashions.” Then, because it has all the time been, simply drop down the mannequin menu and select the one you need. Notice: this feature is barely obtainable to these on paid tiers. If you happen to’re utilizing ChatGPT at no cost, you will take what you are given, and you will find it irresistible.

Ever for the reason that complete generative AI factor kicked off at first of 2023, ChatGPT has been the gold normal of programming instruments, at the least based on my LLM testing.

Additionally: Microsoft rolls out GPT-5 throughout its Copilot suite – here is the place you will discover it

Now? I am actually unsure. That is solely a day or so after GPT-5 has been launched, so its outcomes will most likely get higher over time. However for now, I am sticking with GPT-4o for coding, though I do just like the deep reasoning capabilities in GPT-5.

What about you? Have you ever tried GPT-5 for programming duties but? Did it carry out higher or worse than earlier variations like GPT-4o or GPT-3.5? Had been you in a position to get working code on the primary strive, or GPT-4o did it’s a must to information it by fixes? Are you going to make use of GPT-5 for coding or follow older fashions? Tell us within the feedback under.


You possibly can comply with my day-to-day challenge updates on social media. Make sure to subscribe to my weekly replace e-newsletter, and comply with me on Twitter/X at @DavidGewirtz, on Fb at Fb.com/DavidGewirtz, on Instagram at Instagram.com/DavidGewirtz, on Bluesky at @DavidGewirtz.com, and on YouTube at YouTube.com/DavidGewirtzTV.





Source link

Tags: badcodingGPT4oGPT5sSkillsstickingtested
Share196Tweet123
Previous Post

Can EA-ThinkBot IQ7 Predator be proven?! – Other – 11 August 2025

Next Post

Inside Dylan Field’s Big IPO—and His Even Bigger Plans for Figma

Investor News Today

Investor News Today

Next Post
Inside Dylan Field’s Big IPO—and His Even Bigger Plans for Figma

Inside Dylan Field’s Big IPO—and His Even Bigger Plans for Figma

  • Trending
  • Comments
  • Latest
Equinor scales back renewables push 7 years after ditching ‘oil’ from its name

Equinor scales back renewables push 7 years after ditching ‘oil’ from its name

February 5, 2025
Niels Troost has a staggering story to tell about how he got sanctioned

Niels Troost has a staggering story to tell about how he got sanctioned

December 14, 2024
Housing to remain weakest part of economy in the 2nd half, Goldman says

Housing to remain weakest part of economy in the 2nd half, Goldman says

August 4, 2025
Best High-Yield Savings Accounts & Rates for January 2025

Best High-Yield Savings Accounts & Rates for January 2025

January 3, 2025
Why America’s economy is soaring ahead of its rivals

Why America’s economy is soaring ahead of its rivals

0
Dollar climbs after Donald Trump’s Brics tariff threat and French political woes

Dollar climbs after Donald Trump’s Brics tariff threat and French political woes

0
Nato chief Mark Rutte’s warning to Trump

Nato chief Mark Rutte’s warning to Trump

0
Top Federal Reserve official warns progress on taming US inflation ‘may be stalling’

Top Federal Reserve official warns progress on taming US inflation ‘may be stalling’

0
AUD/USD extends rally as Fed rate cut bets rise, RBA decision looms

AUD/USD extends rally as Fed rate cut bets rise, RBA decision looms

August 11, 2025
LayerZero Plans $110M Acquisition of Stargate Bridge

LayerZero Plans $110M Acquisition of Stargate Bridge

August 11, 2025
Samson Mow Says Ethereum A ‘Vehicle’ For People To Buy More Bitcoin: ‘No One Wants ETH In The Long Run’

Samson Mow Says Ethereum A ‘Vehicle’ For People To Buy More Bitcoin: ‘No One Wants ETH In The Long Run’

August 11, 2025
Inside Dylan Field’s Big IPO—and His Even Bigger Plans for Figma

Inside Dylan Field’s Big IPO—and His Even Bigger Plans for Figma

August 11, 2025

Live Prices

© 2024 Investor News Today

No Result
View All Result
  • Home
  • Market
  • Business
  • Finance
  • Investing
  • Real Estate
  • Commodities
  • Crypto
  • Blockchain
  • Personal Finance
  • Tech

© 2024 Investor News Today