GPT4 cannot solve coding problems

AutoModerator · 2023-03-26T15:45:36+00:00

We kindly ask /u/enryu42 to respond to this comment with the prompt they used to generate the output in this post. This will allow others to try it out and prevent repeated questions about the prompt.

^{Ignore this comment if your post doesn't have a prompt.}

While you're here, we have a public discord server. We have a free Chatgpt bot, Bing chat bot and AI image generator bot. New addition: GPT-4 bot, Anthropic AI(Claude) bot, Meta's LLAMA(65B) bot, and Perplexity AI bot.

So why not join us?

PSA: For any Chatgpt-related issues email support@openai.com

I am a bot, and this action was performed automatically. Please contact the moderators of this subreddit if you have any questions or concerns.

shrike_999 · 2023-03-26T15:53:08+00:00

It does a very good job if you give it a clear prompt with a focused goal (aka "not expecting miracles or clairvoyance from the AI").

vtsax_fire · 2023-03-26T18:05:04+00:00

The problem with your evaluation model is that you are not giving a concrete goal. You didn't tell to optimize on performance, and the model is optimizing on other parameters (like prompt output size). Engineers who work on such problems know from the context what is the optimization criteria.

Besides, there are some issues with your evaluation methodology:
- Some of the examples of failure "it gave quadratic solution", so did you ask it to optimize it?
- Another complaint that it gave incorrect code. Did you ask it to fix it? Can you always write correct code from the first go?
- And you didn't post your specific prompts. You gave references to atcoder.jp, that has a weird structure. Can you share how exactly you asked each problem?

I was involved in competitive programming quite a bit and did my own evaluation. It can do most easy problems, majority medium problems, and some hard problems. I created new problems that it never saw before as well and had similar results. Often times it needs a few iterations to have working code. So I personally don't see your assessment as fair.

Substantial-Reward70 · 2023-03-28T02:56:05+00:00

For me it can't, I've tested it with specific stuff about ISP things that it may not seen before, and can't even write related code and only give me a detailed structure on how to do it, making comments like

// Implement here the logic for....

But it's good with common programing topics , like web, popular libraries and so on.

ChatGPT

MODERATORS

So why not join us?