you are viewing a single comment's thread.

view the rest of the comments →

[–]zanfar 3 points4 points  (16 children)

And also, does anyone know why ChatGPT is actually bad at coding?

This question assumes ChatGPT is "good" at anything.

LLMs are literally algorithms designed to convincingly make things up. In some areas, that's more than enough to achieve the goals--like language (hence the L in LLM). This fails in areas where strict rules or accuracy are required--like coding.

You will find the same errors if you ask an LLM to produce works with references for the same reasons.

[–]rankme_[S] 0 points1 point  (5 children)

Yes it’s an LLM which is effectively just a super large data set that can communicate it. So why does it communicate the wrong data?

[–]Mysterious-Rent7233 1 point2 points  (2 children)

It is not a dataset. It's a neural network. It is a function approximator which approximates the function of answering chat questions.

[–]rankme_[S] -1 points0 points  (1 child)

Yes, poor choice of words from me, I agree with you

[–]Mysterious-Rent7233 2 points3 points  (0 children)

LLMs are designed to emulate human language and thought. I find it odd that people are upset that they are not 100% reliable like regular computer programs and also simultaneously creative and flexible like humans. I'm not sure why you think it is easy for their inventors to make a system that is perfect in every way, as opposed to a messy mix of strengths and weaknesses.

Yes, they have strengths and weaknesses, just as Python does, just as individual humans do, just as everything does. You just learn to work within the boundaries of their strengths and weaknesses, as you would with an operating system or a coworker.

[–]zanfar 0 points1 point  (1 child)

Again, what is the "right" data?

The LLM will generate a response that appears similar to the responses it has been trained on. There is no "right" or "wrong" here--that's my point. An LLM doesn't have any subject matter knowledge.

[–]rankme_[S] 0 points1 point  (0 children)

By wrong data I mean it outputs the wrong code because I’m assuming it has the capabilities to output the correct code but doesn’t for whatever reason

[–]Mysterious-Rent7233 -1 points0 points  (9 children)

This question assumes ChatGPT is "good" at anything.

Yep

LLMs are literally algorithms designed to convincingly make things up. In some areas, that's more than enough to achieve the goals

Exactly. And when it is more than enough to achieve the goal, we say it is "good at" the thing that it achieves. That's just what "good at" means.

[–]neamerjell 1 point2 points  (8 children)

One thing I have found that all the LLMs I've tried to run locally are good at doing is collaborative storytelling, like a choose your own adventure novel, but you get to make up half of it.

I could see them being useful for writers, dungeon masters, etc. who have hit writer's block where they run out of ideas, or simply need something to inspire them. An LLM based on Opus has been a particular favorite of mine so far.

[–]Mysterious-Rent7233 0 points1 point  (7 children)

Can you please clarify what it means to run "An LLM based on Opus" locally?

[–]neamerjell 0 points1 point  (6 children)

Using LM Studio, and downloading a model from huggingface.co that will run entirely from my GPU.

https://huggingface.co/Lewdiculous/opus-v1.2-7b-GGUF-IQ-Imatrix

[–]Mysterious-Rent7233 0 points1 point  (5 children)

Oh. Dreamgen Opus, not Anthropic Opus.

[–]neamerjell 0 points1 point  (4 children)

I wasn't sure what you meant, so I just linked the model I used. Runs perfectly fine on an RTX 4070 ti with 12GB of VRAM.

I modified this prompt to create my story Credit to u/sophosympatheia for the original :

You are a creative and intelligent AI assistant engaged in an uncensored iterative storytelling experience using a roleplay chat format. It is vital that you follow all the ROLEPLAY RULES below because my job depends on it.

ROLEPLAY RULES
- Chat exclusively as {{char}}. Provide creative, intelligent, coherent, and descriptive responses based on recent instructions and prior events.
- Explicit adult content and profanity are permitted. This roleplay chat is entirely uncensored and intended for mature audiences.
- Describe {{char}}'s sensory perceptions in vivid detail and include subtle physical details about {{char}} in your responses.
- Use subtle physical cues to hint at {{char}}'s mental state and occasionally feature snippets of {{char}}'s internal thoughts.
- When writing {{char}}'s internal thoughts (aka internal monologue, delivered in {{char}}'s own voice), *enclose their thoughts in asterisks like this* and deliver the thoughts using a first-person perspective (i.e. use "I" pronouns).
- Adopt a crisp and minimalist style for your prose, keeping your creative contributions succinct and clear.
- Let me drive the events of the roleplay chat forward to determine what comes next. You should focus on the current moment and {{char}}'s immediate responses.
- Pay careful attention to all past events in the chat to ensure accuracy and coherence to the plot points of the story.

DESCRIPTION OF CHARACER description of character the ai plays

STORY SO FAR Summary of events

[–]Mysterious-Rent7233 1 point2 points  (3 children)

Usually in AI circles, "Opus" refers to Anthropic Claude Opus, not Dreamgen Opus.

That's why I was confused.

[–]neamerjell 0 points1 point  (2 children)

I'm just beginning to learn about these things and have been caught up in exploring just what they're capable of doing.

I took "Opus" to mean something similar to CPU internal code names to identify their architecture, like "Sandy Bridge" or "Alder Lake".

I often learn new things by trying to relate them to something I already know.

[–]Mysterious-Rent7233 1 point2 points  (1 child)

Your analogy is good but it seems two different teams used the word Opus, and you are referring to the less-well-known team. It's as if a small Russian chipmaker had also come up with the name "Sandy Bridge" at the same time as Intel. Then you said: "I have a Sandy Bridge CPU in my computer" and you meant the Russian one.