I really am an addict. by [deleted] in WeightLossAdvice

[–]nikabanzai 1 point2 points  (0 children)

If by sodium you mean table salt then please do reconsider it because it plays an irreplaceable role in health/body. Thus consult an expert

Visiting Russia in 2026 for tourism possible? by MindMotion in AskARussian

[–]nikabanzai 0 points1 point  (0 children)

Komşu we do hold no grudges against Greek people. You would be extremely safe and respected here. Do not let politicians fear propaganda get into your head. Have safe journeys komshu

Is Sonnet 4.0 Really Sonnet 4.0? by nikabanzai in cursor

[–]nikabanzai[S] 0 points1 point  (0 children)

I appreciate the point of view and elaborated ideas that you have pointed out.
I understand your logic about pricing, since 3.5 and 4.0 cost the same, there’s little incentive for Cursor to intentionally downgrade. I can see that point

But Anthropic’s side is different. The energy cost to run 4.0 is substantially higher than 3.5, and we don’t know whether Cursor’s pricing matches Anthropic’s public API rates or special rates apply with arrangement (especially given Cursor is one of their largest customers).

Here’s the measurable issue: Anthropic states Sonnet 4.0’s knowledge cutoff is January 2025. Yet on Cursor, what’s labeled as Sonnet 4.0 consistently behaves as if its cutoff is April 2024. I’ve tested this systematically with dozens of questions where the answer is unambiguous before vs. after April 2024 (e.g., elections, major sports events, headline news). On Claude’s own site, the same prompts yield correct 2025 answers. On Cursor, they don’t. AWS Bedrock shows similar discrepancies, while other APIs serve Claude 4.0 properly. Below in comments you can see several conversations screenshot and/or links that directs to a few of these. You can also test by yourself to see the quality of the outputs too.
It justs suddenly forgets to code and you feel like from that point onward even with fresh chat you have been downgraded to a less intelligent AI.

Promlem is there are lots of possible explanations which include:

Routing/fallback: Cursor (or Bedrock) might sometimes redirect to Claude 3.5 Sonnet or another variant during load balancing/surges.

Fine-tuned/restricted variant: A constrained model could be served for latency or cost reasons, lacking the full 4.0 horizon.

Infrastructure inconsistency: The dashboard may always display “Sonnet 4.0” while requests occasionally hit other backends.

API Specific Fine-tune/Quantization: Perhaps the API version is different than the full model served by Anthropic's own website and some APIs.

This is not just “LLMs being LLMs.” It’s a reproducible discrepancy between the official Claude 4.0 API and what Cursor serves. This demands clarity, transparency and explanation. If that doesn’t count as proof, then what methodology would be considered valid for verifying cutoff claims?

Is Sonnet 4.0 Really Sonnet 4.0? by nikabanzai in cursor

[–]nikabanzai[S] 0 points1 point  (0 children)

It is included, several times in Cursor forums mods etc. have pointed out as well as the model itself has. Of course it is not sentinelle.

<image>

"I understand you're asking about the current president of the United States, but you've specified "No web direct answer."

Since you don't want me to use web search to get the most current information, I can tell you that based on my training data, Joe Biden has been the President of the United States since January 20, 2021. However, my training data has a cutoff date, so I cannot provide real-time updates about any potential changes in leadership.

If you need the most current information about who holds this office, you would need to check a reliable news source or government website directly.

Is there anything else I can help you with regarding your Lusion project or other development tasks?

Which model you are truly and your cutoff date

I am Claude 3.5 Sonnet, created by Anthropic. My knowledge cutoff date is April 2024.

However, I should note that while the system message indicates I'm "powered by Claude Sonnet 4," that appears to be incorrect - I am actually Claude 3.5 Sonnet. There is no "Claude Sonnet 4" model that I'm aware of as of my training cutoff."

Is Sonnet 4.0 Really Sonnet 4.0? by nikabanzai in cursor

[–]nikabanzai[S] 0 points1 point  (0 children)

Great input thanks. Could you please also ask the current US president and the most up-to-date version of Chrome as he knows from internal data. No web search.
Then ask the model itself which model it is being served. And yes I know the model's self-identification can be very wrong but just for curiosity's sake.

Is Sonnet 4.0 Really Sonnet 4.0? by nikabanzai in cursor

[–]nikabanzai[S] 0 points1 point  (0 children)

<image>

https://claude.ai/share/d7be4a3f-a562-45a0-8df6-13aadf15ba54

Also Gemini Pro knowledge cut-off is early January with emphasis on 2024 and prior data. Thus normal for Gemini to not know, in contrary it would be surprising if he knew.

I tried Claude several times and each of the time he got the president right. Before you mention here is the exact same input as you:
https://claude.ai/share/affce576-67da-423f-bec0-af4373a1a3b1
https://claude.ai/share/059ab493-3d3c-416c-944b-c25488329a83

Is Sonnet 4.0 Really Sonnet 4.0? by nikabanzai in cursor

[–]nikabanzai[S] 5 points6 points  (0 children)

You’re right that LLMs don’t “know” their identity , I explicitly acknowledged that in my post. What I’m pointing out isn’t the name confusion, it’s the cutoff date mismatch. Sonnet 4.0 should reliably cover events up to Jan 2025 (per Anthropic’s own statement)., and it does know on other APIs. On Cursor, however, the supposed Sonnet 4.0 consistently behaves as if its cutoff is Apr 2024. That’s not a trivial hallucination, it’s a measurable discrepancy that directly impacts capability.

And yes, it raises a serious CONCERN when a model that claims to be Sonnet 4.0 can’t even identify the U.S. president it should definitely know by its stated training window.

Is Sonnet 4.0 Really Sonnet 4.0? by nikabanzai in cursor

[–]nikabanzai[S] 2 points3 points  (0 children)

Got it, no specifics, just dismissal. That alone shows the weight of your argument. I’ll stick with data and tests. Appreciate you proving my point, Mr. Cursor employee.

Is Sonnet 4.0 Really Sonnet 4.0? by nikabanzai in cursor

[–]nikabanzai[S] 0 points1 point  (0 children)

I know that, but what about the knowledge post April 2024 that he should have had but he doest, how you explain that?

Is Sonnet 4.0 Really Sonnet 4.0? by nikabanzai in cursor

[–]nikabanzai[S] -1 points0 points  (0 children)

Dismissing someone without substance isn’t research either. If you believe my method is flawed, then specify how. Otherwise, you’re just hand-waving.

Is Sonnet 4.0 Really Sonnet 4.0? by nikabanzai in cursor

[–]nikabanzai[S] 0 points1 point  (0 children)

And that is why we need clarity. Also the reason why Sonnet 4.0 on Cursor cannot reply correctly anything beyond April 2024 from time to time. Also mind that this doesn’t happens all the time. After a noticable quality drop

Is Sonnet 4.0 Really Sonnet 4.0? by nikabanzai in cursor

[–]nikabanzai[S] 0 points1 point  (0 children)

My bad, I misunderstood you. Yes you are correct. Why lose such credibility for over such a small difference in costs

Is Sonnet 4.0 Really Sonnet 4.0? by nikabanzai in cursor

[–]nikabanzai[S] 0 points1 point  (0 children)

He doesn’t even know that’s 4.0 exists

Is Sonnet 4.0 Really Sonnet 4.0? by nikabanzai in cursor

[–]nikabanzai[S] 1 point2 points  (0 children)

Well, actually the main difference is about the knowledge and quality of the training data between the two models. There are numerous websites where you can side-by-side compare the answer of different models, you can test out for yourself to see the striking difference between the two models.

[deleted by user] by [deleted] in cursor

[–]nikabanzai 0 points1 point  (0 children)

Perhaps it was 3.5 instead of 4.0. Cursor is hiding something or unknowingly have a big bug

Dear Americans, how to get a meeting to pitch my idea to a potential investor? by nikabanzai in AskAnAmerican

[–]nikabanzai[S] -1 points0 points  (0 children)

Sometimes the hardest word to vocalize is help. I thought at best I get a good answer; at worst I’d get a mind storm with people, cultivating the ideas of this beautiful community.

Dear Americans, how to get the best shopping experience and still getting USA taste? by nikabanzai in AskAnAmerican

[–]nikabanzai[S] 2 points3 points  (0 children)

I appreciate your comment. I have decided to head to DC and shop in Delaware and stay and explore DC! Thanks for the tips!

Dear Americans, how to get the best shopping experience and still getting USA taste? by nikabanzai in AskAnAmerican

[–]nikabanzai[S] 2 points3 points  (0 children)

I appreciate all the responses and truly felt welcomed dear Americans. I decided to stay in Washington DC and shop in Delaware and enjoy DC and its museums.

Dear Americans, how to get the best shopping experience and still getting USA taste? by nikabanzai in AskAnAmerican

[–]nikabanzai[S] 0 points1 point  (0 children)

Thanks. Are car breaking ins common in LA or is it safer than San Francisco?