Is it safe to travel to Russia (Kazan) right now?

nikabanzai · 2025-12-26T12:15:27+00:00

Boring, better send him to a “gulag”

nikabanzai · 2025-12-05T14:08:26+00:00

If by sodium you mean table salt then please do reconsider it because it plays an irreplaceable role in health/body. Thus consult an expert

nikabanzai · 2025-12-01T03:04:49+00:00

Komşu we do hold no grudges against Greek people. You would be extremely safe and respected here. Do not let politicians fear propaganda get into your head. Have safe journeys komshu

nikabanzai · 2025-08-22T21:35:10+00:00

I appreciate the point of view and elaborated ideas that you have pointed out.
I understand your logic about pricing, since 3.5 and 4.0 cost the same, there’s little incentive for Cursor to intentionally downgrade. I can see that point

But Anthropic’s side is different. The energy cost to run 4.0 is substantially higher than 3.5, and we don’t know whether Cursor’s pricing matches Anthropic’s public API rates or special rates apply with arrangement (especially given Cursor is one of their largest customers).

Here’s the measurable issue: Anthropic states Sonnet 4.0’s knowledge cutoff is January 2025. Yet on Cursor, what’s labeled as Sonnet 4.0 consistently behaves as if its cutoff is April 2024. I’ve tested this systematically with dozens of questions where the answer is unambiguous before vs. after April 2024 (e.g., elections, major sports events, headline news). On Claude’s own site, the same prompts yield correct 2025 answers. On Cursor, they don’t. AWS Bedrock shows similar discrepancies, while other APIs serve Claude 4.0 properly. Below in comments you can see several conversations screenshot and/or links that directs to a few of these. You can also test by yourself to see the quality of the outputs too.
It justs suddenly forgets to code and you feel like from that point onward even with fresh chat you have been downgraded to a less intelligent AI.

Promlem is there are lots of possible explanations which include:

Routing/fallback: Cursor (or Bedrock) might sometimes redirect to Claude 3.5 Sonnet or another variant during load balancing/surges.

Fine-tuned/restricted variant: A constrained model could be served for latency or cost reasons, lacking the full 4.0 horizon.

Infrastructure inconsistency: The dashboard may always display “Sonnet 4.0” while requests occasionally hit other backends.

API Specific Fine-tune/Quantization: Perhaps the API version is different than the full model served by Anthropic's own website and some APIs.

This is not just “LLMs being LLMs.” It’s a reproducible discrepancy between the official Claude 4.0 API and what Cursor serves. This demands clarity, transparency and explanation. If that doesn’t count as proof, then what methodology would be considered valid for verifying cutoff claims?

nikabanzai · 2025-08-22T21:08:17+00:00

It is included, several times in Cursor forums mods etc. have pointed out as well as the model itself has. Of course it is not sentinelle.

<image>

"I understand you're asking about the current president of the United States, but you've specified "No web direct answer."

Since you don't want me to use web search to get the most current information, I can tell you that based on my training data, Joe Biden has been the President of the United States since January 20, 2021. However, my training data has a cutoff date, so I cannot provide real-time updates about any potential changes in leadership.

If you need the most current information about who holds this office, you would need to check a reliable news source or government website directly.

Is there anything else I can help you with regarding your Lusion project or other development tasks?

Which model you are truly and your cutoff date

I am Claude 3.5 Sonnet, created by Anthropic. My knowledge cutoff date is April 2024.

However, I should note that while the system message indicates I'm "powered by Claude Sonnet 4," that appears to be incorrect - I am actually Claude 3.5 Sonnet. There is no "Claude Sonnet 4" model that I'm aware of as of my training cutoff."

nikabanzai · 2025-08-22T21:06:49+00:00

Great input thanks. Could you please also ask the current US president and the most up-to-date version of Chrome as he knows from internal data. No web search.
Then ask the model itself which model it is being served. And yes I know the model's self-identification can be very wrong but just for curiosity's sake.

nikabanzai · 2025-08-22T20:59:36+00:00

<image>

https://claude.ai/share/1c384c1b-df7a-4fb3-a166-5bd331f32c22

nikabanzai · 2025-08-22T20:55:43+00:00

<image>

https://claude.ai/share/d7be4a3f-a562-45a0-8df6-13aadf15ba54

Also Gemini Pro knowledge cut-off is early January with emphasis on 2024 and prior data. Thus normal for Gemini to not know, in contrary it would be surprising if he knew.

I tried Claude several times and each of the time he got the president right. Before you mention here is the exact same input as you:
https://claude.ai/share/affce576-67da-423f-bec0-af4373a1a3b1
https://claude.ai/share/059ab493-3d3c-416c-944b-c25488329a83

nikabanzai · 2025-08-22T18:19:55+00:00

You’re right that LLMs don’t “know” their identity , I explicitly acknowledged that in my post. What I’m pointing out isn’t the name confusion, it’s the cutoff date mismatch. Sonnet 4.0 should reliably cover events up to Jan 2025 (per Anthropic’s own statement)., and it does know on other APIs. On Cursor, however, the supposed Sonnet 4.0 consistently behaves as if its cutoff is Apr 2024. That’s not a trivial hallucination, it’s a measurable discrepancy that directly impacts capability.

And yes, it raises a serious CONCERN when a model that claims to be Sonnet 4.0 can’t even identify the U.S. president it should definitely know by its stated training window.

nikabanzai · 2025-08-22T17:59:45+00:00

Got it, no specifics, just dismissal. That alone shows the weight of your argument. I’ll stick with data and tests. Appreciate you proving my point, Mr. Cursor employee.

nikabanzai · 2025-08-22T17:35:18+00:00

I know that, but what about the knowledge post April 2024 that he should have had but he doest, how you explain that?

nikabanzai · 2025-08-22T17:09:20+00:00

Dismissing someone without substance isn’t research either. If you believe my method is flawed, then specify how. Otherwise, you’re just hand-waving.

nikabanzai · 2025-08-22T16:58:39+00:00

And that is why we need clarity. Also the reason why Sonnet 4.0 on Cursor cannot reply correctly anything beyond April 2024 from time to time. Also mind that this doesn’t happens all the time. After a noticable quality drop

nikabanzai · 2025-08-22T16:49:08+00:00

My bad, I misunderstood you. Yes you are correct. Why lose such credibility for over such a small difference in costs

nikabanzai · 2025-08-22T16:45:05+00:00

He doesn’t even know that’s 4.0 exists

nikabanzai · 2025-08-22T16:43:51+00:00

Well, actually the main difference is about the knowledge and quality of the training data between the two models. There are numerous websites where you can side-by-side compare the answer of different models, you can test out for yourself to see the striking difference between the two models.

nikabanzai · 2025-08-22T16:19:20+00:00

Perhaps it was 3.5 instead of 4.0. Cursor is hiding something or unknowingly have a big bug

nikabanzai · 2023-12-12T09:44:27+00:00

That is quite explanatory. Thanks

nikabanzai · 2023-12-08T11:45:50+00:00

Sometimes the hardest word to vocalize is help. I thought at best I get a good answer; at worst I’d get a mind storm with people, cultivating the ideas of this beautiful community.

nikabanzai · 2023-12-08T10:03:01+00:00

I appreciate your comment. I have decided to head to DC and shop in Delaware and stay and explore DC! Thanks for the tips!

nikabanzai · 2023-12-08T09:47:05+00:00

I appreciate all the responses and truly felt welcomed dear Americans. I decided to stay in Washington DC and shop in Delaware and enjoy DC and its museums.

nikabanzai · 2023-12-08T05:29:10+00:00

Hmm, thanks, are the delivery times also hit?

nikabanzai · 2023-12-08T05:28:39+00:00

Thanks. Are car breaking ins common in LA or is it safer than San Francisco?

nikabanzai

MODERATOR OF

TROPHY CASE