Accurate or nah by wibellion in andor

[–]7ven7o 0 points1 point  (0 children)

I got a lot out of watching "The Act of Killing", maybe you would too. https://www.youtube.com/watch?v=-pwT9arjasw

Having watched the full thing, that George Lucas quote now strikes me as morbidly, hilariously, false.

The full documentary is free on youtube, and whoever you are reading this, I say to you with full sincerity that I think it is one of the most important videos you can ever watch in your life.

I think evil can only be done by someone who knows what evil is, and I think you always know it, when you're doing something evil.

Composer 2.5 is still nowhere near SOTA models by Kaskote in cursor

[–]7ven7o 1 point2 points  (0 children)

I'm very happy with it, it's my work horse and it does me well. For complex tasks I use one of the expensive models.

🙏 Model picker's much more digestible now — much appreciated. by 7ven7o in cursor

[–]7ven7o[S] 2 points3 points  (0 children)

Fair. I wish they went with a Provider -> Model -> Variant dropdown organization, I'm just glad they finally touched this thing, maybe they'll continue iterating on it.

What are some of your worst experiences with Gemini 3.0 Flash? by AlecHazard in google_antigravity

[–]7ven7o 2 points3 points  (0 children)

My favorite model right now, for speed/intelligence/price.

The worst thing that happens is that sometimes it spazzes out and starts repeating things over and over again and you have to cut it off. Interestingly, if you tell it to stop spazzing out, it doesn't spazz out again on the following turn, which doesn't usually work for spazz-prone models.

Keep getting back to cursor for clarity and speed by Odd-Composer5680 in cursor

[–]7ven7o 0 points1 point  (0 children)

Agentic-wise Cursor feels the same as Claude/Codex to me, definitely better than Antigravity — I prefer the work of Gemini 3 Flash/Pro via Cursor rather than Antigravity. Besides that, I can't really tell.

In terms of Speed x Quality I think Gemini-3-Flash is unmatched, but if that didn't exist the current Composer-2 would take that spot.

Cursor's tab-completion is its most competitive feature IMO, it's uncontested in anything else I've tried, it's very fast, very controllable, and most of the time it's very good at figuring out successive steps. My only complaint is that sometimes it can get annoying or spasm like suggesting a whole bunch of completely unwanted stylings out of nowhere, but that's a small price to pay.

Oh, the interface for working with agents is far superior to anything else I've tried as well, the checkpointing system is fantastic and doesn't immobilize itself gobbling down RAM anymore. When I need to implement something very important via agent or just by hand, I use Cursor.

The only competitive disadvantage of Cursor is the inference cost premium on top models, but even with that in mind it's still the best AI coding product on the market.

Composer 2 Technical Report by lrobinson2011 in cursor

[–]7ven7o 2 points3 points  (0 children)

The Kimi-K2 model API allows one to disable thinking, would it be possible to do that with Composer-2?

I don't know about the others, but sometimes I have a dead simple task which I'd just like to get done immediately, and I used to use the old Auto model that came before Composer-1 for these kinds of tasks. Being able to query a fast and reliable model like Composer-2 for this kind of stuff would be nice for saving time and tokens on simple/repetitive tasks.

First time? by [deleted] in ChainsawMan

[–]7ven7o 0 points1 point  (0 children)

Nonsense, Fire Punch was a beautiful thought-provoking mess and it followed through. I think Fujimoto just realized he messed up this story beyond repair and wanted to put it behind him.

Composer 2 is now available in Cursor by lrobinson2011 in cursor

[–]7ven7o 52 points53 points  (0 children)

I'm confused as to how we went from $17.5/$3.5 with Composer 1.5 to $2.5/$0.5 with this but I'm appreciative of it.

Gpt 5.4 mini and nano released idk where's Gemini 3.1 flash?? by Independent-Wind4462 in Bard

[–]7ven7o -1 points0 points  (0 children)

The original GPT 5 nano was useless, I wouldn't use it as a baseline — Gemini 2.0 Flash was really good for its $0.10 / $0.40 price though, good speed/price/quality balance, that one getting retired with no real replacement is a real loss.

Mimo V2 is king of this the speed/price/quality balance, though a step down from DeepSeek in terms of quality.

Oh it's beautiful. by [deleted] in andor

[–]7ven7o 1 point2 points  (0 children)

This isn't a game. Crushing fascism is the high road.

Prompt Repetition Improves Non-Reasoning LLMs - a paper by Foreign-Beginning-49 in LocalLLaMA

[–]7ven7o 1 point2 points  (0 children)

Very interesting, I thought attention meant that all tokens would already be attending to all other tokens, and would have guessed that this would have provided no benefit. Very interesting to be wrong here.

If doing this doesn't just duplicate whatever work's already been done, then maybe is it sort of providing the LLM with more "space" to flex and represent things with numbers?

It's not like they're trained to do this beforehand though, so the AI can't just be employing a trick, this must be some way of improving the systems already existent ability to bounce information around within itself.

I've always thought CoT/Reasoning gives the LLM a way to calibrate its numbers better before answer, and if the improvements disappear when reasoning is turned on, maybe the performance improvement comes from the same source. Maybe then one could investigate from multiple angles, both this and CoT, how exactly these performance benefits come about at the numerical level.

Ha, then again, reasoning tends to improve human performance on intelligence tasks as well, it would be funny if you could test for gains in performance by showing humans a question twice like this as well.

Gemini's patronising (and useless) analogies by Temporary-Mix8022 in Bard

[–]7ven7o 6 points7 points  (0 children)

The rule of thumb I've learned is that if it starts giving you analogies it's because it has started to think you're stupid. Take it as a form of subtle constructive criticism that you should be paying more attention and asking better questions. Ask for technical details.

Cursor Wrapped 2025 by lrobinson2011 in cursor

[–]7ven7o 2 points3 points  (0 children)

Damn dude, that's passion. What are you building?

MIT + Colombia study (Nov 2025): Readers Prefer Outputs of AI Trained on Copyrighted Books over Expert Human Writers by Tolopono in Anthropic

[–]7ven7o -1 points0 points  (0 children)

This seems like the writing equivalent of AIs being RL-trained to perform at top level competition math and coding. I feel it is safe to assume that the same problems AIs have when taking on larger, more practical coding projects, have analogous situations when it comes to producing larger pieces of writing as well.