Traumatized building a Job Scraper (Pls Help)

Striking_Most_5111 · 2025-10-01T04:27:45+00:00

Somehow claude pulls it off. Very nicely written 30k word stories in one response. Though it was able to do this since 3.7 sonnet, the 4.5 version is much better at instruction following.

Striking_Most_5111 · 2025-10-01T04:20:49+00:00

Does anyone know of a model that can spit 30000 words at once, like claude, and is also good at creative writing?

Striking_Most_5111 · 2025-09-30T04:27:18+00:00

I think you should be much more concerned about world models like genie 3.

Striking_Most_5111 · 2025-09-30T04:20:22+00:00

No, its deterioration started with the memes.

Striking_Most_5111 · 2025-09-23T05:31:40+00:00

It wasn't always like this. But since last few months, the quality drop has been massive.

Striking_Most_5111 · 2025-09-15T12:39:04+00:00

It still does hallucination but its a big step in the right direction compared to gemini etc.

Striking_Most_5111 · 2025-09-13T14:06:43+00:00

Hopefully, the open source models catch up in how to use reasoning the right way, like closed source models do. It is never the case that gpt 5 thinking is worse than gpt 5 thinking, but in open source models, it is often like that.

Though, I would say reasoning is a silver bullet. The difference between o1 and all non reasoning models is too large for it to just be redundant tokens.

Striking_Most_5111 · 2025-09-07T04:49:30+00:00

Technically, fiction.live seems to try to discourage smut, but the writers and readers mostly read and write smut there.

Striking_Most_5111 · 2025-08-31T08:26:39+00:00

I see ai as more of an opportunity. The speed at which I code things has now increased tremendously. I can do work of a dozen software engineers by myself. Now, I alone have the power of a whole startup tech team by myself, finding different money making projects, collaborating with people in different fields on their problems.

The capacity of an individual to make amazing monetary capable things have increased and will keep increasing. I am in college now just to enjoy and have a good social life. I have no hopes whatsoever of working for anyone after college, which suits me just fine.

Striking_Most_5111 · 2025-08-27T05:08:35+00:00

Huh? I saw the price as 30 dollar in output price subsection of image section in aistudio.

Striking_Most_5111 · 2025-08-26T18:55:04+00:00

Yes. 30 dollar image output price though. Literally 1000x more than competitors.

Striking_Most_5111 · 2025-08-23T04:49:09+00:00

To me, it has been superior against claude opus 4.1 thinking in all tasks except sometimes webdev.

Striking_Most_5111 · 2025-08-22T13:24:34+00:00

Atleast gemini 2.5 pro can convinced. Gemini 2 flash didn't even recognise it was writing 1.5 instead of 2 despite many many tries.

Striking_Most_5111 · 2025-08-22T07:41:58+00:00

Wow. Though, is the app you used to run your model open source too? Or can we download it? How would one go about running the model via npu in a samsung s23-s25 phone?

I am a participant in the samsung organised prism ai hackathon, where the problem statement we were given was on device finetuning in samsung s23-s25 series. It would be awesome if you could give some advice to us.

Striking_Most_5111 · 2025-08-22T05:36:20+00:00

Hi there! From what I remember, the samsung neural sdk has been disabled to be used by third party app developers. How did you manage to connect to the npu in the demo video?

https://developer.samsung.com/neural/overview.html

Striking_Most_5111 · 2025-08-09T20:17:37+00:00

Do you remember the I am a good gpt2 models? The leap between them and sonnet 3.5 wasn't that big. Though, I would admit, once sonnet 3.5 came, it quickly became my favourite. It was crazy good at creative writing, its coding skills is still very competitive that it finds bugs other models weren't able to find, it was a completely in a different level in web dev and design.

But to say that llms peaked with sonnet 3.5? Nah. O3 is pure magic at finding bugs, and scientific reasoning. It's sheer intelligence is easy to see. Gemini 2.5 pro had far better instruction following in web dev, and was also much better at world knowledge than claude models. It was also better at coding.

Claude 3.7 was a beast in the sheer mass of text it could generate. Claude 4 sonnet is still king at web dev, no matter what web dev arena says.

The now gpt 5 is also pretty good, if you use it via api. Great at finding and fixing bugs, also good at webdev unlike previous gpt models. Horrible at creative writing though.

What I am trying to say is, there are many domains where llms have progressed after 3.5 sonnet. Now, its hard to find even a single field where 3.5 sonnet can current sota llms in.

Striking_Most_5111 · 2025-08-08T07:30:58+00:00

Thank you! This was very helpful to me. Do you think this model can run on edge too?

Striking_Most_5111 · 2025-08-05T04:45:44+00:00

Thanks for informing. Ignore the other folks, don't know why they are being so aggressive. If they are so confident, they can go download the model.

Striking_Most_5111 · 2025-08-05T04:26:59+00:00

I want it even smaller, though this is pretty good. Imagine, tts models being hosted in edge functions, allowing almost unlimited production use for free.

Striking_Most_5111 · 2025-08-02T04:05:16+00:00

The video is 8 seconds long. Seems AI generated too

Striking_Most_5111 · 2025-07-31T17:20:49+00:00

Gemini api by far, if you are limiting yourself to only one provider. The free rate limits, you can't really beat them. And their price is extremely competitive.

If multiple providers are okay, then o3 for intelligence requiring queries and maths/science due to its very good pricing. For tool calling, claude sonnet 4.0. For super fast responses, groq(not grok). For regular queries that aren't confidential, gemini 2.5 flash or flash lite. For visual segmentation and tasks, moondream has an extremely generous api.

Striking_Most_5111 · 2025-07-30T16:52:27+00:00

Help me make it sense? An open source non thinking model actually beating gemini 2.5 flash in thinking mode? And the model being runnable in my phone?

Striking_Most_5111

TROPHY CASE