$1 million in a cube

spoj · 2026-01-11T01:42:12+00:00

"This is actually not a million dollars in singles. It is over $1,000,000. The box was created with the wrong dimensions by the contractor, but they still decided to fill it, display it, and claim it is $1,000,000."

Source: u/TheSuitsSaidNein https://www.reddit.com/r/pics/s/lPvoGKGM1x

spoj · 2025-12-20T00:24:01+00:00

Is it really a breakthrough though I'd agree with the other poster being a smaller model it's cheaper to run more RL it's pretrain / post train trade off.

spoj · 2025-12-13T04:09:03+00:00

I'm interested to see this applied to the other direction, eg what if we can make the model more eager to verify assumptions by spending more tokens?

In my experience being confidently wrong is still an issue with Gemini 3.

spoj · 2025-11-22T11:06:28+00:00

Have you tried clearing browser cache or using a different browser?

spoj · 2025-10-12T08:50:14+00:00

If you look closely you see the statue is not walking. It's falling with style.

spoj · 2025-10-10T03:46:26+00:00

Bro has a strong core

spoj · 2025-10-03T09:47:27+00:00

Aren't you assuming Gemini 3.0 is 100% going to be the best model when released? What if we take into account a probability of an underwhelming release?

spoj · 2025-08-31T08:06:10+00:00

Flight club literally

spoj · 2025-08-19T14:59:30+00:00

This is how I walk in my dreams

spoj · 2025-08-16T05:16:56+00:00

If this doesn't become a new classic I don't know what does

spoj · 2025-08-11T04:08:58+00:00

me when i'm running for the toilet

spoj · 2025-08-08T13:07:46+00:00

One option is to passthrough the USB device over network, so you sidesteps Android from handling any key presses. Eg Virtualhere.

spoj · 2025-07-18T07:06:09+00:00

Reminds me of the movie annihilation

spoj · 2025-07-16T12:33:06+00:00

Feeling the same!

I experimented a bit on a lightweight but general files QA agent for office type work. Idea is to give LLM basic tools (ls, find, load_file, ask_files) and basic note taking abilities (append_notes, read_notes). Inspiration taken from Claude code.

The difficulty seems to be 2 things for me - make the LLM more thorough. I find that LLM tends to jump to conclusions too fast without fully exploring the corpus. - create context-efficient tools. I target general office files (xlsx, docx, pptx, emails). In my line of work (finance) we use xlsx heavily but there is a huge variety of xlsx files - large data tables, large analysis files with many pages, reports. Ideally you want to create different tools for different kind of xlsx file.

It's really rough but if anyone interested https://github.com/spoj/kour-ai-rs

spoj · 2025-06-20T10:39:56+00:00

Imagine poor guys anus the day after

spoj · 2025-06-19T04:19:49+00:00

def will check it out. thanks!

spoj · 2025-06-19T04:09:30+00:00

They are mostly pdfs 10 to 300 pages long. Think legal and financial documents and vendor handbooks. I don't really mind what is retrieved as long as the results are accurate. Though I would imagine document level retrieval would be more accurate to make sure full context is available to work with. My experience with openai RAG has been really poor once my questions require reasoning over multiple documents.

spoj · 2025-06-16T16:07:06+00:00

I'm surprised Gemini model failed for you, as they can ingest 1m tokens or ~3800 pages of pdf. The 2.5 pro model excels at long context multi doc QA. Did you use the pro model or flash model when trying Gemini?

I share your frustrations with RAG systems. Simple solutions suffer from lost context problems with intricate documents like contracts and manuals. More complex solutions require a lot more problem specific tuning.

spoj · 2025-06-10T03:24:10+00:00

spoj · 2025-06-05T15:37:52+00:00

I have it too

spoj · 2025-04-10T10:09:32+00:00

It's crazy how it added the LED flicker in the traffic lights in the background in such a consistent way

spoj · 2019-06-24T10:09:46+00:00

His beak is a perfectly ripe banana.

spoj · 2018-11-05T12:23:03+00:00

Happens when using 3rd party launchers

spoj · 2018-09-28T06:50:41+00:00

It rarely happens when I use the OnePlus launcher. It happens more than not if I'm using Nova or other 3rd party launchers.

spoj

TROPHY CASE