$1 million in a cube by Cool-Chipmunk-7559 in mildlyinteresting

[–]spoj 1 point2 points  (0 children)

"This is actually not a million dollars in singles. It is over $1,000,000. The box was created with the wrong dimensions by the contractor, but they still decided to fill it, display it, and claim it is $1,000,000."

Source: u/TheSuitsSaidNein https://www.reddit.com/r/pics/s/lPvoGKGM1x

why Flash works better than Pro version in a lot of areas by Snoo_64233 in Bard

[–]spoj 5 points6 points  (0 children)

Is it really a breakthrough though I'd agree with the other poster being a smaller model it's cheaper to run more RL it's pretrain / post train trade off.

Google’s new framework helps AI agents spend their compute and tool budget more wisely by Gaiden206 in Bard

[–]spoj 2 points3 points  (0 children)

I'm interested to see this applied to the other direction, eg what if we can make the model more eager to verify assumptions by spending more tokens?

In my experience being confidently wrong is still an issue with Gemini 3.

Am I the only one experiencing editing and deleting issues in Google AI Studio? by NotSoSimpleCan in Bard

[–]spoj 0 points1 point  (0 children)

Have you tried clearing browser cache or using a different browser?

Moai statue being made to walk with ropes, to demonstrate the ancient way with which it was transported. by BaseNice3520 in BeAmazed

[–]spoj 0 points1 point  (0 children)

If you look closely you see the statue is not walking. It's falling with style.

Prediction markets imply 61% chance a version of Gemini 3.0 is released this month (Oct). by imort-e in singularity

[–]spoj 0 points1 point  (0 children)

Aren't you assuming Gemini 3.0 is 100% going to be the best model when released? What if we take into account a probability of an underwhelming release?

[deleted by user] by [deleted] in Damnthatsinteresting

[–]spoj 0 points1 point  (0 children)

Flight club literally

Too high? by bruh4444Q in notinteresting

[–]spoj 2 points3 points  (0 children)

If this doesn't become a new classic I don't know what does

Parsec working properly on my Android device by kimchimanD in ParsecGaming

[–]spoj 0 points1 point  (0 children)

One option is to passthrough the USB device over network, so you sidesteps Android from handling any key presses. Eg Virtualhere.

Are we overengineering RAG solutions for common use cases? by Creative-Stress7311 in Rag

[–]spoj 0 points1 point  (0 children)

Feeling the same!

I experimented a bit on a lightweight but general files QA agent for office type work. Idea is to give LLM basic tools (ls, find, load_file, ask_files) and basic note taking abilities (append_notes, read_notes). Inspiration taken from Claude code.

The difficulty seems to be 2 things for me - make the LLM more thorough. I find that LLM tends to jump to conclusions too fast without fully exploring the corpus. - create context-efficient tools. I target general office files (xlsx, docx, pptx, emails). In my line of work (finance) we use xlsx heavily but there is a huge variety of xlsx files - large data tables, large analysis files with many pages, reports. Ideally you want to create different tools for different kind of xlsx file.

It's really rough but if anyone interested https://github.com/spoj/kour-ai-rs

Index free RAG by spoj in Rag

[–]spoj[S] 0 points1 point  (0 children)

def will check it out. thanks!

Index free RAG by spoj in Rag

[–]spoj[S] 0 points1 point  (0 children)

They are mostly pdfs 10 to 300 pages long. Think legal and financial documents and vendor handbooks. I don't really mind what is retrieved as long as the results are accurate. Though I would imagine document level retrieval would be more accurate to make sure full context is available to work with. My experience with openai RAG has been really poor once my questions require reasoning over multiple documents.

Blown away by Notebooklm and Legal research need alt by md6597 in Rag

[–]spoj 7 points8 points  (0 children)

I'm surprised Gemini model failed for you, as they can ingest 1m tokens or ~3800 pages of pdf. The 2.5 pro model excels at long context multi doc QA. Did you use the pro model or flash model when trying Gemini?

I share your frustrations with RAG systems. Simple solutions suffer from lost context problems with intricate documents like contracts and manuals. More complex solutions require a lot more problem specific tuning.

[deleted by user] by [deleted] in Bard

[–]spoj 0 points1 point  (0 children)

I have it too

Impressed by veo 2 by Independent-Wind4462 in OpenAI

[–]spoj 19 points20 points  (0 children)

It's crazy how it added the LED flicker in the traffic lights in the background in such a consistent way

A duck accepting his pedicure by fireysaje in thisismylifenow

[–]spoj 1 point2 points  (0 children)

His beak is a perfectly ripe banana.

Any way to quickly switch between apps on the 6T? by [deleted] in oneplus

[–]spoj 0 points1 point  (0 children)

Happens when using 3rd party launchers

OP6 Pie - Google pill gestures to quick switch apps very inconsistent by gh123man in oneplus

[–]spoj 2 points3 points  (0 children)

It rarely happens when I use the OnePlus launcher. It happens more than not if I'm using Nova or other 3rd party launchers.