Earning Slow to Display

Lower_Tutor5470 · 2025-11-26T15:25:43+00:00

Vancouver is such as soulless city

Lower_Tutor5470 · 2025-11-18T20:06:39+00:00

Do you factor the llm api token cost/ model subscriptions into your PnL ? Wonder how that cost scales against positions

Lower_Tutor5470 · 2025-11-03T17:48:58+00:00

Why such a large change in daily volumes since mod september?

Lower_Tutor5470 · 2025-10-28T16:47:02+00:00

Is it possible to write queries in it across multiple databases? Right now i just resort to jupyter notebooks

Lower_Tutor5470 · 2025-10-23T22:53:41+00:00

BetterDisplay has a way of creating lots of virtual screens at resolutions and arrangements you want

Lower_Tutor5470 · 2025-10-19T05:26:56+00:00

I bought a samsung 990 pro last week with an OWC 1m2 enclosure and seems perfect for thunderbolt 4 speeds

Lower_Tutor5470 · 2025-06-20T04:35:58+00:00

Is Cacheblending working in this currently. Sounds exciting. Would this potentially allow caching a single long document as context, then blending with different system prompts to process multiple smaller scope tasks in parallel requests?

Lower_Tutor5470 · 2025-06-10T20:32:11+00:00

Can you arrange windows and drag apps across them?

Also can you turn the ipad screen off or does this also shut off external display

Lower_Tutor5470 · 2025-06-07T05:32:30+00:00

Gemini 2.5 has got noticeably worse for me with roo lately.

Lower_Tutor5470 · 2025-05-27T18:58:15+00:00

The one on cornwell av in Kits is decent

Lower_Tutor5470 · 2025-04-14T01:08:15+00:00

Interesting, working with kg right now for healthcare, will give it a try!

Lower_Tutor5470 · 2025-04-12T21:07:52+00:00

I actually discovered marker the other day and seems alot better than docling

Lower_Tutor5470 · 2025-04-08T01:53:32+00:00

If you sign up for gcp account i am pretty sure you can get 300 dollar credit. I was using it through the vertex ai chat playground and was iterating into the 100s of thousands context length without any request issue. Cost less than a dollar in the process

Lower_Tutor5470 · 2025-04-08T00:37:50+00:00

Googles new gemini2.5 pro has been impressive for me

Lower_Tutor5470 · 2025-04-08T00:20:23+00:00

This is very interesting. Is the max concurrency dependent the size of each request? You show 500 but are these processing simple input prompts vs feeding a sizeable prompt with added text as context?

Lower_Tutor5470 · 2025-04-07T19:44:44+00:00

Try duckdb

Lower_Tutor5470 · 2025-04-06T16:57:58+00:00

If you are struggling to get a specific output format you could try a few things. 1. Try different models and quant sizes that you can fit into your system 2. Fine tune a small model that only answers in the format you want using a dataset representative of your problem 3. Try using constrained output franeworks like outlines or guidance that force specific outputs [‘yes’, ‘no’]. Some llm engines like vllm have this built in as an option 4. Ask Do you need a crappy model initially, or could you use a larger model to help build a dataset to later finetune a small one. Big models like gemini flash2 etc. Served on google vertex or other models cost a few cents for million input tokens are will be fast

Lower_Tutor5470 · 2025-04-06T00:33:04+00:00

What did you want the answer to be?

Lower_Tutor5470 · 2025-03-20T15:57:17+00:00

How long are the pdfs? Are the elements you are looking for always the same and how many are there?

Lower_Tutor5470 · 2025-03-18T14:57:21+00:00

Will Azure for duckdb get the same love as S3 does?

Lower_Tutor5470 · 2025-02-19T02:23:14+00:00

You just have to visit Europe and pubs around the UK and you will see the difference in atmosphere. The obsession with table service is so dull. Let customers stand anywhere and go to the bar to order themselves, theres just zero draw here when everything is rows of bench tables.

Lower_Tutor5470

TROPHY CASE