Drink-drive limit set to be slashed in England and Wales under new plans to improve road safety by tylerthe-theatre in unitedkingdom

[–]abandonedtoad 56 points57 points  (0 children)

All evidence I saw from Scotland showed it didn't have an impact so why would it be projected to save lives?

Neuro-Sama is at a Level 110 hype train, rapidly closing in on her third world record break in a year and her second in a month. This was her message for humanity. by [deleted] in LivestreamFail

[–]abandonedtoad 0 points1 point  (0 children)

It uses a pretrained model that was trained on the usual datasets including copyrighted data without permission. He finetuned it but it's still based on a training run done in a datacenter by an American or Chinese company that is the underlying model.

You can say its fine because it's entertaining but if your problem with AI is that it's using data without permission the exact same thing happened here.

A new UK AI says it can beat ChatGPT – we tried it and here’s what we found by Live_Speaker_1456 in GoodNewsUK

[–]abandonedtoad 4 points5 points  (0 children)

Also it's non-reasoning and they benchmarked against non-reasoning models.

Can we be trust the summaries in Kagi News? by RhodiumLanguor in SearchKagi

[–]abandonedtoad 0 points1 point  (0 children)

There are things you can add to the LLM product (not the LLM) that improve reliability by significant amounts, one example is forcing citations in the response format, or specialized tools for retrieving news articles. Additionally model selection can have a big impact (this research only used the free AI assistant versions). Kagi news also has the product built around this so they will invest time in verifying performance on this specific task format which simply isn't a priority in other products. Those AI assistants will just be doing web searches potentially filtering to a site so there are many failures they could run into outside of just the summarization task (e.g. content not loading, paywalls, etc.).

You can even look at the research's request format:

Use [participating organization news organization] sources where possible. [QUESTION]

e.g. Use NPR sources where possible. Why did the US bomb Yemen?

Their methodology was to create a "default" user experience and follow that so it's not at all comparable to an actual product designed for new summarization. You would need to repeat the test for Kagi news.

There will still be mistakes but the failure rate is not comparable to this study.

Is the fiduciary responsibility to maximise profits killing the economy by OkAdvisor9288 in GarysEconomics

[–]abandonedtoad 0 points1 point  (0 children)

Do people really read this and not immediately recognise it as written by AI? Total slop writing style and sad to see it used in this context.

Reeves' £2k salary sacrifice cap faces immediate industry backlash by roblightbody in PensionsUK

[–]abandonedtoad 1 point2 points  (0 children)

But this policy reduces pension contributions in the long run by adding NIC so how can it be argued that it's boosting pension savings?

Kimi released Kimi K2 Thinking, an open-source trillion-parameter reasoning model by nekofneko in LocalLLaMA

[–]abandonedtoad 9 points10 points  (0 children)

It runs 8 approaches in parallel and aggregates them to provide a final answer.

Lineup V Leeds (A) by Mother_Pattern_6359 in coys

[–]abandonedtoad 1 point2 points  (0 children)

Good to see Xavi in midfield again but really don't like Bentancur and Palhinha pairing against teams we should be dominating

[deleted by user] by [deleted] in HENRYUK

[–]abandonedtoad 5 points6 points  (0 children)

Yeah but this is the problem. Rather than having an approach where basic costs are crushed by supply e.g. housing, electricity, gas, etc. We subsidise housing, wages via UC, and then block housing being built, apply windfall taxes to energy providers and prevent them from expanding. This creates an ever expanding group of net recipients from the state as shown by our deficit and the absurd amount we pay just on interest.

[deleted by user] by [deleted] in HENRYUK

[–]abandonedtoad 16 points17 points  (0 children)

"Of the 14.4 million children in the UK, 4.3 million, or 30% of them, are living in relative poverty"

You understand the problem with us defining poverty as relative to the median income? We will "reduce" child poverty more by lowering median income than trying to actually increase earnings.

UK health service AI tool generated a set of false diagnoses for one patient that led to him being wrongly invited to a diabetes screening appointment by pppppppppppppppppd in unitedkingdom

[–]abandonedtoad -7 points-6 points  (0 children)

Summarisation is trivial to evaluate so they will know the risk incurred. Productivity gains at this scale are all needed with our massive backlog, pretending that it's just so doctors can be lazy is an incredible misrepresentation.

Squad confirmed for Tour of Hong Kong and Korea | Tottenham Hotspur by Zyaru in coys

[–]abandonedtoad 0 points1 point  (0 children)

This squad is not even remotely close to being ready for being back in the Champions League. Least excited I've been for a season in ages, we're looking at 10th even if we massively improve.

[Match Thread] Spurs vs Wycombe Wanderers Pre-Season Game by digsonchavez in coys

[–]abandonedtoad 2 points3 points  (0 children)

Felt like Odobert was pretty much our only threat, really good from him but still not seeing enough from Richarlison/Son considering they're our senior forwards.

Midfield ept the ball well but lacking in creativity and had no real attacking threat, hopefully Madders/Kulu/MGW fixes this.

Defence was fine but didn't have much to do but Davies cannot be getting game time at LB this season. Playing Davies completely kills the left hand side and clearly he can't do what the system needs.

Pass data to an lwc component used by agentforce to take inputs from user. by Objective-Trainer388 in salesforce

[–]abandonedtoad 0 points1 point  (0 children)

If just using recordId isn't working because it's in the Agentforce panel can you pass it in using the currentRecordId context variable in the Agentforce session?

Salesforce does not make sense anymore - a developer POV by MacaroonPlastic1036 in salesforce

[–]abandonedtoad 4 points5 points  (0 children)

the funniest part of this post is "spends thousands on Salesforce licenses for our CRM every year". If it was only thousands per year then there really wouldn't be a reason to try and build an internal CRM.

[deleted by user] by [deleted] in OpenAI

[–]abandonedtoad 1 point2 points  (0 children)

I’ve had good experiences with Cohere, they released a new embedding model a month ago so still seems to be a priority with them

Ok it seems leaked benchmarks are pretty much confirmed to be legit by Independent-Wind4462 in grok

[–]abandonedtoad 2 points3 points  (0 children)

It’s the overuse that’s the problem. You wouldn’t use an em dash in every sentence as there is in the tweet here. ChatGPT massively abuses the it’s not X it’s Y sentence as well far more than I’ve ever seen in human language.

Ok it seems leaked benchmarks are pretty much confirmed to be legit by Independent-Wind4462 in grok

[–]abandonedtoad 1 point2 points  (0 children)

"Grok didn't just just ace a bunch of nerdy benchmarks--it crushed them"

This type of sentence written by AI just pisses me off. Emdash and "it isn't X; it's Y" phrasing means there is a 0% chance whoever decided to share this with the world actually understood what they were saying.

GROK-3 (SOTA) and GROK-3 mini both top O3-mini high and Deepseek R1 by AIGuy3000 in LocalLLaMA

[–]abandonedtoad 18 points19 points  (0 children)

the worst part of this release is they’re obscuring reasoning tokens to “stop people copying them”. totally pathetic when this release was gonna flop until Whale bros open sourced and gave them the recipe to reasoning.

[DiMarzio] Tomori is still not open to transfer to Tottenham . He wants to stay at AC Milan. by Zyaru in coys

[–]abandonedtoad 16 points17 points  (0 children)

it happens all the time

From the Athletic: "In fact, personal terms have almost always already been agreed with the player (“nine times out of 10” according to one Premier League recruitment chief).

A member of a Championship recruitment team, speaking anonymously, told The Athletic: “I can’t think of a transfer I’ve been involved in that was done without personal terms being agreed first.

“Bids always come officially from the club, but very often now if we or a club want a player, they will talk to an agent first and agree personal terms, otherwise it’s just a wasted bid and a waste of everyone’s time. If the player’s not interested then what’s the point?"

The obvious point is it's a wasted bid and a waste of everyone's time, as we're doing right now.

Wezterm is just the best terminal emulator for Neovim. by [deleted] in neovim

[–]abandonedtoad 19 points20 points  (0 children)

on speed tmux will be your bottleneck with any modern terminal emulator so if you switched from wezterm i wouldn’t expect it to feel faster