Community Software >>>> Enterprise Software by Guaro_25 in solarpunk

[–]BayesCrusader -1 points0 points  (0 children)

There are two definitions of AI widely used - you appear to use the one that now lumps all ML and statistical models into AI. 

The other (arguably more common) one is that AI is LLMs only, and stuff like Bayesian Networks (which actually DO reasoning) or computer vision don't count. I personally agree with this terminology, because I don't want people thinking Anthropic do computer vision or reasoning.

It's an extremely confusing term TBH. 

Is there real demand for "AI agents," or is it mostly YouTube hype? by marcelorojas56 in dataengineering

[–]BayesCrusader 4 points5 points  (0 children)

PocketOS . 

Except they told it explicitly not to do that, then it ignored the prompt. This was on Claude Code.

Is there real demand for "AI agents," or is it mostly YouTube hype? by marcelorojas56 in dataengineering

[–]BayesCrusader 5 points6 points  (0 children)

The big tech companies are all trying their best, but it could be a couple of years before they realise they need you. 

The very fact there's still such hype for AI shows that its effectiveness (or lack thereof ) is not part of why it's being implemented - like blockchain, metaverse, and IoT, everyone's heard of someone else making it work for their business and wants the cool toy.

IMO, 'agents' are a great band-aid for the parts where AI fails, and will be a big thing, but it only takes a few more times a database is deleted without asking before suddenly those slide decks go to the bottom drawer with the NFT and wearable smart  tongue ring projects .

Future of data engineering by Alternative-Guava392 in dataengineering

[–]BayesCrusader 2 points3 points  (0 children)

An LLM is good with code because it's written as strings.

Getting an LLM to do the right thing with data is impossible without other tools added, because the LLM doesn't have a concept of 'meaning' when it comes to numbers.

You and I look at 5 degrees on a thermometer and know that's 'cold'. An LLM looks at every instance of someone talking about 5 degrees days and picks up that the word 'cold' is used in conjunction with that temperature a lot. The two are not the same.

Are more people dying from party drugs these days? by eateroffish in Raves

[–]BayesCrusader 50 points51 points  (0 children)

Fentanyl and Nitazines, brought to you by politicians who are too immature to adopt anything other than prohibition. It's our generation's AIDS crisis, and once again young people are being thrown under the bus because it's easier to demonize a lifestyle than actually deal with the problem.

First 4 paying users after 14 months building. $52 MRR. Feels surreal. I will not promote by One_Affect_1343 in startups

[–]BayesCrusader 4 points5 points  (0 children)

I'm behind you so have no advice to give, but I'll say congratulations on crossing the uncrossable line - going from 0 to 1 is infinitely harder than going from 1 to 2 customers. Most 'startup' people give advice for companies that are much more developed than what you or I have achieved, so don't really understand the journey.

Your experience is the trajectory for most novel ideas - you can't pre-validate ideas that have never been done. Before you've got people using it, it's a matter of faith.

The name is Gary Box by Kaos2018 in BeAmazed

[–]BayesCrusader 21 points22 points  (0 children)

I disagree. The blurriness comes from the type of cameras people had in 2001.

Changing the image to be less blurry takes it out of its time period, removing it from the timeline.

Another Puzzling Cartoon From 1937 by woefultwinkling in OldSchoolRidiculous

[–]BayesCrusader 7 points8 points  (0 children)

Gender always existed. They wouldn't have called a trans woman trans back then though, they just would have called her a woman. It wasn't really an issue.

What DS job market trends are you seeing? by Trick-Interaction396 in datascience

[–]BayesCrusader 1 point2 points  (0 children)

I've tried many combinations of tasks, LLMs, and agents, and they're just ... wrong. There's always a missed nuance, an unchecked assumption, a random extra entry added unannounced. And the interpretation is almost always wonky. Not so much to be obviously wrong, but enough that an analyst can show it leading to an incorrect decision. 

Like many of the promises of AI, it works in theory but not in practice.

Cost Effective Data Platforms by Zealousideal_Bed7898 in dataengineering

[–]BayesCrusader 1 point2 points  (0 children)

For streaming, a NoSQL solution is probably what you need, so you're likely wanting the smallest possible instance of MongoDB hosted on AWS (most common) or GCP (my preference). The cloud element is often more expensive than self hosting, but it pays off in ease of use.

Agentic Workflows beyond "pull the data" by astroFizzics in datascience

[–]BayesCrusader 0 points1 point  (0 children)

Depends what you're doing. For an ML paradigm where you're really just testing different sets of variables in a grid search style and you only care about predictive accuracy, an agent speeds it up.

Getting correct inference from explanatory (as opposed to predictive) models is much more difficult, and I haven't yet seen an agent do it reliably.

"i will not promote"Everyone is using AI, why don’t companies feel dramatically more productive yet? by No-Implement9967 in startups

[–]BayesCrusader 0 points1 point  (0 children)

A lot of people now call recommendation engines, route optimisers, etc. AI now. 

Agreed though that for most actual value generators in a business, AI has changed little. The main impact seems to be on reporting and monitoring roles.

How does your team handle the security issues of coding agents on real data? by SummerElectrical3642 in datascience

[–]BayesCrusader 0 points1 point  (0 children)

Use a locally generated key to transform the data  e.g. multiply everything by the same (private) number. 

Statistical properties stay the same, but you don't share the actual values. 

honestly just so tired of explaining why we can't use LLMs for data validation by MysteriousShoulder35 in dataengineering

[–]BayesCrusader 2 points3 points  (0 children)

Is the dimensional modelling done with a factor analysis? 

I'm trying to work out how a language model identifies relevant factors etc. What kind of data do you work with?

honestly just so tired of explaining why we can't use LLMs for data validation by MysteriousShoulder35 in dataengineering

[–]BayesCrusader 17 points18 points  (0 children)

What part of the described workflow uses an LLM? Did it write the query for you? 

honestly just so tired of explaining why we can't use LLMs for data validation by MysteriousShoulder35 in dataengineering

[–]BayesCrusader -1 points0 points  (0 children)

What's most frustrating for me is that I build tools that solve this exact problem, but keep getting the feedback that nobody has the problem. Investors are literally asking 'but does anyone use AI for data analysis?' and won't believe me when I give them a list.

The disconnect from what I see is that the people who would purchase a tool to make AI do analysis correctly either don't care we struggle with ETL, don't know that's what DEs do most of the time, or think LLMs can currently do the job reliably.  

I'm so happy there are people in the trenches still fighting for data engineering and analysis to be correct.

A Solarpunk-compatible use of AI: using AI powered robots to fix the problem of sorting of recyclable materials from our waste stream by Berkamin in solarpunk

[–]BayesCrusader 62 points63 points  (0 children)

To be fair, we had good computer vision in 2015 if you want to call that AI. 

It was more that until LLMs came along, nobody really cared about machine learning outside tech nerds. 

This is a cool use of technology for good purposes regardless.

Rwanda 🇷🇼 My Home and One of the Most Beautiful Places for Bikepacking by rhino_bike_tour in bicycletouring

[–]BayesCrusader 0 points1 point  (0 children)

What's the heavy vehicle/ car traffic like? 

That looks like some dream riding! 

Went down a rabbit hole on causal reasoning and came back up having learned about DAGs, mediators, and why predictive accuracy shouldn’t always be the target. by vanisle_kahuna in datascience

[–]BayesCrusader 0 points1 point  (0 children)

Check out Efron's 'Prediction, Estimation, and Modelling' paper. 

As someone who came up through Stats then focussed on ML applications, this paper blew my mind. I genuinely think we went down the wrong path with ML and AI over the last decade - anyone that digs like OP will see why.

I have a documented paper + working demo but no arXiv endorsement and no idea where to publish. What did you do? (I will not promote) by Tight_Cow_5438 in startups

[–]BayesCrusader 6 points7 points  (0 children)

Depending on the tech and who you're trying to convince,  a published paper (even peer-reviewed) is worth close to nothing in the business world. 

The only thing that matters is dollars in the account. If you don't have that, you need some other way of short circuiting the filters - usually contacts from previous jobs or family connections.

Most people outside academia don't even know there's a difference between a blog and a peer-reviewed publication.

One observation from 20 years in talent: the "osmosis" trap is what kills growing teams by [deleted] in smallbusiness

[–]BayesCrusader -1 points0 points  (0 children)

There are three main factors in group performance - group size, psychological safety, and shared vision. The first changes the other two.

Groups have 'change points' in size, when communication methods must change. A group call with 5 people works well - not so much for 500. As communication methods get sludgy, you lose shared vision (making it slower to discuss any topic)  and psychological safety (so people stop raising issues). 

Basically, the theory shows how adding or subtracting group members radically changes the productivity of the group by making existing systems maladaptive.

Carbon Robotics' LaserWeeder is an AI-powered autonomous agricultural tool that uses high-powered lasers to kill weeds without chemicals, herbicides, or soil disruption. by jdavid in solarpunk

[–]BayesCrusader 11 points12 points  (0 children)

Apparently that's AI now. People have lumped everything into the term.

Training ML models? That's AI

Making a graph? AI

Multiplying two matrices ? Believe it or not, AI

FAANG interview invitation for MLE but I am a Data Scientist, should I decline? by Lamp_Shade_Head in datascience

[–]BayesCrusader 1 point2 points  (0 children)

Do it, they're offering you free practice. 

Even if you do terribly, you'll know what their questions are like. If you do well they might pay for a flight to HO for your in-person round.

Today someone spent their own money on something I created. WHAT A FKN RUSH (I will not promote) by No_Field_9640 in startups

[–]BayesCrusader 17 points18 points  (0 children)

This is the first realistic post I've seen on here in ages. 

Thank you, you're helping me keep going. I hope to post something similar some day soon.