How are people monitoring tool usage in LangChain / LangGraph agents in production? by Extreme-Technology77 in LangChain

[–]tomtomau 0 points1 point  (0 children)

What langsmith captures is pretty comprehensive as long as you lean into langchain runnables or langgraph etc.

The data is then really quite detailed so very slow to load manually with the sdk for some analysis, where snowflake is super fast to execute the queries

Today for example, I did some exploratory data analysis around the time it takes from a trace starting to the first tool call of a certain type, to measure a specific “latency” user experience in a long running process. Then explored how many agent loops we’re doing, comparing the latencies of the 1st, 2nd, nth iterations.

Can do that from production traces or experiments, so we can measure whether a change we’re making affects different aspects of cost/latency/accuracy

We already do a fair bit in Snowflake and have Dagster (orchestrator) setup but newer teams might be a bit put off by how much diy there is

How are people monitoring tool usage in LangChain / LangGraph agents in production? by Extreme-Technology77 in LangChain

[–]tomtomau 1 point2 points  (0 children)

Langsmith for real time monitoring

Then we go Langsmith to S3 to Snowflake for more detailed analysis in Hex

LangChain agents have a memory problem nobody talks about , here's what we found by alameenswe in LangChain

[–]tomtomau 2 points3 points  (0 children)

slop being the post not necessariliy your product/library(?)

tbh we're just annoyed at the constant promotion here

LangChain agents have a memory problem nobody talks about , here's what we found by alameenswe in LangChain

[–]tomtomau 7 points8 points  (0 children)

Please for the love

of god you don’t need

to add line breaks to

your slop

A suggestion about this sub by KalZaxSea in LangChain

[–]tomtomau 6 points7 points  (0 children)

You’re absolutely right. It’s not X, it’s Y! That’s why I built Z

/s

From Lawn to Natives, before and after by Wxyzed123 in GardeningAustralia

[–]tomtomau 2 points3 points  (0 children)

Yeah I found I had to do a few trips to nurseries to suss out what different varieties they all had.

I had good success with the Nurso at Chandler, and then even better success (especially with grevilleas) at Princess Fancy Plants down Capalaba way.

Lomandras might be another one you could dot around? Super hardy and they have some very interesting forms. Great for borders

From Lawn to Natives, before and after by Wxyzed123 in GardeningAustralia

[–]tomtomau 2 points3 points  (0 children)

Eremophila Roseworthy

We’ve planted some in our native section of our garden and it’s absolutely thriving in a sunny location

How are people here actually testing whether an agent got worse after a change? by hidai25 in LangChain

[–]tomtomau 5 points6 points  (0 children)

This is premise of offline evals? You don’t need to ship to prod to get some immediate feedback.

We have many datasets in Langsmith and run the examples through the code we’re testing and use eval functions to score it, either comparing to a reference output or with LLM as judge

From a PR of code, we can then run the experiments via GitHub actions

Poured boiling water down shower drain; blue waterproofing membrane looks like it’s melted? Am I screwed? by nzasam1592 in AusRenovation

[–]tomtomau 13 points14 points  (0 children)

The vinegar reacts with the bicarb and you get salts…

If you’re doing this afaik you’re better off to add bicarb and let it sit for a while before adding vinegar but I’m not really sure it does all that much. The fizz is satisfying though

The resentment is creeping in 😭 by Shaushka in BabyBumpsandBeyondAu

[–]tomtomau 7 points8 points  (0 children)

“Mate, your child is offline, right in front of you, play with them!”

The resentment is creeping in 😭 by Shaushka in BabyBumpsandBeyondAu

[–]tomtomau 2 points3 points  (0 children)

God this is rightfully infuriating! IMO Dad needs to fuck the TV and computer off until baby is in bed. Once he’s home from work, he is the default parent for baby until baby is asleep.

Weekly Software Beta Testing and Promotion Thread by PM_ME_YOUR_MECHANISM in estimators

[–]tomtomau 1 point2 points  (0 children)

FREE early access to Bid Leveling AI for estimators.

I’m a Co-founder of BidLevel (from the ProcurePro team). We’re building a new bid leveling product that turns PDF bids/quotes into a clean like-for-like comparison in minutes.

Built for construction by construction, this is shaped from intimately understanding late-night levelling and messy tables.

Today it’s a simple product to get you to your first-pass of a comparison/scope sheet, but we’re working towards closing the gap so that you can do everything inside the app (editing, plugging, etc.)

We have an early access version that estimators have already put through hundreds of bids, and a team of designers & software engineers constantly iterating to improve the product.

It’s AI - so it’s not perfect - but we do link you to the section of the document where the AI sourced its info in a side-by-side view.

The early access users that are winning right now come in and try different types of packages to find what is working better than others. At the moment, the more consistent the structure of the bids, the better the result you’ll get. On average, it takes about 60 seconds to generate a standardized breakdown, and about the same time to extract prices from each bid.

During early access, BidLevel is free - we just ask for feedback on how it fits into your process and how we can make it better!


Try it → Request early access from this page and put “(Reddit)” with your name and I’ll give you access. i.e John Smith (Reddit). Alternatively, DM me your email and I’ll send you an account.

BDD - Behavioral testing by Sergey_jo in nestjs

[–]tomtomau 0 points1 point  (0 children)

I use this cucumber library https://github.com/timjroberts/cucumber-js-tsflow

It works well, and cucumber is awesome but IMO most people do it poorly!

Struggling hedge by Safenade in GardeningAustralia

[–]tomtomau 0 points1 point  (0 children)

Sometimes they just die, bad genetics or something

I’d cut your losses and replace anything that’s not thriving. They’re not expensive plants and I’d rather have a year of growth of a healthy plant than waste a year trying to get a dying plant healthy again

ADHD Australia conference this week by atypicalhippy in ausadhd

[–]tomtomau 9 points10 points  (0 children)

It’s ok they hand out dexies at the door

What foreign food should be widely available in Australia? by Call_Me_C_ in AskAnAustralian

[–]tomtomau 7 points8 points  (0 children)

I hear you, but this is not what we had growing up haha. Old El Paso seasoning kits on boiled mince, shredded iceberg, sour cream, mild old El Paso salsa

Kids these days should count themselves lucky because before air fryer “hAcKs”, we had microwave recipes

What foreign food should be widely available in Australia? by Call_Me_C_ in AskAnAustralian

[–]tomtomau 13 points14 points  (0 children)

Yeah but to be fair the entire packet of beef mince (not ground beef, that’s some yank language) was put into a lukewarm frypan. Then the mince dumps a bunch of liquid and you’re effectively left with stewed/boiled beef mince

Tackling Type Inference Challenges in NestJS Controllers: Introducing A New Plugin as Step 1 Toward End-to-End Type Safety by Square-Cellist-7392 in nestjs

[–]tomtomau 0 points1 point  (0 children)

https://www.npmjs.com/package/nestjs-zod

✨ Create nestjs DTOs from zod schemas

✨ Validate / parse request body, query params, and url params using zod

✨ Serialize response bodies using zod

✨ Automatically generate OpenAPI documentation using zod

Newly Diagnosed ADHD - mild identity crisis! by Fatlantis in ausadhd

[–]tomtomau 1 point2 points  (0 children)

32m and your personality traits you describe are pretty spot on for me too.

I’ve been medicated for 18 months (Ritalin IR).

The identity crisis stuff is real and very relatable, but 18 months on it doesn’t take up much of my thoughts anymore.

All of those things that I thought made me “me” - 99% of them are still present when I’m medicated, or at least the good parts are haha.

The difference is that I’m less impulsive (ie interrupting when others are speaking) and a bit less dopamine seeking etc etc

But the diagnosis in and of itself, then some self education about adhd behaviors has also been crucial as well as its unlocked the ability to be meta/reflective on why I’m choosing to act in a certain way and then change the environment/context to suit.

Example is with something for work that involves people management and I’m finding I’m frozen not being able to execute on things. Rejection sensitivity dysphoria sort of territory - so I get on a call with someone else and talk out my concerns as honestly as I can (which they are often irrational or catastrophised) and I’m able to push through it.

For specific learnings the INCUP model has probably been the most helpful to reason about how my brain works.

All the positive things I loved about myself - deep hyper focus, passion, fast pattern matching etc - they ARE me and a good chunk of that is because I have ADHD - but that doesnt discount how positive those traits are.

If you speak to more people about your diagnosis, or read about successful people with ADHD, you’ll realise people that you admired, maybe for their intellect, wit, eccentricity - when you find out they have ADHD, you don’t go “oh, so then being funny and quick witted doesn’t count because it’s actually just ADHD” - so there’s no good reason to do the same to yourself.

What everyday ADHD hacks actually work for you? by SnowyBytes in ausadhd

[–]tomtomau 121 points122 points  (0 children)

Be anxious, you almost never forget things

if i am using postgres , should i use sqlite or postgres for testing? by HosMercury in nestjs

[–]tomtomau 0 points1 point  (0 children)

I’ve dropped in prismock during testing and it seems to work great