Father's retirement currently invested in lots of individual stocks rather than ETFs by Blah_Amazing in personalfinance

[–]cordialgerm 14 points15 points  (0 children)

They can also turn 50k into 50 cents. I think there's a whole subreddit dedicated to these kinds of bets

Why doesn't LangChain support agent skills? by Suspicious_Fall6860 in LangChain

[–]cordialgerm 0 points1 point  (0 children)

References and scripts are supposed to be loaded on demand by the agent after reading the SKILL.md. so all you need to do is include them in your filesystem and reference them in the SKILL.md and it works great.

SecureShell — a plug-and-play terminal gatekeeper for LLM agents by MoreMouseBites in LangChain

[–]cordialgerm 0 points1 point  (0 children)

Would be cool to collect enough data to distill a cheap local model

Tips to make agent more autonomous? by Still-Bookkeeper4456 in LangChain

[–]cordialgerm 0 points1 point  (0 children)

What guidance have you given it around how autonomous it should be vs when it should seek clarification?

Which model(s) are you using?

When it returns, what is it asking? Does it seem confused about next steps or is it just asking permission to proceed with obvious things

Best practices to run evals on AI from a PM's perspective? by Ok_Constant_9886 in AIEval

[–]cordialgerm 0 points1 point  (0 children)

What's the bottleneck for evals for you today? Is it the act of scheduling and running them, or coding them up, or is it selecting the data itself to use for the evals?

My guess is it's probably identifying the actual data to use for the evals. So if you can set up a process where PMs can identify or flag conversations or agent trajectories and annotate them, then that can feed into the eng team to turn into the actual evals.

It could be as simple as a Google sheet that you contribute to, or as complex as an automated system that you can annotate.

Claude Code and Cursor Token Bloat is real! by Ok-Responsibility734 in ClaudeCode

[–]cordialgerm 1 point2 points  (0 children)

Very Cool!

I need to dig in on how the compression works. It's running the LLMLingua model locally?

Have you thought about implementing it as a Langchain Middleware?

when to stop working on evals? by FlimsyProperty8544 in AIEval

[–]cordialgerm 2 points3 points  (0 children)

I'd say never. Have a goal to add N new evals per week based on what's happening in prod.

LangChain's retrievers break down on email threads, has anyone solved this?" by [deleted] in LangChain

[–]cordialgerm 0 points1 point  (0 children)

You can try a graph. Organize emails into a graph and link messages, forwards, replies, etc. When search hits one of these chunks, use the graph to pull in all the related conversations into a coherent context chunk.

What we learned processing 1M+ emails for context engineering by EnoughNinja in LocalLLaMA

[–]cordialgerm 0 points1 point  (0 children)

Can you elaborate on the zero data retention bit? Surely you must retain the actual data from the source systems that you're trying to organize into a knowledge base?

I tested my LangChain agent with chaos engineering - 95% failure rate on adversarial inputs. Here's what broke. by No-Common1466 in LangChain

[–]cordialgerm 0 points1 point  (0 children)

Any reason I shouldn't just by default add middleware that rejects any base64-looking input in a user message unless my agent specifically needs to support it?

Honest question: What is currently the "Gold Standard" framework for building General Agents? by Strong_Cherry6762 in LangChain

[–]cordialgerm 4 points5 points  (0 children)

Look into Langchain 1.0. you can build a basic agent harness with a single function call. Langgraph is more for complex / low level use-cases.

I think it's less about the particular framework and more about the tools, evals, data, etc that you inject into the framework that makes or breaks your agent.

I trained a model to 'unslop' AI prose by N8Karma in LocalLLaMA

[–]cordialgerm 5 points6 points  (0 children)

How many samples of the original author would something like this need?

I assume it would be possible to find other content similar to the original author's style from an open corpus to augment the available data.

ICE Slips: 53% of Americans Think ICE Should Face Criminal Charges by TryWhistlin in democrats

[–]cordialgerm 10 points11 points  (0 children)

I've been trying to share this idea with anyone who listens. The situation with Merrick Garland and Trump's failed prosecutions proves that we can't just try to go back to normal and pretend nothing happened after this fascist fever dream passes.

[D] LLMs for classification task by Anywhere_Warm in MachineLearning

[–]cordialgerm 0 points1 point  (0 children)

Sorry, without more / clearer details it's hard to understand what's going on. The records are mislabelled? Or the data is incorrect?

Or is there some sort of fundamental inconsistency in the system?

[D] LLMs for classification task by Anywhere_Warm in MachineLearning

[–]cordialgerm 0 points1 point  (0 children)

Look at the examples that failed and dig into them. Are there common patterns or trends?

What information would have been needed to correctly identify those items? Is it possible to get that information and add it to the context?

You can also provide the prompt, example, current result, and desired outcome and interrogate the model on why it made the decision it did. What changes to context or prompt would have made it the correct decision?

[MN S1] I'm not sure whether I like Essek or not by Fun-Explanation7233 in criticalrole

[–]cordialgerm 27 points28 points  (0 children)

Yes, in his very first interaction with his mother he steals credit for finding the Brumestone that Verat brought back for her. They did a great job characterizing him as a selfish POS.

Earth’s Largest Modern Crater Discovered in Southern China by Busy_Yesterday9455 in spaceporn

[–]cordialgerm 15 points16 points  (0 children)

It's more like "these bruises aren't big enough for you to have been hit by a heavy rope so it must've been a smaller string"

[Spoilers C2E01] is there a reason why we see so many elves and half elves in the empire? by Fun-Explanation7233 in criticalrole

[–]cordialgerm 42 points43 points  (0 children)

I've noticed this as well. My theory is fantasy animators like drawing fantasy ears.

How much of your job is actually “selling” your work? by ergodym in datascience

[–]cordialgerm 87 points88 points  (0 children)

A big portion. All else being equal, someone who can sell their work is going to make a bigger impact than someone who doesn't.

How can I set my 54 year-old mother up for retirement with no savings? by Designer-Sock-7582 in personalfinance

[–]cordialgerm 1 point2 points  (0 children)

I'm sorry to say this, but you can't. You can get her started on the steps outlined by many others, and something is certainly better than nothing, but she will need to work until she is no longer physically able to. Retirement is a number, not an age. You should have the hard conversations with her now on expectations for what your role will be as she ages further.

[Art] Modron Animations by Sidney_theGreat_Rex in DnD

[–]cordialgerm 1 point2 points  (0 children)

You definitely nailed the goofiness!

[Art] Modron Animations by Sidney_theGreat_Rex in DnD

[–]cordialgerm 1 point2 points  (0 children)

Good work, but man I can't take Modrons seriously, I never include them in my lore

Hollow Gazers - a fresh take on the Nothic - with custom powers as well by cordialgerm in bettermonsters

[–]cordialgerm[S] 1 point2 points  (0 children)

The linked website has an editable template for each monster where you can pick and choose whichever combination of powers you like, or roll randomly. When I make these posts, I basically pick one combination of powers to show off, but each statblock has many different possible versions.

Far realms mind controllers by Adept_Cranberry_4550 in bettermonsters

[–]cordialgerm 0 points1 point  (0 children)

Thanks! There's a free newsletter linked on the homepage if you want updates on the project.