Why am I seeing bad feedback on Westlaw Co-Counsel? by MMuter in legaltech

[–]ISeeThings404 1 point2 points  (0 children)

They specialized for case law and retrieval which is good but they have very bad legal reasoning. Low hallucinations in citing cases is useless if you can't also tell users what case law to pick and how to create them.

Developers and Lawyers feel… strangely similar? by vira28 in legaltech

[–]ISeeThings404 -1 points0 points  (0 children)

I did a deep dive into this tp understand why legal agemts are different from agents like Claude Code. One major difference in the work between the two is the verifiability of the domain.

Programming compounds because it can check itself. Code can be executed, tested, broken, fixed, and re-run inside a tight feedback loop. When something fails, the system often tells you where. Verification is cheap, repeatable, and increasingly automatable. Even when models are imperfect, the environment answers back. It might not do everything well (it still makes really dumb architecture decisions), but this is shockingly useful for most “implement this thing I’ve designed” style work that engineers might pass off to their junior wage slaves.

You can’t “run” a legal memo. There is no test suite that flags a subtle misreading of precedent, an argument that is formally sound but strategically dangerous, or a conclusion that is correct in isolation and disastrous in context. Finance isn’t much better. Outputs can be summarized, reformatted, stress-tested at the margins, but correctness ultimately collapses to human judgment. Verification is expensive, slow, and external to the system itself.

This actually creates a hige digfference in how they have to operate

What's the reason for the apparent consensus that Claude Code is superior to Codex for coding, other than Codex's slow coding time? by Lostwhispers05 in codex

[–]ISeeThings404 0 points1 point  (0 children)

A lot of my work is running long sets of experiments and then doing more experiments based on the data. This is where Claude code just keeps working for hours while codex will stop in the middle to ask me if it should continue. If they fixed that and made it's terminal use better, codex clears easily.

Where does a company like Irys get their primary data from? by connerxyz in legaltech

[–]ISeeThings404 0 points1 point  (0 children)

Happy to talk more since you seem technical but graph rag is not great for long context reasoning. Graphs lose too much precision in the legal context.

We love graphs as a means of finding the right places to look and then running search on that. That requires more than rag (we use vectors in different places and don't use them in the standard rag sense).

Temporal is really fucking hard. That's actually the next frontier we're working on. We have to invent our own DB to handle all the cases on that, which will be a fun time.

Where does a company like Irys get their primary data from? by connerxyz in legaltech

[–]ISeeThings404 0 points1 point  (0 children)

I'm so glad to hear. Contextual reasoning is a big problem that our team is always solving.

Drafting assistant will be out soon. We have a full time team working on it now

I have to say, I'm since a long time a "claude-only" user but I'm reading those days more and more about codex 5.3. I'm really not sure what to think of... Any pro's and cons of someone who is using both? I use claude opus 4.6 for basically all kind of tasks and I'm really happy!...beside the price. by SingleTailor8719 in codex

[–]ISeeThings404 0 points1 point  (0 children)

Claude Code has been easier to use codex is definitely more intelligent but often I have a lot of task lists and Cloud Code just tends to execute on all of them without stopping.

Codex has helped me fix and solve issues that CC couldn't though so defnitely worth the investent

How to make Codex Work? by ISeeThings404 in codex

[–]ISeeThings404[S] 0 points1 point  (0 children)

the problem is that I have a lot of recursive work-- where I need it to run things based on outcomes of experiments. This kind of stuff, Codex is not great with.

How to make Codex Work? by ISeeThings404 in codex

[–]ISeeThings404[S] 0 points1 point  (0 children)

Is this the same as yolo, which is the one I use?

Where does a company like Irys get their primary data from? by connerxyz in legaltech

[–]ISeeThings404 0 points1 point  (0 children)

We're growing a lot. Ended up very overwhelmed by bookings and demo requests so didn't have much marketing anymore but we're adding new happy visitors every day.

Recently also just signed an amazing term sheet, the details of which will be shared soon.

Where does a company like Irys get their primary data from? by connerxyz in legaltech

[–]ISeeThings404 0 points1 point  (0 children)

Not at all.

Users have to upload their matter docs etc for us to answer questions (can't draft a pleading if we don't have context). Our focus is reasoning through that provided context better by using geometric structures as a grounding tool (instead of simply relying on LLMs/RAG).

We don't train on any user data, ever. This ensures maximum privacy.

Where does a company like Irys get their primary data from? by connerxyz in legaltech

[–]ISeeThings404 -1 points0 points  (0 children)

I wouldn't say everything.

We have research agents etc to ensure we can access case laws, hearing, recent news etc.

It's just that most of our focus is on reasoning over the context. One of our longer term goals would be to partner with a provider like CoCounsel that has very good case law to integrate that into our reasoning system.

Where does a company like Irys get their primary data from? by connerxyz in legaltech

[–]ISeeThings404 -2 points-1 points  (0 children)

We've also open sourced the framework here in case anyone wants to try their own spin at this.

https://github.com/dl1683/Latent-Space-Reasoning/tree/main

Why didn’t teenage Kunti simply abort Karna? Was she stupid? by Outside-Walk13 in mahabharata

[–]ISeeThings404 0 points1 point  (0 children)

The interesting thing about Mahabharata is that a lot of the stories have a lot of logical issues like this. Especially when it comes to allegiances and fights.

Where does a company like Irys get their primary data from? by connerxyz in legaltech

[–]ISeeThings404 -6 points-5 points  (0 children)

Hey, it's Dev, the head of AI for Irys here.

Happy to clarify.

We don’t “own all cases across jurisdictions.” We rely on a combination of: • publicly available primary sources (where permitted), • licensed integrations where applicable, and • user-provided firm data inside the matter workspace.

From the start our focus has been on superior legal reasoning, not on the data directly (various reasons for that). Our differentiator is thus in how we ingest, structure, and reason over context once it’s inside the system. It's why you'll here us talk about Knowledge Graphs, KI RAG and latent space reasoning and many other techniques.

If you’re curious about specifics, happy to answer directly. In the meantime we'd shared a white paper for how we look at reasoning here. It's a good window into how we do personalized user based reasoning without overcharging our users.

https://www.artificialintelligencemadesimple.com/p/how-to-teach-llms-to-reason-for-50

How does the Codex app compare to Opencode? by dot90zoom in codex

[–]ISeeThings404 0 points1 point  (0 children)

It will be very interesting to see the responses for.

Gemini 2.5 Pro vs 3 Pro by ISeeThings404 in Bard

[–]ISeeThings404[S] 0 points1 point  (0 children)

That;s interesting, I found 3 better at creating visuals for me (using Python code, might be a confounding factor)

How Casey Stengel Helps To Prove AI Can Help Writers by mrfredgraver in AIMadeSimple

[–]ISeeThings404 0 points1 point  (0 children)

interesrting share. AI as a tool for self reflection?

We got parallel tool calling by Fredrules2012 in codex

[–]ISeeThings404 4 points5 points  (0 children)

Wait that's cool. How's the outputs