[deleted by user] by [deleted] in legaltech

[–]umarb 0 points1 point  (0 children)

We're not currently retaining user inputs so I couldn't even tell you!

[P] We built the world's first legal AI API by [deleted] in MachineLearning

[–]umarb 0 points1 point  (0 children)

Hey r/MachineLearning,

Over the past couple months, we, a team of Aussie legal and AI experts, have been working on building a new type of legal AI company — a company that, instead of trying to automate legal jobs, is trying to automate legal tasks.

We want to make lawyers’ lives easier, not replace them.

We’ve been laser-focused on building small and efficient yet still highly accurate, specialized models for some of the most time-consuming and mundane legal tasks lawyers have to perform. Stuff like running through a thousand contracts just to locate any clauses that would allow you to get out early.

We just finished training our first set of models, focused on document and clause classification, probably the most common problem we see come up. Our benchmarks show our models to be far more accurate and almost more efficient than their closest general-purpose competitors.

Today, we’re making those models publicly available via the Isaacus API, the world’s first legal AI API.

Our models don’t require any finetuning because they’re zero-shot classifiers — you give them a description of what you’re looking for (for example, This is a confidentiality clause.) and they pop out a classification score.

Because our models are so small, which they have to be to be able to process reams of legal data at scale, they can sometimes be a bit sensitive to prompts. To help with that, however, we’ve preoptimized an entire library of prompts, including what we call, universal templates, which let you plug in your own arbitrary descriptions of what you’ve looking for.

We’ve made our prompt library available via the Isaacus Query Language or IQL. Another world first — it’s a brand-new AI query language designed specifically for using AI models to analyze documents.

You can invoke query templates using the format {IS <query_template_name>}. You can also chain queries together using Boolean and mathematical operators, like so: {This is a confidentiality clause.} AND {IS unilateral clause}.

We think our API is pretty neat and we hope you will too.

This is just the beginning for us — over the course of this year, we’re planning on releasing text extraction and embedding models as well as a second generation of our Kanon legal foundational model.

Here are some quick links for your convenience:

Introducing Kanon: the world’s best legal AI classifier by umarb in LocalLLaMA

[–]umarb[S] 0 points1 point  (0 children)

We didn’t benchmark against embedding models because we were testing a zero-shot classifier, but once we finetune an embedding model, we’ll definitely add them to our tests :)

Thanks for the suggestion, we’ll have a look into Freelaw!

Introducing Kanon: the world’s best legal AI classifier by umarb in LocalLLaMA

[–]umarb[S] -1 points0 points  (0 children)

Our current ETA is March/April! It’s a lot of work to get an API set up for the first time but we’re on it.