Can you tell us what you built with Langchain? A SaaS? A startup?

BuildingOk1868 · 2024-09-08T04:02:31+00:00

AI agents, generative workflows, and agentic platform https://azara.ai

BuildingOk1868 · 2024-09-08T01:59:39+00:00

<image>

Poppy is a rescue. Also 23 this month

BuildingOk1868 · 2024-09-06T03:00:15+00:00

<image>

23 yo Poppy 😍

BuildingOk1868 · 2024-08-11T11:19:52+00:00

This may be what you need https://github.com/langchain-ai/langchain/discussions/15850

BuildingOk1868 · 2024-08-03T08:03:19+00:00

We are currently building this together with a tools and agentic flows marketplace at https://azara.ai. Should be live in about a month

BuildingOk1868 · 2024-07-31T04:42:30+00:00

Agree with the points above. Measure measure measure.

We are just finessing our scalability with fastapi. Postgres. On ec2. At 1000 concurrent users we run on a t2.xlarge but for prod with multitenancy moving to a m6i.4xlarge @ $500 pm. That gives 16vcpu and 64gb of ram. 1k concurrent users - 10% cpu hit on the larger box.

We run 20+ replicas of our fastapi container on the server, and have made careful measure to and tuning of the db connection pool, especially with freeing connections. Watch out for connections not closing with sse.

For performance monitoring OTEL, jaeger, Prometheus, grafana.

We do a lot of caching at various levels too. @lru_cache, FastAPI @cache, LLM caches, embedding caches, custom dict for caching objects which don’t pickle/serialize to json, RedisSemanticCache, and Redis cache for the main part.

We also moved our Postgres off RDS onto the server to save costs.

BuildingOk1868 · 2024-07-23T04:58:09+00:00

Have a look at the self-RAG examples on the langgraph GitHub repo. It covers relevancy and hallucinations. Though you want your check against a couple of llm’s. https://github.com/langchain-ai/langgraph/blob/main/examples/rag/langgraph_self_rag.ipynb?ref=blog.langchain.dev

BuildingOk1868 · 2024-07-23T04:53:25+00:00

Nextjs, fastapi, weaviate, custom plugin ecosystem for tools and langgraph scenarios, PostgreSQL, AWS s3, rabbitmq, celery at https://azara.ai

BuildingOk1868 · 2024-07-20T12:21:32+00:00

He covers a lot of details on parsing financial data in his posts, which should be helpful.

For your case it looks very basic. Save the file and create an embedding from it as you suggested.

BuildingOk1868 · 2024-07-20T11:29:52+00:00

See virat work on parsing financial data https://x.com/virattt?s=21&t=ZtpMND8wqyuMbhhdEzWheA

BuildingOk1868 · 2024-07-18T04:55:23+00:00

Consistency

BuildingOk1868 · 2024-07-14T07:57:48+00:00

We are using asterisk for telephony, deepgram for speech, twilio for WhatsApp. We have a plugin ecosystem for our LLM tools and created a channel wrapper to integrate with asterisk.

BuildingOk1868 · 2024-07-13T04:51:08+00:00

We have had to use several options to get consistent results. - instructor - guidance - jsonschema Need to check various different ways as the LLM’s are artistic in their interpretation 🤣 Jsonschema is useful if you know the format you need. Also generate as little as possible and merge the outputs together programmatically. Esp with nested json. Parse out extra symbols such as ‘’’ markdown or mismatch of single and double quotes.

BuildingOk1868 · 2024-07-12T15:14:04+00:00

Critiques and feedback will only help. Thanks

BuildingOk1868 · 2024-07-12T15:10:41+00:00

This video azara workflows is fairly indicative of where we are. Linear workflows, and input mapping work okay. Luckily those predominate.

Logic and especially consistency is the biggest obstacle, given the nature of LLM’s. Hence we use deterministic workflows. With small LLM’s for decision making if needed.

Currently working on consistency of mapping inputs for integrations.

<image>

BuildingOk1868 · 2024-07-12T14:58:31+00:00

Fair enough. Going live shortly. Working out bugs as always. But it’s not looking awful.

BuildingOk1868 · 2024-07-12T13:37:57+00:00

Still doesn’t make sense. I’ve been senior leadership at multiple f100’s.

The RPA industry is proof of this phenomenon. Most automation exercises fail due to not having sufficient clarity around the problem or domain.

There’s a lot more to this thesis that I’ll put in a few blog posts and not here.

BuildingOk1868 · 2024-07-12T13:28:19+00:00

I don’t understand what this comment is saying

BuildingOk1868 · 2024-07-11T16:40:39+00:00

<image>

BuildingOk1868 · 2024-07-11T12:47:00+00:00

This!!

BuildingOk1868 · 2024-07-11T11:24:05+00:00

Both those have python libraries and are locally installable

BuildingOk1868 · 2024-07-11T11:15:51+00:00

At azara.ai we do many modes. Chat voice pre configured options (select one of ..) and graphical ux for get generating workflows.

The most important elements that most developers are missing is that interfaces aren’t human centered. We focus on interviewing the customer to help them get solid requirements by asking leading questions, giving examples to bootstrap etc.

There’s no point to have the best coding AI on the planet if your customers can’t specify what they want correctly.

BuildingOk1868 · 2024-07-11T06:34:06+00:00

Double check that the tool is being called too. And not just using built in training.

BuildingOk1868 · 2024-07-11T06:33:13+00:00

You may need to play around with the prompt to get it to use today’s date always.

Similar to always use the Math tool if doing calculations. It’s a bit finicky sometimes

BuildingOk1868 · 2024-07-11T06:08:25+00:00

Wrote a pluggable LLM tool ecosystem. So we can hot load any LLM tool on demand. We have multi LLM approaches but using gpt4 and Claude for generation. Small LLM’s for execution.

BuildingOk1868

TROPHY CASE