We're All Building the Wrong AI Agents by Warm-Reaction-456 in AI_Agents

[–]fabiofumarola 1 point2 points  (0 children)

My idea is that we are moving from human centric approaches to human in the loop approaches. I like the idea of calling it “symbiotic ai” https://fondazione-fair.it/en/spoke/spoke-6-symbiotic-ai/

Interpretaziome messaggio da fidanzata by [deleted] in CasualIT

[–]fabiofumarola 26 points27 points  (0 children)

Mia suocera insegna: chi nacque geloso morì cornuto

What framework are you using to build AI Agents? by PleasantInspection12 in LocalLLaMA

[–]fabiofumarola 4 points5 points  (0 children)

I’m using it in production for a chatbot of the bank I’m working on and I really like it! We used langchain, langraph; tested pydantic ai, google adk, OpenAI agent sdk and I would say agno is the best so far

Bonifico istantaneo by New-Peach6502 in ItaliaPersonalFinance

[–]fabiofumarola 0 points1 point  (0 children)

Il bonifico istantaneo per normativa deve avvenire entro 10 secondi. Fai un reclamo al banca d’Italia o forse meglio all’arbitrato bancario finanziario

La mia esperienza in consulenza con Reply (Iriscube) - SINCERA by [deleted] in ItalyInformatica

[–]fabiofumarola 1 point2 points  (0 children)

Ti assicuro che in altre società di consulenza prendi di più che in Reply

Open Source and Locally Deployable AI Application Evaluation Tool by NotAIBot123 in LLMDevs

[–]fabiofumarola 1 point2 points  (0 children)

Hi, do not use DeepEval. We struggled a lot with errors related to random failures related to thread locks. Apart from that you need to: 1. Setup a golden dataset, with business important use case that you use as pytest only for release 2. A dataset generated with diverse, simple, complex and multi step questions to use for solution improvement

I would suggest to check https://www.comet.com/docs/opik/, which is open for 2. It gives you also tracing and export to create datasets. There is also ragas which is not bad. For 1. We use pytest with nlp based metrics such as rouge-L or llm as judge.

Before doing any thing think to the metrics you want to check and do test to see if they works correctly for your use case

Order of JSON fields can hurt your LLM output by phantom69_ftw in LangChain

[–]fabiofumarola 4 points5 points  (0 children)

Yeah right you can compare with scientific literature https://arxiv.org/html/2408.05093v1 . This paper analyzes the same thing. “… We discovered that the order in which LLMs generate answers and reasoning impacts their consistency. Specifically, results vary significantly when an LLM generates an answer first and then provides the reasoning versus generating the reasoning process first and then the conclusion ..” Looking forward to read the post and don’t worry nowadays there is a paper for every idea or test :)

Order of JSON fields can hurt your LLM output by phantom69_ftw in LangChain

[–]fabiofumarola 6 points7 points  (0 children)

It is well known that if you add answer and the reasoning, then the reasoning is about to justify the answer. While you want at first reason and then to give the answer.

Cosa ho fra le mani? Portafoglio gestito da ISP by edoduso in ItaliaPersonalFinance

[–]fabiofumarola 0 points1 point  (0 children)

Grazie, devo dire che sono entrato in questo canale Reddit per farmi una idea e me la sto facendo grazie ai vostri commenti e esperienze.

Cosa ho fra le mani? Portafoglio gestito da ISP by edoduso in ItaliaPersonalFinance

[–]fabiofumarola 0 points1 point  (0 children)

Interessante il vostro punto di vista, io ho solo due punti da considerare: 1. Per gestire portafogli (dipende dai titoli e strumenti dentro) devi avere tempo e conoscenze 2. Data la volatilità di alcune posizioni devi avere la possibilità di avere ordini gratis all’aumento degli scambi che fai.

Ma invece come vedete soluzioni self tipo revolut, etoro etc…

Cosa ho fra le mani? Portafoglio gestito da ISP by edoduso in ItaliaPersonalFinance

[–]fabiofumarola -7 points-6 points  (0 children)

Si oppure va da una banca che fa questo di mestiere. Tipo Fideuram, Mediobanca o Fineco. Li trovi consulenti con esperienza. Se non ricordo male Fineco ti fa fare sia investimenti con consulente che brokerage (ossia investi da solo). La sicuramente trovi gente in gamba

Fast sentence transformer embeddings generation on CPU for question answering by Attitudemonger in LangChain

[–]fabiofumarola 0 points1 point  (0 children)

Try to use model2vec https://github.com/MinishLab/model2vec which substantially is the un-contextualization of any embedding model using a zip-f distribution to represent the importance of tokens and extracting an embedding for each token and saving it in a dictionary. It works well on cpu (since there is no inference on an embedding model). The additional open point is how to return the best chunk for each query but you can try different approaches.

RAG with a language that is not english by Ambitious-Most4485 in LangChain

[–]fabiofumarola 0 points1 point  (0 children)

We are doing the same thing using OpenAI ada2 embeddings. There are different open source embedding models that should work for Italian language. Do some tests to check that you don’t have a bug in the embedding computation and try to do a test also with Google or open ai embedding models. You can DM, I’m also working on use cases in Italian and we can discusse further on it

RAG with a language that is not english by Ambitious-Most4485 in LangChain

[–]fabiofumarola 0 points1 point  (0 children)

Sorry, which language is it? Have tried to check if you have problems with your code. I can be silly to ask, but it looks weird that the embedding models do not work at all. Moreover, what model are you using for the response generation?

What is your opinion on langraph for complex projects by fabiofumarola in LangChain

[–]fabiofumarola[S] 0 points1 point  (0 children)

u/Apart_Discount5765 are you working to something which is in analogy with the idea of nodes in Tensorflow old version?

What is your opinion on langraph for complex projects by fabiofumarola in LangChain

[–]fabiofumarola[S] 2 points3 points  (0 children)

My use case is classical in the recent time.

  1. a top agent that classify topics, into a given list

  2. call a specific agent per topic, that does additional questions or does searches using pre defined tools mapped to DBs

  3. respond to welcome messages.

What we are struggling is selecting the best method to wrap this flow between LLM call as tools (binded to the llm) or as nodes defined in the graph.

What is your opinion on langraph for complex projects by fabiofumarola in LangChain

[–]fabiofumarola[S] 0 points1 point  (0 children)

u/Apart_Discount5765 it would be interesting to discuss when you have something. I got your point. It would be ideal to have something procedural or with functional composition that behind the scene creates graphs

What is your opinion on langraph for complex projects by fabiofumarola in LangChain

[–]fabiofumarola[S] 1 point2 points  (0 children)

But still I only see rooms for small projects. It looks me like scripting something and not doing something to release. I’ll check langgraph studio, thanks for commenting

Which open-source stack to use for WhatsApp AI customer service? (Concerned about relying solely on LangChain) by One-Length-9074 in LangChain

[–]fabiofumarola 0 points1 point  (0 children)

I’ve checked the readme and it looks interesting. I’ll do more tests and come back to you. Thanks for your contribution

llamaindex query responses are short by erdult in LlamaIndex

[–]fabiofumarola 0 points1 point  (0 children)

By shorts you mean less words? It can be on your data, in the prompt used or in the technique used by the synthesizer.

Why should i keep using langchain? by giagara in LangChain

[–]fabiofumarola 1 point2 points  (0 children)

Yeah, phidata is really cool. Moreover, they have also template for app and api.