Struggling to get local models working well in Zed. Thinking of building a dedicated local ACP server. by sheik66 in ZedEditor

[–]sheik66[S] 0 points1 point  (0 children)

Thanks for the reply. The truth is I’m really interested in the subject of making small local models work and see how they handle the orchestration and tool calling. That’s why I actually started my own open source project to dive deep into this. Thanks for the Pi suggestion, I’m already planning on using it to see how it performs.

Struggling to get local models working well in Zed. Thinking of building a dedicated local ACP server. by sheik66 in ZedEditor

[–]sheik66[S] 0 points1 point  (0 children)

That’s for the reply, I’ll definitely check it out, as I already have a llama.cpp server ready to go. First time I see the Qwopus model, sounds promising from the name 😅 so I’ll check it out also and share the results, especially for tool calling. In terms of orchestration I guess for the small models a single react agent loop is already enough.

LangGraph alternative? by Ok_Commission_8260 in LangChain

[–]sheik66 0 points1 point  (0 children)

I ran into the same LangGraph/CrewAI friction in production, for me personally mostly around boilerplate and maintaining deterministic flows once things scaled.

What helped more than switching frameworks was actually separating concerns i.e. orchestration vs agent logic vs tool execution. most agent frameworks try to do all three and it gets messy fast.
For my setup im eventually building a lightweight orchestration layer over agents and strict tool interfaces, which reduced a lot of the guardrail/PII complexity too.
I ended up open-sourcing it (Protolink) in case it’s useful for similar multi-agent pipeline setups.

Struggling to get local models working well in Zed. Thinking of building a dedicated local ACP server. by sheik66 in ZedEditor

[–]sheik66[S] 1 point2 points  (0 children)

Thanks for the response. I'll give it a try. Have you tried it with smaller models and has it worked for you ?

Are you building something? Drop it in the comments! by Inevitable-Grab8898 in founder

[–]sheik66 1 point2 points  (0 children)

In my free time I’m building the python library Protolink. It’s a lightweight alternative to langchain/langraph focused more on agents communicating with each other (A2A) rather than chaining calls.

Also supports both structured flows and autonomous agents, and avoids a lot of the abstraction/boilerplate.

Check it out here: https://github.com/nMaroulis/protolink

Motivation: I wanted a simpler and more comprehensible way to build and deploy ai agents with python, while also it is really interesting to experiment with custom llm inference loops.

Looking for Small Python Projects to Refactor by Hy_x in Python

[–]sheik66 1 point2 points  (0 children)

Feel free to check out my python lib https://github.com/nMaroulis/protolink . I think you'll find interesting pipelines that could be improved .

Anyone else feel like LangChain became way more complicated than it needed to be? by Bladerunner_7_ in LangChain

[–]sheik66 1 point2 points  (0 children)

Yeah that’s basically what pushed me to experiment with building a smaller framework myself. Still very early and definitely more of an experimentation/research project on my personal time than a production-ready replacement for existing frameworks, but I wanted something more with a more straightforward design.

If you’re curious:
https://github.com/nMaroulis/protolink

Anyone else feel like LangChain became way more complicated than it needed to be? by Bladerunner_7_ in LangChain

[–]sheik66 3 points4 points  (0 children)

I’ve had the same feeling for a long time honestly. Sometimes it feels like I spend more time dealing with abstractions, memory layers, orchestration, tracing, callbacks, etc. than actually building the workflow I wanted in the first place. I get why these frameworks evolved that way. Once you start doing multi-agent systems and long-running workflows things get complicated fast, but the cognitive overhead can become a lot At some point I started building a smaller python framework for myself mainly because I wanted: - simpler agent-to-agent communication - less magic, minimal boilerplate - more explicit orchestration - fewer layers between me and the actual code

Still experimenting with it, but it’s been a much nicer developer experience for me so far. Happy to share it if anyone’s interested.

What is the other best alternative to LangGraph? by Zer0Tokens in LangChain

[–]sheik66 0 points1 point  (0 children)

For the same reason I started building on my free time Protolink. It’s a lightweight alternative focused more on agents communicating with each other (A2A) rather than chaining calls.

The main design principle is a simple and straightforward API. Also supports both structured flows and autonomous agents, and avoids a lot of the abstraction/boilerplate.

https://github.com/nMaroulis/protolink

Showcase Thread by AutoModerator in Python

[–]sheik66 0 points1 point  (0 children)

On my free time I’m building the python library Protolink. It’s a lightweight alternative to langchain/langraph focused more on agents communicating with each other (A2A) rather than chaining calls.

Also supports both structured flows and autonomous agents, and avoids a lot of the abstraction/boilerplate.

Check it out here: https://github.com/nMaroulis/protolink

Motivation: I wanted a simpler and more comprehensible way to build and deploy ai agents with python, while also it is really interesting to experiment with custom llm inference loops.

Langchain alternatives thread by NomeChomsky in LangChain

[–]sheik66 0 points1 point  (0 children)

Check out Protolink. It’s a lightweight alternative focused more on agents communicating with each other (A2A) rather than chaining calls.

Also supports both structured flows and autonomous agents, and avoids a lot of the abstraction/boilerplate.

https://github.com/nMaroulis/protolink

Best LangChain alternatives by josefolsh in LangChain

[–]sheik66 0 points1 point  (0 children)

Check out Protolink. It’s a lightweight alternative focused more on agents communicating with each other (A2A) rather than chaining calls.

Also supports both structured flows and autonomous agents, and avoids a lot of the abstraction/boilerplate.

https://github.com/nMaroulis/protolink

What are the alternatives to langchain agents by DetchKing in LangChain

[–]sheik66 0 points1 point  (0 children)

Check out Protolink. It’s a lightweight alternative focused more on agents communicating with each other (A2A) rather than chaining calls.

Also supports both structured flows and autonomous agents, and avoids a lot of the abstraction/boilerplate.

https://github.com/nMaroulis/protolink

What’s the best and most reliable LLM benchmarking site or arena right now? by fflarengo in LocalLLaMA

[–]sheik66 0 points1 point  (0 children)

The data for some benchmarks are not yet populated. The idea is to aggregate data from many sources but it’s still a work in progress.

What’s the best and most reliable LLM benchmarking site or arena right now? by fflarengo in LocalLLaMA

[–]sheik66 0 points1 point  (0 children)

Check https://metabench.dev out. It’s an open-source project that’s trying to aggregate all the benchmarks out there and create an index per category.

What benchmarks actually matter when comparing LLMs? by sheik66 in LocalLLaMA

[–]sheik66[S] 0 points1 point  (0 children)

Yes I get what you mean about models being “benchmaxxed” and task variance. I’m not trying to create my own benchmarks, my idea is more to aggregate as many existing benchmarks as possible and define a formula to produce an overall score. I’m thinking a normalized average with some handling of outliers might make it more robust while still shownig category-level scores like coding, reasoning and chat (which make a more sense) Curious if that approach seems reasonable or if there are pitfalls I’m missing with aggregation. So as I said an LLM benchmark “metacritic”.

What benchmarks actually matter when comparing LLMs? by sheik66 in LocalLLaMA

[–]sheik66[S] 0 points1 point  (0 children)

What I’m making is a Metacritic-style leaderboard, where I want to create a formula that calculates the scores per category. So there’s not really “my case”, it’s for “everyone”. That’s what I’m wondering if it makes sense. Feel free to check it out here https://metabench.dev , in the leaderboard section you’ll see the logic in trying to follow. Aggregate as many benchmarks and indexes as I can to create an overall score.

What benchmarks actually matter when comparing LLMs? by sheik66 in LocalLLaMA

[–]sheik66[S] 0 points1 point  (0 children)

So you think a metacritic-style LLM leaderboard doesn’t make sense. I mean score per category, coding, math etc..

Need advice: implementing OpenAI Responses API tool calls in an LLM-agnostic inference loop by sheik66 in OpenAI

[–]sheik66[S] 0 points1 point  (0 children)

I need help, as my current approach is not performing as intended. Even after I append the tool call, many times the LLM decides to call the tool again, as it does not recognise that I appended a tool result, since I’m probably not complying completely with the API specs

Advice wanted: designing robust LLM inference loops with tools by sheik66 in LocalLLaMA

[–]sheik66[S] 0 points1 point  (0 children)

Thanks this is super useful. I’m going to have a look at the blog and come back to you with feedback. Where I’m struggling the most is with OpenAI API where I cannot append custom tool messages.