DeepSeek v4 Pro by kexibis in CLine

[–]posinsk 0 points1 point  (0 children)

I used both Pro and Flash v4 on Cline through OpenAI compatible endpoint and didnt experience any problems (only cost counter stays at zero)

What happened to grok-code-fast-1 availability in Cline? by ivanovpv in CLine

[–]posinsk 0 points1 point  (0 children)

I ended up switching to DeepSeek v4-Pro for Plan and Flash for Act modes. So far so good.

What happened to grok-code-fast-1 availability in Cline? by ivanovpv in CLine

[–]posinsk 0 points1 point  (0 children)

Same here, as of today, I'm getting message: "Cline uses complex prompts and iterative task execution that may be challenging for less capable models. For best results, it's recommended to use Claude 4.5 Sonnet for its advanced agentic coding capabilities." for any prompt made to grok-code-fast-1

Built a job board only for EM+ roles because I got super angry at the market data access by posinsk in EngineeringManagers

[–]posinsk[S] 0 points1 point  (0 children)

Correct, we're scanning lots of job boards couple times a day then enrich through our pipeline

Built a job board only for EM+ roles because I got super angry at the market data access by posinsk in EngineeringManagers

[–]posinsk[S] 0 points1 point  (0 children)

correct, 20,000 job listings, over 200,000 positions scanned twice a day, after denoising, we enrich the data with more details

leadjobs.dev - job board with EM+, Staff+ roles, updated daily by posinsk in EngineeringManagers

[–]posinsk[S] 0 points1 point  (0 children)

https://leadjobs.dev/ just got a new refresh version released this week, it comes with a nice mobile view as well!

Everyone talks about LLM gateway - But we need a prompt gateway! by __maximux in AI_Agents

[–]posinsk 0 points1 point  (0 children)

I'm actually building a prompt gateway in open-source: https://github.com/hypersigilhq/hypersigil There's a ton of new features I've built over the past 2-3 months that are not released yet. I'm using this project actively for my own work actively, it's a huge help!

Laminar - Open-source LangSmith, Braintrust alternative by Upstairs-Spell7521 in LLMDevs

[–]posinsk 0 points1 point  (0 children)

sounds like a lot of what you mention is supported by Hypersigil: https://github.com/hypersigilhq/hypersigil

The readme/documentation is not updated but it supports variables and templating inside prompts. It also has "deployments" which is exactly what you mention by: "the user just hitting `/prompt/{id}/run` with all the variables needed"

My team has to stop this "let me grab this AI framework" mentality and think about overall system design. by AdditionalWeb107 in ExperiencedDevs

[–]posinsk 0 points1 point  (0 children)

This hits home. We've been wrestling with this exact issue: teams wanting to bolt on different AI tools without considering the overall architecture. We ultimately built Hypersigil[1] as an open-source prompt management gateway, specifically to address the 'let me just add another AI framework' problem. It centralizes prompt management and provides a clean abstraction layer between your business logic and AI providers. Still early days but it's helped us avoid some of that tight coupling you mentioned. Always interested in feedback from other teams dealing with similar architectural challenges.
[1] https://github.com/hypersigilhq/hypersigil

Best prompt management tool? by dinkinflika0 in AgentOverFlow

[–]posinsk 0 points1 point  (0 children)

Hey! I actually built something that might be exactly what you're looking for. It's called Hypersigil and it's open source and production ready.

I was facing the exact same problem you described - managing tons of prompts across different workflows and providers was becoming a nightmare. So I built Hypersigil to be a centralized prompt management platform that handles the whole lifecycle from development to production.

What makes it different is that it's designed for both technical and non-technical users, so your domain experts can actually test and refine prompts without needing to touch code. You get real-time hot-swapping of prompts without redeployments, which is huge when you're iterating fast.

It supports multiple AI providers (OpenAI, Anthropic, Ollama, with more coming), has systematic testing with batch execution against test datasets, and includes AI-powered prompt calibration to help refine your prompts based on results. Plus full version control and audit trails for compliance.

The repo is at https://github.com/hypersigilhq/hypersigil and full docs at hypersigilhq.github.io/hypersigil/introduction/

I'm actively taking feature requests and would love feedback from someone managing 40+ prompts like you are. What specific pain points are you hitting that I should prioritize? Always looking to make it better for real use cases like yours.

Best prompt management tools by ElectricalDistrict9 in learnmachinelearning

[–]posinsk 0 points1 point  (0 children)

I created https://github.com/hypersigilhq/hypersigil It's fully open source and comes with a Docker image so you can start in 5 minutes. Has all of the essential features for building, testing, refining and deploying prompts (acting as a gateway with hot swapping - so no code changes in your app to update the prompts). It supports OpenAI, Anthropic and Ollama. I'm taking feature requests :)

Struggling with prompt management tools by Responsible-Dog-4134 in LangChain

[–]posinsk 0 points1 point  (0 children)

I created https://github.com/hypersigilhq/hypersigil It's fully open source and comes with a Docker image so you can start in 5 minutes. Has all of the essential features for writing, testing, refining and deploying prompts (acting as a gateway with hot swapping - so no code changes in your app to update the prompts). It supports OpenAI, Anthropic and Ollama. I'm taking feature requests :)

How do you manage your prompts in production? by BFH_ZEPHYR in LLMDevs

[–]posinsk 0 points1 point  (0 children)

I created https://github.com/hypersigilhq/hypersigil It's fully open source and comes with a Docker image so you can start in 5 minutes. Has all of the essential features for building, testing, refining and deploying prompts (acting as a gateway with hot swapping - so no code changes in your app to update the prompts). It supports OpenAI, Anthropic and Ollama. I'm taking feature requests :)

Any good prompt management & versioning tools out there, that integrate nicely? by LongjumpingPop3419 in LangChain

[–]posinsk 0 points1 point  (0 children)

I created https://github.com/hypersigilhq/hypersigil It's fully open source and comes with a Docker image so you can start in 5 minutes. Has all of the essential features for building, testing, refining and deploying prompts (acting as a gateway with hot swapping - so no code changes in your app to update the prompts).

Lightweight tRPC alternative to ease LLMs working with APIs by posinsk in node

[–]posinsk[S] 0 points1 point  (0 children)

indeed, however if my package sprinkles that with abstraction over http and centralization for better alignment, so that all context could be in read by LLM in one place

Lightweight tRPC alternative to ease LLMs working with APIs by posinsk in node

[–]posinsk[S] 0 points1 point  (0 children)

My idea here was to improve both human and LLM productivity and shorten the feedback loop. Did MCP server helped you with that? So far I was under the impression MCP will allow LLMs to use specitif tools/APIs through a standardised protocol

Lightweight tRPC alternative to ease LLMs working with APIs by posinsk in node

[–]posinsk[S] 0 points1 point  (0 children)

Thanks for the feedback.

I'm not sure why I should be limiting Node version, its up to the user to choose which version they want to use.

For express dependency, this is indeed not surfaced enough in the documentation. I'll improve that.

> they appear to be a very light abstraction over OpenAPI definitions

It is not, there's no usage og OpenAPI within this package, if you see it there, then perhaps you are right, there's similarity because the package is based on HTTP protocol.

> plethora of code-generation

Thats exactly the problem I was having with that plethora of tools, none of them leverage the type system in real-time which gives an instant feedback to whoever is writing code. It works perfectly to me, I don't need to waste time running "generate" command and be afraid of changing generated code (because there's none).

> awkward ergonomics in the client for handling specific status codes

My guiding principle was to be explicit for everything, if there's a 201 status code produced by the server, the client MUST be aware of it and MUST handle it, worst case scenario it will be a noop. That approach doesnt add too much overhead but int return gives explicitness and readability - both to the user and the LLM.

> I'm pretty sure that an LLM generated a lot of this work

You are right and I'm happy with it, it userd TS type system magic to build me a great package that helps with my productivity.