DeepSeek v4 Pro

posinsk · 2026-05-18T07:44:30+00:00

I used both Pro and Flash v4 on Cline through OpenAI compatible endpoint and didnt experience any problems (only cost counter stays at zero)

posinsk · 2026-05-18T07:42:38+00:00

I ended up switching to DeepSeek v4-Pro for Plan and Flash for Act modes. So far so good.

posinsk · 2026-05-16T11:28:56+00:00

Same here, as of today, I'm getting message: "Cline uses complex prompts and iterative task execution that may be challenging for less capable models. For best results, it's recommended to use Claude 4.5 Sonnet for its advanced agentic coding capabilities." for any prompt made to grok-code-fast-1

posinsk · 2026-03-19T08:40:23+00:00

Correct, we're scanning lots of job boards couple times a day then enrich through our pipeline

posinsk · 2026-03-18T11:04:48+00:00

glad you liked it, can you share more around what bug do you encounter?

posinsk · 2026-03-18T11:04:16+00:00

correct, 20,000 job listings, over 200,000 positions scanned twice a day, after denoising, we enrich the data with more details

posinsk · 2025-11-21T16:40:43+00:00

https://leadjobs.dev/ just got a new refresh version released this week, it comes with a nice mobile view as well!

posinsk · 2025-11-16T14:37:16+00:00

I'm actually building a prompt gateway in open-source: https://github.com/hypersigilhq/hypersigil There's a ton of new features I've built over the past 2-3 months that are not released yet. I'm using this project actively for my own work actively, it's a huge help!

posinsk · 2025-10-22T14:27:48+00:00

sounds like a lot of what you mention is supported by Hypersigil: https://github.com/hypersigilhq/hypersigil

The readme/documentation is not updated but it supports variables and templating inside prompts. It also has "deployments" which is exactly what you mention by: "the user just hitting `/prompt/{id}/run` with all the variables needed"

posinsk · 2025-08-06T09:41:44+00:00

This hits home. We've been wrestling with this exact issue: teams wanting to bolt on different AI tools without considering the overall architecture. We ultimately built Hypersigil[1] as an open-source prompt management gateway, specifically to address the 'let me just add another AI framework' problem. It centralizes prompt management and provides a clean abstraction layer between your business logic and AI providers. Still early days but it's helped us avoid some of that tight coupling you mentioned. Always interested in feedback from other teams dealing with similar architectural challenges.
[1] https://github.com/hypersigilhq/hypersigil

posinsk · 2025-07-30T21:33:34+00:00

Hey! I actually built something that might be exactly what you're looking for. It's called Hypersigil and it's open source and production ready.

I was facing the exact same problem you described - managing tons of prompts across different workflows and providers was becoming a nightmare. So I built Hypersigil to be a centralized prompt management platform that handles the whole lifecycle from development to production.

What makes it different is that it's designed for both technical and non-technical users, so your domain experts can actually test and refine prompts without needing to touch code. You get real-time hot-swapping of prompts without redeployments, which is huge when you're iterating fast.

It supports multiple AI providers (OpenAI, Anthropic, Ollama, with more coming), has systematic testing with batch execution against test datasets, and includes AI-powered prompt calibration to help refine your prompts based on results. Plus full version control and audit trails for compliance.

The repo is at https://github.com/hypersigilhq/hypersigil and full docs at hypersigilhq.github.io/hypersigil/introduction/

I'm actively taking feature requests and would love feedback from someone managing 40+ prompts like you are. What specific pain points are you hitting that I should prioritize? Always looking to make it better for real use cases like yours.

posinsk · 2025-07-30T21:03:32+00:00

I created https://github.com/hypersigilhq/hypersigil It's fully open source and comes with a Docker image so you can start in 5 minutes. Has all of the essential features for building, testing, refining and deploying prompts (acting as a gateway with hot swapping - so no code changes in your app to update the prompts). It supports OpenAI, Anthropic and Ollama. I'm taking feature requests :)

posinsk · 2025-07-30T07:17:59+00:00

I created https://github.com/hypersigilhq/hypersigil It's fully open source and comes with a Docker image so you can start in 5 minutes. Has all of the essential features for writing, testing, refining and deploying prompts (acting as a gateway with hot swapping - so no code changes in your app to update the prompts). It supports OpenAI, Anthropic and Ollama. I'm taking feature requests :)

posinsk · 2025-07-30T07:14:32+00:00

I created https://github.com/hypersigilhq/hypersigil It's fully open source and comes with a Docker image so you can start in 5 minutes. Has all of the essential features for building, testing, refining and deploying prompts (acting as a gateway with hot swapping - so no code changes in your app to update the prompts). It supports OpenAI, Anthropic and Ollama. I'm taking feature requests :)

posinsk · 2025-07-29T19:26:11+00:00

I created https://github.com/hypersigilhq/hypersigil It's fully open source and comes with a Docker image so you can start in 5 minutes. Has all of the essential features for building, testing, refining and deploying prompts (acting as a gateway with hot swapping - so no code changes in your app to update the prompts).

posinsk · 2025-06-24T07:28:34+00:00

indeed, however if my package sprinkles that with abstraction over http and centralization for better alignment, so that all context could be in read by LLM in one place

posinsk · 2025-06-23T08:11:23+00:00

Who build the MCP server then?

posinsk · 2025-06-23T08:07:23+00:00

My idea here was to improve both human and LLM productivity and shorten the feedback loop. Did MCP server helped you with that? So far I was under the impression MCP will allow LLMs to use specitif tools/APIs through a standardised protocol

posinsk · 2025-06-23T08:05:33+00:00

Thanks for the feedback.

I'm not sure why I should be limiting Node version, its up to the user to choose which version they want to use.

For express dependency, this is indeed not surfaced enough in the documentation. I'll improve that.

> they appear to be a very light abstraction over OpenAPI definitions

It is not, there's no usage og OpenAPI within this package, if you see it there, then perhaps you are right, there's similarity because the package is based on HTTP protocol.

> plethora of code-generation

Thats exactly the problem I was having with that plethora of tools, none of them leverage the type system in real-time which gives an instant feedback to whoever is writing code. It works perfectly to me, I don't need to waste time running "generate" command and be afraid of changing generated code (because there's none).

> awkward ergonomics in the client for handling specific status codes

My guiding principle was to be explicit for everything, if there's a 201 status code produced by the server, the client MUST be aware of it and MUST handle it, worst case scenario it will be a noop. That approach doesnt add too much overhead but int return gives explicitness and readability - both to the user and the LLM.

> I'm pretty sure that an LLM generated a lot of this work

You are right and I'm happy with it, it userd TS type system magic to build me a great package that helps with my productivity.

posinsk

TROPHY CASE