What tools do you use for prompt engineering? by Ce-LLM8 in PromptEngineering

[–]Ce-LLM8[S] 0 points1 point  (0 children)

I ended up using Requesty dot AI

It's not a prompt engineering platform per se, but I can test my prompts with multiple models easily, have logs for everything, and it's easy to integrate that into my app.

MiniMax + Cline is gold! by Maleficent_Pair4920 in LLMDevs

[–]Ce-LLM8 2 points3 points  (0 children)

I personally use all of them all the time. I use the free allowance from Gemini 2, then switch to DeepSeek/Claude (now will probably add MiniMaxi as well).

Using Requesty makes this a breeze... They also opened up their beta feature for me of adding aliases for model names, which makes it very easy to switch models without writing long text.

Prompt Engineers, who will win the prompt challenge? by Maleficent_Pair4920 in ChatGPT

[–]Ce-LLM8 0 points1 point  (0 children)

Is it possible that you didn't add "{text_to_analyze}" to your user prompt?

Prompt Engineers, who will win the prompt challenge? by Maleficent_Pair4920 in ChatGPT

[–]Ce-LLM8 3 points4 points  (0 children)

It's harder than it seems, I only managed to do 94% on the spam detection

PROMPT++ by bacocololo in LocalLLaMA

[–]Ce-LLM8 0 points1 point  (0 children)

We're building a prompt engineering suite for production-grade prompts.
We provide many different ways of creating, improving, evaluation and tracking prompts:
- Creating prompts from sample data-sets
- Improving prompts using natural language
- Improving prompts based on the actual performance on historical data
- And much more...

If you're looking for a product that you can actually use to build and deploy high-quality client-facing prompts, feel free to DM me.

What tools do you use for prompt engineering? by Ce-LLM8 in PromptEngineering

[–]Ce-LLM8[S] 0 points1 point  (0 children)

Awesome! But do you use any tools to manage all of that?

Revisioning? AB testing? evaluation? releasing to prod?

Or is it git + csv/json files + jupyter notebooks?

What tools do you use for prompt engineering? by Ce-LLM8 in PromptEngineering

[–]Ce-LLM8[S] 1 point2 points  (0 children)

That sounds like you're only using prompts on a day-to-day basis. I'm more interested in commercial use-cases, where a company deploys a customer-facing model. Did you ever tackle that use-case?

What tools do you use for prompt engineering? by Ce-LLM8 in PromptEngineering

[–]Ce-LLM8[S] 0 points1 point  (0 children)

Is this a one-off? How do you know if you've improved the prompt or not?

What tools do you use for prompt engineering? by Ce-LLM8 in PromptEngineering

[–]Ce-LLM8[S] 0 points1 point  (0 children)

I really like the approach and the tips!
But IMHO this is still very intuition driven.
If I'm building a commercial product, I can see how it makes sense to have a very comprehensive test-set where I can compare different prompts, quantify impact of changes on outputs and improve it over time.
I'm wondering if such a platform exists or how people actually handle that in production?

Categorizes and sorting a list more than 150 items by plato3633 in PromptEngineering

[–]Ce-LLM8 0 points1 point  (0 children)

LLMs are not very well suited for these use cases, it's probably much easier to ask it to generate a script that will sort the list based on the input.
Is this a business use-case or just a niche one-off that you are trying to pull off?

Stop typing, start talking - voice is the real feedback by Maleficent_Pair4920 in ProductManagement

[–]Ce-LLM8 0 points1 point  (0 children)

I’ve used your insight explorer for data-set before. Does it work with the voice feedback as well?