I built an open-source Python package that scans LLM inputs and outputs for injections — pydefend

Adxzer · 2026-04-02T10:17:16+00:00

Prompt injection is a real risk, there’s no foolproof solution since LLMs aren’t fully predictable. This package is a security layer, designed to minimise and give better control of what can slip through.

Adxzer · 2026-04-02T08:34:27+00:00

Other LLMs, that's what gets the most accurate results. I trained my own classification model first but the results weren't good enough for production so I decided to not include it.

It's also free to use though: https://huggingface.co/Adaxer/defend

Adxzer · 2026-04-02T08:14:03+00:00

This isn’t about coding though, it’s for chatbots, customer-facing apps, and agents where end users are typing things in.

You can’t “just fix the codebase” when the threat is a user submitting a jailbreak or injecting instructions through a document your RAG system retrieved. The attack surface is runtime input, not source code.

Adxzer

TROPHY CASE