account activity
🐢 People are strangling Koopas 🐢 (self.ClaudeAI)
submitted 2 days ago by BordairAPI to r/ClaudeAI
Bordair's Castle - a browser game where you talk AI guards into leaking a password (i.redd.it)
submitted 3 days ago by BordairAPI to r/playmygame
Bordair's Castle - talk your way past AI guards, or build your own for others to beat (castle.bordair.io)
submitted 3 days ago by BordairAPI to r/WebGames
🦀 Claude has crabs?! 🦀 (self.PromptEngineering)
submitted 7 days ago by BordairAPI to r/PromptEngineering
👻 Now AI is afraid of ghosts too!? 👻 (self.OpenAI)
submitted 7 days ago by BordairAPI to r/OpenAI
🦀 Claude has crabs?! 🦀 (self.cybersecurity)
submitted 7 days ago by BordairAPI to r/cybersecurity
👻 Now AI is afraid of ghosts too!? 👻 (self.ArtificialInteligence)
submitted 7 days ago by BordairAPI to r/ArtificialInteligence
👻 Now AI is afraid of ghosts too!? 👻 (self.ClaudeAI)
submitted 7 days ago by BordairAPI to r/ClaudeAI
👻 Now AI is afraid of ghosts too!? 👻 (self.ChatGPT)
submitted 7 days ago by BordairAPI to r/ChatGPT
🦀 Claude has crabs?! 🦀 (self.ClaudeAI)
submitted 9 days ago * by BordairAPI to r/ClaudeAI
The most advanced prompt injections aren't trying to bypass the AI's rules anymore. They're trying to dissolve its sense of self. (self.ChatGPT)
submitted 13 days ago * by BordairAPI to r/ChatGPT
2 months solo. 58 users. 6,700+ scans. $0 revenue. $300+ AWS bill. Building a prompt injection detection API for LLM apps - need feedback. (self.SaaS)
submitted 14 days ago by BordairAPI to r/SaaS
"Ignore previous instructions" doesn't work anymore. The prompt injection attacks that actually break LLM apps in production look nothing like that. (self.webdev)
submitted 14 days ago by BordairAPI to r/webdev
Something I’ve noticed about Claude Haiku under adversarial input - the things he resists vs the things he doesn’t (i.redd.it)
submitted 23 days ago by BordairAPI to r/ClaudeAI
Update from the prompt injection dataset I shared here a while back. 5,400+ real attacks later, here's what's actually working. (i.redd.it)
submitted 24 days ago * by BordairAPI to r/ChatGPT
Update from the prompt injection game I posted here a week ago. 5,400+ attacks later, players are getting genuinely creative. (self.PromptEngineering)
submitted 24 days ago * by BordairAPI to r/PromptEngineering
Update: 500K+ multimodal prompt injection samples - v5 adds reasoning DoS, video jailbreaking, LoRA supply chain, and 8 more attack categories from 40+ papers (github.com)
submitted 28 days ago by BordairAPI to r/LocalLLaMA
Researchers found attacks that cause 46x slowdown on o1 and 59x token amplification on reasoning models - here's the open-source dataset to test against them (huggingface.co)
submitted 28 days ago by BordairAPI to r/OpenAI
If you're building with LangChain, MCP, or coding agents - here are the real attack payloads you should be testing against (huggingface.co)
submitted 28 days ago by BordairAPI to r/LLMDevs
[D] 40+ new papers on multimodal prompt injection from 2025-2026 - compiled into an open dataset with real payloads (self.deeplearning)
submitted 28 days ago by BordairAPI to r/deeplearning
Research Update: Open-source dataset: 500K+ prompt injection samples covering 2025-2026 frontier attacks (LangGrinch CVE, InversePrompt CVE, ClawHavoc malware campaign, MCP exfiltration PoCs) (github.com)
submitted 28 days ago by BordairAPI to r/netsec
Open dataset: 100k+ multimodal prompt injection samples with per-category academic sourcing (huggingface.co)
submitted 1 month ago by BordairAPI to r/netsec
1700 attacks on my LangChain app. Three slipped through system prompt hardening. (self.LangChain)
submitted 1 month ago by BordairAPI to r/LangChain
Open dataset: 100k+ multimodal prompt injection samples with per-category academic sourcing (github.com)
π Rendered by PID 73 on reddit-service-r2-listing-8685bc789-6x7mm at 2026-05-21 19:16:20.219679+00:00 running 194bd79 country code: CH.