DEV Community

이령 profile picture

이령

Korean indie hacker 💪 Gym + Code every day Obsessed with digging deep into AI security — one deterministic proof at a time.

Your AI agent's leak risk depends more on the model than the prompt

Your AI agent's leak risk depends more on the model than the prompt

Comments
5 min read

Want to connect with 이령?

Create an account to connect with 이령. You can also sign in below to proceed if you already have an account.

Already have an account? Sign in
I tested whether "just paste the leak into your AI to fix it" actually works. It depends on the model — here's what broke.

I tested whether "just paste the leak into your AI to fix it" actually works. It depends on the model — here's what broke.

Comments
4 min read
What my leak scanner catches — and the exact line where it stops

What my leak scanner catches — and the exact line where it stops

Comments
3 min read
My AI agent leaked a secret in a way my own scanner missed. Here's what I learned about what these tools can and can't catch.

My AI agent leaked a secret in a way my own scanner missed. Here's what I learned about what these tools can and can't catch.

Comments
2 min read
What an AI agent leak looks like — and what my scanner can (and can't) catch

What an AI agent leak looks like — and what my scanner can (and can't) catch

Comments
4 min read
I tested 5 LLMs for prompt-injection leaks. Same code, 0% to 90%.

I tested 5 LLMs for prompt-injection leaks. Same code, 0% to 90%.

Comments
3 min read
A real prompt-injection case — and the blind spot it exposed in my own scanner

A real prompt-injection case — and the blind spot it exposed in my own scanner

1
Comments
1 min read
Three AI assistants, three vendors, one bug — the confused-deputy pattern that keeps shipping

Three AI assistants, three vendors, one bug — the confused-deputy pattern that keeps shipping

Comments
4 min read
rojaprove now ships two live targets you can test it against before trusting it

rojaprove now ships two live targets you can test it against before trusting it

Comments
4 min read
Your user typed nothing malicious. Your AI leaked their data anyway.

Your user typed nothing malicious. Your AI leaked their data anyway.

Comments
2 min read
loading...