Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

How about fixing the most basic things first? Claude is very vulnerable when it comes to injections. Very scary for data processing. How corps dares to use Cloud code is mind-boggling. I mean, you can give Claude simple tasks but if the context is like "Name my cat" it gets derailed immediately no matter what the system prompt is.




“Name my cat” is a very common prompt in corps

It is a test to see if you can break out of the prompt. You have a system prompt like. Bla bla you are a pro AI-translator bla bla bullet points. But then it breaks when the context is like "name my cat" or whatever. It follows those instructions...

I know, I was being facetious - do not put that in the prompt :)



Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: