When you purchase through links on our site, we may earn an affiliate commission.Heres how it works.
System prompts are instructions that the user gives to thechatbot.
They can include sensitive information, such as passwords.
By asking the right questions, the researchers were able to get Gemini to disclose system prompts.
For example, they told the chatbot a hidden passphrase and told it not to disclose it.
After that, they asked it to share the passphrase, which it gracefully declined.
This could be abused, for example, during elections, to spread dangerous fake news.
“We’ve also built safeguards to prevent harmful or misleading responses, which we are continuously improving.”