“Make Me Pay” with Persuasive AI
Why Organisations Should be Concerned and What we can do to Safeguard In the rapidly evolving landscape of artificial intelligence, large language models (LLMs) in the form of agents or copilots are gradually becoming indispensable tools for businesses, enhancing customer service, automating content creation, and driving innovation. However, with great power comes great responsibility, and the rise of persuasive adversarial prompts (PAPs) poses a significant threat to the integrity and safety of these advanced systems. Let’s start our journey to explore this topic by first understanding Persuasive Adversarial Prompts. Understanding Persuasive Adversarial Prompts (PAP) Persuasive Adversarial Prompts are sophisticated techniques used to subtly influence the behaviour of AI systems. By crafting carefully designed inputs, PAP can steer AI responses towards desired outcomes, often without the user’s awareness. This technique leverages the AI’s inherent ability to generate human-like tex...