The Polyglot Persuader: Unpacking the Influence of Multi-Lingual Conversations on LLMs

Published:

  • Examined influence of multi-turn attacks on LLMs for low vs high-resource languages
  • Tested the application of Continuous Adversarial Training (CAT) methods to better defend against misinformation. Proposed further techniques for improving model robustness