- Published on
LLM-based applications face security challenges in form of prompt injections and jailbreaks. This project reviews the key architectural improvements underpinning ModernBERT, and implements fine-tuning for discriminating malicious prompts. PangolinGuard closely approximates the performance of Claude 3.7 on a mixed benchmark, while maintaining low latency (< 40ms).