- Published on
LLM-based applications face critical security challenges in form of prompt injections and jailbreaks. This project outlines the key architectural improvements underpinning ModernBERT, and demonstrates how to implement fine-tuning for discriminating malicious prompts. PangolinGuard closely approximates the performance of Claude 3.7 and Gemini Flash 2.0 on a mixed benchmark, while maintaining low latency (<40ms).