2606.05566
2026-06-05
cs.AI
cs.CR
GuardNet: Ensemble Strategies of Shallow Neural Networks for Robust Prompt Injection and Jailbreak Detection
GuardNet: 用于鲁棒提示注入和越狱检测的浅层神经网络集成策略
Paulo Ricardo Ferreira Neves, Edson Rodrigues da Cruz Filho, Paulo Henrique Eleuterio Falsetti, João Vitor Pavan, Ian Degaspari, Henrique Vieira Laturrague, Patrick Vieira Laturrague, Guilherme Nielsen Dias, Marccello Wilson Perez Berto, Gustavo Voltani Von Atzingen
发表机构
*
Quickium Technology Ltd.(Quickium技术有限公司)
;
Federal University of São Carlos (UFSCar)(萨尔瓦多·卡罗斯联邦大学)
;
Federal Institute of Education, Science and Technology of São Paulo (IFSP)(圣保罗教育、科学和技术联邦研究所)
AI总结
提出GuardNet,一种基于浅层神经网络(BiLSTM)集成的护栏系统,通过多样性示例覆盖和阈值校准实现对抗鲁棒性,在低延迟下达到与轻量检测器竞争的性能。