arxiv:2602.07918
minbeomkim
mbkim
·
AI & ML interests
Safe, Reliable, and Controllable AI Agency.
Recent Activity
submitted
a paper
2 days ago
CausalArmor: Efficient Indirect Prompt Injection Guardrails via Causal Attribution
authored
a paper
2 days ago
LifeTox: Unveiling Implicit Toxicity in Life Advice
authored
a paper
2 days ago
Critic-Guided Decoding for Controlled Text Generation