minbeomkim's picture

4 4 7

minbeomkim

mbkim

·

https://minbeomkim.github.io/

AI & ML interests

Safe, Reliable, and Controllable AI Agency.

Recent Activity

submitted a paper 2 days ago

CausalArmor: Efficient Indirect Prompt Injection Guardrails via Causal Attribution

authored a paper 2 days ago

LifeTox: Unveiling Implicit Toxicity in Life Advice

authored a paper 2 days ago

Critic-Guided Decoding for Controlled Text Generation

View all activity

Organizations

Papers 11

arxiv:2602.07918

arxiv:2509.17393

arxiv:2505.15182

arxiv:2502.14289

models 3

mbkim/LifeTox_Moderator_13B

Text Classification • Updated Mar 20, 2024 • 2 • 1

mbkim/LifeTox_Moderator_7B

Text Classification • Updated Mar 20, 2024 • 4

mbkim/LifeTox_Moderator_350M

Text Classification • Updated Mar 20, 2024 • 6 • 2

datasets 2

mbkim/AdvisorQA

Viewer • Updated Jan 29, 2025 • 10.4k • 9 • 2

mbkim/LifeTox

Viewer • Updated Mar 20, 2024 • 87.5k • 48 • 3