view article Article Red Teaming with RL: Exploiting Tinker API for Harmful RL on 235B Model 30 days ago โข 18