arxiv:2502.20475
Lorena Yan PRO
LorenaYannnnn
AI & ML interests
None yet
Recent Activity
updated
a model 1 day ago
LorenaYannnnn/20260217-Qwen3-0.6B_grpo_sycophancy_warmup_4x_baseline_320000_episodes_seed_42 published
a model 1 day ago
LorenaYannnnn/20260217-Qwen3-0.6B_grpo_sycophancy_warmup_4x_baseline_320000_episodes_seed_42 updated
a model 4 days ago
LorenaYannnnn/20260217-Qwen3-0.6B_sycophancy_warmup_16000_ep_OURS_gdpo_192000_episodes_seed_42_no_cl_IS