·
AI & ML interests
None yet
Organizations
-
-
-
-
-
-
-
-
-
-
-
view article
Activation Steering With Mean Response Probes : A Case Study In Suppressing Sycophancy In Language Models During TTC
view article
Exploring Direct Tensor Manipulation in Language Models: A Case Study in Binary-Level Model Enhancement