·
AI & ML interests
I train and eval pretty ok
Recent Activity
Organizations
-
-
-
-
-
-
-
-
-
-
-
published an
article about 1 month ago view article Proof of Time: A Benchmark for Evaluating Scientific Idea Judgments
view article Budget Alignment: Making Models Reason in the User’s Language
published an
article over 1 year ago view article What We Learned About LLM/VLMs in Healthcare AI Evaluation: