Several trained models to compare the differences between each method. Each model has a complete description of hyperparams with wandb reports.
G
G-reen
AI & ML interests
Currently working on https://github.com/Green0-0/propagate (ES Trainer for LLMs). Also interested in TPU training, and building better synthetic datasets.
Recent Activity
updated
a model
about 3 hours ago
G-reen/gemma-2-2b-it-fft-simpo-adj
published
a model
about 3 hours ago
G-reen/gemma-2-2b-it-fft-simpo-adj
updated
a model
about 5 hours ago
G-reen/gemma-2-2b-it-fft-simpo
Organizations
None yet