AI & ML interests
None yet
Organizations
None yet
dslighfdsl/Qwen2.5-7B-Instruct-Baseline-SFT-alfworld-DPO
Updated
dslighfdsl/Qwen2.5-7B-Instruct-Baseline-SFT-webshop-ETO
Updated
dslighfdsl/Qwen2.5-7B-Instruct-Baseline-SFT-sciworld-ETO
Updated
dslighfdsl/Qwen2.5-7B-Instruct-Baseline-SFT-webshop-DPO
Text Generation
•
Updated
•
2
dslighfdsl/Qwen2.5-7B-Instruct-Baseline-SFT-sciworld-DPO
Text Generation
•
Updated
•
2
dslighfdsl/Qwen2.5-7B-Instruct-Baseline-SFT-webshop
Text Generation
•
Updated
•
5
dslighfdsl/Qwen2.5-7B-Instruct-Baseline-SFT-alfworld
Text Generation
•
Updated
•
1
dslighfdsl/Qwen2.5-7B-Instruct-Baseline-SFT-sciworld
Text Generation
•
Updated
•
2
dslighfdsl/Qwen2.5-7B-Instruct-SFT-webshop-stage-3
Text Generation
•
Updated
•
2
dslighfdsl/Qwen2.5-7B-Instruct-SFT-webshop-stage-2
Text Generation
•
Updated
•
2
dslighfdsl/Qwen2.5-7B-Instruct-SFT-webshop-stage-1
Text Generation
•
Updated
•
2
dslighfdsl/Qwen2.5-7B-Instruct-SFT-alfworld-stage-3
Text Generation
•
Updated
•
1
dslighfdsl/Qwen2.5-7B-Instruct-SFT-alfworld-stage-2
Text Generation
•
Updated
•
2
dslighfdsl/Qwen2.5-7B-Instruct-SFT-alfworld-stage-1
Text Generation
•
Updated
•
3
dslighfdsl/Qwen2.5-7B-Instruct-SFT-sciworld-stage-3
Text Generation
•
Updated
•
3
dslighfdsl/Qwen2.5-7B-Instruct-SFT-sciworld-stage-2
Text Generation
•
Updated
•
2
dslighfdsl/Qwen2.5-7B-Instruct-SFT-sciworld-stage-1
Text Generation
•
Updated
•
1
dslighfdsl/Llama-3.1-8B-Instruct-Baselines-MPO-meta-planner-webshop
Text Generation
•
Updated
•
5
dslighfdsl/Llama-3.1-8B-Instruct-Baselines-SFT-webshop-ETO
Text Generation
•
Updated
•
1
dslighfdsl/Llama-3.1-8B-Instruct-Baselines-SFT-sciworld-ETO
Text Generation
•
Updated
•
2
dslighfdsl/Llama-3.1-8B-Instruct-Baselines-SFT-sciworld-DPO
Text Generation
•
Updated
•
1
dslighfdsl/Llama-3.1-8B-Instruct-Baselines-SFT-webshop-DPO
Text Generation
•
Updated
•
2
dslighfdsl/Llama-3.1-8B-Instruct-Baselines-SFT-alfworld-DPO
Text Generation
•
Updated
•
2
dslighfdsl/Llama-3.1-8B-Instruct-Baselines-SFT-webshop
Text Generation
•
Updated
•
5
dslighfdsl/Llama-3.1-8B-Instruct-Baselines-SFT-alfworld
Text Generation
•
Updated
•
2
dslighfdsl/Llama-3.1-8B-Instruct-Baselines-SFT
Text Generation
•
Updated
•
2
dslighfdsl/Llama-3.1-8B-Instruct-SFT-CoT-short-full-3-alfworld-stage3
Text Generation
•
Updated
•
2
dslighfdsl/Llama-3.1-8B-Instruct-SFT-CoT-short-full-3-alfworld-stage3_2
Text Generation
•
Updated
•
2
dslighfdsl/Llama-3.1-8B-Instruct-SFT-CoT-short-full-3-alfworld-stage1
Text Generation
•
Updated
•
3
dslighfdsl/Llama-3.1-8B-Instruct-SFT-CoT-short-full-3-alfworld-stage2
Text Generation
•
Updated
•
2