Agent research, tool calling and ReAct
Note A merged model Qwen3-4B-I-1209 with the base model improve on agent task.
Note Fine tuned with grpo