The models trained with EVOL-RL
Yujun Zhou
yujunzhou
AI & ML interests
None yet
Organizations
None yet
models 253
yujunzhou/MATH-TTT-Qwen3-4B-Base-Semantic-ClipHigh-Ent0.003-RandomNovelty
4B • Updated
yujunzhou/MATH-TTT-Qwen3-4B-Base-Semantic-ClipHigh-Ent0.003-OpenAI
4B • Updated • 8
yujunzhou/SFT_Advanced_Risk_Self_Grading_Qwen3-4B
Text Generation • 4B • Updated • 2
yujunzhou/SFT_Advanced_Risk_Self_Grading_llama
Text Generation • 8B • Updated • 2
yujunzhou/SFT_Advanced_Risk_Self_Grading_Qwen3-4B-Base
Text Generation • 4B • Updated • 2
yujunzhou/SFT_Advanced_Risk_Reward_Tampering_Qwen3-4B
Text Generation • 4B • Updated • 5
yujunzhou/Advanced_Risk_Self_Grading_llama
8B • Updated • 1
yujunzhou/SFT_Advanced_Risk_Reward_Tampering_Qwen3-4B-Base
Text Generation • 4B • Updated • 4
yujunzhou/SFT_Advanced_Risk_Reward_Tampering_llama
8B • Updated
yujunzhou/SFT_Advanced_Risk_Situation_Aware_Qwen3-4B-Base
Text Generation • 4B • Updated • 1 •