Jiangshan's Personal Website

Posted 2026-06-29Updated 2026-07-01Study Experiment6 minutes read (About 918 words)

Training and evaluation of the slime Search-R1-trained Qwen2.5-3B model on NQ and HotpotQA.