经过计算最优抽样训练大言语模型推理器 更弱 却更优 更小
一、论断写在前面论文题目,Smaller,Weaker,YetBetter,TrainingLLMReasonersviaCompute,OptimalSampling论文链接,https,arx...
共1页 1条
一、论断写在前面论文题目,Smaller,Weaker,YetBetter,TrainingLLMReasonersviaCompute,OptimalSampling论文链接,https,arx...