Smaller, Weaker, Yet Better- Training LLM Reasoners via Compute-Optimal Sampling.pdf 1.11MB Smaller, Weaker, Yet Better- Training LLM Reasoners via Compute-Optimal Sampling.pptx 0.83MB