Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach
Paper • 2502.05171 • Published • 154
GenQA is a large scale synthetic instruction tuning dataset created using a novel generator prompt strategy to ensure quality and diversity.