Surprisal Guided Selection Training at test-time for kernel optimization Surprisal-Guided Selection: Compute-Optimal Test-Time Strategies for Execution-Grounded Code Generation Paper • 2602.07670 • Published 5 days ago • 1 Jarrodbarnes/KernelBench-RLVR-120b Text Generation • 117B • Updated 2 days ago • 30 • 1
Surprisal-Guided Selection: Compute-Optimal Test-Time Strategies for Execution-Grounded Code Generation Paper • 2602.07670 • Published 5 days ago • 1
OpenSec: Incident Response Agent Calibration OpenSec is a dual-control RL environment, dataset, and evaluation suite that measures agent calibration on incident response tasks. OpenSec: Measuring Incident Response Agent Calibration Under Adversarial Evidence Paper • 2601.21083 • Published 15 days ago • 1 Jarrodbarnes/opensec-seeds Viewer • Updated 3 days ago • 380 • 122 • 1 Jarrodbarnes/opensec-gdpo-4b Text Generation • 4B • Updated about 3 hours ago • 74 Sleeping RL OpenSec Environment 🔐
OpenSec: Measuring Incident Response Agent Calibration Under Adversarial Evidence Paper • 2601.21083 • Published 15 days ago • 1
Surprisal Guided Selection Training at test-time for kernel optimization Surprisal-Guided Selection: Compute-Optimal Test-Time Strategies for Execution-Grounded Code Generation Paper • 2602.07670 • Published 5 days ago • 1 Jarrodbarnes/KernelBench-RLVR-120b Text Generation • 117B • Updated 2 days ago • 30 • 1
Surprisal-Guided Selection: Compute-Optimal Test-Time Strategies for Execution-Grounded Code Generation Paper • 2602.07670 • Published 5 days ago • 1
OpenSec: Incident Response Agent Calibration OpenSec is a dual-control RL environment, dataset, and evaluation suite that measures agent calibration on incident response tasks. OpenSec: Measuring Incident Response Agent Calibration Under Adversarial Evidence Paper • 2601.21083 • Published 15 days ago • 1 Jarrodbarnes/opensec-seeds Viewer • Updated 3 days ago • 380 • 122 • 1 Jarrodbarnes/opensec-gdpo-4b Text Generation • 4B • Updated about 3 hours ago • 74 Sleeping RL OpenSec Environment 🔐
OpenSec: Measuring Incident Response Agent Calibration Under Adversarial Evidence Paper • 2601.21083 • Published 15 days ago • 1