Breaking Failure Cascades: Step-Aware Reinforcement Learning for Medical Multimodal Reasoning Paper • 2606.31825 • Published 5 days ago • 17
MRPO Collection This collection hosts MRPO series introduced in paper, Breaking Failure Cascades: Step-Aware Reinforcement Learning for Medical Multimodal Reasoning • 4 items • Updated 1 day ago
MRPO Collection This collection hosts MRPO series introduced in paper, Breaking Failure Cascades: Step-Aware Reinforcement Learning for Medical Multimodal Reasoning • 4 items • Updated 1 day ago
ToxReason: A Benchmark for Mechanistic Chemical Toxicity Reasoning via Adverse Outcome Pathway Paper • 2604.06264 • Published Apr 7 • 4
Learning from Negative Samples in Generative Biomedical Entity Linking Paper • 2408.16493 • Published Aug 29, 2024 • 1
ASGuard: Activation-Scaling Guard to Mitigate Targeted Jailbreaking Attack Paper • 2509.25843 • Published Apr 14 • 20
The Curious Case of Analogies: Investigating Analogical Reasoning in Large Language Models Paper • 2511.20344 • Published Nov 25, 2025 • 14
Thinking Sparks!: Emergent Attention Heads in Reasoning Models During Post Training Paper • 2509.25758 • Published Sep 30, 2025 • 25
HierSearch: A Hierarchical Enterprise Deep Search Framework Integrating Local and Web Searches Paper • 2508.08088 • Published Aug 11, 2025 • 29