Your Agents Are Aging Too: Agent Lifespan Engineering for Deployed Systems Paper • 2605.26302 • Published 6 days ago • 27
Unleashing Mask: Explore the Intrinsic Out-of-Distribution Detection Capability Paper • 2306.03715 • Published Jun 6, 2023
Exploring Model Dynamics for Accumulative Poisoning Discovery Paper • 2306.03726 • Published Jun 6, 2023
Can Language Models Perform Robust Reasoning in Chain-of-thought Prompting with Noisy Rationales? Paper • 2410.23856 • Published Oct 31, 2024 • 5
DeepInception: Hypnotize Large Language Model to Be Jailbreaker Paper • 2311.03191 • Published Nov 6, 2023 • 2
Co-Reward: Self-supervised Reinforcement Learning for Large Language Model Reasoning via Contrastive Agreement Paper • 2508.00410 • Published Aug 1, 2025 • 1
AgentForesight: Online Auditing for Early Failure Prediction in Multi-Agent Systems Paper • 2605.08715 • Published 22 days ago • 8
Towards Effective Evaluations and Comparisons for LLM Unlearning Methods Paper • 2406.09179 • Published Feb 25, 2025
Micro-Defects Expose Macro-Fakes: Detecting AI-Generated Images via Local Distributional Shifts Paper • 2605.09296 • Published 21 days ago • 4
Rethinking How to Remember: Beyond Atomic Facts in Lifelong LLM Agent Memory Paper • 2605.19952 • Published 12 days ago • 9
Your Agents Are Aging Too: Agent Lifespan Engineering for Deployed Systems Paper • 2605.26302 • Published 6 days ago • 27
AgentHijack: Benchmarking Computer Use Agent Robustness to Common Environment Corruptions Paper • 2605.25707 • Published 6 days ago • 2
Your Agents Are Aging Too: Agent Lifespan Engineering for Deployed Systems Paper • 2605.26302 • Published 6 days ago • 27
Rethinking How to Remember: Beyond Atomic Facts in Lifelong LLM Agent Memory Paper • 2605.19952 • Published 12 days ago • 9
MemEye: A Visual-Centric Evaluation Framework for Multimodal Agent Memory Paper • 2605.15128 • Published 17 days ago • 62
AgentForesight: Online Auditing for Early Failure Prediction in Multi-Agent Systems Paper • 2605.08715 • Published 22 days ago • 8
MEMO: Memory-Augmented Model Context Optimization for Robust Multi-Turn Multi-Agent LLM Games Paper • 2603.09022 • Published Mar 9 • 24