Xinfeng Li
LetterJohn
AI & ML interests
Trustworthy AI, AI for Security & Privacy
Recent Activity
upvoted a paper 29 days ago
Internal Safety Collapse in Frontier Large Language Models upvoted a paper about 2 months ago
BandPO: Bridging Trust Regions and Ratio Clipping via Probability-Aware Bounds for LLM Reinforcement Learning upvoted a paper 3 months ago
AgentDoG: A Diagnostic Guardrail Framework for AI Agent Safety and Security