林結衣's picture

林結衣

averyham

·

AI & ML interests

None yet

Recent Activity

liked a dataset 1 day ago

Gingiris/devrel-playbook

upvoted a paper 1 day ago

HRBench: Benchmarking and Understanding Thinking-Mode Switch Strategies in Hybrid-Reasoning LLMs

liked a dataset 6 days ago

Fanfan-1028/poker-cards-cxcvz

View all activity

Organizations

None yet

upvoted a paper 1 day ago

HRBench: Benchmarking and Understanding Thinking-Mode Switch Strategies in Hybrid-Reasoning LLMs

Paper • 2605.28398 • Published 8 days ago • 15

upvoted a paper 11 days ago

Active Learners as Efficient PRP Rerankers

Paper • 2605.14236 • Published 20 days ago • 98

upvoted 2 papers 12 days ago

Perception or Prejudice: Can MLLMs Go Beyond First Impressions of Personality?

Paper • 2605.22109 • Published 14 days ago • 169

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

Paper • 2605.11609 • Published 23 days ago • 195

upvoted a paper 13 days ago

AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration

Paper • 2605.20025 • Published 16 days ago • 185

upvoted a paper 18 days ago

CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence

Paper • 2605.12882 • Published 22 days ago • 270

upvoted a paper 23 days ago

LiVeAction: a Lightweight, Versatile, and Asymmetric Neural Codec Design for Real-time Operation

Paper • 2605.06628 • Published 28 days ago • 6

upvoted a paper about 1 month ago

TCOD: Exploring Temporal Curriculum in On-Policy Distillation for Multi-turn Autonomous Agents

Paper • 2604.24005 • Published Apr 27 • 8

upvoted 2 papers about 2 months ago

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published Apr 3 • 630

Lie to Me: How Faithful Is Chain-of-Thought Reasoning in Reasoning Models?

Paper • 2603.22582 • Published Mar 23 • 7

upvoted a paper 3 months ago

Heterogeneous Agent Collaborative Reinforcement Learning

Paper • 2603.02604 • Published Mar 3 • 198