openai/whisper-large-v3-turbo Automatic Speech Recognition • 0.8B • Updated Oct 4, 2024 • 6.15M • • 3.09k
chen25star/paraphrase-multilingual-MiniLM-L12-v2 Sentence Similarity • 0.1B • Updated 18 days ago • 19 • 1
DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards Paper • 2605.21467 • Published 29 days ago • 204
Video2GUI: Synthesizing Large-Scale Interaction Trajectories for Generalized GUI Agent Pretraining Paper • 2605.14747 • Published May 14 • 145
Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information Paper • 2605.11609 • Published May 12 • 195
Stage-adaptive Token Selection for Efficient Omni-modal LLMs Paper • 2605.20035 • Published about 1 month ago • 5
CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence Paper • 2605.12882 • Published May 13 • 271
Learning to Foresee: Unveiling the Unlocking Efficiency of On-Policy Distillation Paper • 2605.11739 • Published May 13 • 59
Pion: A Spectrum-Preserving Optimizer via Orthogonal Equivalence Transformation Paper • 2605.12492 • Published May 12 • 6