From Pixels to Concepts: Do Segmentation Models Understand What They Segment? Paper • 2605.09591 • Published 5 days ago • 1
The Extrapolation Cliff in On-Policy Distillation of Near-Deterministic Structured Outputs Paper • 2605.08737 • Published 6 days ago • 2
FeatCal: Feature Calibration for Post-Merging Models Paper • 2605.13030 • Published 2 days ago • 4
Orthrus: Memory-Efficient Parallel Token Generation via Dual-View Diffusion Paper • 2605.12825 • Published 3 days ago • 7
Many-Shot CoT-ICL: Making In-Context Learning Truly Learn Paper • 2605.13511 • Published 2 days ago • 27
One Turn Too Late: Response-Aware Defense Against Hidden Malicious Intent in Multi-Turn Dialogue Paper • 2605.05630 • Published 3 days ago • 10
Debiased Model-based Representations for Sample-efficient Continuous Control Paper • 2605.11711 • Published 3 days ago • 8
Multi-Stream LLMs: Unblocking Language Models with Parallel Streams of Thoughts, Inputs and Outputs Paper • 2605.12460 • Published 3 days ago • 15
PASA: A Principled Embedding-Space Watermarking Approach for LLM-Generated Text under Semantic-Invariant Attacks Paper • 2605.10977 • Published 6 days ago • 9
LoopUS: Recasting Pretrained LLMs into Looped Latent Refinement Models Paper • 2605.11011 • Published 5 days ago • 9
Your Language Model is Its Own Critic: Reinforcement Learning with Value Estimation from Actor's Internal States Paper • 2605.07579 • Published 7 days ago • 13
δ-mem: Efficient Online Memory for Large Language Models Paper • 2605.12357 • Published 3 days ago • 101
view article Article Releasing the largest multilingual open pretraining dataset Pclanglais • Nov 13, 2024 • 107
TextLDM: Language Modeling with Continuous Latent Diffusion Paper • 2605.07748 • Published 7 days ago • 23
SpecBlock: Block-Iterative Speculative Decoding with Dynamic Tree Drafting Paper • 2605.07243 • Published 7 days ago • 4