DONGRYEOLLEE

drlee1

DONGRYEOLLEE1

AI & ML interests

None yet

Recent Activity

upvoted a paper about 12 hours ago

Learning from the Self-future: On-policy Self-distillation for dLLMs

upvoted a paper 2 days ago

FastContext: Training Efficient Repository Explorer for Coding Agents

upvoted a paper 3 days ago

MaxProof: Scaling Mathematical Proof with Generative-Verifier RL and Population-Level Test-Time Scaling

View all activity

Organizations

None yet

upvoted a paper about 12 hours ago

Learning from the Self-future: On-policy Self-distillation for dLLMs

Paper • 2606.18195 • Published 3 days ago • 68

upvoted a paper 2 days ago

FastContext: Training Efficient Repository Explorer for Coding Agents

Paper • 2606.14066 • Published 7 days ago • 81

upvoted a paper 3 days ago

MaxProof: Scaling Mathematical Proof with Generative-Verifier RL and Population-Level Test-Time Scaling

Paper • 2606.13473 • Published 8 days ago • 89

upvoted a paper 4 days ago

MiniMax Sparse Attention

Paper • 2606.13392 • Published 8 days ago • 138

liked a model 4 days ago

nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16

Text Generation • 32B • Updated Mar 15 • 1.31M • • 768

liked a model 7 days ago

jinaai/jina-embeddings-v5-text-small

Feature Extraction • 0.6B • Updated Apr 15 • 361k • 178

upvoted a paper 7 days ago

MoBA: Mixture of Block Attention for Long-Context LLMs

Paper • 2502.13189 • Published Feb 18, 2025 • 19

upvoted a paper 8 days ago

SearchSwarm: Towards Delegation Intelligence in Agentic LLMs for Long-Horizon Deep Research

Paper • 2606.09730 • Published 11 days ago • 50

liked a dataset 9 days ago

m-a-p/CodeFeedback-Filtered-Instruction

Viewer • Updated Feb 26, 2024 • 157k • 16.9k • 204

liked a model 9 days ago

ny1031/Qwen3-1.7B-SFT-RLVR-IF

Text Generation • 2B • Updated May 6 • 6 • 1

liked a dataset 9 days ago

allenai/tulu-3-sft-mixture

Viewer • Updated Dec 2, 2024 • 939k • 20.8k • 251

upvoted 2 papers 13 days ago

Domain-Specific Data Synthesis for LLMs via Minimal Sufficient Representation Learning

Paper • 2605.30039 • Published 21 days ago • 20

On the Scaling of PEFT: Towards Million Personal Models of Trillion Parameters

Paper • 2606.02437 • Published 18 days ago • 230

upvoted a paper 16 days ago

K-BrowseComp: A Web Browsing Agent Benchmark Grounded in Korean Contexts

Paper • 2606.02404 • Published 18 days ago • 56

liked a model 17 days ago

Qwen/Qwen3.5-2B

Image-Text-to-Text • 2B • Updated Mar 2 • 1.55M • • 306

upvoted 2 papers 24 days ago

GoLongRL: Capability-Oriented Long Context Reinforcement Learning with Multitask Alignment

Paper • 2605.19577 • Published about 1 month ago • 59

Full Attention Strikes Back: Transferring Full Attention into Sparse within Hundred Training Steps

Paper • 2605.16928 • Published May 16 • 96

liked a model 28 days ago

bytedance-research/Lance

Any-to-Any • Updated 21 days ago • 3.09k • 1.05k

upvoted a paper 28 days ago

Orthrus: Memory-Efficient Parallel Token Generation via Dual-View Diffusion

Paper • 2605.12825 • Published May 12 • 12

upvoted a paper 29 days ago

Refusal in Language Models Is Mediated by a Single Direction

Paper • 2406.11717 • Published Jun 17, 2024 • 14

DONGRYEOLLEE

AI & ML interests

Recent Activity

Organizations

drlee1's activity