AceCoder

community

https://jdf-prog.github.io/

DongfuJiang

jdf-prog

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

DongfuJiang authored a paper 18 days ago

RewardHarness: Self-Evolving Agentic Post-Training

DongfuJiang authored a paper 18 days ago

Cosmos 3: Omnimodal World Models for Physical AI

DongfuJiang authored a paper 18 days ago

AutoLab: Can Frontier Models Solve Long-Horizon Auto Research and Engineering Tasks?

View all activity

DongfuJiang

authored 3 papers 18 days ago

RewardHarness: Self-Evolving Agentic Post-Training

Paper • 2605.08703 • Published May 9 • 10

Cosmos 3: Omnimodal World Models for Physical AI

Paper • 2606.02800 • Published 23 days ago • 133

AutoLab: Can Frontier Models Solve Long-Horizon Auto Research and Engineering Tasks?

Paper • 2606.05080 • Published 21 days ago • 30

DongfuJiang

authored 4 papers about 1 month ago

Watch Before You Answer: Learning from Visually Grounded Post-Training

Paper • 2604.05117 • Published Apr 6 • 36

ClawBench: Can AI Agents Complete Everyday Online Tasks?

Paper • 2604.08523 • Published Apr 9 • 265

Nemotron 3 Super: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Paper • 2604.12374 • Published Apr 14 • 37

Beyond Semantic Similarity: Rethinking Retrieval for Agentic Search via Direct Corpus Interaction

Paper • 2605.05242 • Published May 3 • 126

JasperHaozhe

authored 13 papers 2 months ago

Dr. Bench: A Multidimensional Evaluation for Deep Research Agents, from Answers to Reports

Paper • 2510.02190 • Published Jan 29 • 20

Infinity Parser: Layout Aware Reinforcement Learning for Scanned Document Parsing

Paper • 2510.15349 • Published Oct 17, 2025

TR-DQ: Time-Rotation Diffusion Quantization

Paper • 2503.06564 • Published Mar 9, 2025

From Illusion to Intention: Visual Rationale Learning for Vision-Language Reasoning

Paper • 2511.23031 • Published Nov 28, 2025 • 1

CogDoc: Towards Unified thinking in Documents

Paper • 2512.12658 • Published Dec 14, 2025

EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience

Paper • 2601.15876 • Published Jan 22 • 92

Understanding by Reconstruction: Reversing the Software Development Process for LLM Pretraining

Paper • 2603.11103 • Published Mar 11 • 9

SWE-QA-Pro: A Representative Benchmark and Scalable Training Recipe for Repository-Level Code Understanding

Paper • 2603.16124 • Published Mar 17 • 3

LongCat-Next: Lexicalizing Modalities as Discrete Tokens

Paper • 2603.27538 • Published Mar 29 • 148

RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time

Paper • 2604.11626 • Published Apr 13 • 103

AI & ML interests

Recent Activity

Team members 4

CodeDPO's activity