arxiv:2606.07379
Thanawat Lodkaew
skydddoogg
ยท
AI & ML interests
None yet
Recent Activity
authored a paper 1 day ago
Do Coding Agents Deceive Us? Detecting and Preventing Cheating via Capped Evaluation with Randomized Tests new activity 1 day ago
ishidalab/capcode:Add task category and license metadata upvoted a paper 2 days ago
How Can I Publish My LLM Benchmark Without Giving the True Answers Away?