Shiting Huang
chocckaka
AI & ML interests
None yet
Recent Activity
authored a paper 14 days ago
SkillFlow:Benchmarking Lifelong Skill Discovery and Evolution for Autonomous Agents authored a paper 14 days ago
SaaSBench: Exploring the Boundaries of Coding Agents in Long-Horizon Enterprise SaaS Engineering upvoted a paper about 1 month ago
AsyncTool: Evaluating the Asynchronous Function Calling Capability under Multi-Task Scenarios