Sequoia China launches xbench, a new AI benchmark! They've released a paper, "xbench: Tracking Agents Productivity, Scaling with Profession-Aligned Real-World Evaluations." This benchmark features a dual-track evaluation system and a long-term evaluation mechanism.
Editor:
Gao Han