What does BenchFlow do?
BenchFlow is a San Francisco-based frontier environment lab building a unified platform to evaluate AI models and agents using standardized, reproducible benchmarks derived from real-world tasks. It ships products such as SkillsBench, ClawsBench, and the BenchFlow runtime, supporting evaluation of coding agents, web agents, RAG systems, and call-center automation. Founded in 2024 by CEO Xiangyi Li, the company offers custom benchmarks built on reinforcement learning and open community contributions.