New AI Benchmark: Measuring Real-World Job Performance