About 13,800,000 results
Open links in new tab
  1. New secret math benchmark stumps AI models and PhDs alike

  2. Testing AI systems on hard math problems shows they still …

  3. AI’s math problem: FrontierMath benchmark shows how far …

  4. FrontierMath | Epoch AI

  5. [2411.04872] FrontierMath: A Benchmark for Evaluating Advanced ...

  6. Epoch AI Launches FrontierMath AI Benchmark to Test …

  7. FrontierMath: Evaluating Advanced Mathematical Reasoning in AI …

  8. FrontierMath: A Benchmark for Evaluating Advanced …

  9. FrontierMath: A benchmark for evaluating advanced ... - Hacker News

  10. Some results have been removed