Meta, OpenAI, Microsoft, and other AI companies create their own internal benchmarks as new models approach or exceed 90% accuracy on existing public tests (Cristina Criddle/Financial Times)

1 week ago 3

Cristina Criddle / Financial Times:
Meta, OpenAI, Microsoft, and other AI companies create their own internal benchmarks as new models approach or exceed 90% accuracy on existing public tests — Rapidly advancing technology is surpassing current methods of evaluating and comparing large language models

Read Entire Article