Practical superiority on current benchmarks reflects benchmark limitations rather than genuine seman...