Barney Gossett
@barneygossett Member since November 25, 2024
Wormer, Netherlands
genmoai They introduced a new benchmarking approach called "functional variants," which tests a model’s ability to reason instead of just memorize.
@barneygossett Member since November 25, 2024
genmoai They introduced a new benchmarking approach called "functional variants," which tests a model’s ability to reason instead of just memorize.