Epoch AI has introduced MirrorCode, a new benchmark designed to evaluate whether AI models can reconstruct entire software programs without seeing the original source code. This test requires models to recreate complex codebases based only on high-level descriptions and functional requirements. While current models show promise in smaller tasks,...
Läs hela artikeln hos källan.
Kommentarer (0)
Inga kommentarer ännu. Bli först med att kommentera!