Science ❯ Computer Science ❯ Data Structures ❯ Graphs
Results cap below 90% overall accuracy, signaling room to improve grounded reasoning, table use, abstention.