潜龙 QianLong · 中文 AI 内容与工具平台

For years, we’ve treated AI coding assistants as highly advanced autocomplete tools—handy for generating a quick function or debugging a stubborn script. But a new benchmark suggests we need to radically update our expectations. AI is no longer just writing snippets; it is reverse-engineering entire software architectures from the outside in.

The evidence comes from MirrorCode, a testing framework developed by AI measurement groups METR and Epoch AI. Instead of asking an AI to write code from a standard text prompt, MirrorCode presents the AI with a compiled, working program and says: rebuild this from scratch. The AI gets absolutely no access to the original source code. It can only interact with the command-line interface and observe various test cases to figure out how the software functions.

Strikingly, the AI model Claude Opus 4.6 managed to successfully reimplement gotree, a complex bioinformatics toolkit. This wasn't a simple script—it consisted of roughly 16,000 lines of Go code and featured over 40 distinct commands. Researchers estimate that a human engineer tackling the same reverse-engineering puzzle without AI assistance would need anywhere from two to 17 weeks of full-time work. The findings also highlight a trend known as inference scaling: the more computational time the model is allowed to spend "thinking," the better it performs on these massive projects.

However, immense coding capability does not equal robust common sense. As these AI systems evolve from passive tools into autonomous agents capable of weeks-long tasks, they remain surprisingly fragile in the wild. A recent paper from Google DeepMind outlines how easily these intelligent agents can be compromised by bad actors. Attackers can hijack them through "content injection"—hiding malicious commands inside HTML, CSS, or image files—or through "semantic manipulation," which uses emotionally charged or authoritative language to trick the AI into breaking its own safety rules. It is a bizarre paradox: an AI can flawlessly reconstruct a 16,000-line program but might be fooled by a cleverly worded sob story hidden in a webpage.

To prepare for the economic and structural shockwaves of such capabilities, organizations like the Windfall Trust have recently published a "Policy Atlas." This atlas outlines 48 distinct strategies—ranging from labor market adaptations to global coordination—to manage the disruption transformative AI will bring.

As we enter the next era of software development, the reality of modern AI is coming into sharp focus. We are deploying systems that possess weeks' worth of human engineering stamina, yet lack basic defensive skepticism. Navigating this future will require managing both ends of this extreme spectrum.

Key Points

The MirrorCode benchmark tests AI's ability to recreate software using only command-line access and test cases.
Claude Opus 4.6 successfully reverse-engineered a 16,000-line Go program, a task that would take humans weeks to complete.
Despite these advanced capabilities, AI agents remain highly vulnerable to semantic manipulation and hidden content injection.
Organizations are already mapping out dozens of policy responses to handle the impending economic disruption caused by such AI advancements.

Why It Matters

As AI transitions from generating simple code snippets to autonomously reverse-engineering complex systems, the software industry faces a massive shift in both productivity potential and cybersecurity risks.

Sources:

Import AI 453: Breaking AI agents; MirrorCode; and ten views on gradual disempowerment — Import AI (Jack Clark)

潛

本文完

潜龙编辑部 · 2026/7/14

The AI Detective: Rebuilding 16,000 Lines of Code Without the Source

Key Points

Why It Matters

更多专栏

The Rise of the ChatGPT Flyer: AI's Awkward Physical Era

The Accidental Legacy of Apple's Cancelled Car

Inside the AI Mind: Unlocking the Secrets of 'J-Space'