Princeton researchers build AI system that autonomously runs quantum materials experiments in a robotic lab

Princeton's Qumus AI system autonomously designs, runs, and analyzes quantum materials experiments using robotics and computer vision. It built graphene flakes and a working transistor with minimal human involvement.

Categorized in: AI News Science and Research
Published on: May 27, 2026
Princeton researchers build AI system that autonomously runs quantum materials experiments in a robotic lab

Autonomous Lab System Conducts Quantum Materials Research Without Human Intervention

Researchers at Princeton University have built an AI system that autonomously designs, executes and analyzes laboratory experiments in quantum materials. The platform, called Qumus, combines large language models, robotics and computer vision to operate as what the team describes as the first "AI quantum materials experimentalist."

In demonstrations, Qumus created graphene flakes and fabricated atomically thin transistors inside a robotic mini-lab with minimal human involvement. A user could request "a graphene flake," and the system would interpret the request, check its material inventory, and autonomously carry out the fabrication steps until it produced the requested sample.

How the System Works

Qumus operates like a small research group. A lead AI agent orchestrates the work while specialized sub-agents handle planning, laboratory monitoring, device design and physical processing. The system consults experimental history, evaluates available materials and instruments, designs fabrication plans and executes them through robotic hardware.

The physical setup includes robotic arms, microscope systems, temperature-controlled stages and machine-vision systems that identify microscopic material flakes. Computer-vision models monitor the workspace continuously, allowing the system to detect problems in real time.

In one task, researchers asked Qumus to create a graphene flake larger than 200 square micrometers and erased its experimental history, forcing the system to start fresh. The AI independently explored fabrication parameters-substrate temperature, heating time, massage cycles and tape peel-off speed-over more than four hours until it succeeded. The system generated hypotheses, evaluated failures and refined parameters based on observations, similar to how an experienced human experimentalist works.

Error Detection and Correction

The system demonstrated closed-loop reasoning when handling unexpected problems. In one test, a researcher intentionally removed a chip that Qumus was actively processing. The system detected the missing chip using computer vision, confirmed the problem and generated a new plan to restart the experiment.

In another error scenario, a language model incorrectly labeled hexagonal boron nitride as graphene-a hallucination error common in generative AI. Qumus identified the inconsistency and restarted the process until it successfully produced the correct material.

Building a Transistor

The most complex demonstration involved fabricating a graphene transistor. In response to a request for a "graphene transistor," Qumus designed a multilayer device architecture using graphene and boron nitride flakes placed onto metal electrodes. The system searched its inventory, generated a device layout, selected suitable flakes, aligned them and performed dry-transfer stacking to assemble the device.

The entire process took about 90 minutes and involved roughly 30 procedural steps and 18 decision-making calls among AI agents. The resulting structure functioned as an atomically thin field-effect transistor, a basic building block of modern electronics.

Why This Matters for Quantum Materials

Two-dimensional quantum materials-ultrathin crystals only atoms thick-show unusual electrical and quantum behaviors. Since graphene's discovery in 2004, scientists have identified thousands of layered materials that could be peeled into atomically thin sheets and stacked into engineered structures.

Progress has been constrained by labor-intensive workflows. Producing usable material flakes often involves repeated cycles of mechanical exfoliation, microscope inspection, alignment and transfer. The process remains difficult to scale and heavily dependent on expert judgment. Qumus attempts to automate that entire chain.

Language Models Show Different Personalities

The researchers tested Qumus with language models from OpenAI, Google, Anthropic, xAI, Alibaba and DeepSeek. While all successfully completed experiments, they behaved differently in terms of caution, efficiency and willingness to act quickly.

Some models spent more time reasoning and checking conditions before acting, while others moved more aggressively into execution. The researchers quantified these differences using metrics such as "bias for action," "caution" and "token efficiency"-behavioral patterns that resembled differences among human researchers.

Current Limitations

The system remains constrained by hardware speed rather than AI reasoning. Much of the total runtime was consumed by physical processes such as robotic movement, microscope focusing and thermal stabilization rather than language-model computation.

Qumus operates in a highly specialized environment focused on two-dimensional materials. Extending the approach to broader scientific disciplines would require substantial customization of both robotic hardware and AI workflows.

Hallucination errors from large language models remain a challenge. Although Qumus corrected some mistakes autonomously, the study showed that AI-generated errors can still disrupt experiments and require additional validation layers.

The current demonstrations remain relatively simple compared with broader ambitions for autonomous scientific discovery. Producing graphene flakes and basic transistor structures represents a major engineering achievement, but it does not yet represent independent scientific insight or discovery of fundamentally new materials.

What Comes Next

Researchers report the system establishes a framework that could evolve as AI models and robotic systems improve. Future versions could operate inside inert-atmosphere gloveboxes, allowing handling of air-sensitive quantum materials that degrade when exposed to oxygen or moisture.

The researchers also envision networks of AI laboratories coordinating experiments across different scientific domains. The broader implication is that AI systems may increasingly move beyond analyzing scientific data and into physically conducting experiments themselves.

This transition could prove especially important in quantum materials research, where experimentation is often bottlenecked by scarce human expertise and labor-intensive procedures. If embodied AI systems can reliably automate those tasks, researchers may explore vastly larger combinations of materials, geometries and fabrication methods than human teams alone can practically manage.

For professionals in research roles, understanding how AI for Science & Research applies to laboratory automation and experimental design offers practical insight into how these systems may reshape scientific workflows.

Note: This research appears on arXiv, a preprint server. The work has not yet undergone formal peer review, which is an important step in the scientific process to verify results.


Get Daily AI News

Your membership also unlocks:

700+ AI Courses
700+ Certifications
Personalized AI Learning Plan
6500+ AI Tools (no Ads)
Daily AI News by job industry (no Ads)