AI Agents Sandbox: Outcome of Trial and Emerging Considerations

20 May 2026

On 20 May 2026, the outcome of the global-first Artificial Intelligence (“AI“) Agents Sandbox that was launched in August 2025, was published. Results showed strong potential for automation and citizen services, while highlighting risks in oversight, cybersecurity, privacy, and governance for future agentic AI systems.

The sandbox was a collaboration between Google and the Singapore Government, specifically the Cyber Security Agency of Singapore (“CSA“), Government Technology Agency of Singapore, and the Infocomm Media Development Authority. It aimed to better understand how agents operate in real-world settings, and to use those insights to inform how they are developed, deployed, and governed. The sandbox was conducted over approximately four months.

Outcomes from the Sandbox

Automated quality assurance: The sandbox explored how AI agents could support and automate quality assurance testing of government digital services, thereby improving reliability and freeing up engineering resources.
- Outcome: The agent successfully evaluated government websites, testing response times, search functionality, and page integrity. Using natural language understanding, it correctly identified intentionally seeded inactive pages, filler text, and staging Uniform Resource Locator (URL) mismatches. This demonstrates the broader potential of agentic AI in software quality assurance.

AI safety testing: The sandbox explored automating the safety testing of AI software like chatbots to ensure that they meet the government’s requirements prior to deployment.
- Outcome: The trial showed that AI agents can reliably perform large-scale safety testing across various languages and formats, significantly reducing the manual effort needed for chatbot assessments. While the implementation was not entirely error-free, this approach offers a more scalable and consistent way to strengthen AI assurance as the technology evolves.

Social assistance applications: The sandbox explored assisting citizens in navigating and applying for social assistance programmes, thereby helping to streamline complex processes.
- Outcome: The trial demonstrated the agent’s ability to guide applicants or social workers through complex social assistance application processes, potentially reducing the need for substantial resources devoted to in-person assistance, helplines, and manual follow-ups to address errors, omissions, and incomplete submissions.

Risks and Challenges that Emerged from the Sandbox

Human oversight: A key risk lies in ensuring sufficient control and accountability, particularly where decisions have real-world consequences for individuals.

Customisation and control: Another challenge includes balancing flexibility with safeguards, especially in testing or evaluation environments.

Cybersecurity risks: These risks include, most prominently, indirect prompt injection, where an agent could be deceived into performing unintended actions.

Data protection and privacy: Risks arise where agents interact directly with personal data, including potential privacy breaches or data leakages.

Preparing for a Future with AI Agents

Two broad sets of considerations emerged for AI agent use, including: (i) near-term considerations, e.g. the kinds of concrete measures, controls, and design choices that organisations may want to consider; and (ii) longer-term issues that may require deeper study as agentic technologies evolve.

Strengthening trust and resilience for AI agents today
- Choosing where to start with AI agents: The sandbox highlighted the importance of controlled testing and incremental real-world deployment as a means of building confidence and trust in AI agents.
- Calibrating the level of human oversight: Oversight should be risk-based to balance control and autonomy, with safeguards distributed across the system, the organisation, and the user. Higher-risk actions may require pre-approval, while lower-risk actions can proceed with post-hoc review where outcomes are reversible and redress mechanisms exist.
- Keeping AI agents secure: Developing and deploying AI agents securely is a shared responsibility. Safeguards should be distributed across the platform/model, system/organisational, and end-user levels, depending on which actors are best placed to anticipate and manage different risks.
- Balancing flexibility and control in AI agents: Systems should be safe and secure by default, while allowing for calibrated flexibility and customisation if appropriate. Such flexibility should, however, remain bounded so that experimentation does not inadvertently introduce new risks.

Exploring the horizons of an agentic future
- Addressing current technical limitations: The sandbox demonstrated both the potential of agentic systems and what may become possible as the technology matures. It also highlighted areas for further exploration, e.g. how screenshot-based perception could be complemented by alternative techniques in scenarios requiring higher accuracy, particularly when handling information-dense content.
- Potential of multi-agent approaches: During the sandbox, participants discussed the potential for multiple agents to collaborate to review, critique, and refine outputs. Such approaches could unlock new capabilities but also bring interoperability and governance challenges into sharper focus. If agents developed by different organisations are expected to interact, coalition-led open standards and common foundations will be increasingly important.
- Building the digital infrastructure for agentic AI: The sandbox highlighted a mismatch between today’s digital environment that is largely designed around human users, and one that could reliably support agentic interaction at scale. Elements of the underlying ecosystem, from identity and authentication frameworks to permission and access controls, may need to evolve to accommodate more autonomous, agent-driven interactions.
- Balancing privacy and personalisation with AI agents: As AI agents become more autonomous and gain greater access to users’ personal context, the tension between the benefits of personalisation and privacy becomes more pronounced. The challenge is in ensuring that agents can continue to leverage data to deliver value, while maintaining meaningful user control, minimising unnecessary data use, and exploring privacy-enhancing approaches that move beyond a zero-sum trade-off between utility and protection.

Click on the following link for more information:

CSA Press Release titled “AI Agents: Insights from the Singapore Government and Google Sandbox” (available on the CSA website at www.csa.gov.sg)

Disclaimer

Rajah & Tann Asia is a network of member firms with local legal practices in Cambodia, Indonesia, Lao PDR, Malaysia, Myanmar, the Philippines, Singapore, Thailand and Vietnam. Our Asian network also includes our regional office in China as well as regional desks focused on Brunei, Japan and South Asia. Member firms are independently constituted and regulated in accordance with relevant local requirements.

The contents of this publication are owned by Rajah & Tann Asia together with each of its member firms and are subject to all relevant protection (including but not limited to copyright protection) under the laws of each of the countries where the member firm operates and, through international treaties, other countries. No part of this publication may be reproduced, licensed, sold, published, transmitted, modified, adapted, publicly displayed, broadcast (including storage in any medium by electronic means whether or not transiently for any purpose save as permitted herein) without the prior written permission of Rajah & Tann Asia or its respective member firms.

Please note also that whilst the information in this publication is correct to the best of our knowledge and belief at the time of writing, it is only intended to provide a general guide to the subject matter and should not be treated as legal advice or a substitute for specific professional advice for any particular course of action as such information may not suit your specific business and operational requirements. You should seek legal advice for your specific situation. In addition, the information in this publication does not create any relationship, whether legally binding or otherwise. Rajah & Tann Asia and its member firms do not accept, and fully disclaim, responsibility for any loss or damage which may result from accessing or relying on the information in this publication.

CONTACTS

Rajesh Sreenivasan

Brunei, Singapore,

+65 6232 0751

Steve Tan

Singapore,

+65 6232 0786

Benjamin Cheong 张汶权

China, Singapore,

+65 6232 0738

Country

Singapore

EXPERTISE

RELATED VIEWPOINT

Legal Updates

MOT Consults on Proposed Legislative Framework for Autonomous Vehicles Regarding Liability, Insurance, and Cybersecurity

9 Jun 2026

Legal Updates

Court of Appeal Clarifies Statutory Limits on Damages for Trade Mark Infringement

9 Jun 2026

Legal Updates

Striking the Balance between Criminal and Insolvency Regimes: High Court Sets Out Approach for Recovery Claims in Liquidation

9 Jun 2026

Countries

Regional Desks

PRACTICES

PEOPLE SEARCH

News

RESPONSIBLE BUSINESS

Countries

Regional Desks

PRACTICES

PEOPLE SEARCH

News

RESPONSIBLE BUSINESS

Countries

Regional Desks

Countries

Regional Desks

PRACTICES

PEOPLE SEARCH

News

RESPONSIBLE BUSINESS

Countries

Regional Desks

PRACTICES

PEOPLE SEARCH

News

RESPONSIBLE BUSINESS

Countries

Regional Desks

AI Agents Sandbox: Outcome of Trial and Emerging Considerations

CONTACTS

Country

EXPERTISE

Share

RELATED VIEWPOINT

Rajah & Tann Singapore LLP

Our Offices

Our Regional Desks

Our AI Journey

Rajah & Tann Asia is a network of legal practices based in Asia.

Rajah & Tann Singapore LLP

Our Offices

Our Regional Desks

Our AI Journey

Rajah & Tann Asia is a network of legal practices based in Asia.