Skip to main content
Posted 30 June, 2026
Accenture

AI Infrastructure Architect

London, United Kingdom Full Time

YOU ARE

As a hands-on Infrastructure Architect, you are an experienced engineer with several years in infrastructure engineering who now takes on more complex, higher-impact work designing and optimizing the AI and machine learning infrastructure that powers real-world applications. Working alongside senior architects and engineers - and increasingly leading your own workstreams - you apply proven skills in coding, testing, configuring, deploying, monitoring, and troubleshooting AI systems and the infrastructure they run on. Day to day, you architect and optimize infrastructure components, write and review code and deployment scripts, design and tune cloud and on-premises compute resources such as GPU clusters and distributed training environments, deploy AI systems and models into production, and build and optimize data pipelines that feed AI and ML workflows. You optimize the computational stack for performance, cost, power, and scalability, monitor AI systems and infrastructure health across both InfraOps and MLOps disciplines, perform AI monitoring to track model and system performance, and independently troubleshoot and resolve complex issues across the stack. You also mentor junior engineers, contribute to architectural decisions, and help establish best practices. This is a hands-on, ownership-driven role where you apply and deepen your expertise across modern tools and platforms - including container orchestration, model serving, CI/CD pipelines, InfraOps, MLOps, and AI monitoring - while making meaningful contributions to infrastructure that enables AI-driven business outcomes.

THE WORK

  • Write, review, and debug code, scripts, and infrastructure-as-code for AI infrastructure, automation, and tooling, setting standards for quality across the team.


  • Architect, configure, and provision compute resources across cloud and on-premises environments, including GPU clusters and distributed training setups, optimizing for performance and utilization.


  • Design and maintain deployment automation and CI/CD pipelines to support reliable, repeatable releases of AI systems, models, and applications.


  • Deploy AI systems, models, and data pipelines into production, defining and improving the processes and best practices others follow.


  • Lead container orchestration and model serving using tools such as Docker, Kubernetes, and model deployment frameworks.


  • Architect and optimize the computational stack for performance, power, cost, and scalability, balancing trade-offs against business goals.


  • Evaluate and select tools, frameworks, and platforms, making recommendations that shape the infrastructure roadmap.


  • Integrate AI models and systems into existing enterprise systems, ensuring interoperability, security, and regulatory compliance.


  • Own AI monitoring and infrastructure health across InfraOps and MLOps, tracking performance, reliability, and utilization, and driving remediation.


  • Independently troubleshoot and resolve complex issues across the computational stack - hardware, networking, software, and models - and lead root-cause analysis.


  • Mentor junior engineers and lead code reviews, providing technical direction and supporting their growth.


  • Define and document architecture standards, processes, and procedures, and apply security, cost-efficiency, and scalability best practices across the infrastructure.