AKINDOYIN Akinbiyi, PhD

Customer-facing Solutions and Security Architect, AI Security Specialist

About

Highly accomplished Customer-facing Solutions and Security Architect with over a decade of experience, specializing in leading the design, build, integration production of large-scale AI/HPC GPU infrastructure and GenAI application stacks across multi-cloud and on-prem environments, DevSecOps, and regulatory compliance in highly regulated industries (financial services, insurance, energy). Experienced in embedding defense-in-depth, least privilege, and secure-by-design principles into CI/CD workflows (using terraform), with practical expertise in CSPM/CNAPP, threat detection, and audit-ready security documentation. Expertly advises enterprise customers and cloud partners, multi-cloud LZ platform deployment, accelerating GPU platform adoption and consumption through advanced Python engineering, deep learning fundamentals (PyTorch/TensorFlow), and MLOps, consistently delivering impactful solutions and high-quality technical content.

Work

EY
|

Senior Director - Senior Cloud & Enterprise Architect

Houston, Texas, UK

Summary

Led multi-stakeholder technical engagements, producing reference architectures and implementation roadmaps for secure cloud platforms and distributed applications across healthcare, insurance, and energy sectors.

Highlights

Led multi-stakeholder technical engagements across architecture, build planning, integration, and go-live readiness, aligning engineering, product, and business leaders to achieve strategic delivery and technical requirements.

Developed comprehensive reference architectures, decision records, and implementation roadmaps, accelerating secure cloud platform adoption and distributed application patterns.

Managed end-to-end customer lifecycle, leading multi-disciplinary engagements from discovery to production cutover, resolving critical issues and driving feature discussions.

Designed and deployed large-scale GPU clusters for training and inference, delivering hardened reference designs with robust capacity planning, HA, and operational runbooks.

Standardized production serving patterns using NVIDIA stack (Triton, TensorRT-LLM, CUDA-X), operationalizing GPU telemetry with DCGM dashboards and alerts.

Optimized distributed training performance by tuning NCCL and topology-aware configurations, resolving critical bottlenecks across network, storage, and framework layers.

Co-developed partner-ready solution kits, including sizing guides, Terraform modules, and validation playbooks, accelerating solution adoption and consumption.

Led cloud security architecture for a multi-region data lakehouse platform, ensuring compliance with NIST 800-53, PCI-DSS, and GDPR.

Converted successful LLM fine-tuning and inference PoCs into production deployments, incorporating robust security controls, SLAs, and runbooks.

Implemented LoRA/QLORA pipelines and integrated model stacks with NeMo for enterprise RAG, enhancing model fine-tuning and deployment workflows.

Deployed and optimized GPU workloads across Kubernetes and SLURM, configuring scheduling, quotas, and multi-tenant controls for enterprise developer teams.

Established and executed benchmark methodology for training/inference throughput, identifying and resolving scaling bottlenecks through load tests and profiling.

Enabled enterprise developers by authoring tutorials, "golden path" templates, and reference repositories, significantly unblocking adoption.

Secured AWS infrastructure for forecasting and optimization workloads by defining robust VPC architectures, private subnets, security groups, and private link endpoints.

Designed and built secure, scalable multi-region architectures with network segmentation, IAM, encryption, and audit logging for high-throughput workloads.

Delivered IaC-based cluster provisioning and standardized environments, implementing policy guardrails and operational readiness checks.

Enhanced platform stability by establishing SLOs, telemetry, incident playbooks, and capacity forecasting, leading to proactive monitoring and automated remediation.

Automated security workflows using AWS Lambda and Step Functions, remediating open security groups, unencrypted S3 buckets, and non-compliant IAM policies.

Integrated vulnerability findings and misconfiguration alerts into ServiceNow, streamlining end-to-end remediation workflows with operations teams.

Authored high-quality architecture diagrams, runbooks, and security playbooks, enhancing incident response and operational procedures in a remote-first environment.

Lloyds Banking Group
|

Group Enterprise Cloud Architect

London, England, UK

Summary

Defined strategic roadmaps and provided architectural guidance for public cloud adoption, ensuring compliance with regulatory requirements and operational excellence.

Highlights

Defined strategic roadmaps and influenced investment prioritization for public cloud adoption, guiding strategy and design.

Conducted regular vulnerability assessments, ensuring compliance with SOC2, ISO 27001, and other regulatory requirements.

Played a pivotal role in incident response, leveraging Prisma Cloud to rapidly identify and remediate security incidents in the cloud.

Maintained the enterprise architecture repository, ensuring consolidation and accuracy of various teams' architectural artifacts.

Triaged large programs of work to identify broad enterprise architecture impact and effort.

Reviewed and assisted teams and line architects in aligning and cohesive local architecture strategies and roadmaps.

HSBC
|

Lead Cloud Architect, Digital Cloud and Platform Service (DCPS)

London, England, UK

Summary

Led the design and implementation of cloud security patterns, DevOps practices, and scalable cloud platforms for seamless migration and robust digital data solutions.

Highlights

Developed and designed cloud security patterns, enabling seamless migration of workloads to public cloud for cross-functional teams.

Developed proofs of concept for automations, enabling cloud-native services compliant with security, operational, and financial best practices.

Implemented DevOps practices, including IaC (Terraform, CloudFormation), CI/CD, and automated deployment.

Deployed Kubernetes container orchestration on AWS EKS and GCP GKE.

Implemented and managed continuous integration and delivery systems and methodologies.

Implemented and automated security controls, governance processes, and compliance frameworks/validation.

Provided technical input and expertise to architecture governance processes for new GCP and AWS cloud services.

Defined and deployed comprehensive monitoring, metrics, and logging cloud systems.

Implemented highly available, scalable, and self-healing systems for digital cloud data platforms, aligning with global blueprints (AWS and GCP WAF policies).

Delivered multiple production products, including NodeJS application migration to AWS, Container Scanning Solution, Service Hosted Platform, Automated AMI Bakery, and Digital Splunk on AWS.

Presented product solutions at KubeSec Europe 2020, showcasing expertise in cloud security and platforms.

Documented program-level design decisions and open items, enhancing transparency and alignment with key stakeholders.

Collaborated with cross-functional teams (Platform, Cloud, Security, Architecture), ensuring project deadlines and business requirements were consistently met.

Price Waterhouse Coopers LLP
|

Digital Technology Consultant

London, England, UK

Summary

Provided digital technology consulting, leveraging machine learning, cloud infrastructure design, and migration strategies to deliver automated solutions and best practices for diverse clients.

Highlights

Led internal and external projects, developing solutions for document/text ingestion, NLP/NLG, knowledge representation, and automated report generation.

Applied supervised and unsupervised machine learning methods to solve complex analytical problems.

Built and automated consistent AWS infrastructure solutions, distributing them across multiple project accounts.

Developed architecture blueprints and detailed documentation, including Service Catalog and bills of materials for AWS services (EC2, S3).

Created and promoted cloud security best practice blueprints, ensuring alignment and compliance with the cloud security team.

Built new SharePoint sites and Microsoft Teams to meet diverse customer requirements.

Supported and administered SharePoint (on-premise and online) and O365, including Teams and Office Apps.

Designed and executed comprehensive cloud migration strategies, including delivery architecture, migration plans, orchestration, and runbooks.

Identified and evangelized appropriate AWS and GCP architectural best practices across projects.

Designed and deployed scalable, highly available, and resilient solutions on GCP and AWS.

Mapped business requirements to technical solutions, developing business cases to capture ROI and cost implications.

Created and maintained optimal AWS data pipelines using Python.

Utilized a wide array of cloud big data tools, including AWS Lambda, EMR, Redshift, and SageMaker, for data processing.

Developed and trained machine learning models using AWS SageMaker.

Built infrastructure for optimal ETL from diverse data sources using SQL and AWS big data technologies.

Identified, designed, and implemented internal process improvements, automating manual processes and optimizing data delivery for scalability.

Automated EC2 AMI generation/deletion and provisioned lifecycle policies for S3 and AWS RDS.

Integrated AWS RDS (PostgreSQL) with MS Power BI to build analytics tools for actionable data insights.

Collaborated with market risk managers to test and validate models, facilitating independent review processes for CSDR.

Conducted regulatory analysis on market risk models using Python for CSDR implementation.

Supported risk managers in understanding model behavior and impact on specific portfolios.

Developed models (Excel, VBA, Access, SQL) to estimate P&L impact of T2S and buy-in penalties for CSDR implementation.

Developed and implemented an FRTB risk calculations framework, including models, calculation modules, and data analysis tools (Qlik, Alteryx, Tableau).

Designed and implemented a BluePrism RPA Proof of Concept to automate invoice statement generation, consolidating data from 5 disparate systems and applying logical business rules.

Education

Imperial College London
London, England, UK

PhD

Electrical and Electronic Engineering

Grade: 4.0/4.0

Courses

Thesis: "Uncertainties and Antenna Array geometry Selection". Industrial Research Collaboration with National Instrument.

Dissertation: Spiral: Uncertainties and array geometry selection (imperial.ac.uk)

University of Leeds
Leeds, England, UK

MSc

Broadband Wireless and Optical Communications

Grade: 86% (Distinction)

Courses

Ranked Top 1% in the class of 70 students.

Dissertation: "M.Sc. Thesis on: Physical Layer Security Using Artificial Noise and Spatial Beamforming". Available on Semantic Scholars via: https://pdfs.semanticscholar.org/dddd/9fb972a0182e3b50ad81cee6476ac9bbdde4.pdf

Obafemi Awolowo University
Ile-Ife, Osun, Nigeria

BSc (hons)

Computer Engineering

Grade: 4.7/5.0 (First Class Honors)

Courses

Best Graduating Student out of a Class of 160.

Dissertation: An Unpublished B.Sc. Thesis on Development of a Local Content ISM Band Helical Antenna for Rural Internet Development".

Publications

Recent Capital One Data Breach - Technical Analysis and Lessons Learnt

Summary

A. Akindoyin. Recent Capital One Data Breach - Technical Analysis and Lessons Learnt, Available at: https://www.linkedin.com/pulse/recent-capital-one-data-breach-technical-analysis-akindoyin-ph-d-/, August 2019.

Subframe resource optimization for massive machine device access in LTE networks

Summary

A. Ilori, A. Akindoyin, Z. Tang, J He. Subframe resource optimization for massive machine device access in LTE networks, https://arxiv.org/abs/1904.07966, March 2019.

Comparative Study of 2D Grid Antenna Array Geometries for Massive Array Systems

Published by

Proceedings IEEE GlobeComm

Summary

T. Gabillard, V. Sridhar, A. Akindoyin, A. Manikas. Comparative Study of 2D Grid Antenna Array Geometries for Massive Array Systems. Proceedings IEEE GlobeComm, Dec. 2015, San Diego USA.

Modelling and Estimation of Carrier Frequency and Phase Uncertainties in Large Aperture Arrays

Published by

Proceedings IEEE International Conference on Communications

Summary

A. Akindoyin. Modelling and Estimation of Carrier Frequency and Phase Uncertainties in Large Aperture Arrays. Proceedings IEEE International Conference on Communications, June 2015, London, UK.

Localization and Array Shape Estimation Using Software Defined Radio Array Test Bed

Published by

Proceedings of the Eight IEEE Sensor Array and Multichannel Signal Processing (SAM)

Summary

A. Akindoyin, M. Willerton and A. Manikas. Localization and Array Shape Estimation Using Software Defined Radio Array Test Bed. Proceedings of the Eight IEEE Sensor Array and Multichannel Signal Processing (SAM), June 2014, Corunda, Spain.

Languages

English

Certificates

NVIDIA Certified AI Infrastructure and Operations

Issued By

NVIDIA

GCP Professional Cloud Security Engineer (re-certified)

Issued By

Google Cloud Platform

AZ-305 Designing Microsoft Azure Infrastructure Solutions

Issued By

Microsoft Azure

HashiCorp Certified Terraform Associate

Issued By

HashiCorp

GCP Certified Machine Learning Engineer

Issued By

Google Cloud Platform

GCP Certified Digital Leader

Issued By

Google Cloud Platform

TOGAF Certified Enterprise Architect

Issued By

TOGAF

GCP Certified Professional Architect

Issued By

Google Cloud Platform

AWS SME in Solution Architecture and Security

Issued By

Amazon Web Services

Certified SAFE 5 Agilist

Issued By

Scaled Agile, Inc.

AWS Certified Machine Learning Specialty

Issued By

Amazon Web Services

AWS Certified Big Data Specialty

Issued By

Amazon Web Services

AWS Certified Solution Architect - Professional

Issued By

Amazon Web Services

AWS Certified Cloud Practitioner

Issued By

Amazon Web Services

Certified Information Systems Security Professional (CISSP)

Issued By

ISC²

Certified Kubernetes Application Developer (CKAD)

Issued By

Cloud Native Computing Foundation

Certified Kubernetes Administrator (CKA)

Issued By

Cloud Native Computing Foundation

AWS Certified Security Specialty

Issued By

Amazon Web Services

GCP Certified Associate Engineer

Issued By

Google Cloud Platform

AWS Certified Developer – Associate

Issued By

Amazon Web Services

AWS Certified SysOps Administrator

Issued By

Amazon Web Services

AWS Certified Solution Architect – Associate

Issued By

Amazon Web Services

Implementing Microsoft Azure Infrastructure Solutions – Microsoft Certified Professional (70-533)

Issued By

Microsoft Azure

ITIL V3 Foundation

Issued By

AXELOS Global Best Practice

Cisco Certified Internetworking Expert Data Center 350-080 (CCIE – written)

Issued By

Cisco

Building Cisco Multilayer Switched Network (BCMSN)

Issued By

Cisco

Microsoft Certified Technologist (MCTS) – SQL Server Database Developer (70-433)

Issued By

Microsoft

Microsoft Certified Technologist Specialist (MCTS) – Window Server 2008 Network Configuring (70-642)

Issued By

Microsoft

Cisco Certified Network Associate (CCNA) 640-802

Issued By

Cisco

Skills

GPU Cloud Infrastructure & HPC

GPU cluster design, Compute, Networking, Storage, Capacity planning, Reliability engineering, HA/DR, Upgrade/patch strategy, RDMA/InfiniBand/Ethernet, High-throughput east/west patterns, Load balancing, Segmentation, NCCL, DCGM, UFM, Mission Control, Base Command Manager.

Cloud & Orchestration

Kubernetes at scale (EKS/AKS/GKE), GPU Operator, Autoscaling, Multi-tenant isolation, SLURM cluster orchestration (partitions/queues, scheduling policies, accounting), K8s batch frameworks, Distributed cloud architectures, Hybrid connectivity, Infrastructure automation.

Programming & Debugging

Strong Python engineering, Performance profiling, Software design, Observability-first troubleshooting, PyTorch/TensorFlow, Distributed training, Data pipelines, Reproducibility, Experiment tracking.

Customer & Partner Leadership

Primary technical driver across lifecycle, Technical discovery, PoCs, Architecture reviews, Debugging sessions, Executive communication, Workshops, Tutorials, Reference architectures, Partner enablement.

Technical Skills

Python (expert), Bash, SQL, Go/C++ (nice-to-have), PyTorch, TensorFlow, Hugging Face ecosystem, Distributed training patterns, CUDA, NCCL, DCGM, Triton, TensorRT/TensorRT-LLM, NeMo, Dynamo, NeMo Retriever, RAPIDS, CI/CD, Model registry, Experiment tracking, Feature stores, Monitoring, Canary/blue-green deployments, Kubernetes (EKS/AKS/GKE), SLURM, Helm, GitOps, AWS, Azure, GCP, VPC/VNet, IAM, Managed storage, Observability, Security controls, Profiling, Load testing, GPU utilization tuning, AI benchmarking (e.g., MLPerf-style methodology).