AKINDOYIN Akinbiyi, PhD
Customer-facing Solutions and Security Architect, AI Security Specialist
About
Highly accomplished Customer-facing Solutions and Security Architect with over a decade of experience, specializing in leading the design, build, integration production of large-scale AI/HPC GPU infrastructure and GenAI application stacks across multi-cloud and on-prem environments, DevSecOps, and regulatory compliance in highly regulated industries (financial services, insurance, energy). Experienced in embedding defense-in-depth, least privilege, and secure-by-design principles into CI/CD workflows (using terraform), with practical expertise in CSPM/CNAPP, threat detection, and audit-ready security documentation. Expertly advises enterprise customers and cloud partners, multi-cloud LZ platform deployment, accelerating GPU platform adoption and consumption through advanced Python engineering, deep learning fundamentals (PyTorch/TensorFlow), and MLOps, consistently delivering impactful solutions and high-quality technical content.
Work
EY
|Senior Director - Senior Cloud & Enterprise Architect
Houston, Texas, UK
→
Summary
Led multi-stakeholder technical engagements, producing reference architectures and implementation roadmaps for secure cloud platforms and distributed applications across healthcare, insurance, and energy sectors.
Highlights
Led multi-stakeholder technical engagements across architecture, build planning, integration, and go-live readiness, aligning engineering, product, and business leaders to achieve strategic delivery and technical requirements.
Developed comprehensive reference architectures, decision records, and implementation roadmaps, accelerating secure cloud platform adoption and distributed application patterns.
Managed end-to-end customer lifecycle, leading multi-disciplinary engagements from discovery to production cutover, resolving critical issues and driving feature discussions.
Designed and deployed large-scale GPU clusters for training and inference, delivering hardened reference designs with robust capacity planning, HA, and operational runbooks.
Standardized production serving patterns using NVIDIA stack (Triton, TensorRT-LLM, CUDA-X), operationalizing GPU telemetry with DCGM dashboards and alerts.
Optimized distributed training performance by tuning NCCL and topology-aware configurations, resolving critical bottlenecks across network, storage, and framework layers.
Co-developed partner-ready solution kits, including sizing guides, Terraform modules, and validation playbooks, accelerating solution adoption and consumption.
Led cloud security architecture for a multi-region data lakehouse platform, ensuring compliance with NIST 800-53, PCI-DSS, and GDPR.
Converted successful LLM fine-tuning and inference PoCs into production deployments, incorporating robust security controls, SLAs, and runbooks.
Implemented LoRA/QLORA pipelines and integrated model stacks with NeMo for enterprise RAG, enhancing model fine-tuning and deployment workflows.
Deployed and optimized GPU workloads across Kubernetes and SLURM, configuring scheduling, quotas, and multi-tenant controls for enterprise developer teams.
Established and executed benchmark methodology for training/inference throughput, identifying and resolving scaling bottlenecks through load tests and profiling.
Enabled enterprise developers by authoring tutorials, "golden path" templates, and reference repositories, significantly unblocking adoption.
Secured AWS infrastructure for forecasting and optimization workloads by defining robust VPC architectures, private subnets, security groups, and private link endpoints.
Designed and built secure, scalable multi-region architectures with network segmentation, IAM, encryption, and audit logging for high-throughput workloads.
Delivered IaC-based cluster provisioning and standardized environments, implementing policy guardrails and operational readiness checks.
Enhanced platform stability by establishing SLOs, telemetry, incident playbooks, and capacity forecasting, leading to proactive monitoring and automated remediation.
Automated security workflows using AWS Lambda and Step Functions, remediating open security groups, unencrypted S3 buckets, and non-compliant IAM policies.
Integrated vulnerability findings and misconfiguration alerts into ServiceNow, streamlining end-to-end remediation workflows with operations teams.
Authored high-quality architecture diagrams, runbooks, and security playbooks, enhancing incident response and operational procedures in a remote-first environment.
Lloyds Banking Group
|Group Enterprise Cloud Architect
London, England, UK
→
Summary
Defined strategic roadmaps and provided architectural guidance for public cloud adoption, ensuring compliance with regulatory requirements and operational excellence.
Highlights
Defined strategic roadmaps and influenced investment prioritization for public cloud adoption, guiding strategy and design.
Conducted regular vulnerability assessments, ensuring compliance with SOC2, ISO 27001, and other regulatory requirements.
Played a pivotal role in incident response, leveraging Prisma Cloud to rapidly identify and remediate security incidents in the cloud.
Maintained the enterprise architecture repository, ensuring consolidation and accuracy of various teams' architectural artifacts.
Triaged large programs of work to identify broad enterprise architecture impact and effort.
Reviewed and assisted teams and line architects in aligning and cohesive local architecture strategies and roadmaps.
HSBC
|Lead Cloud Architect, Digital Cloud and Platform Service (DCPS)
London, England, UK
→
Summary
Led the design and implementation of cloud security patterns, DevOps practices, and scalable cloud platforms for seamless migration and robust digital data solutions.
Highlights
Developed and designed cloud security patterns, enabling seamless migration of workloads to public cloud for cross-functional teams.
Developed proofs of concept for automations, enabling cloud-native services compliant with security, operational, and financial best practices.
Implemented DevOps practices, including IaC (Terraform, CloudFormation), CI/CD, and automated deployment.
Deployed Kubernetes container orchestration on AWS EKS and GCP GKE.
Implemented and managed continuous integration and delivery systems and methodologies.
Implemented and automated security controls, governance processes, and compliance frameworks/validation.
Provided technical input and expertise to architecture governance processes for new GCP and AWS cloud services.
Defined and deployed comprehensive monitoring, metrics, and logging cloud systems.
Implemented highly available, scalable, and self-healing systems for digital cloud data platforms, aligning with global blueprints (AWS and GCP WAF policies).
Delivered multiple production products, including NodeJS application migration to AWS, Container Scanning Solution, Service Hosted Platform, Automated AMI Bakery, and Digital Splunk on AWS.
Presented product solutions at KubeSec Europe 2020, showcasing expertise in cloud security and platforms.
Documented program-level design decisions and open items, enhancing transparency and alignment with key stakeholders.
Collaborated with cross-functional teams (Platform, Cloud, Security, Architecture), ensuring project deadlines and business requirements were consistently met.
Price Waterhouse Coopers LLP
|Digital Technology Consultant
London, England, UK
→
Summary
Provided digital technology consulting, leveraging machine learning, cloud infrastructure design, and migration strategies to deliver automated solutions and best practices for diverse clients.
Highlights
Led internal and external projects, developing solutions for document/text ingestion, NLP/NLG, knowledge representation, and automated report generation.
Applied supervised and unsupervised machine learning methods to solve complex analytical problems.
Built and automated consistent AWS infrastructure solutions, distributing them across multiple project accounts.
Developed architecture blueprints and detailed documentation, including Service Catalog and bills of materials for AWS services (EC2, S3).
Created and promoted cloud security best practice blueprints, ensuring alignment and compliance with the cloud security team.
Built new SharePoint sites and Microsoft Teams to meet diverse customer requirements.
Supported and administered SharePoint (on-premise and online) and O365, including Teams and Office Apps.
Designed and executed comprehensive cloud migration strategies, including delivery architecture, migration plans, orchestration, and runbooks.
Identified and evangelized appropriate AWS and GCP architectural best practices across projects.
Designed and deployed scalable, highly available, and resilient solutions on GCP and AWS.
Mapped business requirements to technical solutions, developing business cases to capture ROI and cost implications.
Created and maintained optimal AWS data pipelines using Python.
Utilized a wide array of cloud big data tools, including AWS Lambda, EMR, Redshift, and SageMaker, for data processing.
Developed and trained machine learning models using AWS SageMaker.
Built infrastructure for optimal ETL from diverse data sources using SQL and AWS big data technologies.
Identified, designed, and implemented internal process improvements, automating manual processes and optimizing data delivery for scalability.
Automated EC2 AMI generation/deletion and provisioned lifecycle policies for S3 and AWS RDS.
Integrated AWS RDS (PostgreSQL) with MS Power BI to build analytics tools for actionable data insights.
Collaborated with market risk managers to test and validate models, facilitating independent review processes for CSDR.
Conducted regulatory analysis on market risk models using Python for CSDR implementation.
Supported risk managers in understanding model behavior and impact on specific portfolios.
Developed models (Excel, VBA, Access, SQL) to estimate P&L impact of T2S and buy-in penalties for CSDR implementation.
Developed and implemented an FRTB risk calculations framework, including models, calculation modules, and data analysis tools (Qlik, Alteryx, Tableau).
Designed and implemented a BluePrism RPA Proof of Concept to automate invoice statement generation, consolidating data from 5 disparate systems and applying logical business rules.
Education
Imperial College London
→
PhD
Electrical and Electronic Engineering
Grade: 4.0/4.0
Courses
Thesis: "Uncertainties and Antenna Array geometry Selection". Industrial Research Collaboration with National Instrument.
Dissertation: Spiral: Uncertainties and array geometry selection (imperial.ac.uk)
University of Leeds
→
MSc
Broadband Wireless and Optical Communications
Grade: 86% (Distinction)
Courses
Ranked Top 1% in the class of 70 students.
Dissertation: "M.Sc. Thesis on: Physical Layer Security Using Artificial Noise and Spatial Beamforming". Available on Semantic Scholars via: https://pdfs.semanticscholar.org/dddd/9fb972a0182e3b50ad81cee6476ac9bbdde4.pdf
Obafemi Awolowo University
→
BSc (hons)
Computer Engineering
Grade: 4.7/5.0 (First Class Honors)
Courses
Best Graduating Student out of a Class of 160.
Dissertation: An Unpublished B.Sc. Thesis on Development of a Local Content ISM Band Helical Antenna for Rural Internet Development".
Publications
Comparative Study of 2D Grid Antenna Array Geometries for Massive Array Systems
Published by
Proceedings IEEE GlobeComm
Summary
T. Gabillard, V. Sridhar, A. Akindoyin, A. Manikas. Comparative Study of 2D Grid Antenna Array Geometries for Massive Array Systems. Proceedings IEEE GlobeComm, Dec. 2015, San Diego USA.
Modelling and Estimation of Carrier Frequency and Phase Uncertainties in Large Aperture Arrays
Published by
Proceedings IEEE International Conference on Communications
Summary
A. Akindoyin. Modelling and Estimation of Carrier Frequency and Phase Uncertainties in Large Aperture Arrays. Proceedings IEEE International Conference on Communications, June 2015, London, UK.
Localization and Array Shape Estimation Using Software Defined Radio Array Test Bed
Published by
Proceedings of the Eight IEEE Sensor Array and Multichannel Signal Processing (SAM)
Summary
A. Akindoyin, M. Willerton and A. Manikas. Localization and Array Shape Estimation Using Software Defined Radio Array Test Bed. Proceedings of the Eight IEEE Sensor Array and Multichannel Signal Processing (SAM), June 2014, Corunda, Spain.
Languages
English
Certificates
NVIDIA Certified AI Infrastructure and Operations
Issued By
NVIDIA
GCP Professional Cloud Security Engineer (re-certified)
Issued By
Google Cloud Platform
AZ-305 Designing Microsoft Azure Infrastructure Solutions
Issued By
Microsoft Azure
HashiCorp Certified Terraform Associate
Issued By
HashiCorp
GCP Certified Machine Learning Engineer
Issued By
Google Cloud Platform
GCP Certified Digital Leader
Issued By
Google Cloud Platform
TOGAF Certified Enterprise Architect
Issued By
TOGAF
GCP Certified Professional Architect
Issued By
Google Cloud Platform
AWS SME in Solution Architecture and Security
Issued By
Amazon Web Services
Certified SAFE 5 Agilist
Issued By
Scaled Agile, Inc.
AWS Certified Machine Learning Specialty
Issued By
Amazon Web Services
AWS Certified Big Data Specialty
Issued By
Amazon Web Services
AWS Certified Solution Architect - Professional
Issued By
Amazon Web Services
AWS Certified Cloud Practitioner
Issued By
Amazon Web Services
Certified Information Systems Security Professional (CISSP)
Issued By
ISC²
Certified Kubernetes Application Developer (CKAD)
Issued By
Cloud Native Computing Foundation
Certified Kubernetes Administrator (CKA)
Issued By
Cloud Native Computing Foundation
AWS Certified Security Specialty
Issued By
Amazon Web Services
GCP Certified Associate Engineer
Issued By
Google Cloud Platform
AWS Certified Developer – Associate
Issued By
Amazon Web Services
AWS Certified SysOps Administrator
Issued By
Amazon Web Services
AWS Certified Solution Architect – Associate
Issued By
Amazon Web Services
Implementing Microsoft Azure Infrastructure Solutions – Microsoft Certified Professional (70-533)
Issued By
Microsoft Azure
ITIL V3 Foundation
Issued By
AXELOS Global Best Practice
Cisco Certified Internetworking Expert Data Center 350-080 (CCIE – written)
Issued By
Cisco
Building Cisco Multilayer Switched Network (BCMSN)
Issued By
Cisco
Microsoft Certified Technologist (MCTS) – SQL Server Database Developer (70-433)
Issued By
Microsoft
Microsoft Certified Technologist Specialist (MCTS) – Window Server 2008 Network Configuring (70-642)
Issued By
Microsoft
Cisco Certified Network Associate (CCNA) 640-802
Issued By
Cisco
Skills
GPU Cloud Infrastructure & HPC
GPU cluster design, Compute, Networking, Storage, Capacity planning, Reliability engineering, HA/DR, Upgrade/patch strategy, RDMA/InfiniBand/Ethernet, High-throughput east/west patterns, Load balancing, Segmentation, NCCL, DCGM, UFM, Mission Control, Base Command Manager.
Cloud & Orchestration
Kubernetes at scale (EKS/AKS/GKE), GPU Operator, Autoscaling, Multi-tenant isolation, SLURM cluster orchestration (partitions/queues, scheduling policies, accounting), K8s batch frameworks, Distributed cloud architectures, Hybrid connectivity, Infrastructure automation.
Programming & Debugging
Strong Python engineering, Performance profiling, Software design, Observability-first troubleshooting, PyTorch/TensorFlow, Distributed training, Data pipelines, Reproducibility, Experiment tracking.
Customer & Partner Leadership
Primary technical driver across lifecycle, Technical discovery, PoCs, Architecture reviews, Debugging sessions, Executive communication, Workshops, Tutorials, Reference architectures, Partner enablement.
Technical Skills
Python (expert), Bash, SQL, Go/C++ (nice-to-have), PyTorch, TensorFlow, Hugging Face ecosystem, Distributed training patterns, CUDA, NCCL, DCGM, Triton, TensorRT/TensorRT-LLM, NeMo, Dynamo, NeMo Retriever, RAPIDS, CI/CD, Model registry, Experiment tracking, Feature stores, Monitoring, Canary/blue-green deployments, Kubernetes (EKS/AKS/GKE), SLURM, Helm, GitOps, AWS, Azure, GCP, VPC/VNet, IAM, Managed storage, Observability, Security controls, Profiling, Load testing, GPU utilization tuning, AI benchmarking (e.g., MLPerf-style methodology).