About
Gagandeep Sidhu
Staff Platform Engineer with 12+ years of experience designing, modernizing, and operating AWS-based infrastructure platforms across fintech and SaaS. Based in Mission, BC — open to US remote opportunities.
I work at the intersection of platform engineering, reliability, and cloud economics. Right now that means leading platform modernization at Nomis Solutions — cutting deployment times from 2 hours to under 4 minutes, reducing cloud spend by ~25%, and shipping AI-powered tooling on AWS Bedrock.
Experience
Staff Platform Engineer
Nomis Solutions — Canada Remote
- Architected and operated AWS platform infrastructure supporting fintech banking products across ECS and Kubernetes environments.
- Led migration of legacy EC2-based deployments to ECS-driven pipelines, cutting release cycles from ~2 hours to under 4 minutes.
- Designed production-grade EKS platform with private cluster patterns, IRSA, managed node groups, and GitOps delivery via ArgoCD and Argo Rollouts.
- Built reusable Terraform modules for ECS, EKS, ECR, networking, IAM, and security controls.
- Redesigned a multi-tenant optimization service using SQS-based async communication — enabling ~70% infrastructure cost savings on optimized workloads.
- Built an AI-powered troubleshooting assistant using Agno to inspect MongoDB, environment variables, and application dependencies.
- Implemented SOC 2-aligned controls including CloudTrail auditing, IAM least-privilege policies, and secrets management hardening.
- Led CenturyLink-to-AWS migration with MongoDB replica-set failover — zero data loss.
- Built observability workflows with CloudWatch, Sumo Logic, Prometheus, and Grafana — contributing to ~25% MTTR reduction.
- Integrated Amazon Bedrock to enable LLM-based platform workflows and early AI features.
- Mentored engineers on Kubernetes, Terraform, and AWS platform engineering best practices.
- Received 6 Bold Awards across 10 quarters for platform modernization, security, and reliability.
Team Lead DevOps
Pepper Content
- Led DevOps and cloud infrastructure strategy for a microservices platform on AWS (ECS, Lambda, SQS).
- Built and improved CI/CD pipelines using GitHub Actions and Jenkins.
- Introduced Terraform-based provisioning to reduce configuration drift.
- Implemented centralized logging with ELK stack for operational visibility.
Senior Site Reliability Engineer
Mindtickle
- Defined SLI/SLO-driven reliability practices for distributed systems in production.
- Built monitoring and alerting with Datadog, Prometheus, and Sumo Logic.
- Led incident response, root cause analysis, and follow-up reliability improvements.
Lead DevOps Engineer
Sentieo India Pvt Ltd
- Designed and managed MongoDB infrastructure for financial analytics workloads.
- Built CI/CD pipelines with Jenkins and monitoring with Datadog, ELK, and Nagios.
- Mentored a team of 4 DevOps engineers on operational practices.
Earlier roles
Safaltek Software · Syniverse Technologies
Certifications
Education
Bachelor of Technology, Computer Science
Lovely Professional University · 2012
Get in touch
Open to Staff / Principal platform engineering roles — especially US remote. Reach me at gagan6011@gmail.com or on LinkedIn.