Premium Minds
SRE & Product Manager
Apr 2012 — Dec 2024 · 12 yrs 9 mos
- Owned the infrastructure while leading product for Spain/Portugal's #1 parking app (Telpark, 4M+ users), driving practices across 7 teams and 40+ engineers
- Eliminated recurring outages caused by undetected database issues and deployment failures, building observability and release processes that delivered 99.95% uptime
- Architected migration from static EC2 instances to Nomad-orchestrated cluster, using IaC Terraform, enabling dynamic scaling and zero-downtime deployments across 30+ services
- Unified CI/CD pipelines across 30+ microservices through standardised Jenkins workflows, reducing maintenance burden and ensuring consistent practices
- Replaced basic CloudWatch monitoring with comprehensive Datadog stack (metrics, APM, alerting), shifting from reactive firefighting to proactive incident detection
Problem Solving
Kubernetes
AWS
Terraform
Nomad
Datadog