Diogo Figueiredo

Staff SRE • Cloud Architect • Product Leader

Lisbon, Portugal

15+ Years Experience
4M+ Users Served
99.95% Uptime Achieved

About

Staff SRE with 13+ years building and scaling infrastructure for products serving millions of users. I specialize in platform transformations: taking systems from frequent outages to 99.95% uptime, reducing cloud costs, and building observability that enables teams to move faster with confidence.

Deep expertise in AWS, Terraform, Kubernetes, Nomad, and Datadog. Experienced leading cross-functional teams up to 15 engineers across infrastructure, mobile, and product.

Key Skills

Amazon Web Services (AWS) Product Development DevOps Management Site Reliability Engineering Terraform Kubernetes Nomad Datadog CI/CD

Experience

Staff SRE

Dashlane Jan 2025 – Present · 1 yr 5 mos

Leading SRE and cloud architecture for a password manager/B2B security product serving millions of users globally.

  • Leading company observability initiative, designing unified monitoring architecture and established consistent instrumentation
  • Reducing AWS infrastructure costs through architecture audit and rightsizing initiatives
  • Driving automated dependency/update workflows to reduce security risk surface and free engineering time in partnership with Security team
  • Establishing SLO framework across platform services, defining reliability targets and error budgets
  • Establishing disaster recovery strategy for critical components reducing RTO with automated failover
Terraform AWS SRE

SRE & Product Manager

Premium Minds Apr 2012 – Dec 2024 · 12 yrs 9 mos

Owned the infrastructure while leading product for Spain/Portugal’s #1 parking app (4M+ users), driving practices across 7 teams and 40+ engineers.

  • Eliminated recurring outages caused by undetected database issues and deployment failures, building observability and release processes that delivered 99.95% uptime
  • Architected migration from static EC2 instances to Nomad-orchestrated cluster using IaC Terraform, enabling dynamic scaling and zero-downtime deployments across 30+ services
  • Unified CI/CD pipelines across 30+ microservices through standardized Jenkins workflows, reducing maintenance burden
  • Replaced basic CloudWatch monitoring with comprehensive Datadog stack (metrics, APM, alerting), shifting from reactive firefighting to proactive incident detection

Product Manager: Led the development of the #1 car parking app in Spain and Portugal, serving over 4 million users with seamless parking solutions.

Kubernetes Terraform AWS Product

SRE

Etleap Feb 2021 – May 2024 · 3 yrs 4 mos

Responsible for infrastructure reliability and developer experience at a B2B data pipeline company processing multiple TBs of data.

  • Led infrastructure for B2B data pipeline platform: AWS architecture, CI/CD pipelines, observability, and developer tooling
  • Reduced CI pipeline duration by 50% (70 to 35 min) and eliminated flaky test noise through test parallelization, Docker optimizations and intelligent retry mechanisms
  • Unified fragmented monitoring tools into single Datadog platform, building standardized dashboards that enabled all engineers to self-serve debugging
  • Established Terraform standards and practices that scaled infrastructure ownership from sole SRE to full engineering team
AWS Datadog Terraform CI/CD

CTO

Limetree Jun 2012 – Oct 2014 · 2 yrs 5 mos

Designed architecture and led development of mobile platform for family memory sharing. Built backend, frontend, and mobile apps; managed AWS infrastructure and deployment automation.

Founder & CEO

MyOut Jan 2011 – May 2014 · 3 yrs 5 mos

Built cultural events aggregator for Portugal. Developed full stack including ElasticSearch-powered search, Redis caching, and web crawlers.

Education

Master, Information Systems & Computer Engineering

Instituto Superior Técnico 2010 – 2012

Master, Computer Science (Erasmus)

Utrecht University 2011

Contact

Lisbon, Portugal

diogo@dfigueiredo.cc

linkedin.com/in/diogofigueiredo