Kasada has an exciting opportunity to join our team responsible for defending our customers against fraud and malicious automation. Attackers use automated frameworks, fake devices and device farms to launch highly costly attacks on our customers. As a leader of the Infrastructure & Operations (IO) team, you will be leading a highly capable infrastructure engineering team in building scalable cloud compute platform and engineering foundations in helping us defeating adversaries. A critical part of this role involves empowering other engineering teams, and ensuring our high traffic volume workload are performant, resilient, and cost efficient.
We are looking for a “Player-Coach” to lead our IO team and scale by managing complexity such as automation, architectural patterns, and golden paths. While your focus may shift from time to time, you are expected to remain technical and hands on, while being the people leader, and shaping the engineering foundations roadmap through product thinking.
What you will be doing:
-
Culture fostering: Foster a culture of trust, psychological safety, and pragmatic continuous improvement
-
Lead from the front: Not just managing backlog, you are expected to write code, and demonstrate “What Good Looks Like” to the team (architecture, code quality of infrastructure code, reliability, for example). Kasada is handling billions of requests every day and protecting our customers from adversaries, the infrastructure team plays a key role in making that happen.
-
Shape infrastructure direction: Shape infrastructure and engineering foundations direction, drive architectural decisions across our AWS and EKS platform, provide architectural and tooling empowerment to other engineering teams with operational management of stateless and stateful systems such as RDS, Redis, Kinesis, Kafka, Kubernetes Operators, Apache Flink
-
Drive operational excellence: Own and evolve our on-call and incident response practices. All engineers in Kasada participate in on-call, and you'll help make that continue to be sustainable and effective as Kasada grows
-
Build scalable solutions: Partner with product engineering teams to unblock delivery and improve developer experience with a product mindset and customer empathy
What you'll bring:
-
You're energized by collaboration, genuinely curious, and motivated by helping teams (your own and others) deliver with confidence
-
You have led cloud infrastructure, DevOps, or SRE teams with empathy and great enjoyment, but you do not shy away from getting your hands dirty, whether it is writing glue code, debugging systems, or acting as technical escalation point for critical outages.
-
You have 5+ years hands on experience in SRE, infrastructure, or DevOps roles, demonstrating experience in managing high-availability cloud infrastructure in production
-
Strong grasp of production management and reliability engineering of AWS, Kubernetes, Linux, and cloud networking to support secure data movement across regions
-
Firm believer in developer experience fundamentals through means such as CI/CD pipeline and Infrastructure as Code
-
Product mindset to shape infrastructure and engineering foundation roadmap
Additional bonus points if:
-
You have experience working in “Platform Engineering” or “Engineering Foundation” teams that treat infrastructure as product
-
Experience working in SOC2 and PCI compliant software environment
What you'll be working with:
-
AWS, Kubernetes (Operators, CRD, Statefulsets etc)
-
Grafana, Prometheus/Thanos, Splunk
-
TypeScript, Pulumi
-
Buildkite, Argo Rollout