Principal or Senior DevOps and Site Reliability Engineering

Engineering

Veszprém/Budapest, Hungary

REMOTE

https://www.strivacity.com/hungary/principal-or-senior-devops-and-site-reliability-engineering

We’re looking for an experienced Principal or Senior DevOps and Site Reliability Engineer to scale our infrastructure, reliability, and DevOps initiatives. The level of this role (Senior or Principal) will be determined based on your experience, leadership capabilities, and potential impact.

If you’re a seasoned engineer with a strong track record in automation, infrastructure at scale, and platform reliability—and you’re excited to shape strategy while staying hands-on—we want to hear from you.

What You’ll Do

Mentor and guide a small, high-performing team to foster a collaborative, results-driven culture.
Drive platform reliability. Ensure our multi-region, multi-instance infrastructure is secure, scalable, and fault-tolerant.
Support incident response and postmortems, helping the team understand service behavior, performance, capacity, and security posture.
Drive automation across infrastructure provisioning, deployment pipelines, and routine operational tasks.
Implement and test robust backup and disaster recovery systems that meet RPO and RTO objectives.
Ensure monitoring, alerting, and escalation workflows meet or exceed service level objectives.
Champion GitOps and Infrastructure-as-Code practices to enable safe, repeatable infrastructure changes.
Collaborate across teams including Engineering and Customer Success to ensure the platform supports customer needs.
Improve cost visibility and help optimize operational costs through modeling and analysis.
Support operational tools and services, including outage triage, upgrades, and developer assistance.

What You Bring

7+ years of experience in Software Engineering with a focus on DevOps or Site Reliability Engineering.
Deep experience with Infrastructure-as-Code tools like Terraform, Ansible, Kubernetes, and Helm.
Proficiency with cloud platforms like AWS, GCP, or Azure.
Experience with observability and monitoring stacks (e.g., Prometheus, Grafana).
Strong scripting skills (e.g., Bash, Python, or similar).
A solid understanding of security best practices and operational excellence.
Clear, concise communicator with strong written, verbal, and presentation skills.
A passion for building scalable, resilient systems—and improving them continuously.

Submit application

Thanks for submitting the form.
For any questions, please reach out to us directly at hello@strivacity.com

Principal or Senior DevOps and Site Reliability Engineering

What You’ll Do

What You Bring

Submit application

Manager, Customer Success Engineering

Front-end developer (SDK focus)