Site Reliability Engineer (SRE) – Platform Infrastructure Team
Location: Remote (EU time zone)
Start Date: ASAP
About the Company
Our client is a
- growing, fully independent Saa
S product company with a portfolio of 10+ B2C digital products — and more on the way. They build their own products, use them internally, and scale them globally. No external clients, no investors, and no startup risk.
The team includes 90+ professionals working remotely from over 20 countries across Europe and the Middle East. In 2025, the company plans to launch up to 10 new products beyond the MVP stage.
About the Role
Our client is looking for a Site Reliability Engineer (SRE) to join their Platform Infrastructure team.
In this role, you’ll take ownership of reliability, scalability, and observability across their systems. You’ll work closely with Dev
Ops and product engineers to ensure smooth deployments, robust CI/CD pipelines, and a secure, highly available infrastructure that performs under high load.
Responsibilities
Design, implement, and maintain secure, scalable infrastructure on AWS
Improve system observability (metrics, logs, traces, alerts)
Define and implement SLIs/SLOs across services
Enhance and manage Kubernetes clusters and
- environment deploymentsCollaborate with product teams to ensure apps (Next. js, Nest
JS) are
- readySupport and improve CI/CD pipelines (Git
Hub Actions, Infra as Code, automation)Contribute to incident response,
- mortems, and capacity planningBuild automation for
- healing and recovery of systemsPromote and implement SRE best practices across the company
Requirements
3+ years in SRE, Dev
Ops, or Platform Engineering rolesStrong experience with AWS and Kubernetes in production environments
Good understanding of modern web app stacks (Next. js, Nest
JS is a plus)Deep expertise in monitoring, alerting, and observability tools
Experience with SLIs/SLOs definition and management
Hands-on experience with
- load,
- throughput systemsProficiency with Terraform or similar Infrastructure-as-Code tools
Scripting skills to automate manual processes
Solid troubleshooting mindset and strong system thinking
Familiarity with CI/CD tooling (e. g. , Git
Hub Actions)Team-oriented,
- friendly mindset
Nice-to-Have
Experience working in
- paced startup environmentsExposure to
- product Saa
S platforms
Key Performance Indicators
Service Uptime
MTTR / MTBF
Monitoring Coverage
What our client offer
22 paid vacation days + public holidays based on your country
Fully remote role with flexible working hours (7:00–18:00 GMT+2)
Annual performance bonus
Sponsored upskilling and development
Annual company retreats in Europe (fully covered)
High-caliber,
- level teamNo startup chaos — only focused, scalable product work
Ready to engineer systems that power a growing portfolio of B2C products? Apply now!
- Informații detaliate despre oferta de muncă
Firma: OnHires Localiția: Bucureşti
Bucharest, Bucharest, RomaniaAdăugat: 13. 5. 2025
Postul de muncă activ
Fii primul, care se va înregistra la oferta de muncă respectivă!