coverletter.tech
coverletter.tech

Site Reliability Engineer

Bucharest, Romania (hybrid)
Employee
System and Network Administration

The job requires a hybrid working mode: 2 days/week at the office

Office location: Bucharest

We are looking for a Site Reliability Engineer for our client, one of the leading online travel agencies. In this role you will be responsible for the design, prioritization and implementation of complex technical solutions. They can accurately estimate or forecast the effort and impact of the items they work on, and show a high quality of craft in what they deliver. They are expected to lead incident response for issues affecting their team. Senior SRE is expected to coach and mentor less experienced engineers and be a thought leader in their team ensuring best practices are being implemented.

Tasks

Building software applications

  • Is responsible to build software applications by using relevant development languages and applying knowledge of systems, services and tools appropriate for the business area and guide more junior members of the team in this topic.
  • Is responsible to refactor and simplify code by introducing design patterns when necessary and guide more junior members of the team in this topic.
  • Is responsible to ensure the quality of the application by following standard testing techniques and methods that adhere to the test strategy
  • Is responsible to write readable and reusable code by applying standard patterns and using standard libraries

Software Systems Design

  • Is responsible to evaluate possible architecture solutions by taking into account cost, business requirements, technology requirements and emerging technologies
  • Is responsible to describe the implications of changing an existing system or adding a new system to a specific area, by having a broad, high-level understanding of the infrastructure and architecture of our systems
  • Is responsible to help grow the business and/or accelerate software development by applying engineering techniques (e.g. prototyping, spiking and vendor evaluation) and standards

End to End System Ownership

  • Is responsible to own a service end to end by actively monitoring application health and performance, setting and monitoring relevant metrics and act accordingly when violated
  • Is responsible to reduce business continuity risks and bus factor by applying state-of-the-art practices and tools, and writing the appropriate documentation such as runbooks and OpDocs
  • Is responsible to reduce risk and obtain customer feedback by using continuous delivery and experimentation frameworks

Technical Incident Management

  • Is responsible to address and resolve live production issues by mitigating the customer impact within SLA
  • Is responsible to improve the overall reliability of systems by producing long term solutions through root cause analysis

Automation and toil reduction

  • Is responsible to ensure that infrastructure stays current by reducing technical debt, searching for bottlenecks and preparing for scaling
  • Is responsible to reduce cost of operations and maintenance by leveraging new technologies, automation, and partner with vendors to ensure we stay current
  • Is responsible to reduce human labour by writing small software features that address availability, scalability, latency and efficiency

Requirements

● Building Software Applications

● Software System Design

● End to End System Ownership

● Technical Incident Management

● Operations (Automation & Toil)

● Observability (Monitoring & Alerting)

● Critical Thinking

● Continuous Quality & Process Improvement

● Effective Communication

● Architectural Guidance

● Coaching & Mentoring

Benefits

  • Learning budget
  • Travel discounts
  • Discounted Gym
  • Discounted dental services
  • Free optometric consultation
  • Meal tickets
  • 25 days annual vacation
Updated: 2 days ago
Job ID: 11225882
Report issue

coverletter.tech

51-200 employees
Staffing and Recruiting

Next generation matchmaking - fast, accurate and 100% digital.

  1. Site Reliability Engineer