Job Description
Are you an experienced developer or DevOps engineer? Do you want the freedom to work remotely and want to grow in the new field of site reliability at an internationally successful software and education company? Well, than take our reliability to the next level as part of our Site Reliability Engineering team :)
*** Please note ENGLISCH and GERMAN is a MUST on this position. Please do not apply if you do not speak both languages ***
Who is Digistore24?
We are one of the fastest-growing tech companies in Europe.
What drives us? We shape the digital future! Our mission is to empower people with our software and expertise to share their knowledge online, enabling them to fulfill their dream of an own business. As a result, millions of people gain access to information that helps them reach their goals. To keep pace with our growth, we aim to expand our teams sustainably. We emphasize working with experts and strong personalities who share our values – regardless of their location.
Your new dream job
- Automation and Infrastructure as Code (IaC): You automate repetitive tasks, deployments, and system management to reduce human error and improve efficiency. This might involve creating scripts, CI/CD pipelines, or automating infrastructure provisioning.
- Reliability and Performance Optimization: You continuously improve the system uptime by identifying bottlenecks and optimizing system architecture.
- Capacity Planning and Scaling: You assess and predict system resource requirements (CPU, memory, storage) to ensure the infrastructure can scale with increasing demand. Implement auto-scaling solutions to handle load spikes without human intervention, ensuring systems remain performant under various conditions.
- System Monitoring and Incident Response: Continuously monitor system performance, uptime, and reliability using tools like Prometheus, Grafana, or ElasticSearch. The goal is to detect and respond to issues before they impact users. Manage and respond to incidents, outages, and failures quickly, aiming to minimize downtime. This includes managing incident documentation, communication, and post-incident analysis.
Incident Postmortems and Continuous Improvement: Conduct root cause analysis (RCA) after incidents to identify what went wrong and how to prevent similar issues in the future. Implement fixes, improvements, and best practices based on learnings from postmortems to increase system reliability and reduce future incidents.
Your benefits at Digistore24
You will play a crucial role in shaping our cutting-edge projects in our collaborative work environment – while enjoying flexibility in working time and location.
- Work in our partner's coworking spaces or in your home office, as long as you can guarantee uninterrupted internet access
- Regular further education
- The stability of an extremely successful German high-tech company that is funded by its successful product and not by investors
- Outcome focused teams and a culture of direct feedback
- Modern equipment: Thinkpad or MacBook
- International, collaborative team with strong cohesion
- Spectacular team events in various European countries
- Autonomy from day one
- Contribution to the retirement scheme
- Work in your team on a first-name basis, without a dress code, and at eye level
- Flexible working hours from Mondays to Fridays (core working hours from 10AM to 4PM)
Requirements
Your new dream job at Digistore24
Our values
Please take a REALLY close look at the values. Are you ready to live them?