Site Reliability Engineer (SRE) Sunderland - Hybrid
Fancy being our next SRE Superstar? 🚀
Site Reliability Engineer (SRE) | Sunderland (Hybrid) | Full-time
Alright, listen up! Here at Tombola, we're not just about bingo – we're about brilliant tech, seamless experiences, and keeping millions of players happy. And to do that, we need a Site Reliability Engineer who's as excited about rock-solid systems and clever automation as we are about winning lines!
So, what's this ace role all about? You'll be the wizard behind the curtain, ensuring our critical systems are always reliable, available, and performing like a dream. We're talking about implementing smart automation, sharp monitoring, and super-speedy incident response strategies to keep everything running smoothly. You'll be working hand-in-hand with our dev, infra, and security teams, making sure we balance exciting new features with unbeatable stability.
What you'll be getting up to:
- System Reliability & Availability Hero: You'll be the guardian of our uptime, making sure our critical systems are always available and hitting those all-important SLAs. You'll also be leading the charge on incident management, getting to the bottom of any issues and making sure we learn from them.
- Monitoring & Alerting Maestro: Setting up and maintaining top-notch monitoring systems (like Dynatrace) will be your jam. You'll craft alerting systems that give us a heads-up before problems even get a chance to impact our players, and you'll define key metrics to measure system health.
- Incident Response Ace: When things get a bit wobbly, you'll be on the front lines, resolving incidents fast to minimize downtime. After the dust settles, you'll lead the root cause analysis to prevent similar issues from popping up again.
- Automation Whizz: Got a repetitive task? You'll be the one automating it away! From environment setup to configuration, you'll be using tools like Terraform, Git, and TeamCity to streamline everything and build slick CI/CD pipelines.
- Capacity Planning Pro: You'll ensure our systems can effortlessly scale to meet demand, optimizing resource usage so we're always efficient and ready for anything. You'll be forecasting future needs to keep things performing perfectly.
- Performance Optimiser: You'll be constantly poking and prodding our systems, tuning databases, improving response times, and making sure everything runs at peak performance. Plus, you'll be running load and stress tests to ensure we can handle even the busiest periods.
- Infrastructure Guru: You'll be bossing our AWS cloud resources, making sure they're properly scaled, cost-effective, and resilient. And yes, you'll be crafting disaster recovery plans so we're ready for any curveballs!
- Collaboration King/Queen: You'll be working hand-in-hand with our awesome development teams, making sure new features are built with reliability in mind. You'll champion service ownership and provide valuable feedback to keep improving our operational success.
- Security & Compliance Captain: Keeping things safe is a big deal here. You'll be weaving security best practices into our infrastructure and making sure we're always playing by the rules and protecting our production environments.
- Documentation Dynamo: If you build it, you'll document it! Clear, concise docs for all our infrastructure, procedures, and runbooks are key.
- Continuous Improvement Enthusiast: You'll always be on the lookout for new tech and better ways of doing things, constantly pushing us to improve system reliability, performance, and efficiency.
Sound like a bit of you?
If you're an experienced SRE with a passion for building reliable, scalable, and efficient systems, and you love working in a fun, collaborative environment, then we want to hear from you!
Ready to join the Tombola family and help us build even more amazing things? Apply now!
- Department
- Technology
- Locations
- Sunderland, UK
- Remote status
- Hybrid

Already working at tombola?
Let’s recruit together and find your next colleague.