DevOps/SRE Engineer

Paris /
Engineering – Infrastructure /
/ Hybrid
About you

We offer you to join our Ops team in order to cover regular operations and architecture, at the heart of the Diabolocom service. Ops team participates in the deployment and operational maintenance of our SaaS platforms, on a Hybrid Cloud basis.

The Diabolocom Hybrid Cloud is composed of on-premise infrastructure distributed amongst 5 dedicated colocations around Paris and Germany and public clouds (AWS / GCP / OVH).
Offering solutions with a high level of security and availability, and certified environments, notably PCI-DSS and HDS (health data), ops team is responsible for making sure everything runs smoothly, in a scalable and secure way, 24/24, 7/7.

In a context of strong development, Diabolocom is working on the evolution of its infrastructure, from proxmox-based infrastructure using VMs deployed via Ansible, to a Kubernetes oriented architecture deploying applications with ArgoCD. We are looking for people able to maintain legacy infra while it’s being migrated, but also able to architect and build the future with new tools and state-of-the-art technologies and methodologies.

Our legacy infrastructure still has some interesting features, including full IaC management. Among other things, our servers and VM are booted over PXE on self-generated, fully IaC images generated with our Ansible playbooks.

At Diabolocom, you will:

Co-architect new infrastructure to contribute to its evolution and migration to kubernetes
Develop PoCs to test and validate new decisions
Be in charge of the infrastructure, both legacy and new one, and its global vision
Be interested in the evolution of standards to advise and propose upgrade plans
Maintain, upgrade and secure existing and already deployed infrastructure
Be using modern tools and good practices to deploy a scalable and reproducible infrastructure without SPoFs

What we are looking for:

You are curious, always a source of proposals
You are on the lookout for new technologies and are always ready to learn new things
You master Ansible, docker and other standard tools used everyday by our ops team
You are experienced with bash and you know your way around a linux-based OS (debian)
You are familiar with some of the tools or technologies listed below (we don’t expect anyone to master everything of course)
You have an appetite to optimize and automate systems
You have the security of the infrastructure in mind
You are fluent in English, both written and spoken
You have excellent interpersonal skills and are not afraid to solve problems
You are autonomous and able to tackle problems by yourself

What’s in it for you:
- You’ll have a three-week onboarding to get to know our product, our teams and our culture!
- You’ll have the chance to work in a multicultural environment with colleagues coming from 5 different countries and 10 different nationalities 🇨🇵 🇩🇪 🇪🇦 🇮🇹 🇬🇧
- You’ll have first choice IT equipment 
- Our office is based right in the heart of Paris, 100 meters from the Opera Garnier!
- Remote friendly: up to 2/3 days per week
- Lunch vouchers: Swile card
- Team building events : Athens, Meribel, Ibiza, Porquerolles… what’s next? 😎
- Be part of a company at a key moment of its growth, with lots of opportunities 🚀
- You’ll be lucky to join a team that is big enough to thrive but also small enough to actually have an impact on the success of the organisation!
- A context to work in where your ideas are listened to and valued, and in which you can easily contribute and make a difference
- We offer opportunities to learn and grow

The technical scope of the Ops teams:
IaC scripting and automation under Ansible, bash and Python
IaC scripting for the modern kubernetes environment with ArgoCD and helm
Monitoring and observability of deployed infrastructure with Netdata, Prometheus and Grafana
Interact with dev and QA teams to provide tools and infrastructure evolution to fit their needs
Advise them in CI/CD and Kubernetes usage
Daily day 2 operation on existing infrastructure
Incident detection and resolution with thorough post-mortem analysis to prevent the class of incidents in the future

Our technical stack:

Kubernetes with ArgoCD and Helm
nginx ingress controller, HAProxy
Ansible (heavily used), terraform
Proxmox cluster, debian-based VM booting with PXE
Netdata, Prometheus, Victoriametrics, Grafana
docker and docker-compose
Gitlab and Gitlab-CI
Various deployed services, including: sentry, harbor, cert-manager, renovate, openLDAP, external-DNS, powerdDNS, bind9, dhcpd, loki, rsyslog, mysql, authentik, ISC DHCPD, RabbitMQ, Hashicorp Vault, passbolt, openreplay, weblate, …
Bare metal infrastructure (90% of the load) and cloud-based (AWS)
Slack, G Suite
Small python and bash scripts

Recruitment process:

Test challenge
Intro call with a recruiter
Tech interview with the DevOps/SRE team
Tech interview with Sergey, our CTO
Interview with David, our COO

At Diabolocom, diversity and inclusion are in our DNA. All qualified applicants will receive equal consideration for employment without regard to color, language, religion, sex, sexual orientation, gender identity, national or social origin, opinion disability, age.