Machine Learning Platform / Backend Engineer
Belgrade / Timișoara
R&D – Engineering - Central ML /
Full Time Permanent /
Hybrid
Everseen: A leader in vision AI solutions for the world’s leading retailers.
The Role
We are seeking a Machine Learning Platform/Backend Engineer to design, build, and maintain scalable infrastructure that empowers our data scientists and machine learning engineers to develop, train, benchmark, and monitor machine learning models efficiently. You will be instrumental in shaping our internal Machine Learning Platform and driving automation, reproducibility, and performance across the machine learning lifecycle.
As part of this role, you will own the design and implementation of the internal ML platform, enabling end-to-end workflow orchestration, resource management, and automation using cloud-native technologies (GCP/Azure).
You will also design and manage Kubernetes-based infrastructure for multi-tenant GPU and CPU workloads with strong isolation, quota control, and monitoring along with integrating and extending orchestration tools (Airflow, Kubeflow, Ray, Vertex AI, Azure ML or custom schedulers) to automate data processing, training, and deployment pipelines.
You will work to develop shared services for model behavior/performance tracking, data/datasets versioning, and artifact management (MLflow, DVC, or custom registries) and have a clear focus on building out docuemtnation in relation to architecture, policies and operations runbooks.
What you'll do
- Teaching and Sharing Culture:
- Share skills, knowledge, and expertise with members of the data engineering team.
- Foster a culture of collaboration and continuous learning by organizing training sessions, workshops, and knowledge-sharing sessions.
- Design and Development:
- Collaborate and drive progress with cross-functional teams to design and develop new features and functionalities.
- Ensure that the developed solutions meet project objectives and enhance user experience.
- Influence and Decision-Making:
- Have influence over the technology stack and internal technical improvements, contributing to strategic decision-making.
- Coding:
- Based on requirements and a longer-term product and feature strategy, design and implement reusable, testable, efficient, and elegant code.
- Ensure adherence to coding standards and best practices.
- Testing:
- Create, maintain, and run unit tests for new and existing applications and services.
- Aim to deliver defect-free and well-tested solutions.
- Data Analysis:
- Analyze and collect data from various sources such as log files, application stack traces, and thread dumps.
- Utilize data analysis to identify trends, patterns, and potential areas for improvement. Based on this, begin to implement changes.
- Continuous Integration and Continuous Deployment (CI/CD):
- Create and maintain CI/CD integration using various tools.
- Automate the build, test, and deployment processes to ensure efficiency and reliability.
- Integration of Third-Party Solutions:
- Research and propose third-party software solutions to optimize system performance.
- Expand product capabilities by integrating compatible third-party solutions.
- Monitor update and tracking of third-party solutions' compatibility with Everseen stack according to internal development guidelines
- Monitoring and Troubleshooting:
- Monitor production logs to identify and troubleshoot issues promptly.
- Ensure seamless operation and timely resolution of any anomalies to maintain system reliability.
- Documentation:
- Responsible for creating, reviewing, and maintaining high-quality technical documentation to ensure clarity, consistency, and knowledge sharing within the development team.
Collaborate with
- AI/ML Engineering team
- Data Engineering team
- Software Development Engineers
- DevOps team
- Product Managers
- Security & Compliance Teams
Profile and Skills
- 4-5+ years of work experience in either ML infrastructure, MLOps, or Platform Engineering
- Bachelors degree or equivalent focusing on the computer science field is preferred
- Excellent communication and collaboration skills.
- Technical Skills:
- Expert knowledge of Python
- Experience with CI/CD tools (e.g., GitLab, Jenkins). Hands-on experience with Kubernetes, Docker, and cloud services.
- Understanding of ML training pipelines, data lifecycle, and model serving concepts
- Familiarity with workflow orchestration tools (e.g., Airflow, Kubeflow, Ray, Vertex AI, Azure ML).
- A demonstrated understanding of the ML lifecycle, model versioning, and monitoring.
- Experience with:
- - ML frameworks (e.g., TensorFlow, PyTorch)
- - GPU orchestration (e.g., NVIDIA GPU Operator, MIG),
- - Infrastructure as Code (e.g., Terraform).
- - Data engineering tools (e.g., Snowflake, Databricks, BigQuery, Airbyte, Kafka)
- - Familiarity with feature stores and model registries. Exposure to large-scale distributed systems and performance optimisation.
- Ability to work with Linux systems, including troubleshooting skills such as log investigations, performance testing, and connectivity investigation.
- Possesses a deep understanding of technical concepts and terminology relevant to Everseen's products and services.
- Expert knowledge of advanced concepts like microservices and distributed systems, indicating an understanding of modern software development architectures.
- In-depth knowledge of Azure Kubernetes Services for container orchestration, Azure Blob Storage for data storage, and ElasticSearch for search and analytics.
- Ability to leverage cloud computing technologies and services for testing and validation purposes.
- In-depth knowledge of cloud security, scalability, and performance optimization principles.
- Excellent understanding of cloud computing technologies and services, including infrastructure as a service (IaaS), platform as a service (PaaS), and software as a service (SaaS).
- Broad understanding of the software engineering and architecture space, including knowledge of various programming languages, frameworks, techniques, and industry trends in AI.
Additional Skills
- Interest in Learning and Growth Mindset:
- Demonstrated interest in learning and a strong desire to expand knowledge in their respective field.
- Curiosity to explore new technologies, methodologies, and best practices to enhance skills and capabilities.
- Results-oriented attitude, with a drive to achieve objectives efficiently.
- Analytical and Problem-Solving Skills:
- Possesses strong analytical and problem-solving abilities, leveraging data to inform product decisions. This skill is essential for identifying market opportunities, optimising product features, and addressing challenges effectively.
About Everseen
Everseen is a leader in vision AI. We are transforming business operations for global retailers, driving measurable business value and improving the customer experience.
We are a dedicated team of inventors, research scientists, engineers, AI experts and retail industry veterans. Our mission is to protect people, process, products and profitability within the retail sector and beyond.
We are trusted by major food, drug, mass, and specialty retailers around the world— including Kroger, Meijer, and Woolworths—we also partner with leading hardware, AI, and cloud computing providers such as Google, NVIDIA, NCR, and Dell.
We are operationalizing vision AI at an unprecedented scale with the largest global footprint of edge AI powered Computer Vision in Retail.
Founded in 2007 and headquartered in Cork, Ireland, Everseen has over 900 employees globally, with a European headquarters in Cork, Ireland, a U.S. headquarters in Miami, and hubs in Romania, Serbia, India, Australia, and Spain.
Key Numbers
Top 11
Trusted by 11 of the top 20 global grocery retailers
120,000+
Edge AI Endpoints Worldwide
+3x ROI
Delivering Market's best ROI
Our Commitment
Everseen is committed to creating an environment where everyone can succeed. Our employees should feel a sense of belonging, have an opportunity to grow their careers, and feel free to be their most authentic selves. Everseen takes great pride in the diversity of its global workforce, and insists upon a safe, inclusive workplace where our differences are our collective strength. We treat each other with dignity, and respect, and require all employees, officers, and directors to seek to understand the importance and value to Everseen of diversity, and inclusion.
Everseen is committed to creating a safe environment for all employees and has a zero tolerance policy for bias and discrimination of any kind. Our work environment is one without offensive, hostile, or intimidating conduct, whether verbal, written or physical, in nature. Everseen will not tolerate prejudice or discrimination of any kind including without limitation, where based on aspects such as, race, colour, sex, gender, religion, age, family status, disability of any kind, sexual orientation.
