Site Reliability Engineer


Location
Hybrid working (in the UK), with occasional in person working in London
Salary Range
£70-90k per annum
Deadline


Newton’s Tree is searching for a highly skilled Site Reliability Engineer (SRE) to join our early stage startup with proven product market fit and exceptional growth prospects. In this opportunity you can shape your role as we grow. We are looking for equally adventurous spirits to us who flourish in this dynamic environment.

Your role will be to ensure the reliability, scalability, and performance of the Newton’s Tree AI platform in our hospital sites and on the cloud. Reporting to the VP of Engineering, your primary focus will be creating a robust, scalable, and secure platform that enables healthcare organisations to leverage AI technologies effectively. This role combines software engineering and systems administration to automate and improve our infrastructure and services.

Typical responsibilities and duties

System Integration: work with Hospitals to install hardware to run kubernetes clusters that will host our platform and the AI applications and ETL pipelines of our clients

Observability: implement monitoring, alerting, and incident response processes.

Force multiplication: collaborate with development teams to ensure new features are deployed swiftly and with confidence and reliability

Continuous Learning: perform root cause analysis of production issues and learn from them so that our processes and tooling is improving in response to our challenges and failures

Testing and Quality Assurance: Implementing testing strategies, uptime monitoring and alerting to ensure the platform's reliability, accuracy, and functionality. This also involves troubleshooting and debugging issues reported by users and ensuring timely resolution.

Documentation and Knowledge Sharing: Creating comprehensive technical documentation, including Incident Response processes, infrastructure diagrams and troubleshooting playbooks.

Qualifications and skills

Education/experience: A BSc or MA in computer science, engineering, or a related field, or equivalent experience would be essential. Having a Ph.D. focused on AI, machine learning, or healthcare informatics, or extensive experience in these fields would be desirable.

Tech stack: Demonstrating experience in managing Linux and cloud platforms. Experience with infrastructure-as-code tools like Terraform and Ansible, and GitOps. Knowledge of containerisation and orchestration technologies (Docker, Kubernetes). Familiarity with monitoring tools (Prometheus, Grafana, Datadog) and log management (ELK stack, Splunk). Understanding of networking concepts (DNS, load balancing, TCP/IP).

Data & ML/AI: Experience working with data standards in healthcare (DICOM, HL7, FHIR) would be highly desirable, as we will be integrating with PACS and EHR systems in hospitals and will need to solve infrastructure problems in this space.

Security: Essential working experience of security best practices and experience in implementing security measures, such as encryption, authentication, and authorisation mechanisms; knowledge of data privacy regulations, such as HIPAA, SOC2, GDPR.

Agile/ways of working: Demonstrated skills in cross-functional planning, continuous delivery, testing and automation are essential.

Example of some of our current tools: FluxCD, Argo workflows, Cilium CNI, Talos Linux, Teleport, Dell iDRAC, HP iLO

Compensation & benefits

  • £70-90k per annum, depending on experience and skills
  • Hybrid working (in the UK), with occasional in person working in London
  • Participation in shared company bonus scheme
  • Health and wellbeing package
  • Professional development opportunities
  • 28 days holiday (+ 8 statutory bank holidays)

Our interview process

We try to conclude the interview process after three sessions:

  • Technical depth, focused on ways of working, system design/architecture, planning & delivery/testing, and other facets of day-to-day Reliability Engineering
  • Product and domain, focused on how products get built and digging into the problem domain, how you understand the product, and working in the healthcare technology sector
  • Culture and leadership, focused on expectations of the role, personal qualities, handling difficult situations, and how you work with others

About Newton’s Tree

Newton’s Tree is a health tech startup that enables healthcare providers to select, test, deploy monitor third party AI applications as part of routine care pathways through its enterprise AI platform.

We believe that to deliver healthcare sustainably we need to be able to radically reimagine healthcare delivery through the large scale adoption of safe and effective AI technologies.

The company is led by a founding team with globally unique experience at the nexus of healthcare, AI technology, and cutting edge research. We work with leading healthcare organisations that are bending the adoption curve for AI. We partner with AI developers and innovators across the globe throughout the product lifecycle to enable the development and deployment of the very best technology.

How to apply

Send your CV and a short note telling us why you’d be excited to work with Newton’s Tree to info@newtonstree.com .

No recruiters or agencies, please.

Apply Now