Site Reliability Engineer

Permanent employee, Full-time · Remote

Your mission
  • Monitoring: contribute to the improvement of the monitoring and measurement systems that support our operational scale and continuous delivery. This goes from setting up and maintaining the right tools, to help the different engineering teams on the correct instrumentation of their code;

  • Availability: work to measure and increase the mean-time-between-failures and decrease the mean-time-to-repair of public-facing systems;

  • Operations: help the engineering team to operate their systems;

  • Performance, Efficiency & Latency: contribute to the measurement techniques that assist in the performance tuning of the applications stack by recommending and implementing performance improvements, also leveraging monitoring systems to maintain application performance at acceptable levels;

  • Security & Risk: participate in the ongoing process to identify and mitigate risk on our systems, ensuring compliance requirements standards are met;

  • Capacity Planning: use our monitoring suite to advise on capacity requirements;

  • Engineering Tools: create and maintain tools that help engineering teams improve their day to day work.

Your profile
  • 2+ years of professional experience in Devops/SRE

  • Experience in containerization technologies (Docker, Containerd, Podman…);

  • Application performance monitoring (Grafana, Prometheus, Cloudwatch, Datadog…);

  • Application development experience with at least one programming language (Scala, Java, Go, Python...);

  • Experience managing systems with daily deployments that handle millions of requests;

  • Understanding that managing systems at scale requires end to end infrastructure tools and automation;

  • Broad knowledge of system administration, networking, databases, security, storage and performance, having expertise in at least one of these disciplines;

  • Experience aligning with the DevOps movement goals, in the sense that teams own the full cycle of the development process, from design to operation;

  • Has provided a positive contribution to both operations-focused and development-focused work;

  • Built and maintained cloud-based applications and infrastructure (AWS preferred);

  • Knowledge with security certifications, such as SOC2;

  • Worked with tools and frameworks for infrastructure automation;

  • Passion for and experience in best practices in systems operations tools and techniques.

Why us?
  • Competitive Salary. Check our salary calculator at https://www.codacy.com/careers

  • Comprehensive health insurance;

  • Generous learning and development budget;

  • Flexible holidays;

  • Flexible working hours;

  • Remote first work policy (work from anywhere!)

About us

Codacy’s vision is to enable everyone to craft software with confidence while focusing on impacting the world at the speed of thought.

Our DevOps Intelligence Platform includes four solutions that enable software development teams to achieve their full potential and give management teams visibility on their investment through Codacy Quality, Codacy SecurityCodacy Coverage and Codacy Pulse.

We're curious, funny, radically honest, yet kind, and thrive on collaboration and transparency. We're a team of highly dedicated and ambitious domain experts brought together by the mission to help development teams reach their full potential and are driven by having a worldwide impact on software development.  

We are looking forward to hearing from you!
Thank you for your interest in Codacy. Please fill out the following short form. 
Uploading document. Please wait.
Please add all mandatory information with a * to send your application.