Remote Observability Engineer (Cloud & Monitoring Tools)

  • Location: Remote
  • Type: Perm (Contingency)
  • Job #47029
  • Salary: $100,000

Job Overview – Remote Observability Engineer (Cloud & Monitoring Tools):
Compensation: $100,000 – $140,000/year + bonus
Schedule: Monday to Friday (Remote)

Atlantic Group is hiring a Remote Observability Engineer (Cloud & Monitoring Tools) for our client. In this fully remote role, you’ll design and implement observability frameworks that enhance visibility, performance, and scalability across cloud environments. You’ll collaborate with clients to assess needs, translate requirements, and deliver results using tools like Datadog, Kubernetes, and AWS. Ideal for a hands-on engineer with strong technical and client-facing skills in cloud monitoring and infrastructure optimization.

Responsibilities as the Remote Observability Engineer:

  • Client Collaboration: Partner with customers to identify operational needs and deliver tailored observability solutions.
  • System Design: Develop observability frameworks using RED/Golden metrics to improve system performance and reliability.
  • Data Analysis: Validate and interpret metrics, logs, and traces to ensure accurate monitoring and actionable insights.
  • Automation: Use tools such as Ansible, Puppet, or Chef to automate observability processes and optimize infrastructure.
  • Tool Integration: Deploy and manage platforms like Datadog, integrating with ELK, Splunk, or AppDynamics for end-to-end visibility.

Qualifications for the Remote Observability Engineer:

  • Education: Bachelor’s degree in Computer Science, Engineering, or a related technical field.
  • Experience: 5+ years in cloud infrastructure, DevOps, or observability engineering with proven expertise in monitoring tools and delivering client-focused solutions.
  • Technical Skills: Proficient in Python, Go, or Bash scripting with hands-on experience in Docker, Kubernetes, and CI/CD pipelines, as well as automation tools such as Ansible, Puppet, and Chef.
  • Cloud & Observability: Skilled in AWS, Azure, and GCP environments with expertise in Datadog and familiarity with ELK Stack, Splunk, and AppDynamics.
  • Industry Knowledge: Strong understanding of SDLC, application instrumentation, and metrics-based performance monitoring.
  • Skills & Attributes: Analytical, solutions-driven professional with strong communication and problem-solving skills, adept at managing complex projects in a remote environment.

Application Notice: Qualified candidates will be contacted within 2 business days of application. If an applicant does not meet the above criteria, Atlantic Group will keep your resume on file for future opportunities and may contact you for further discussion.

Attach a resume file. Accepted file types are DOC, DOCX, PDF, HTML, and TXT.

We are uploading your application. It may take a few moments to read your resume. Please wait!

Job ID: 47029