Job Overview – Remote Senior Observability Engineer (Monitoring & Cloud Infrastructure):
Compensation: $120,000 – $160,000/year
Location: Remote Role (U.S. Based)
Schedule: Monday to Friday (Full-Time)
Join our client as a Remote Senior Observability Engineer (Monitoring & Cloud Infrastructure), leading the design and implementation of scalable observability solutions across modern cloud-native environments. In this fully remote role, you’ll collaborate directly with clients to build RED/Golden metric frameworks, enhance infrastructure visibility, and deliver insights using tools like Datadog, Splunk, and ELK. This is an ideal opportunity for engineers with strong scripting, cloud, and automation skills who thrive in dynamic, client-facing environments.
Responsibilities as the Remote Senior Observability Engineer:
- Client Engagement & Solution Design: Collaborate with clients to define observability needs and deliver customized monitoring solutions that support performance and reliability.
- Observability Frameworks: Design and implement systems using RED/Golden metrics to monitor distributed applications and infrastructure.
- Scripting & Automation: Create scripts in Python, Go, or Bash for automation, instrumentation, and data validation.
- Cloud & Container Platforms: Support observability across Kubernetes, AWS, Azure, GCP, and hybrid environments.
- Monitoring Tools & Integration: Deploy and manage tools like Datadog, ELK, or Splunk while integrating observability into CI/CD pipelines.
- Insights & Optimization: Use statistical methods to validate data accuracy and fine-tune alerting for proactive system management.
Qualifications for the Remote Senior Observability Engineer:
- Education: Bachelor’s degree in Computer Science, Information Systems, or a related technical field required.
- Experience: 5+ years of hands-on experience in observability, DevOps, SRE, or infrastructure engineering, with a strong background in client-facing collaboration.
- Technical Skills: Skilled in scripting with Python, Go, or Bash, with deep experience in Kubernetes, Docker, and automation tools including Ansible, Chef, Puppet, and Salt.
- Industry Knowledge: Proficient in cloud infrastructure including AWS, Azure, or GCP, with strong experience using observability tools such as Datadog, NewRelic, AppDynamics, ELK, Splunk, or Dynatrace.
- Skills & Attributes: Strong communication and problem-solving skills, with the ability to translate abstract technical needs into actionable deliverables in collaborative environments.
Application Notice: Qualified candidates will be contacted within 2 business days of application. If an applicant does not meet the above criteria, Atlantic Group will keep your resume on file for future opportunities and may contact you for further discussion.