US
0 suggestions are available, use up and down arrow to navigate them
What job do you want?

Apply to this job.

Think you're the perfect candidate?
Banner of Prestige Staffing company

Senior Observability Engineer – NS2JP00000386

Prestige Staffing Oak Hill, VA (Onsite) Full-Time
JobID: 51180

Senior Observability Engineer

Location: Remote

Job Summary:
We are seeking a skilled and experienced Senior Observability Engineer to join the Observability team. The ideal candidate will be responsible for improving our monitoring and alerting posture for Cloud Infrastructure. The role requires a strong understanding of observability tools and practices, with a focus on Prometheus, Grafana, Gardener Kubernetes, and Splunk. Experience with Dynatrace is a plus.

Key Responsibilities:
- Implement, manage, and improve monitoring solutions that use Prometheus, ensuring high availability and accurate alerting for our systems.
- Contribute to the development of observability strategies to improve our Cloud monitoring posture.
- Collaborate with development teams to integrate observability into the CI/CD pipeline and throughout the application lifecycle.
- Respond to and investigate incidents, providing thorough post-mortem analyses and implementing preventive measures.
- Stay current with the latest trends and best practices in site reliability and observability.
- Work with cross-functional teams to ensure system reliability, scalability, and performance.

Qualifications:
- Bachelor's degree in Computer Science, Information Technology, or a related field, or equivalent experience.
- Proven experience with observability tools such as Prometheus, Grafana, and Splunk.
- Hands-on experience with Kubernetes and container orchestration, preferably with Gardener Kubernetes.
- Familiarity with logging, monitoring, and application performance management (APM) tools; experience with Dynatrace is a plus.
- Strong understanding of cloud infrastructure, networking, and distributed systems.
- Excellent problem-solving and analytical skills, with the ability to work independently and as part of a team.
- Strong communication skills and the ability to work effectively with both technical and non-technical stakeholders.
- Experience with scripting and automation tools. (Python, Terraform, Ansible, etc.)

#ZR-PRO

Get job alerts by email. Join Our Talent Network!

Job Snapshot

Employee Type

Full-Time

Location

Oak Hill, VA (Onsite)

Experience

Not Specified

Date Posted

08/16/2025

Apply to this job.

Think you're the perfect candidate?