Zelis Logo

Zelis

SRE Golden Signals Lead

Job Posted 3 Days Ago Posted 3 Days Ago
Be an Early Applicant
Remote
5 Locations
Senior level
Remote
5 Locations
Senior level
Lead the observability strategy at Zelis, focusing on defining monitoring practices, enhancing system reliability, and collaborating with engineering teams to improve operational efficiency.
The summary above was generated by AI

At Zelis, we Get Stuff Done. So, let’s get to it!

A Little About Us
Zelis is modernizing the healthcare financial experience for all by providing a connected platform that bridges the gaps and aligns interests across payers, providers, and healthcare consumers. This platform serves more than 750 payers, including the top 5 national health plans, BCBS insurers, regional health plans, TPAs and self-insured employers, and millions of healthcare providers and consumers. Zelis sees across the system to identify, optimize, and solve problems holistically with technology built by healthcare experts—driving real, measurable results for clients.

A Little About You
You bring a unique blend of personality and professional expertise to your work, inspiring others with your passion and dedication. Your career is a testament to your diverse experiences, community involvement, and the valuable lessons you've learned along the way. You are more than just your resume; you are a reflection of your achievements, the knowledge you've gained, and the personal interests that shape who you are.
 

Position Overview

Zelis is seeking a strategic and results-driven SRE Golden Signals Lead to define and drive the observability roadmap across all platforms. This role is responsible for establishing a consistent approach to monitoring and alerting, leveraging golden signals to enhance system reliability and operational efficiency. The SRE Golden Signals Lead will work closely with the Enterprise SRE team, engineering leads, and India-based resources to build a unified observability strategy and ensure alignment with organizational goals.

What You’ll Do

Observability Roadmap Development:

  • Define a unified vision for observability across all platforms, focusing on golden signals as the foundation for monitoring and alerting.

  • Develop and maintain a comprehensive roadmap to improve observability, reduce tool redundancy, and align practices across platforms.

  • Establish key performance indicators (KPIs) to measure progress and ensure accountability for roadmap milestones.

Collaboration and Alignment:

  • Partner with Enterprise SRE team and engineering leads to break down silos and establish consistent observability practices.

  • Drive cross-platform collaboration to reduce operational inconsistencies and define a "north star" approach for observability.

  • Facilitate knowledge sharing to ensure teams are aligned on current and future observability initiatives.

Monitoring and Alerting:

  • Standardize the implementation of golden signals across all applications to improve system reliability and incident detection.

  • Optimize alerting tools and reduce the number of redundant or ineffective panes of glass.

  • Lead efforts to enhance observability while minimizing the operational burden on platform teams.

Operational Support and Improvement:

  • Identify and address gaps in current observability practices, prioritizing long-term scalability and reliability.

  • Collaborate with India-based resources to execute the observability build-out, ensuring efficiency and quality.

  • Reduce the number of client, provider, and print facility-raised issues through proactive monitoring improvements.

Reporting and Continuous Improvement:

  • Track and maintain service levels across environments.

  • Measure and report on observability success metrics, including the number of actionable alerts and reduced issue escalations.

  • Continuously evaluate and refine observability strategies based on feedback and evolving organizational needs.

What You’ll Bring to Zelis

  • 5+ years of experience in Site Reliability Engineering, DevOps, Production Support or a similar role with a focus on observability.

  • Experience designing and implementing monitoring and alerting solutions across complex IT environments.

  • Experience and understanding of SRE principles and golden signals for system monitoring.

  • Experience with observability tools such as Splunk, New Relic, or Logic Monitor.

  • Familiarity with cloud platforms (AWS, Azure) and containerization technologies (Docker, Kubernetes).

  • Strong leadership and collaboration skills, with the ability to align diverse teams toward common goals.

  • Excellent analytical and problem-solving abilities, with a focus on proactive solutions.

  • Clear and effective communication skills to convey technical concepts to stakeholders at all levels.

Preferred Skills:

  • Bachelor’s degree.

  • Experience with building observability roadmaps and scaling solutions in enterprise environments helpful.

  • Certifications in cloud or DevOps-related disciplines (e.g., AWS Certified DevOps Engineer, Kubernetes Administrator).

Location and Workplace Flexibility
We have offices in Atlanta GA, Boston MA, Morristown NJ, Plano TX, St. Louis MO, St. Petersburg FL, and Hyderabad, India. We foster a hybrid and remote friendly culture, and all our employee's work locations are based on the needs of the position and determined by the Leadership team. In-office work and activities, if applicable, vary based on the work and team objectives in accordance with Company policies.  

Zelis is modernizing the healthcare financial experience by providing a connected platform that bridges the gaps and aligns interests across payers, providers, and healthcare consumers. This platform serves more than 750 payers, including the top 5 national health plans, BCBS insurers, regional health plans, TPAs and self-insured employers, and millions of healthcare providers and consumers. Zelis sees across the system to identify, optimize, and solve problems holistically with technology built by healthcare experts – driving real, measurable results for clients.

Commitment to Diversity, Equity, Inclusion, and Belonging 
At Zelis, we champion diversity, equity, inclusion, and belonging in all aspects of our operations. We embrace the power of diversity and create an environment where people can bring their authentic and best selves to work. We know that a sense of belonging is key not only to your success at Zelis, but also to your ability to bring your best each day.

Equal Employment Opportunity  
Zelis is proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws. 

We encourage members of traditionally underrepresented communities to apply, even if you do not believe you 100% fit the qualifications of the position, including women, LGBTQIA people, people of color, and people with disabilities.  

Accessibility Support 

We are dedicated to ensuring our application process is accessible to all candidates. If you are a qualified individual with a disability or a disabled veteran and require a reasonable accommodation with any part of the application and/or interview process, please email TalentAcquisition@zelis.com.  

Top Skills

AWS
Azure
DevOps
Docker
Kubernetes
Logic Monitor
New Relic
Site Reliability Engineering
Splunk

Similar Jobs

An Hour Ago
Easy Apply
Remote
2 Locations
Easy Apply
Senior level
Senior level
Artificial Intelligence • Fintech • Machine Learning • Social Impact • Software
As a Senior Software Engineer, you will enhance user engagement through personalized offer recommendations, optimize strategies, and collaborate with cross-functional teams to drive growth.
Top Skills: KotlinReactRuby On Rails
3 Hours Ago
Remote
Hybrid
United States
Senior level
Senior level
Artificial Intelligence • Information Technology • Machine Learning • Security • Software • Cybersecurity • Generative AI
Responsible for maintaining production environment reliability and availability, implementing automation for operational issues and collaborating with engineering teams on services and infrastructure improvements.
Top Skills: AWSDockerJavaKubernetesLinuxPerlPHPPythonRuby
3 Hours Ago
Remote
Hybrid
USA
Senior level
Senior level
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
As a Principal Engineer, you will guide technical decisions for the Linux sensor, conduct design reviews, mentor senior engineers, and collaborate across teams.
Top Skills: CC++EbpfLinux

What you need to know about the Calgary Tech Scene

Employees can spend up to one-third of their life at work, so choosing the right company is crucial, not just for the job itself but for the company culture as well. While startups often offer dynamic culture and growth opportunities, large corporations provide benefits like career development and networking, especially appealing to recent graduates. Fortunately, Calgary stands out as a hub for both, recognized as one of Startup Genome's Top 100 Emerging Ecosystems, while also playing host to a number of multinational enterprises. In Calgary, job seekers can find a wide range of opportunities.
By clicking Apply you agree to share your profile information with the hiring company.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account