Rill Data Logo

Rill Data

Site Reliability Engineer - Remote (North America)

Posted 8 Days Ago
Remote
Senior level
Remote
Senior level
The Site Reliability Engineer will ensure high availability in cloud-hosted environments, automating deployment processes, setting up monitoring systems, and implementing cloud architectures across AWS, GCP, and Azure, while managing service-oriented architectures and maintaining high-scale distributed systems.
The summary above was generated by AI

As a Site Reliability Engineer you will be responsible for maintaining high availability of production and non-production work environments. Your role will also be to automate all the manual tasks for developing and deploying code and data to implement continuous deployment and continuous integration frameworks.

Responsibilities

  • Our product is a cloud-hosted, workspace application for business intelligence, enabling users and organizations to create, edit, and share data tables and dashboards powered by our Druid database engine.

  • As such, your responsibilities would include:

  • Creating state-of-the-art technical architectures with automation to make complex product delivery easy (or at least easier) 
  • Setup monitoring systems with the “three pillars of observability”: metrics, tracing, and logs 
  • Implementing cloud architecture in GCP, AWS, and Azure 
  • Maintaining and managing the whole deployment stack

Qualifications

  • Maintaining a streaming analytics pipelines service requires an impossible breadth of knowledge and experience, no individual will have all of them, and thus will be expected to learn some of these on the job.

  • Recognizing this, here are some of the key qualifications we seek in a successful candidate for this role:


  • 5+ years of experience, ideally for an Enterprise SaaS company in the infrastructure, analytics, and/or data space
  • The ability to think through requirements to determine high-impact solutions to problems 
  • Knowledgeable working with cloud infrastructure (AWS, GCP, Azure) and cloud data warehouses (BigQuery and Snowflake) in order to deliver end-to-end Cloud Infrastructure engagements that includes assessment, design, deployment and migrations. 
  • Experience building and maintaining high-scale distributed systems in a service-oriented architecture, ideally with tools such as GCP, Pulumi, Kubernetes, Docker, etc. 
  • Experience working with infrastructure technologies such as Kubernetes, Terraform, Helm. 
  • Experience with continuous integration/continuous delivery systems using tools such as GitHub Actions, Travis, Argo, or Jenkins 
  • Experience with one or more general purpose programming languages like Python, Golang, etc. 
  • Setting up observability & monitoring tools like Datadog, Prometheus, Grafanaetc
  • Hands-on experience in hardening infrastructure for security, performance,compliance & regulatory requirements 

About Rill

  • Rill makes it easy to create and consume metrics by combining a SQL-based data modeler, real-time database, and metrics dashboard into a single product—a simple alternative to complex BI stacks. Our thousands of users love Rill for the "magical" experience of our real-time, interactive (and easy-to-use!) dashboards. Founded at the start of Covid, Rill is a remote-first company that values human connection. Our team is truly global, with co-founders in the Bay Area and India - with the team spread across the US, Europe and Asia.

We believe that having a team of diverse backgrounds and voices working together will enable us to create innovative products that improve the way people live and communicate. We are proud to be an equal opportunity employer, and committed to providing employment opportunities regardless of race, religious creed, color, national origin, ancestry, physical disability, mental disability, medical condition, genetic information, marital status, sex, gender, gender identity, gender expression, pregnancy, childbirth and breastfeeding, age, sexual orientation, military or veteran status, or any other protected classification, in accordance with applicable federal, state, and local laws. If you have a disability or special need that requires accommodation, please let us know.

Top Skills

Go
Python

Similar Jobs

19 Hours Ago
San Francisco, CA, USA
Remote
11,000 Employees
Junior
11,000 Employees
Junior
Cloud • Information Technology • Productivity • Security • Software • App development • Automation
The Site Reliability Engineer will enhance cloud services by overseeing caching infrastructure and automation, ensuring high availability and performance. The role involves monitoring, debugging, and improving code while scaling distributed software in production environments. Responsibilities include communication across technical levels and implementing best practices in service reliability.
Yesterday
Texas, USA
Remote
460 Employees
Senior level
460 Employees
Senior level
Food • Logistics • Mobile • On-Demand • App development
As a Senior Site Reliability Engineer, you will enhance system reliability and performance by managing cloud and configuration processes while collaborating with various engineering teams. Responsibilities include monitoring system performance, automating operational tasks, troubleshooting service disruptions, mentoring team members, and improving team processes.
2 Days Ago
2 Locations
Remote
51 Employees
Senior level
51 Employees
Senior level
Artificial Intelligence • Cloud • Hardware • Machine Learning • Other • Software • Infrastructure as a Service (IaaS)
The Site Reliability Engineer at Voltage Park is responsible for building and operating core infrastructure, including managing thousands of GPU servers, implementing improvements, and collaborating across networks and software development teams. This role involves on-call rotations and requires strong skills in Linux, AWS, Kubernetes, and automation tools.

What you need to know about the Calgary Tech Scene

Employees can spend up to one-third of their life at work, so choosing the right company is crucial, not just for the job itself but for the company culture as well. While startups often offer dynamic culture and growth opportunities, large corporations provide benefits like career development and networking, especially appealing to recent graduates. Fortunately, Calgary stands out as a hub for both, recognized as one of Startup Genome's Top 100 Emerging Ecosystems, while also playing host to a number of multinational enterprises. In Calgary, job seekers can find a wide range of opportunities.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account