Sr. Site Reliability Engineer Job at Global Soft Systems, Remote

blc5bDRCL2xOQTUwNXB1NURFeVU2THNy
  • Global Soft Systems
  • Remote

Job Description

Sr. Site Reliability Engineer

100% Remote

Long Term

Primary Purpose of Position

The Cloud Platform Engineer will architect, develop, and maintain Optum Serve's cloud environment in both the commercial and government cloud. The role will work closely with software engineers, architects, and DevOps engineers to architect and maintain a secure, resilient and high performance cloud infrastructure.

Essential Functions Include:

Build, maintain, and operate IaaS and PaaS infrastructure in Azure commercial and government clouds

Work closely with dev teams to identify and measure SLOs, SLAs and SLIs

Act a strong contributor to development of platform services including architecture, provisioning, configuration, deployment, and support

Perform integrations with central logging, metrics dashboards, instrumentation, incident monitoring and management

Build/integrate/administer systems and tools that enable engineering teams to observe their applications in production with autonomy (Dashboards, APMs).

Support software and/or cloud-infrastructure in an on-call rotation basis

Assist with identification and remediation of technical problems at the root cause by continuously implementing automation, self-healing, and real-time monitoring to production systems

Maintain and improve operational tooling, frameworks,

Build frameworks that test the performance and resiliency of our platform services/tools

Automate alerts for metrics on performance, cost, vulnerabilities, risk, compliance violations

Improve processes and champion automation of any manual items around support.

Job Qualifications:

Requirements:

4 + years of experience working within a cloud engineer/SRE role

Expert knowledge of a cloud service provider

Expert knowledge and hands on production experience in Kubernetes (bare metal or managed) cluster setup and management required.

Experience with infrastructure as code (IaC) tools like Terraform, Pulumi.

Experience with Kubernetes deployment tools like Helm, ArgoCD, Flux

Strong awareness of networking and internet protocols.

Understanding of identity and access management (IAM)

Experience supporting infrastructure in production cloud environments.

Knowledge of Encryption, Public Key Infrastructure (PKI), understanding of OWASP

Experience working with RESTful services

Some experience with monitoring tools (Azure Monitor, Splunk, Dynatrace, Graphana, Prometheus).

Familiarity with IDEs and Source Control tools like Visual Studio Code and Git.

Preferences:

Bachelor's Degree in Computer Science, Information Technology, Software Engineering, Math, Physics

Master's Degree with coursework focused on advanced algorithms, mathematics in computing, data structures or related field

Expert knowledge of Azure

Demonstrate passion about infrastructure automation

Ability to prioritize work in a fast-paced environment.

Job Tags

Full time, Remote job,

Similar Jobs

ManTech

Principal Cyber Network Engineer Job at ManTech

 ...candidates to have a Bachelor's degree or four additional years of network experience in lieu of a degree. You should have over 7 years of...  ...of Windows and Linux systems, general operating system security practices, TCP/IP networking, and network security concepts is... 

Courier Connection

Employee Medical Courier Driver-Same Day Courier Service Job at Courier Connection

 ...Medical Delivery Driver CC Last Mile a leading provider of same day delivery service is expanding its operations in the Atlanta area and were looking for a customer service driven individuals to join our driving team . The Driver position is responsible for safely... 

Newport Associates

Remote Client Services Agent Job at Newport Associates

 ...passionate about creating unforgettable travel experiences? Do you have a knack for superior...  ...Professionals. No prior experience is necessary; we provide comprehensive training to...  ...to success. Location: Remote (Work from Home) Job Type: Full-time or Part-time... 

Accenture

Sr. Cyber Threat Intelligence Analyst - Join us in Huntsville, AL! Job at Accenture

 ...to pursue the limitless potential of technology and ingenuity for clients across defense, national security, public safety, civilian, and military health organizations.Join Accenture Federal Services, a technology company and part of global Accenture, to do work that... 

Achieve Beyond Pediatric Therapy & Autism Services

Clinical Fellowship Year/SLP Job at Achieve Beyond Pediatric Therapy & Autism Services

 ...meet the needs of developmentally disabled children through our ABA, speech, occupational, and physical therapy. We currently offer our SLP services in the entire metro New York area (including the five boroughs of NYC, Hudson Valley and Long Island) and Albany, New Jersey...