Overview

Job title:- Site Reliability Engineer
Location:-Pittsburgh PA.

Service reliability senior engineer is the lead operations role interfacing with existing teams comprised
of multiple skill sets (Support, Development, Systems Operations) focused on the operations of a specific
agile release train (ART). The SRE is the change agent who provides direction and focus to the teams
through an understanding of the individual ART blended with a broad skillset in software reliability
engineering. SRE is reliant on leveraging appropriate data metrics about applications in order to focus
on the appropriate priorities that deliver the most business impact. The main goals are to create ultra-
scalable and highly reliable software systems.
The Service Reliability engineer will focus on enabling both the operations teams and the development
teams through the application of reliability and resiliency patterns and best practices as appropriate to
the Agile Release train. The Service Reliability engineer will also collaborate with peers in other Agile
release trains and the organization level Reliability Engineering and DevSecOps Enablement groups. This
is a technical lead role with an emphasis on strong coaching, mentoring, and communication skills
utilizing multiple forms of media to drive continuous improvement across the value stream.
Successful candidates will be required to develop a strong understanding of how their IT value stream
supports the business, including the ability to help their business partners articulate and document
system expectations and impacts.
Deep understanding and strong experience in more than one of the following
Distributed Systems Administration (Windows/Linux)
Technical Architecture
Networking and network based Load Balancing
Shared infrastructure resources – SAN/NAS, Message Bus, Backup and Recovery, Data Replication
Application Architecture/Platform Development
Cloud Platforms
Cloud Native Architecture
Monitoring/Instrumentation Patterns (AppDynamics, Splunk, HP Openview)
DevSecOps
Reliability Engineering/Site Reliability Engineering concepts and Patterns
Resiliency Patterns
IT Operational support and Crisis Management
Database administration both Physical and Logical/Data Architecture
Conceptual understanding and some experience in more than one of the following
Java
Agile
API/Service Orientation
High Availability and Disaster Recovery Patterns
Virtualization/Containerization
Application Instrumentation/Telemetry and Operational Data Visualization
Cyber Security Concepts
Shared infrastructure Resources
This particular position will be with our Financial Enablement value stream. They are going to be
focused on working with the team that will be migrating our Rating and Pricing systems over to a more
modern enterprise solution. Anyone with modern architecture/cloud based experience and financial
systems background will be a plus
DONATO TECHNOLOGIES, INC
12100 Ford Rd, #306, Dallas, TX 75234
Email: [email protected]

More jobs: