1+ months

Site Reliability Engineer - P206725_S1

Riverwoods, IL 60015
  • Job Code
\u003Cp\u003EAt Discover, be part of a culture where diversity, teamwork and collaboration reign. Join a company that is just as employee-focused as it is on its customers and is consistently awarded for both. \u003Cstrong\u003EWe\u0027re all about people, and our employees are why Discover is a great place to work. \u003C/strong\u003EBe the reason we help millions of consumers build a brighter financial future and achieve yours along the way with a rewarding career.\u003C/p\u003E\u003Cp\u003E As a Site Reliability Engineer, this person is responsible for the availability, performance, monitoring, and incident response, among other things, of the CI/CD Platform.\u003Cbr\u003E\u003Cbr\u003E\u003Cstrong\u003EResponsibilities:\u003Cbr\u003E\u003C/strong\u003E \u003C/p\u003E\u003Cul\u003E\u003Cli\u003E Build a Playbook for maintaining the Availability, Security, Reliability and Quality of Service ( QOS) for the CI/CD Platform and ensure the SLA goals are met. \u003C/li\u003E\u003Cli\u003E Define Service Level Objectives(SLO), Service Level Agreements (SLA) and Service Level Indicators(SLI) for the CI/CD Platform \u003C/li\u003E\u003Cli\u003E Implement fault tolerance, performance enhancement and configuration management of applications using DevOps tools and Automation capabilities \u003C/li\u003E\u003Cli\u003E Engage with developers, infrastructure and operation engineers to integrate software development and delivery \u003C/li\u003E\u003Cli\u003E Perform System administration activities on primarily Linux systems or Kubernetes Clusters like Open Shift \u003C/li\u003E\u003Cli\u003E Experience with DevOps tools sets such as Git, Github, Jenkins, SonarQube, Nexus, Chef, Ansible ,Terraform, Docker, Kubernetes, and OCP(Open Shift) \u003C/li\u003E\u003Cli\u003E Experience with Linux system administration (Windows experience is a plus) \u003C/li\u003E\u003Cli\u003E Using REST APIs to get tools to talk to each other (especially when they weren\u0027t designed to) \u003C/li\u003E\u003Cli\u003E Experience with coding against REST APIs \u003C/li\u003E\u003Cli\u003E Operational Performance \u0026 Stability: Works with other members of their assigned Value Stream to ensure that the in-scope applications/platforms are meeting performance and stability requirements. This includes managing Major Incidents to Mitigation/Resolution. \u003C/li\u003E\u003Cli\u003E Problem Management: Performs Post-Incident Reviews of all Major Incidents and determining Action Items required to avoid similar issues/minimize downtime for future Incidents. \u003C/li\u003E\u003Cli\u003E Monitors and Metrics: Works with Application Development to ensure that assigned applications/platforms have the appropriate monitoring and metrics in place to appropriately measure performance and stability. \u003C/li\u003E\u003Cli\u003E Identify Functional and Non-Functional Improvements: Acts as the Operations representative in Value Stream planning and prioritizes sessions to ensure that Operational needs of assigned applications/platforms are addressed as needed. Holds quarterly Operational Performance Reviews with Value Stream management. \u003C/li\u003E\u003Cli\u003E Release Planning \u0026 Coordination: Works with other members of their assigned Value Stream to ensure that the Production releases for their in scope applications/platforms are properly planned and coordinated. This includes Holds Change/Release implementation reviews to ensure thorough and appropriate implementation plans. \u003C/li\u003E\u003Cli\u003E Provides review and sign-off/approval of change tickets for the assigned Value Stream. Represents the Value Stream in Change Advisory Board Meetings. \u003C/li\u003E\u003Cli\u003E Participates in Program Increment Planning Sessions as a liaison for Operations and Infrastructure support. Provides information regarding upcoming critical changes to the Value Stream. \u003C/li\u003E\u003Cli\u003E Operational Readiness: Ensures that applications/platforms in the Value Stream are Operationally ready for Production. This includes Annual Review of all SOPs/Knowledge Articles. \u003C/li\u003E\u003Cli\u003E Monitors review for any new Feature launch or other significant change that may impact monitoring. SOP/Knowledge Article review for any new Feature launch or other significant change that may impact support documentation. \u003C/li\u003E\u003Cli\u003E Training of Command Center and Application 1st level Support on new SOPs, Knowledge Articles, and any other support-related needs. \u003C/li\u003E\u003Cli\u003E Performs Monthly Capacity Analysis of applications/platforms within the Value Stream. Creates and Maintains Operationally focused ELK Dashboards for the Value Stream. \u003C/li\u003E\u003Cli\u003E Responsible for the Operational Stability and Performance of one or more Critical Business Services used by Discover Customers and Employees. \u003C/li\u003E\u003C/ul\u003E\u003Cp\u003E #LI-MF1 \u003C/p\u003E \u003C/p\u003E\u003Cp\u003E \u003Cp\u003E\u003Cstrong\u003EMinimum Qualifications\u003C/strong\u003E\u003C/p\u003E\u003Cp\u003E\u003Cstrong\u003EAt a minimum, here\u0027s what we need from you:\u003C/strong\u003E\u003C/p\u003E\u003Cul\u003E\u003Cli\u003EBachelor\u0027s Degree in Business, Computer Information Systems, Computer Science, MIS, Engineering, Science, or related field\u003C/li\u003E\u003Cli\u003E2\u002B years of experience in Information Technology, or related field\u003C/li\u003E\u003Cli\u003EIn lieu of a degree, 4\u002B years of experience in Information Technology, or related field\u003C/li\u003E\u003C/ul\u003E\u003Cp\u003E\u003Cstrong\u003EPreferred Qualifications\u003C/strong\u003E\u003Cstrong\u003E\u003C/strong\u003E\u003C/p\u003E\u003Cp\u003E \u003Cstrong\u003EIf we had our say, we\u0027d also look for:\u003Cbr\u003E\u003C/strong\u003E \u003C/p\u003E\u003Cul\u003E\u003Cli\u003E 5\u002B years of hands on Knowledge of at least one Programming Language like Java, Python and one scripting Language like Bash, Groovy etc \u003C/li\u003E\u003Cli\u003E 3\u002B years of experience as Site Reliability Engineer with passion for debugging: troubleshooting and debugging monitoring alerts in the Product \u003C/li\u003E\u003Cli\u003E Knowledge or at least familiarity with Linux shell, system internals, network, java applications and RDBMS and No SQL Databases. \u003C/li\u003E\u003Cli\u003E Ability to read and understand server/systems logs and produce meaningful issue analyses. \u003C/li\u003E\u003Cli\u003E Familiarity with Splunk, Python, Apache, rsync and monitoring/alerting tools like Nagios, XMatters will be a plus. \u003C/li\u003E\u003Cli\u003E Experience with container and on premise/public cloud technologies. \u003C/li\u003E\u003Cli\u003E 4\u002B years of experience in Technology, or related field \u003C/li\u003E\u003C/ul\u003E\u003Cp\u003E\u003Cstrong\u003E \u003C/strong\u003E\u003C/p\u003E \u003C/p\u003E\u003Cp\u003E \u003C/p\u003E \u003Cp\u003E The same way we treat our employees is how we treat all applicants - with respect. Discover Financial Services is an equal opportunity employer (EEO is the law). We thrive on diversity \u0026 inclusion. You will be treated fairly throughout our recruiting process and without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or veteran status in consideration for a career at Discover. \u003C/p\u003E \u003Cbr\u003E\u003Cbr\u003E



  • Banking / Finance
Posted: 2019-11-02 Expires: 2019-12-12

Welcome to Discover
We strive to be the leading direct bank and payments services company. Our mission is to help people spend smarter, manage debt better, and save more to achieve a brighter financial future.

Why Work with Us?
You can make an impact. Whether it’s developing corporate strategy, innovating new services or supporting IT needs, every employee has the opportunity to be a vital part of our business and make a real difference in people’s lives. It’s the heart of what we do.


Employment Trends

Before you go...

Our free job seeker tools include alerts for new jobs, saving your favorites, optimized job matching, and more! Just enter your email below.

Share this job:

Site Reliability Engineer - P206725_S1

Riverwoods, IL 60015

Join us to start saving your Favorite Jobs!

Sign In Create Account
Powered ByCareerCast