Site Reliability Engineer (Windows, SQL)

Employee | Tech | Professional | Poland | Katowice | 2024-02-19 | REQ-10065559

Apply

We are looking for you, if you:

  • have experience in operating system administration (Windows),
  • know key cloud proconcepts you can describe cloud-native
  • understand and have knowledge about other stack layers – Network, Virtualization, Middleware, Databases (MS SQL),
  • have good understanding of programming (preferred languages: Python, PoweShell, Golang),
  • know how to use IaC/orchestration/automation tooling like Azure Pipelines, Ansible, Terraform,
  • can identify and automate infrastructural management tasks using best infra-as-code practice,
  • know key reliability engineering framework practices, consumer engineering idea and acronyms like SLI, MTTR and BCM are not just a couple random letters glued together.

You'll get extra points:

  • you value your time and don’t log in to host to run commands – Infra as a Code is your creed.
  • you do not like solving Incidents you prevent them from happening.
  • you like to always be step ahead and use new technologies.

English level: B2

    As the Site Reliability Engineering Department, we focus on four key topics:

    • Run & Change
    • Enablement
    • Rapid Response
    • Education

    Your responsibilities:

    • Implementation of reliability across global platforms & services, global supporting tooling and entities:
      • Operating in strong cooperation with involved Enterprise Architects, other SREs & DevOps engineers,
      • Implementing observability measures via respective tooling of our critical business services,
      • Identifying service level objectives with associated indicators
      • Look for and elimination of manual and repetitive task (commonly known as toil
      • Planning and evaluating new releases of features within infrastructure environment (release trains)
    • Later on, focus will also be on other practices e.g.
      • Mature major incident management process (major incident mgt, problem mgt, post-mortem & root-cause analysis)
      • Mature capacity planning & forecasting practice
      • Mature reliability reporting
      • Introduction of Error budgeting
      • Knowledge management about spreading “reliability by design” concept and execution of all required reliability practices

    Information about the squad:

    We are a Team of Infra admins who got tired of manual work and decided to move to Infra as a Code approach. We want to prevent, not repair and make our system Reliable. Taking best approach from Google and Microsoft we want to create Culture of SRE Engineering with focus on Design, Run Enable, Rapid Response, Educate and Review. Are you up for the challenge?

    Apply

    Questions about this opportunity?

    Feel free to contact Team, Recruiter. e-mail: Career.INGHubsPoland@ing.com

    Back to top

    Please be aware that the recruitment procedures, (labour) regulations and labour agreements of Poland apply.

    Yes No
    Listen