Ops Engineer
Employee | Tech | Professional | Philippines | Makati | 2024-04-22 | REQ-10067841
Title: Site Reliability Engineer
Overview:
Working as an SRE in this team, you will have the responsibility to impact ING globally. You will challenge and contribute to ING’s well-architected framework and underlying reliability patterns. You will be responsible in
If you enjoy:
- monitoring and troubleshooting
- performance monitoring
- proactive identifying problem areas of a system, i.e. software bugs, misconfigurations, bottlenecks
- trend analysis
- availability and reliability
- increasing availability and reliability of our production systems
- chaos testing
- capacity planning
- technical risk and health assessment
- keeping systems compliant with technical state health assessments
- service level management
- proactive monitoring and adherence to SLAs
- holding IT Engineering, Security and Architecture accountable for remediation of any SLA degradation
- IT Key Controls
- ensuring that IT is ‘in CONTROL’ by holding IT groups accountable for adherence
- collating and providing necessary evidence to Auditors for these controls
- automation
- architecting, creating and automatically managing an army of ‘runners or bots’ that fully automate tasks across infrastructure and applications
- building and integrating tools that will assist in improving system availability, reliability, and performance
- incident and problem management
- coordinating incident management and service restoration.
- SREs are part of the on-call team of engineers that support production systems
- Work with BizDevOps squads on post mortems & assist in identifying and fixing reliability issues
- disaster recovery (DR) & business continuity planning (BCP)
- production reporting
- service request management
If you are:
- a good leader and technology evangelist
- an innovator
- accountable and takes ownership of problems
- an eager investigator
- cool under pressure
If you can:
- work comfortably with Linux, Unix, and Windows
- work with virtualization technologies such as VMWare
- develop and run applications in the cloud (Azure, AWS, GCP)
- containerize and orchestrate applications (Docker, Openshift, Kubernetes, microservices)
- continuously integrate and deploy (CI/CD, git, jenkins, ansible)
- comfortably write automation scripts (bash, go, python)
- administer and query SQL
- work and analyze aggregated logs (ELK)
- configure networks (load balancing, firewalls)
- work with observability tools (Prometheus, Grafana)