Director of Cloud Operations & Engineering - Platform Organization
Requisition ID: 78288
Join the Global Community of Scotiabankers to help customers become better off.
As Scotiabank's engine of modernization, the PLATO platform enables technology teams to build software quickly and securely using modern practices. PLATO is an integrated set of technical capabilities, services and processes that encapsulate critical enterprise functions through standardization, re-use and automation.
The PLATO team is comprised of engineers, problem solvers, agilists and creatives in roles such as Enterprise Platform Engineering and Architecture, Enterprise Data Services, Cloud Infrastructure and Architecture, Product Engineering, and Product Management. Together, the team provides the platform that enables the Bank to deliver transformative experiences that help our 24 million customers become better off.
Interested in joining an agile team that's impacting change for our customers around the world? Watch our video
Scotiabank is looking for a Director of Cloud Technology Operations & Engineering to build a new team focused on delivering our container solution using Kubernetes and Docker to the organization. The ideal candidate will be highly motivated, articulate, experienced building an operations & fulfillment capability and intimately familiar with Linux operating systems, Google & Azure cloud services delivering Kubernetes container solutions, creating tools, automated deployments, upgrades, onboarding and validation.
Build an Technology Operations & Fulfillment team to deliver container based solutions and services and manage and operate the core container based infrastructure for the enterprise that includes:
A. Service Provisioning: Tech OPS will create a service catalog & provision multiple services across PLATO to enable a seamless user experience. (Ex. environments, service accounts, metadata, templates, manifests, API, API registration, event subscriptions, etc.)
B. Business Support: Provide deployment and post deployment support (Ex. warranty support, command center services, process, review release content and coordinate with clients.
C. Automation: Create & maintain deployment automation scripts. Create automated recovery scripts. Build dashboards and provide service analytics.
D. Operations: Maintain and troubleshoot production & non-production environments including certificate management, patch management. Create and maintain alerts and dashboards, access control management and perform system admin activities. Create deployment and operations guides. Perform NFT testing on the ecosystem to ensure resilience & peak performance.
E. L2 Support Services: Restore services within agreed upon SLA. Respond to Major Incidents impacting technical components, seek opportunities for improvement.
F. Capacity Management: Ensure the capacity of each service is able to meet SLAs in a cost effective manner. (Ex. monitor environment capacity, manage & optimize available space and resources.) Define the container service fulfillment and operations model including Docker Enterprise Edition and Kubernetes.
Additional Services include:
Designing and implementing solutions to leverage a Kubernetes cluster.
Deploying cloud-native applications and monitoring and supporting a Kubernetes environment.
Develop, test, and deploy code that will configure and manage enterprise container service environments (Docker and Kubernetes). Configuring hardware, peripherals, and services, managing settings and storage.
Deploy and upgrade Docker and Kubernetes software as well as stackdriver, maestro & impeller.
Support the deployment of applications into the container service.
Troubleshoot and resolve issues in Docker and Kubernetes environments.
Support deployment orchestration for business applications and services into the container service.
Deliver training, coaching, and support for engineering teams to write code that will manage infrastructure and application configurations.
Provide product vision, standards, and best practices for using configuration management software across the organization.
Define strategies on implementing cloud agnostics K8 solution.
Develop, test, and deploy code that will configure and manage our client's enterprise container service environments (Docker and Kubernetes).
Define the container service architecture, including Docker Enterprise Edition and Kubernetes.
Assess & debug issues with the container service and applications.
Participate in team code reviews and design reviews.
The main and routine tasks of this position are to:
• Develop and maintain an advanced platform to manage Kubernetes clusters lifecycle
• Develop tools for automated build, test, deployment and management of the platform
• Improve continuous integration and delivery systems
• Monitor system events to ensure health, maximum system availability and service quality
• Perform system & application patching
• Maintain documentation regarding configurations, operations and troubleshooting procedures
• Participate in the definition of standards, guidelines and best practices
• Assist in evaluating new requirements, technical design and standards
• Answer user's query and service requests
• Participate in R&D effort bringing new ideas and new technologies
Bachelor's Degree in Computer Science or relevant field.
7-10 years' work experience with DevOps solutioning or infrastructure engineering & operations roles.
Minimum of 2 years of experience with container software (Docker Container and Kubernetes). Implementation, setup, configuration, troubleshooting, tuning and infrastructure.
Minimum of 2 years of experience with AWS, Azure or Google cloud IaaS, strong experience with Windows, Linux, or Unix/OpenShift, as well as solid skills with orchestration platforms, such as OpenShift, Kubernetes.
Experience with Linux, scripting, automation, computing in the cloud, and application monitoring technologies.
Strong understanding of container lifecycle and infrastructure code concepts.
Experience working with Google Cloud and/or Microsoft Azure.
Experience with the usage of GitHub (or Git version control systems).
Experience working in an agile development environment.
Detail oriented, team player with an ability to work with minimal direction.
• Bachelor's Degree in Computer Science, Information Systems or equivelan
Location(s): Canada : Ontario : Toronto
As Canada's International Bank, we are a diverse and global team. We speak more than 100 languages with backgrounds from more than 120 countries. Our employees are committed to a superior customer experience and use the Bank's six guiding sales practice principles to ensure they act with honesty and integrity.
At Scotiabank, we value the unique skills and experiences each individual brings to the Bank, and are committed to creating and maintaining an inclusive and accessible environment for everyone. If you require accommodation (including, but not limited to, an accessible interview site, alternate format documents, ASL Interpreter, or Assistive Technology) during the recruitment and selection process, please let our Recruitment team know. If you require technical assistance, please click here . Candidates must apply directly online to be considered for this role. We thank all applicants for their interest in a career at Scotiabank; however, only those candidates who are selected for an interview will be contacted. Job Segment:
Operations Manager, Manager, Cloud, Banking, Operations, Technology, Management, Finance