Senior DevOps Engineer Senior DevOps Engineer …

TD Bank Group
in Toronto, ON, Canada
Permanent, Full time
Be the first to apply
TD Bank Group
in Toronto, ON, Canada
Permanent, Full time
Be the first to apply
Senior DevOps Engineer
Company Overview

Tell us your story. Don't go unnoticed. Explain why you're a winning candidate. Think "TD" if you crave meaningful work and embrace change like we do. We are a trusted North American leader that cares about people and inspires them to grow and move forward.

Stay current and competitive. Carve out a career for yourself. Grow with us. Here's our story:

Department Overview

Layer 6 is a leading Canadian machine learning applied research company, a fully owned subsidiary of TD Bank Group. Layer 6 develops advanced machine learning and deep learning systems that have the power to uplift large populations while advancing the field of artificial intelligence. Our research is supported by access to massive datasets, close collaboration with world renowned academic faculty, and a uniquely scalable machine learning platform.

Our technical capabilities have been publicly recognized through a number of wins in various international machine learning competitions, including the prestigious ACM RecSys Challenge (the only repeat winner in 2017 and 2018 and runner-up in 2019), Google's Landmark Retrieval Challenge (2nd place in 2018, 3rd place in 2019), the Stanford Question Answering Dataset (2nd place in 2019), 3rd YouTube-8M Video Understanding Challenge (winner in 2019) and Open Images 2019 - Visual Relationship (winner in 2019).

Job Description

About This Role

You will join the team in solving the challenges of developing high performance, robust and scalable machine learning solutions and delivering them in production.

We are looking for world-class devops engineers and problem solvers. You will be interacting with machine learning scientists and engineers to develop and automate machine learning pipelines that work. In particular, you will participate in the development, delivery and automation of scalable systems for data ingestion, processing, validation, model training, large-scale computation, monitoring, serving results and handling upgrades.

Major responsibilities include but are not limited to:
Design and implement our next generation of AI platform in AWS and Azure, including CI/CD, IAM, Network, Security Control, Storage, and AI specific tasks automation, etc.

Maintain and upgrade the on-premise AI Cluster

Design and implement the automation of model delivery system


What can you bring to Layer 6? Share your credentials, but your relevant experience and knowledge can be just as likely to get our attention. It helps if you have:

Required Skills

5+ years of experience building sophisticated and automated infrastructure

Prior success in automating a real-world production environment

Experience with seamless/automated build scripts used for release management across all environments

Experience with managing CI/CD tools and pipelines

Experience with docker and container orchestration

Understanding and experience with deploying microservices

Solid Azure and AWS experience

Strong scripting skills, i.e., Bash, Python etc.

Knowledge of IP networking, VPN's, DNS, load balancing and firewalls

Strong practical Linux systems administration skills in a Cloud environment, Redhat and Ubuntu

Strong verbal and written communication skills, with the ability to work effectively across teams

Experience with Git, Jenkins, Elasticsearch

Experience with automated testing tools

Familiarity with monitoring tools

Experience with troubleshooting and tuning systems performance

Knowledge of Java, C++ build tools

BA/BS degree or equivalent experience; Computer Science or Math background preferred

Nice to have

Java, Scala, C++, Python experience

Big Data experience

GPU experience

Additional Information


Entrepreneurial and learning culture

Excellent health coverage and pension plan

Four weeks paid vacation

Catered lunches twice a week over machine learning talks




At TD, we are committed to fostering an inclusive, accessible environment, where all employees and customers feel valued, respected and supported. We are dedicated to building a workforce that reflects the diversity of our customers and communities in which we live and serve. If you require an accommodation for the recruitment/interview process (including alternate formats of materials, or accessible meeting rooms or other accommodation), please let us know and we will work with you to meet your needs.