DevOps Tools Engineer DevOps Tools Engineer …

Ameriprise Financial, Inc.
in Gurgaon, Haryana, India
Permanent, Full time
Be the first to apply
Competitive
Ameriprise Financial, Inc.
in Gurgaon, Haryana, India
Permanent, Full time
Be the first to apply
Competitive
DevOps Tools Engineer
Job Description

POSTING DESCRIPTION - RESPONSIBILITIES:

As an operations analyst at Ameriprise you will play a key role in the day-to-day activities of the tools and monitoring team. You will be part of a team responsible for the support and maintenance of the enterprise monitoring platform and service management tools. In this role you will often collaborate with partners in other lines of business to assist and ensure monitors are properly implemented and maintained when new applications are introduced, or changes are made to existing applications. You will help drive the optimization of technology operations though system/services performance monitoring, data and analysis. Collect data, analyze, and report on performance, usage and trends to identify opportunities for improvement. Escalate and document issues and events by set up procedures and processes.

Proactive Monitoring & Preventative Maintenance
- Ensure the up time and response time SLAs/OLAs for services are met and or exceeded.
- Pro-actively monitor the stability and performance of various technologies and take appropriate corrective action prior to an
incident or problem occurring.
- Create action plans to address issues and gaps in monitoring deliverables/service to ensure that proper monitoring exists to
pro-actively find issues and leading indicators.
- Aid in the configuration and integration of modern technologies.

Troubleshooting & Incident Management
- Actively collaborate with fellow members of the team and contractors/vendors on bridge calls to prevent
or resolve incidents/problems in a fast manner
- Perform moderately difficult and independent assignments in the troubleshooting, problem diagnosis,
problem resolution and ongoing production support for one or more technologies within the technical
area of expertise.

Data Collection, Tracking & Analysis
- Use a variety of data collection techniques and systems to collect technology operations performance data.
- Analyze to draw correct conclusions regarding performance, trends and issues (current and/or potential).
- Monitor compliance with defined SLA/Os.
- Monitor consumption/usage metrics to understand trends to assist in the effective management of vendor partners (as
applicable).
- Perform trend analysis to find cause of performance and/or usage issues.

Alert & Document
- Alert proper team (per incident and event management processes) to provide warning/notification that a threshold has
been reached, something has changed, or a failure has occurred.
- Document concerns and findings, collect all pertinent data (to include comparison of exception data and normal data) and
ensure incident/event tracking tools are updated according to established guidelines and procedures.

Continuous Improvement
- Work with application teams to determine the impact of application changes to the monitors configured for an application
and determine if any changes or additions are needed.
- Aid teams in identifying monitoring requirements and implementing the appropriate monitors to achieve the desired
results.
- Use experience, expertise and data analysis to collaborate with manager and team members in the identification of corrective
action to increase efficiency, improve performance and meet or exceed targets.

Responsibilities

POSTING DESCRIPTION - RESPONSIBILITIES:

As an operations analyst at Ameriprise you will play a key role in the day-to-day activities of the tools and monitoring team. You will be part of a team responsible for the support and maintenance of the enterprise monitoring platform and service management tools. In this role you will often collaborate with partners in other lines of business to assist and ensure monitors are properly implemented and maintained when new applications are introduced, or changes are made to existing applications. You will help drive the optimization of technology operations though system/services performance monitoring, data and analysis. Collect data, analyze, and report on performance, usage and trends to identify opportunities for improvement. Escalate and document issues and events by set up procedures and processes.

Proactive Monitoring & Preventative Maintenance
- Ensure the up time and response time SLAs/OLAs for services are met and or exceeded.
- Pro-actively monitor the stability and performance of various technologies and take appropriate corrective action prior to an
incident or problem occurring.
- Create action plans to address issues and gaps in monitoring deliverables/service to ensure that proper monitoring exists to
pro-actively find issues and leading indicators.
- Aid in the configuration and integration of modern technologies.

Troubleshooting & Incident Management
- Actively collaborate with fellow members of the team and contractors/vendors on bridge calls to prevent
or resolve incidents/problems in a fast manner
- Perform moderately difficult and independent assignments in the troubleshooting, problem diagnosis,
problem resolution and ongoing production support for one or more technologies within the technical
area of expertise.

Data Collection, Tracking & Analysis
- Use a variety of data collection techniques and systems to collect technology operations performance data.
- Analyze to draw correct conclusions regarding performance, trends and issues (current and/or potential).
- Monitor compliance with defined SLA/Os.
- Monitor consumption/usage metrics to understand trends to assist in the effective management of vendor partners (as
applicable).
- Perform trend analysis to find cause of performance and/or usage issues.

Alert & Document
- Alert proper team (per incident and event management processes) to provide warning/notification that a threshold has
been reached, something has changed, or a failure has occurred.
- Document concerns and findings, collect all pertinent data (to include comparison of exception data and normal data) and
ensure incident/event tracking tools are updated according to established guidelines and procedures.

Continuous Improvement
- Work with application teams to determine the impact of application changes to the monitors configured for an application
and determine if any changes or additions are needed.
- Aid teams in identifying monitoring requirements and implementing the appropriate monitors to achieve the desired
results.
- Use experience, expertise and data analysis to collaborate with manager and team members in the identification of corrective
action to increase efficiency, improve performance and meet or exceed targets.

Required Qualifications

- Bachelor's degree in Computer Science, IT, MIS, Math or related field; or equivalent work experience.
- 3-5 years of experience in a technology operations organization.
- 1+ years of experience pulling and analyzing data in support of a technology operations organization
. Must have worked upon server application monitoring tools before.
- In-depth knowledge of IT monitoring tools capabilities. Preferably server and application monitoring tools
- Ability to work occasional evenings and weekends to aid the team in providing 24x7 support/weekend support.
- Experience and ability with entire Microsoft Office suite.
- Python/Shell/Java Scripting Must

Preferred Qualifications

- ITIL Foundation certification.
- Proved ability to clearly and persuasively communicate ideas, issues and recommendations.
- Strong analytical ability with proven ability in synthesizing data into meaningful and digestible data points and actions.
- Strong attention to detail.
- Preferred tools like Dynatrace APM, ScienceLogic, Service Now Event (Event Management), Dynatrace Synthetic, SumoLogic(Log Monitoring)
- Integrations based on probes, gateways and using web services and tool like Microsoft SCOM.
- Understands DevOps/CICD concepts
Close
Loading...