Enterprise System Management - L3 Operations

  • Competitive
  • Tokyo, Tokyo-to, Japan
  • Permanent, Full time
  • Morgan Stanley
  • 20 Sep 17

See job description for details


The L3 Operations role is part of Enterprise System Management (ESM Ops) in Morgan Stanley's Enterprise Infrastructure division. ESM Ops is a Level 3 team of subject matter experts focused on the firm’s strategic platforms and components that deliver telemetry, metrics, and analytics across Morgan Stanley Technology. These platforms underpin the very fabric of technology telemetry for operations and security at the firm. The team is expanding to meet growing demand for our solutions by internal customers. This position is reporting directly to the Asia Ops Manager based in Tokyo.

Responsibilities include:

  • Deploying and administering Apache Kafka and internal components related to our Logging as a Service platform
  • Deploying and administering a large Splunk Enterprise environment
  • Deploying and administering a distributed Extrahop environment
  • Deploying and administering other selective application monitoring and alerting platforms, both in-house and vendor sourced, including AppDynamics
  • Working with clients to navigate and make best use of the environment
  • Working with engineering teams to improve the design and supportability of the environment
  • Troubleshooting issues that arise
  • Problem and incident management
  • Driving process improvements through automation and scripting
  • Writing technical documentation
  • Enabling the transfer of work to L2 as well as accepting their escalations
  • Sharing an on-call rotation with the rest of the large global team (with a time-off in lieu system)




Qualifications:


Skills Required:

  • Experience supporting, diagnosing, and solving problems in a distributed systems Unix/Linux operating environment
  • Working knowledge of Python and/or Perl
  • Problem solving skills and the ability to multi-task
  • Good interpersonal and communication skills / Strong verbal & written skills required to interact with customers and global teams
  • Can successfully operate as a team member as well as independently
  • Can successfully handle high pressure situations such as system outages


Skills Desired:

  • Knowledge of analytics platforms such as Apache Kafka, Splunk, Extrahop or other Application Performance Management solutions
  • TCP/IP networking
  • Experience using git or other source control system