Table of Content

New Job Private Cloud Sre In Karnataka

Private Cloud SRE
Private Cloud SRE

Private Cloud SRE

Company : Morgan Stanley
Salary : Details not provided
Location : Karnataka

Full Description

Private Cloud SRE

Job Number:


POSTING DATE: May 10, 2022
PRIMARY LOCATION: Non-Japan Asia-India-Karnataka-Bengaluru
EDUCATION LEVEL: Associate's Degree
JOB: Production Management and Operational Support
JOB LEVEL: Associate


About Us
Company Profile Morgan Stanley is a leading global financial services firm providing a wide range of investment banking, securities, wealth management and investment management services. With offices in more than 41 countries, the Firm's employees serve clients worldwide including corporations, governments, institutions and individuals. For further information about Morgan Stanley, please visit
Technology/ Role/ Department at Morgan Stanley
Enterprise Technology & Services (ETS) delivers shared technology services for Morgan Stanley supporting all business applications and end users. ETS provides capabilities for all stages of Morgan Stanley’s software development lifecycle, enabling productive coding, functional and integration testing, application releases, and ongoing monitoring and support for over 3,000 production applications. ETS also delivers all workplace technologies (desktop, mobile, voice, video, productivity, intranet/internet) in integrated configurations that boost the personal productivity of employees. Application and end user functions are delivered on a scalable, secure, and reliable infrastructure composed of seamlessly integrated datacenter, network, compute, cloud, storage, and database functions.

Job Profile The private cloud team is responsible for enabling the on-premises cloud to become a preferred platform across Morgan Stanley IT. This is a global, multi-discipline team responsible for supporting the virtualization plant, and is the highest escalation level of operational support. The infrastructure we support houses a high number of production servers and trading applications in correspondence with efforts to host applications on a virtualized platform rather than traditional hardware solutions.

Our team supports a leading virtualization technology (VMware) as well as many in-house tooling providing manageability services such as infrastructure and Virtual machine provisioning and maintenance, monitoring, performance monitoring, capacity management, CMDB, platform automation and runtime tools and we are presently looking for a Site Reliability Engineer to accelerate the culture of resilience and transformation on observability and SRE adoption. The candidate would be involved in multiple SRE initiatives including scripting, automation, observability, error budget, SLI, SLO and SLA. The candidate will be working alongside Operations team and Developing applications teams to optimize multiple areas such as monitoring, incident response, post-mortem/root cause analysis, capacity planning and data integrity.

Primary Responsibilities:

Support for a production VMware environment
Provide L3 front-line response to incidents and outages for Morgan Stanley’s private cloud, including on-call rotation on both weekdays and weekends for project work and incidents
Work closely with the internal engineering team and provide input on testing of new component releases and infrastructure upgrades, as well as performance, capacity and monitoring.
Assist on the implementation of full Observability stack for Alerting, Performance, Configuration Validation & Proactive Analysis
Assist on the adoption of SRE practices, mindset & cultural change across the global team
Assist on implementing feature stories based on toil reduction, establishing SLO’s/SLI’s, production readiness, monitoring and observability
Implement SRE frameworks to support global multi-cloud environments, and ensure the highest level of SLA through operational excellence


Required Qualifications / Skills

Good Linux experience
Task automation experience in any programming language (preferable in python)
Experience of at least one pillar of observability (metrics, logs or traces)
Experience with Agile and DevOps/SRE concepts
Communicate effectively with various user groups, e.g. developers and engineers as well as remote team members
Good knowledge on server infrastructure, virtualization and cloud computing
ITIL or equivalent knowledge

Desired Skills

Python development for task automation
Experience with site reliability engineering practices, like service level objectives (SLOs), error budgets, blameless post-mortems, toil reduction
Knowledge of system monitoring in cloud environments, including cloud-specific products and tools such as Splunk, Grafana, PagerDuty and Prometheus, etc
Experience with data science/ML tools using statistical computer languages (R, Python, SQL,etc) to manipulate data and draw insights from large datasets

Morgan Stanley is an equal opportunities employer. We work to provide a supportive and inclusive environment where all individuals can maximise their full potential. Our skilled and creative workforce is comprised of individuals drawn from a broad cross section of the global communities in which we operate and who reflect a variety of backgrounds, talents, perspectives and experiences. Our strong commitment to a culture of inclusion is evident through our constant focus on recruiting, developing and advancing individuals based on their skills and talents.