Description and Requirements
Position Summary
Design, implement, and manage the organization's IT infrastructure, including network systems, servers, storage, virtualization, and cloud services. Develop and maintain infrastructure architecture standards, policies, and procedures.
Job Responsibilities
Job Responsibilities:
- Manage BMC product portfolio like TCO, TOM, BCO & BPA
- Fundamental understanding of the functionality of BMC True Sight Capacity Optimization solutions and architecture along with different Integration methods
- Functional understanding of software integrations and software systems experience with Web services and API
- Support senior resources in the successful design and implementation in capacity Optimization of Servers, network and Storage architecture
- Proven ability to configure BCO per client requirements and adhere to best practices for the following criteria
- Performance and response time and Capacity and growth
- High availability and failover and Disaster Recovery
- Must be able to read and understand a technical architecture that spans multiple platforms and Data Centers and deploy software products as dictated by the plan with little or no supervision
- Strong technical, analytical, and problem-solving skills
- Knowledge on BMCs IT business software portfolio
- Engineer solutions and establish standards for Server Monitoring and deployments, including optimizations and tunings per requirements
- Devise an authentication and authorization model for monitoring within standard Client Infrastructure framework. Develop customizations as needed
- Architect a highly available and scalable infrastructure with appropriate monitoring and alerting mechanisms.
- Seek opportunities for automated deployments so as to reduce operational tasks.
- Seek opportunities for integration of Server with other monitoring tools in the Client portfolio.
- Test and implement Monitoring Extensions appropriate for Client's technology stack (e.g. Webservers; Messaging Tiers).
- Test and implement Database Monitoring
- Design, configure, and implement Server Monitoring Tool and Integrate solutions based on business requirements.
- Collaborate with architects and development teams to ensure effective integration of applications.
- Ensure environments are set up for optimal performance, scalability, and reliability.
- Design and implement message models for efficient communication.
- Troubleshoot and resolve issues related to Alert queues, Alerts flows, and other components
- Monitoring Requirements and Request fulfillment Support
- Review monitoring ServiceNow Requests, follow up with stakeholder where more clarity is needed
- and perform any necessary testing of custom monitoring
- Perform custom and database monitoring deployment, removal and modification
- Investigate reports of missed alerts
- Perform cell remediation in Impact manager for alerting issues
- Configuring auto ticketing and make updates to routing table for all monitoring platforms and third party tools
- Set blackout windows on monitoring in any scenarios not covered by automated blackout requests
- Add, modify and remove URL/keepalive monitors
- Add modify and remove synthetic transections monitoring as needed
- troubleshoot monitoring data collection and configurations issues
- Provide first level response to questions about monitoring from stakeholders
- Server Build
- Perform additional and removal of new servers in TSCO console
- Troubleshoot incident related to TSCO data collection issues and TSCO agent
- Application Performance Monitoring Support
- Create alerting policies assign health rules for auto ticketing and create blackout schedule for APM
- perform updates to the config template for addition and removal of container monitoring
- provide first level response to questions from stakeholder about APM configuration
- Troubleshoot APM alerting and container monitoring configuration issues and provide RCA Perform routine administrative tasks, including monitoring, logging, and health checks
- Gather requirements and business process knowledge in order to transform the data in a way thats geared towards the requirement
- Maintain and improve already existing processes
Education, Technical Skills & Other Critical Requirement
Education
Bachelor's degree in computer science, Information Systems, or another related field with 4+ years of IT and Infrastructure engineering work experience.
Experience
(In Years)
4Years+
Technical Skills
- Experience on monitoring tools
- Experience in monitoring solution development and deployment by understanding client requirement
- Strong technical skills with emphasis in the areas of IT infrastructure including Windows, VMware, Storage, Linux and Azure
- Prior knowledge on program management and business tools
- Experience in preparation and execution of project plans, schedules, cost estimates, baseline change management, risk mitigation, technical design reviews, management reviews, customer coordination meetings, cost/schedule/status reporting. Implementing corrective actions as necessary to achieve commitments.
- Ability to grasp technical concepts rapidly. History of understanding avionics and core platform life cycle: requirements, development, test, flight test, certification.
- Strong communication skills, able to understand the project risks well ahead, take necessary actions.
About MetLife
Recognized on Fortune magazine's list of the 2024 "World's Most Admired Companies" and Fortune World's 25 Best Workplaces™ for 2024, MetLife , through its subsidiaries and affiliates, is one of the world's leading financial services companies; providing insurance, annuities, employee benefits and asset management to individual and institutional customers. With operations in more than 40 markets, we hold leading positions in the United States, Latin America, Asia, Europe, and the Middle East.
Our purpose is simple - to help our colleagues, customers, communities, and the world at large create a more confident future. United by purpose and guided by empathy, we're inspired to transform the next century in financial services.
At MetLife, it's #AllTogetherPossible . Join us!
Top Skills
What We Do
Named one of Fortune’s “World’s Most Admired Companies,” MetLife is leading the global transformation of an industry we’ve defined for more than 150 years. At MetLife, every innovation and line of code is a lifeline for our customers and their families—from victims of natural disasters to people living with disabilities and beyond. With operations in more than 40 markets and leading positions across the globe, MetLife’s building a workforce of diverse and empowered voices that all belong. Join our remarkable journey—one in which you help write the next century of innovation in financial services—because with MetLife, making the world a better place is All Together Possible.
Why Work With Us
At MetLife, you’ll be working for a company whose purpose is to help customers throughout their life’s journey, and often in their most critical time of need. You’ll be a part of developing leading-edge platforms that will have a lasting impact on the lives and well-being of tens of millions of customers.
Gallery
MetLife Teams
MetLife Offices
Hybrid Workspace
Employees engage in a combination of remote and on-site work.
MetLife's current workplace policies classify roles as Office, Hybrid or Virtual based on the nature of work, encouraging new ways of working together