SRE Manager (India)

Posted 21 Hours Ago
Be an Early Applicant
Bangalore, Bengaluru, Karnataka
Artificial Intelligence • Software • Generative AI
The Role
Seeking a skilled and motivated Site Reliability Engineer (SRE) to ensure the reliability, availability, and performance of cloud-based services and applications. Responsibilities include high availability implementation, incident management, automation development, performance optimization, security, and monitoring.
Summary Generated by Built In

ABOUT GLEAN:

We’re on a mission to bring people the knowledge they need to make a difference in the world. 

Glean was founded by a seasoned team of former Google search and Facebook engineers, who wondered why we don’t have an easier way of finding what we need at work. In our personal lives, we have tools to help us find pretty much whatever we need. Why don’t we have that at work? And that was the beginning of Glean.

Glean searches across all your company’s apps to help you find exactly what you need and discover the things you should know. We’re a diverse team of curious and creative people who want to help each other get big things done—so we can help other teams do the same. 

We're backed by some of the Valley's leading venture capitalists—including Sequoia, Kleiner Perkins, Lightspeed, and General Catalyst—and have assembled a world-class team with senior leadership experience at Google, Slack, Facebook, Dropbox, Rubrik, Uber, Intercom, Pinterest, Palantir, and others.
ABOUT THE ROLE:
As an SRE Manager at Glean, you will lead a team of software and system engineers in ensuring the reliability, availability, and performance of our cloud-based services and applications. You will collaborate closely with our engineering teams to design, build, and maintain robust, scalable, and highly available cloud infrastructure. Your expertise in team leadership, coding, algorithms, problem-solving, and SRE practices will be crucial in managing the complex challenges of scale and fast growth unique to Glean.
What you will do and achieve:
Team Leadership and Development: Lead, mentor, and support a team of software and systems engineers, fostering their growth and success. Set team priorities and drive the execution of team OKRs with input from engineering leadership and cross-functional partners. Establish technical credibility and influence the technical direction and high-quality delivery from the team.

  • Ensure High Availability: Implement and maintain resilient cloud architectures, monitor system performance, and proactively identify and resolve potential bottlenecks or points of failure.
  • Incident Management: Participate in primary oncall rotation; cultivate technical curiosity and growth mindset, and a blameless postmortem culture within the team. Continuously optimize the on-call process for sustainability and efficiency.
  • Automation and Tooling: Develop and maintain automation scripts, tools, and processes to streamline system deployment, monitoring, and management tasks. Your contributions will be vital in efficiently scaling cloud operations.
  • Performance Optimization: Optimize cloud infrastructure and applications for performance, scalability, and cost-effectiveness.
  • Security and Compliance: Collaborate with security engineers to implement best practices and ensure compliance with security standards and policies.
  • Monitoring and Alerting: Design and configure advanced monitoring systems to gain insights into system behavior, set up alerts, and respond proactively to potential issues.
  • Create and maintain comprehensive dashboards and playbooks for production on-call.
  • Software Development Consultation: Engage actively in the entire software development lifecycle. Participate in system design reviews and provide valuable SRE insights during launch reviews, influencing and enhancing system architecture.

Who you are:

  • Bachelor’s degree in Computer Science, a related field, or equivalent practical experience.
  • 8+ years of experience in a senior-level role within Site Reliability Engineering or similar role, particularly in managing cloud-based services and infrastructure.
  • 5+ years of experience with software development in one or more programming languages.
  • 2+ years of experience managing people or teams, leading projects, and designing, analyzing, and troubleshooting distributed systems running in Cloud.
  • Strong knowledge of cloud platforms such as Google Cloud Platform, AWS, or Azure.
  • Practical experience with containerization technologies, including Docker and Kubernetes. Familiarity with infrastructure as code tools like Terraform is essential.
  • Solid understanding of networking, security principles, and best SRE and security practices.
  • Proficiency in using monitoring and alerting tools to detect and respond to potential issues effectively

We are a diverse bunch of people and we want to continue to attract and retain a diverse range of people into our organization. We're committed to an inclusive and diverse company. We do not discriminate based on gender, ethnicity, sexual orientation, religion, civil or family status, age, disability, or race.

Top Skills

Python
The Company
HQ: Palo Alto, CA
224 Employees
On-site Workplace
Year Founded: 2019

What We Do

Glean searches across all your company’s apps to help you find exactly what you need and discover the things you should know.

🔍 AI-powered workplace search.
💡 Personalized results and knowledge discovery.
⚡ Easy to use, ready to go— right out of the box.

Similar Jobs

Bangalore, Bengaluru, Karnataka, IND
60 Employees

Atlassian Logo Atlassian

Principal Machine Learning Engineer - AI / ML

Cloud • Information Technology • Productivity • Security • Software • App development • Automation
Bengaluru, Karnataka, IND
11000 Employees

Capital One Logo Capital One

Principal Associate - Full Stack Software Engineer

Fintech • Machine Learning • Payments • Software • Financial Services
Hybrid
Bengaluru, Karnataka, IND
55000 Employees

Exabeam Logo Exabeam

Senior Software Engineer - Java, Kafka/PubSub, NoSQL

Artificial Intelligence • Information Technology • Machine Learning • Security • Software • Cybersecurity • Generative AI
Hybrid
Bangalore, Bengaluru, Karnataka, IND
850 Employees

Similar Companies Hiring

TrainingPeaks (A Peaksware Company) Thumbnail
Software • Fitness
Louisville, CO
69 Employees
bet365 Thumbnail
Software • Gaming • eSports • Digital Media • Automation
Denver, Colorado
6100 Employees
Jobba Trade Technologies, Inc. Thumbnail
Software • Professional Services • Productivity • Information Technology • Cloud
Chicago, IL
45 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account