Senior Site Reliability Engineer - GetFeedback Digital Team

Posted Yesterday
Be an Early Applicant
Bengaluru, Karnataka
Senior level
Digital Media • Software • Analytics
The Role
The Senior Site Reliability Engineer will design, engineer and maintain AWS environments, automate infrastructure provisioning, and support observability systems. They will mentor other engineers and collaborate on building scalable cloud infrastructure, focusing on best practices in deployment and system performance.
Summary Generated by Built In

SurveyMonkey is the world’s most popular platform for surveys and forms, built for business—loved by users. We combine powerful capabilities with intuitive design, effectively serving every use case, from customer experience to employee engagement, market research to payment and registration forms. With built-in research expertise and AI-assisted technology, it’s like having a team of expert researchers right at your fingertips.
Trusted by millions—from startups to Fortune 500 companies—SurveyMonkey helps teams gather insights and information that inspire better decisions, create experiences people love, and drive business growth. Discover how at surveymonkey.com.

What we’re looking for

As a member of the Infrastructure team at Survey Monkey, you will have a direct impact in designing, engineering and maintaining our Cloud, Messaging and Observability Platform. Solutioning with best practices, deployment processes, architecture, and support the ongoing operation of our multi-tenant AWS environments. This role presents a prime opportunity for building world-class infrastructure, solving complex problems at scale, learning new technologies and offering mentorship to other engineers.

What you’ll be working on

  • Architect, build, and operate AWS environments at scale with well-established industry best practices
  • Automating infrastructure provisioning, DevOps, and/or continuous integration/delivery
  • Support and maintain AWS services, such as EKS
  • Write libraries and APIs that provide a simple, unified interface to other developers when they use our monitoring, logging, and event-processing systems
  • Support and partner with other teams on improving our observability systems to monitor site stability and performance
  • Work closely with developers in supporting new features and services.
  • Work in a highly collaborative team environment
  • Serving as a technical mentor
  • Participate in on-call rotation

We’d love to hear from people with

Experience and Background:

  • 5-8+ years of professional experience in relevant fields, with solid engineering expertise.
  • Over 5 years of hands-on experience in production environments using AWS services, including EKS, EC2, IAM, Kafka, Redis, Memcache, and CloudWatch.

Cloud Infrastructure and Tools:

  • Experience with cloud platforms such as AWS, Azure, or Google Cloud.
  • Familiarity with observability tools like Splunk, OpenTelemetry, New Relic, Datadog, and Grafana/Prometheus.

Configuration Management and Automation:

  • Proficient in configuration management and orchestration tools such as Ansible, Terraform, and CloudFormation.
  • Strong expertise in Docker and Kubernetes for containerization and orchestration.
  • Knowledge of GitOps practices and tools like ArgoCD or FluxCD.

Scripting and Programming:

  • Proficient in scripting languages (Bash, Python, YAML) for system automation.
  • Experience instrumenting Python and Node.js applications to send metrics, traces, and logs to third-party observability tools.

Data Management and Analysis:

  • Familiarity with databases and caching technologies, including PostgreSQL, MongoDB, Memcached, Redis, and Kafka.
  • Experience with metrics and logging libraries, data aggregation, and visualization tools, particularly Splunk and OpenTelemetry.

Collaboration and Communication:

  • Excellent communication skills, capable of collaborating effectively with both co-located and remote teams.
  • Ability to listen and partner to understand requirements, troubleshoot problems, and promote platform adoption.

Agile and Development Practices:

  • Experience working in agile environments and utilizing JIRA.
  • Familiarity with GitHub and GitHub Actions in software engineering or DevOps contexts.
  • Preferably experienced with secrets management, particularly HashiCorp Vault.

­­Additional Skills:

  • Strong troubleshooting skills across systems, networks, and application code.
  • Genuine interest in the instrumentation and optimization of Kubernetes clusters.
  • Understanding of scalable and high-performance application architecture.

This opportunity is hybrid and requires you to work from the SurveyMonkey office in Bengaluru 3 days per week.
#LI-Hybrid

Why SurveyMonkey? We’re glad you asked 

SurveyMonkey is a place where the curious come to grow.  We’re building an inclusive workplace where people of every background can excel no matter their time zone. At SurveyMonkey, we weave employee feedback and our core values into everything we do to create forward-looking benefits policies, employee programs, and an award-winning culture, including our annual holiday refresh, our annual week of service, learning and development opportunities like Curiosity Week, and our C.H.O.I.C.E Fund. 

Our commitment to an inclusive workplace

SurveyMonkey is an equal opportunity employer committed to providing a workplace free from harassment and discrimination. We celebrate the unique differences of our employees because that is what drives curiosity, innovation, and the success of our business. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, gender identity or expression, age, marital status, veteran status, disability status, pregnancy, parental status, genetic information, political affiliation, or any other status protected by the laws or regulations in the locations where we operate. Accommodations are available for applicants with disabilities.

Top Skills

Ansible
AWS
Azure
Bash
CloudFormation
Cloudwatch
Datadog
Docker
Ec2
Eks
GCP
Grafana
Iam
Kafka
Kubernetes
Memcache
MongoDB
New Relic
Opentelemetry
Postgres
Prometheus
Python
Redis
Splunk
Terraform
Yaml
The Company
HQ: San Mateo, CA
1,767 Employees
On-site Workplace
Year Founded: 1999

What We Do

We deliver intuitive, people-centric solutions that help industry leaders quickly and confidently make important decisions, take action, and achieve tangible results. Our AI-powered platform is built with a purposeful balance of humanity and technology, weaving together over 20 years of experience with data derived from billions of real questions and responses. Today, we offer enterprise solutions for agile experience management and insights by our three product brands: Momentive, GetFeedback, and SurveyMonkey.

Similar Jobs

Take-Two Interactive Software Logo Take-Two Interactive Software

SRE I

Gaming • Information Technology • Mobile • Software
Bengaluru, Karnataka, IND
6500 Employees

Exabeam Logo Exabeam

Senior Site Reliability Engineer

Artificial Intelligence • Information Technology • Machine Learning • Security • Software • Cybersecurity • Generative AI
Hybrid
Bangalore, Bengaluru, Karnataka, IND
850 Employees
Hybrid
Bengaluru, Karnataka, IND
289097 Employees

Spotnana Logo Spotnana

Staff Site Reliability Engineer

Big Data • Cloud • Information Technology • Software • Travel
Easy Apply
Bengaluru, Bengaluru Urban, Karnataka, IND
356 Employees

Similar Companies Hiring

InCommodities Thumbnail
Renewable Energy • Machine Learning • Information Technology • Energy • Automation • Analytics
Austin, TX
234 Employees
RunPod Thumbnail
Software • Infrastructure as a Service (IaaS) • Cloud • Artificial Intelligence
Charlotte, North Carolina
53 Employees
Hedra Thumbnail
Software • News + Entertainment • Marketing Tech • Generative AI • Enterprise Web • Digital Media • Consumer Web
San Francisco, CA
14 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account