Staff Site Reliability Engineer

Posted 9 Days Ago
Be an Early Applicant
Palo Alto, CA
176K-229K Annually
Senior level
HR Tech • Information Technology • Software
The Role
The Staff Site Reliability Engineer will lead cloud infrastructure automation, enhance monitoring capabilities, and ensure system reliability while collaborating with DevOps and engineering teams.
Summary Generated by Built In

Velocity Global offers the most unified, tech-enabled, and customer service-driven global workforce management, ensuring smooth, reliable operations across countries, roles, and workforce types so businesses can navigate complexity with confidence, deliver strong results, and stay ahead. We help you expand your business into new markets without the complexity of setting up entities. We hire, pay, and manage your workforce across 185+ countries with our AI-powered global Workforce Technology platform.

Velocity Global seeks a Staff Site Reliability Engineer (Staff SRE) with extensive cloud engineering experience. In this role, you will lead, design and help create the automation and support efforts of our cloud Infrastructure, identify and execute strategies to improve our full-stack telemetry, monitoring and alerting capabilities, and improve our overall SLA's.

We obsess over performance, scalability, privacy and security. SREs work cross-functionally with DevOps and Engineering teams, combining operations work with software engineering principles to enable high availability of production systems. You will serve as a partner to our Engineering organization to help make their services more performant, scalable, observable, and reliable. Every engineering team at Velocity Global should be responsible for the software they build. SREs are critical in providing the tools, practices, and expertise to make that happen.

You will be based in Palo Alto, California, and in-office collaboration is required for at least three days per week.

You Will:

  • Automating observability and alerting across an ever-changing landscape of microservices
  • Automated Service Reliability Scorecards and Production Readiness Standards
  • Chaos Engineering and Game Day Simulations to discover and test fixes for weak spots that would otherwise not be identified until a real-life production incident occurred
  • Software engineering project work, proposed and driven by individual SRE team members, to remove operational bottlenecks and increase velocity in ways we've never considered before
  • Expand and improve our observability and monitoring footprint
  • Collaborate with the Engineering and DevOps to create architectural plans, define project requirements, and establish technical standards
  • Improve common operational challenges by building tools and automating scripts
  • Serve on the Incident Response Team to help debug and drive resolution of production reliability issues, contribute to the postmortem, and work to prevent recurrence
  • Participate in design and production reviews for new features, products, or infrastructure
  • Audit and tune the configuration of systems owned by other engineering teams
  • Plan for the growth of Velocity Global's infrastructure and infrastructure reliability/resiliency
  • Designing and implementing High Availability architecture underlying Velocity Global's platform
  • Creating Disaster Recovery solutions, including backups, redundant systems, and emergency response processes
  • Collaborating with Architects and Engineering leaders in the hiring, training and mentoring of all talents.

You Have:

  • Outstanding analytical skills with the ability to solve complex systems challenges and performance bottlenecks
  • Proficient knowledge of public cloud infrastructure, networking, architecture, and Linux as well as orchestration, monitoring, automation, and configuration management solutions
  • Practical knowledge of distributed service design and performance, including messaging protocols, caching, data residency, and observability
  • Passion for designing and evolving complex systems while also being able to support day-to-day infrastructure operations. We want someone who does not prefer one tool, but rather looks for the right tool for the job
  • A dedication to learning new techniques and technologies, then sharing ideas with your fellow engineers with mastery of breaking down, discussing, and communicating technical concepts

Nice to Have:

  • 5-8 years of experience (Depending on open role) Software engineering experience, preferably within the Infrastructure Engineering area.
  • 5-8 years of experience in highly scalable cloud architectures including service-oriented architectures (AWS and/or GCP experience preferred)
  • Ability to collaborate well and come up with maintainable, reliable solutions. Experience building scalable, high-performing systems.
  • Strong analytical and problem-solving skills.
  • Ability to provide both architectural guidance and detailed technical directions.
  • Excellent communication, collaboration and leadership skills


#LI-Hybrid

Our job titles may span more than one career level. The base pay depends upon many factors, such as training, transferable skills, work experience, business needs, and market demands. The base pay range is subject to change and may be modified. 

Annualized Pay Range

$176,000$229,000 USD

We are dedicated to fostering diversity and inclusion across our organization, embracing the rich tapestry of cultures, backgrounds, and perspectives that our global team brings together in offices around the world. Velocity Global is an Equal Opportunity Employer committed to empowering individuals from all walks of life to achieve their professional goals with us, regardless of race, religion, gender, gender identity, pregnancy, disability, sexual orientation, age, national origin, citizenship status, or genetic information. We actively seek and encourage applications from diverse candidates, including those with disabilities, and offer accommodations throughout the selection process upon request.

Velocity Global offers a range of benefits tailored to the location and type of role. A general benefits overview is below:

  • Flexible Time Off + Parental Leave
  • Health and Dental Insurance (where applicable)
  • Retirement Savings + Employee Incentive Plan
  • WFH Stipend

Please visit our career page for more information.

Top Skills

Automation
AWS
Cloud Infrastructure
Configuration Management
GCP
Linux
Monitoring
Orchestration
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Denver, CO
784 Employees
On-site Workplace
Year Founded: 2014

What We Do

Velocity Global accelerates the future of work for anyone, anywhere, anyhow. Its Global Work Platform™ simplifies the employer and talent experience through its proprietary cloud-based talent management technology, backed by personalized expertise and unmatched global scale. The platform offers a full suite of talent solutions, including global Employer of Record and Contractor Management, to help companies onboard, manage, and pay talent in more than 185 countries and all 50 United States.

Founded in 2014, the company has hundreds of employees across six continents, with plans to double in size in 2022.

As a work anywhere company, Velocity Global employees can create a work experience to fit their lifestyle, giving them the freedom to pursue their passion without the hassle of long commutes, rigid schedules and overlapping work-life demands. We empower people to live life fully, supported by a dynamic culture, growth opportunities and unique benefits. In the words of our founder, Ben Wright: “Professional experiences of a lifetime start here.”

Similar Jobs

Cisco Meraki Logo Cisco Meraki

Lead Site Reliability Engineer, Network - Remote

Hardware • Information Technology • Security • Software • Cybersecurity • Conversational AI
Easy Apply
Remote
Hybrid
2 Locations
3000 Employees
148K-236K Annually

NVIDIA Logo NVIDIA

Senior Staff Site Reliability Engineer - CDN

Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
Remote
2 Locations
21960 Employees

Illumio Logo Illumio

Staff Site Reliability Engineer

Software • Cybersecurity
Sunnyvale, CA, USA
552 Employees

Illumio Logo Illumio

Staff Site Reliability Engineer

Software • Cybersecurity
Sunnyvale, CA, USA
552 Employees

Similar Companies Hiring

True Anomaly Thumbnail
Software • Machine Learning • Hardware • Defense • Artificial Intelligence • Aerospace
Colorado Springs, CO
131 Employees
Caliola Engineering Thumbnail
Software • Machine Learning • Hardware • Defense • Data Privacy • App development • Aerospace
Colorado Springs, CO
53 Employees
Red 6 Thumbnail
Virtual Reality • Software • Hardware • Defense • Aerospace
Orlando, Florida
113 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account