Senior Site Reliability Engineer

Reposted 18 Days Ago
Be an Early Applicant
San Jose, CA
Senior level
Artificial Intelligence • Cloud • Fintech • Healthtech • Biotech
The Role
The Senior Site Reliability Engineer will design, develop, and manage cloud infrastructure and enhance system reliability through automation and monitoring frameworks.
Summary Generated by Built In

Description

About the company

It is the leading destination for short-form mobile video. It is the largest Unicorn startup. It's the leader in short-form video hosting service now. It surpassed 1.3 billion mobile downloads in United States and 2 billion worldwide. With 1.5 billion monthly active users worldwide, it ranked one of the most popular social entertainment app.

About the team

Our data infrastructure Site Reliability Engineering (SRE) team is a pioneer in innovation. We seamlessly merge software development and infrastructure operations to design, build, and manage large-scale, highly distributed systems. We take pride in overseeing one of the industry's most extensive cloud infrastructures. As software development evolves, building systems from a mix of components has become the new standard. In this era, SRE takes a central role. This role demands the ability to design, develop, and operate these components, transforming them into cloud-managed, scalable, and reliable elements. Our professionals play a critical role as connectors, ensuring the seamless integration of these diverse components to deliver high-performing systems. Our dynamic SRE field is about actively shaping the future of technology, not just keeping pace with it. We contribute significantly to the next chapter of data infrastructure. We're currently in the process of building global teams around the world. Join us today and embark on this transformative journey!

Responsibilities:

- Participate in and enhance the complete service lifecycle, from inception and design, through development, capacity planning, launch reviews, deployment, operation, and refinement.- Design and implement software platforms and monitoring frameworks to govern service-oriented architecture (SOA) efficiently, automatically, and intelligently.

- Develop and manage components of cloud-managed data infrastructure, encompassing technologies such as Kubernetes, Redis, MySQL, Flink, and more.

- Establish sustainable mechanisms for scaling systems, such as automation, to drive enhancements in reliability, efficiency, and velocity.

- Provide sustainable user support, manage incident responses, and conduct blameless postmortems as part of our ongoing efforts to improve our systems.

Requirements

• Bachelor's degree in Computer Science or a related technical field with 5+ years of experience

• Experience programming in one of the following Languages: C, C++, Java, Python, Go, and Rust

• Familiar with Unix/Linux system internals, networking, and distributed systems

• [Preferred] Experience in MySQL, Redis, Ngnix, Kubernetes, Docker, OpenStack, Hadoop, Spark, Flink, etc.

• [Preferred] Experience in designing and analyzing large-scale distributed systems

• [Preferred] Strong skills in problem solving and communication

• [Preferred] Bilingual in Mandarin and English

Benefits

Our company benefits are designed to convey company culture and values, to create an efficient and inspiring work environment, and to support our employees to give their best in both work and life. We offer the following benefits to eligible employees: ​

We cover 100% premium coverage for employee medical insurance, approximately 75% premium coverage for dependents and offer a Health Savings Account(HSA) with a company match. As well as Dental, Vision, Short/Long term Disability, Basic Life, Voluntary Life and AD&D insurance plans. In addition to Flexible Spending Account(FSA) Options like Health Care, Limited Purpose and Dependent Care. ​

Our time off and leave plans are: 10 paid holidays per year plus 17 days of Paid Personal Time Off (PPTO) (prorated upon hire and increased by tenure) and 10 paid sick days per year as well as 12 weeks of paid Parental leave and 8 weeks of paid Supplemental Disability. ​

We also provide generous benefits like mental and emotional health benefits through our EAP and Lyra. A 401K company match, gym and cellphone service reimbursements. The Company reserves the right to modify or change these benefits programs at any time, with or without notice.

Top Skills

C
C++
Docker
Flink
Go
Hadoop
Java
Kubernetes
MySQL
Nginx
Openstack
Python
Redis
Rust
Spark
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Jose, CA
55 Employees
Hybrid Workplace
Year Founded: 2015

What We Do

We are HireIO, the Workforce Solutions Provider who tomorrow’s tech giants count on to be connected with today’s tech genius. We help create an impact on the tech community by partnering with teams and professionals who specialize in FinTech, Cloud/SaaS, healthcare, biotech, A.I., and any emerging technologies, to grow from new opportunities and support equal opportunity

Similar Jobs

BlackLine Logo BlackLine

Senior Site Reliability Engineer

Cloud • Fintech • Information Technology • Machine Learning • Software • App development • Generative AI
Remote
Hybrid
2 Locations
1810 Employees
133K-167K Annually

Roblox Logo Roblox

Senior SRE, Compute Orchestration

Computer Vision • Gaming • Software • Virtual Reality • Web3 • Metaverse
Hybrid
San Mateo, CA, USA
2500 Employees
193K-239K Annually

MongoDB Logo MongoDB

Senior Site Reliability Engineer

Big Data • Cloud • Software • Database
San Francisco, CA, USA
2382 Employees
127K-249K Annually

Cisco Meraki Logo Cisco Meraki

Senior Site Reliability Engineer, Test Platform- REMOTE

Hardware • Information Technology • Security • Software • Cybersecurity • Conversational AI
Easy Apply
Remote
Hybrid
San Francisco, CA, USA
3000 Employees
126K-222K Annually

Similar Companies Hiring

Stepful Thumbnail
Software • Healthtech • Edtech • Artificial Intelligence
New York, New York
60 Employees
HERE Technologies Thumbnail
Software • Logistics • Internet of Things • Information Technology • Computer Vision • Automotive • Artificial Intelligence
Amsterdam, NL
6000 Employees
True Anomaly Thumbnail
Software • Machine Learning • Hardware • Defense • Artificial Intelligence • Aerospace
Colorado Springs, CO
131 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account