Site Reliability Engineer - Data Infrastructure, AD/ADAS

Posted 21 Days Ago
Be an Early Applicant
Tokyo
Hybrid
Senior level
Automotive • Software • Automation
The Role
The Site Reliability Engineer will design, build, and maintain large-scale multi-cloud infrastructure for a data platform focused on autonomous driving technology. Their responsibilities include ensuring system reliability, managing incidents, collaborating with other engineers, and mentoring junior staff, while contributing to long-term system strategies.
Summary Generated by Built In

About Woven by Toyota

Woven by Toyota, a part of the Toyota Group, is challenging the current state of mobility through human-centric innovation and empowering mobility transformation. Through our AD/ADAS technology, our automotive software development platform Arene OS, our mobility test course Toyota Woven City, and Toyota’s growth fund, Woven Capital, we are pioneering the movement of people, goods, information, and energy, weaving a future of enhanced safety, connectivity and well-being for all.


=========================================================================


TEAM

Our data platform team is working on accelerating autonomous driving by providing access to petabytes of data collected by our fleet of autonomous and non-autonomous vehicles. Efficient, fast and cost-effective access to data at large scale is key to tackle the hardest problems in AD/ADAS, from developing the Machine Learning (ML) models for perception and prediction of human driving patterns, to increasing the sophistication of our validation and simulation by identifying rare and interesting real-world driving situations. The data ecosystem developed by the Data Infrastructure team is a key building block for developing and testing modern AD/ADAS products that will impact millions of customers. 


Our ML and Data pipelines are built on-top of the open-source Flyte orchestration framework and are deployed to AWS. Pipeline code is written in Python. We leverage AWS S3, GCP BigQuery and ElasticSearch for data storage and search. We schedule our workloads on AWS EKS. Our infrastructure is spread across multiple regions and multiple cloud providers. We believe strongly in automation and testing to ensure delivery of robust and correct systems. We are a distributed team, working in Japan, the UK and the US.


WHO ARE WE LOOKING FOR?

The Data Infrastructure team is looking for engineers who are passionate about and enable the next generation of automotive software development. The right candidate will have excellent communication skills, solid coding skills, expertise in building scalable, reliable, highly available and fault-tolerant systems, broad knowledge of software engineering and site reliability engineering in areas such as Large-Scale Data and Compute Infrastructure, Stream Processing, Kubernetes, High-Performance Networking, Observability and Infrastructure Automation.

RESPONSIBILITIES

  • Design, build, maintain, optimize and support large scale, multi-region, multi-cloud compute and storage infrastructure powering our data platform and mission critical services
  • Work with fellow Data Infrastructure engineers and Site Reliability engineers to ensure our systems are scalable, reliable, fault-tolerant, highly available, highly performant, and observable
  • Manage incidents, triage product or system issues and debug/track/resolve by analyzing the root cause of these issues and the impact on users & operations
  • Work closely with other Data Infrastructure engineers, Site Reliability engineers, ML Platform engineers, Computer Vision and ML engineers on high-impact projects to create innovative solutions to problems in the self-drive space
  • Mentor junior engineers in their day to day work and drive best practices across the organization
  • Contribute to the long term strategy for several of our systems and products

MINIMUM QUALIFICATIONS

  • Bachelor’s degree in Computer Science, a related field, or equivalent practical experience
  • 5+ years of experience with data structures/algorithms and professional software engineering in one or more programming languages (e.g., Python, Go, Java, C, C++)
  • 3+ years of experience as a Site Reliability Engineer, working with Terraform, Docker, cloud-native technologies, networking and Kubernetes in production
  • Experience designing, deploying, monitoring and maintaining large-scale, fault-tolerant multi-region and/or multi-cloud distributed systems
  • Ability to debug & optimize code, to troubleshoot distributed systems and to automate routine tasks
  • Business-level proficiency in English speaking, reading and writing (e.g., technical documents, software documentation)

NICE TO HAVES

  • Master’s degree in Computer Science
  • Experience working as a Software Engineer on data-intensive applications, data platforms, data pipelines, workflow orchestration, batch processing, and/or distributed databases
  • Experience working with RPC protocols and their formats, e.g., gRPC/protobuf, Apache Avro, etc.
  • Experience with cloud-based (e.g. AWS, GCP, Azure) microservice architecture, event-driven, distributed architectures
  • Experience working in a fast-paced environment, collaborating across teams and disciplines
  • Experience with data governance, data privacy and security
  • Business-level proficiency in Japanese

=========================================================================

Important Points

・All interviews will be arranged via Google Meet, unless otherwise stated.

・The same job descriptions are available in both English and Japanese; therefore, we kindly ask that you apply to only one version.

・We kindly request that you submit your resume in English, if possible. However, Japanese resumes are also acceptable. Please note that, depending on the English proficiency requirements of the role, we may request an English version of your resume later in the process.


WHAT WE OFFER

・Competitive Salary - Based on experience

・Work Hours - Flexible working time

・Paid Holiday - 20 days per year (prorated)

・Sick Leave - 6 days per year (prorated)

・Holiday - Sat & Sun, Japanese National Holidays, and other days defined by our company

・Japanese Social Insurance - Health Insurance, Pension, Workers’ Comp, and Unemployment Insurance, Long-term care insurance

・Housing Allowance

・Retirement Benefits

・Rental Cars Support

・In-house Training Program (software study/language study)


Our Commitment

・We are an equal opportunity employer and value diversity.

・Any information we receive from you will be used only in the hiring and onboarding process. Please see our privacy notice for more details.

Top Skills

C
C++
Go
Java
Python
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
Palo Alto, , California
1,679 Employees
On-site Workplace
Year Founded: 2023

What We Do

Woven by Toyota will help to deliver the safest, most intelligent mobility experiences and lifestyle for Toyota customers everywhere. At Woven by Toyota, we envision a human-centered future, where world-class technology expands global access to mobility, enhances the capabilities of drivers, and empowers people to thrive. We achieve this with a unique global culture that weaves modern Silicon Valley innovation with Japanese quality craftsmanship. As society, technology and customer needs evolve, we continuously pursue new ways to create a more personal, seamless experience for customers.

Similar Jobs

Qualtrics Logo Qualtrics

Solution Engineer - Tokyo

Artificial Intelligence • Information Technology • Natural Language Processing • Software • Business Intelligence • Generative AI
Tokyo, JPN
5000 Employees

Cloudflare Logo Cloudflare

Senior Technical Account Manager

Cloud • Information Technology • Security • Software • Cybersecurity
Hybrid
Tokyo, JPN
3900 Employees

UL Solutions Logo UL Solutions

Automotive Cybersecurity Assessor / Engineer

Automotive • Professional Services • Software • Consulting • Energy • Chemical • Renewable Energy
Hybrid
Tokyo, JPN
15000 Employees

UL Solutions Logo UL Solutions

Functional Safety Project Engineer

Automotive • Professional Services • Software • Consulting • Energy • Chemical • Renewable Energy
Hybrid
Tokyo, JPN
15000 Employees

Similar Companies Hiring

HERE Technologies Thumbnail
Software • Logistics • Internet of Things • Information Technology • Computer Vision • Automotive • Artificial Intelligence
Amsterdam, NL
6000 Employees
True Anomaly Thumbnail
Software • Machine Learning • Hardware • Defense • Artificial Intelligence • Aerospace
Colorado Springs, CO
131 Employees
Caliola Engineering Thumbnail
Software • Machine Learning • Hardware • Defense • Data Privacy • App development • Aerospace
Colorado Springs, CO
52 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account