About Onehouse
Onehouse is a mission-driven company dedicated to freeing data from data platform lock-in. We deliver the industry’s most interoperable data lakehouse through a cloud-native managed service built on Apache Hudi. Onehouse enables organizations to ingest data at scale with minute-level freshness, centrally store it, and make available to any downstream query engine and use case (from traditional analytics to real-time AI / ML).
We are a team of self-driven, inspired, and seasoned builders that have created large-scale data systems and globally distributed platforms that sit at the heart of some of the largest enterprises out there including Uber, Snowflake, AWS, Linkedin, Confluent and many more. Riding off a fresh $35M Series B backed by Craft, Greylock and Addition Ventures, we're now at $68M total funding and looking for rising talent to grow with us and become future leaders of the team. Come help us build the world's best fully managed and self-optimizing data lake platform!
The Community You Will Join
When you join Onehouse, you're joining a team of passionate professionals tackling the deeply technical challenges of building a 2-sided engineering product. Our engineering team serves as the bridge between the worlds of open source and enterprise: contributing directly to and growing Apache Hudi (already used at scale by global enterprises like Uber, Amazon, ByteDance etc) and concurrently defining a new industry category - the transactional data lake. The Data Infrastructure team is the grounding heartbeat to all of this. We live and breathe databases, building cornerstone infrastructure by working under Hudi's hood to solving incredibly complex optimization and systems problems.
The Impact You Will Drive:
- As a software engineer at Onehouse, you will contribute directly to Apache Hudi and the surrounding open source ecosystem, while deploying and operating these technologies at massive scale for our customers.
- Accelerate our open source enterprise flywheel by working on the guts of Apache Hudi's transactional engine and optimizing it for diverse Onehouse customer workloads.
- Act as a SME to deepen our teams' expertise on database internals, query engines, storage and/or stream processing.
A Typical Day:
- Build systems that enable users to manage petabytes of data with a fully managed cloud service.
- Build functionality that enables data systems to be cloud native (self managed), scalable (auto scaling) and secure (different levels of access control).
- Build scalable job management on Kubernetes to ingest, store, manage and optimize petabytes of data on cloud storage.
- Design systems that help scale and streamline metadata and data access from different query/compute engines.
- Exhibit full ownership of product features, including design and implementation, from concept to completion.
- Be passionate about designing for future scale and high availability, while possessing a deep understanding of common failure patterns and their remediations.
- Uphold a high engineering bar around the code, monitoring, operations, automated testing, release management of the platform.
What You Bring to the Table:
- 3+ years of experience as a software engineer with experience developing distributed systems.
- Strong, object-oriented design and coding skills with Java.
- Experience with inner workings of distributed (multi-tiered) systems, algorithms, and relational databases.
- Deal well with ambiguous/undefined problems; ability to think abstractly; articulate technical challenges and solutions.
- Speed and hustle → Ability to prioritize across feature development and tech debt.
- Ability to solve complex programming/optimization problems.
- Ability to quickly prototype optimization solutions and analyze large/complex data.
- Clear communication skills.
- Experience working on database systems, Query Engines or Spark codebases.
- Experience working on cloud based (data focused) services.
- Deep understanding of Spark, Flink, Presto, Hive, Parquet internals.
- Hands-on experience with open source projects like Hadoop, Hive, Delta Lake, Hudi, Nifi, Drill, Pulsar, Druid, Pinot, etc.
Nice to haves (but not required):
How We'll Take Care of You
-Competitive Compensation; the estimated base salary range for this role is $200,000 - $220,000
-Equity Compensation; our success is your success with eligible participation in our company equity plan
-Health & Well-being; we'll invest in your physical and mental well-being with up to 90% health coverage (50% for spouses/dependents) including comprehensive medical, dental & vision benefits
-Financial Future; we'll invest in your financial well-being by making this role eligible to contribute to our company 401(k) or Roth 401(k) retirement plan
-Location; we are a remote-friendly company (internationally distributed across N. America + India), though some roles will be subject to in-person requirements in alignment with the needs of the business
-Generous Time Off; unlimited PTO (mandatory 1 week/year minimum), uncapped sick days and 11 paid company holidays
-Company Camaraderie; Annual company offsites and Quarterly team onsites @Sunnyvale HQ
-Food & Meal Allowance; weekly lunch stipend, in-office snacks/drinks
-Equipment; we'll provide you with the equipment you need to be successful and a one-time $500 stipend for your initial desk setup
-Child Bonding!; 8 weeks off for parents (birthing, non-birthing, adoptive, foster, child placement, new guardianship) - fully paid so you can focus your energy on your newest addition
House Values
One Team
Optimize for the company, your team, self - in that order. We may fight long and hard in the trenches, take care of your co-workers with empathy. We give more than we take to build the one house, that everyone dreams of being part of.
Tough & Persevering
We are building our company in a very large, fast-growing but highly competitive space. Life will get tough sometimes. We take hardships in the stride, be positive, focus all energy on the path forward and develop a champion's mindset to overcome odds. Always day one!
Keep Making It Better Always
Rome was not built in a day; If we can get 1% better each day for one year, we'll end up thirty-seven times better. This means being organized, communicating promptly, taking even small tasks seriously, tracking all small ideas, and paying it forward.
Think Big, Act Fast
We have tremendous scope for innovation, but we will still be judged by impact over time. Big, bold ideas still need to be strategized against priorities, broken down, set in rapid motion, measure, refine, repeat. Great execution is what separates promising companies from proven unicorns.
Be Customer Obsessed
Everyone has the responsibility to drive towards the best experience for the customer, be an OSS user or a paid customer. If something is broken, own it, say something, do something; never ignore. Be the change that you want to see in the company.
Pay Range Transparency
Onehouse is committed to fair and equitable compensation practices. Our job titles may span more than one career level. The pay range(s) for this role is listed above and represents the base salary range for non-commissionable roles or on-target earnings for commissionable roles. Actual compensation packages are dependent upon several factors that are unique to each candidate, including but not limited to: job-related skills, depth of transferable experience, relevant certifications and training, business needs, market demands and specific work location. Based on the factors above, Onehouse utilizes the full width of the range; the base pay range is subject to change and may be modified in the future. The total compensation package for this position will also include eligibility for equity options and the benefits listed above.
Top Skills
What We Do
Onehouse delivers a universal data lakehouse through a cloud-native managed lakehouse service built on Apache Hudi, which was created by the founding team while they were at Uber. Onehouse makes it possible to blend the ease of use of a warehouse with the scale of a data lake, by offering a seamless experience for engineers to get their data lakes up and running. Onehouse offers the widest interoperability for your data in the market across table formats, multiple compute engines and multiple cloud providers.
We have a stellar team of inspired, seasoned professionals including data, distributed systems, and platform engineers from Uber, LinkedIn, Confluent, and Amazon. Our product team has helped build enterprise data products at major enterprises including Azure Databricks.