[Remote] Principal Software Engineer - Spark
Note: The job is a remote job and is open to candidates in USA. Cloudera is a leading data partner for top companies across various industries, focusing on transforming complex data into actionable insights. They are seeking a Principal Software Engineer to lead the technical roadmap and architectural vision for Cloudera Data Engineering, working with big data technologies such as Apache Spark and Airflow while collaborating with cross-functional teams.
Responsibilities
- Drive the multi-year technical roadmap and architectural vision for Cloudera Data Engineering
- Gain deep technical knowledge across the data services technical stack, with a focus on Spark, Airflow, Iceberg, and apply this expertise in your daily work
- Foster engineering excellence through technical mentorship, design reviews, and architectural guidance
- Collaborate with product, engineering, and cross-functional partners, leading the delivery of several large, critical features in Cloudera’s data engineering experience
- Work on large-scale distributed systems, ranging from hundreds to thousands of nodes in production clusters
- Bring passion for programming, clean coding practices, attention to detail, and a strong focus on quality
Skills
- Relevant studies / BS or MS in Computer Science or related field
- 10+ years of experience as a Software Engineer in the data infrastructure space
- Strong understanding of at least one of the following languages: Java, Scala, C++, Python, GoLang. And interested to learn the languages we're using
- Passionate about programming, clean coding habits, attention to detail, and focus on quality
- Deep expertise in distributed data processing systems and cloud-native architectures
- Excellent communication and collaboration skills
- Experience with containerization (Kubernetes, Docker)
- Experience with using/developing Apache Spark/Airflow or other related technologies
- Experience with public cloud (AWS/Azure/GCP) and/or private cloud (OpenShift/Rancher)
- (Most importantly) An open-minded attitude, desire to learn new things and build great products
- Contributed to open-source projects
- Strong understanding of modern Lakehouse architectures, open table formats, and metadata/catalog services
- Experience with large-scale, distributed systems design and development with an understanding of scaling, performance, and scheduling
- Solid experience with at least one cloud service (AWS, Azure, GCP, OpenShift)
Benefits
- Generous PTO Policy
- Support work life balance with Unplugged Days
- Flexible WFH Policy
- Mental & Physical Wellness programs
- Phone and Internet Reimbursement program
- Access to Continued Career Development
- Comprehensive Benefits and Competitive Packages
- Paid Volunteer Time
- Employee Resource Groups
Company Overview