[Remote] Staff Data Engineer
Note: The job is a remote job and is open to candidates in USA. Shift5 is building the data platform for onboard operational technology (OT). We are seeking a Staff Data Engineer to design, build, and maintain data processing, storage, and integration pipelines while collaborating with various teams to create impactful products.
Responsibilities
- Design, build, and maintain robust, scalable batch and streaming data processing, storage, and integration pipelines using technologies such as Apache Airflow or Benthos
- Interpret requirements and design specifications, taking full ownership of building features from the ground up
- Write clean, well-documented, scalable, extensible, and testable code to ensure application quality and maintainability
- Partner with data scientists and engineers to create semantic data models representing complex vehicular systems, and integrate applications cleanly across other Shift5 componentry
- Build scalable data products for data scientists, transportation engineers, and executives to drive insights and decision-making
- Create efficient, reliable, cost-effective, dynamically scalable, and observable solutions utilizing AWS cloud services
- Analyze complex data sets to create data ontologies, verify data quality/integrity, and ensure data accuracy throughout pipelines
- Support the design process and occasionally travel to customer sites (estimated a few times per year) to collaborate with Field Engineers on data integration and deployment
Skills
- Engineering Experience: 6+ years of software/data engineering experience with a deep understanding of software engineering practices and concepts
- Core Languages: 6+ years of experience with a major programming language (GoLang, Java, or Python)
- Databases & Big Data: 6+ years of relational database experience (PostgreSQL, MySQL, Oracle, etc.) alongside 6+ years of experience with Big Data (Hadoop, Spark) and Data Modeling
- Cloud & Containers: 4+ years of experience with containerization and cloud services (Docker, Kubernetes) and cloud monitoring tools
- Data Pipelines & Stack: Expertise with batch and streaming data pipelines (Apache Airflow, Benthos) and familiarity with modern data stack components (ingestion, transformation, orchestration)
- U.S. citizenship required and ability to obtain a security clearance
- Education: MS in Computer Science, Cybersecurity, Cyber Intelligence, or equivalent
- Data Architecture: Experience and understanding of data lakes/warehouses (e.g., Snowflake, Databricks, Redshift)
- DevOps Practices: Proficiency with CI/CD, source control, design reviews, and integrating observable practices
- Advanced Languages & Tools: Experience with Rust, SDK design, and implementation (including code gen of type-specific bindings to Python, Go, TypeScript)
- Travel Flexibility: Willingness to travel occasionally to customer and partner sites to support field integration and deployment efforts
Benefits
- Bonus program and equity in a fast-growing startup
- Competitive medical, dental, and vision coverage for employees and their families
- Health Savings Account with annual employer contributions
- Employer-paid Life and Disability Insurance
- Uncapped paid time off policy
- Flexible work & remote work policy
- Tax-deferred public transit benefits with Metro SmartBenefits (DC/MD/VA)
Company Overview