[Remote] Data Engineer
Note: The job is a remote job and is open to candidates in USA. Cutsforth is seeking a skilled and innovative Data Engineer to join their team and help shape the future of data usage to drive operational outcomes. The role involves designing and maintaining scalable data pipelines, preparing high-quality datasets, and collaborating with the data science team to support AI/ML workflows.
Responsibilities
- Regularly design, develop, and maintain data pipelines that support operational, analytical, and machine learning workloads
- Write clean, efficient, and well-documented Python code for data processing and automation tasks
- Support the integration and monitoring of AI/ML models in production by delivering reliable, well-structured data pipelines
- Communicate technical findings and recommendations clearly to both technical and non-technical stakeholders across teams
- Participate in cross-functional meetings and planning sessions to align data engineering efforts with broader business goals
- Stay current with advancements in data engineering tools, techniques, and industry-specific applications, particularly within power, oil, and gas environments
- Document processes, data models, and systems in a way that supports knowledge sharing and team continuity
Skills
- 2-5 years of experience in data engineering, data science, or a related technical role
- Experience with AI/ML workflows and the data needs of model development, including supporting model deployment in production environments
- 1-4 years of hands-on experience with Databricks for building and managing data pipelines
- 1-2 years of hands-on experience with Apache Spark for distributed data processing
- 1-3 years of experience with Microsoft Azure; equivalent experience with AWS or GCP may substitute
- Strong proficiency in Python for data engineering pipelines
- Strong proficiency in SQL for querying, transforming, and validating large datasets
- Experience with Git and standard version control workflows, including branching, pull requests, and code review
- Experience designing and managing end-to-end data pipelines (data ingestion to transformation to validation to serving)
- Strong data system design skills, including data modeling and architecting scalable, reliable, and maintainable data systems
- Demonstrated ability to collaborate effectively across cross-functional teams
- Bachelor's degree in Computer Science, Data Science, Engineering, Mathematics, or a related field
- Master's degree in a relevant technical discipline
- Experience working with machine health datasets and/or condition monitoring applications
- Background in the power, oil, or gas industry - understanding of operational data, equipment telemetry, or industrial IoT environments is a strong plus
- Experience with workflow orchestration tools
Benefits
- Paid Time Off
- Medical, Vision, Dental Insurance
- Health Savings Account with Employer contributions
- 401(k) with Employer match
- Short-term & Long-term Disability Coverage
- Accidental Death & Dismemberment Coverage
- Life Insurance Coverage
- Eight paid holidays per year
- All other benefits required by applicable law
Company Overview