[Remote] AI Infrastructure Consultant
Note: The job is a remote job and is open to candidates in USA. Erie Data Partners LLC is a boutique Technology Talent Solutions firm specializing in Data & AI, Cloud Infrastructure, and more. They are seeking an experienced AI Infrastructure Consultant to support an enterprise client in building and scaling modern AI platforms, primarily focusing on cloud infrastructure and AI applications.
Responsibilities
- Design and implement scalable AI infrastructure in cloud environments primarily with AWS and Azure
- Build and optimize GPU-enabled compute environments for AI and machine learning workloads
- Architect containerized solutions using Kubernetes and Docker
- Deploy and manage AI platforms utilizing services such as Azure AI Foundry, Azure OpenAI, AWS Bedrock or similar technologies
- Develop Infrastructure-as-Code (IaC) solutions using Terraform, Bicep, ARM Templates, or CloudFormation
- Partner with Data Engineers, Machine Learning Engineers, and Software Development teams to support production AI solutions
- Implement CI/CD pipelines supporting AI and MLOps workflows
- Monitor infrastructure performance, scalability, security, and cost optimization
- Establish best practices for AI platform governance, security, and operational excellence
- Troubleshoot complex cloud infrastructure and distributed systems
Skills
- 5+ years of experience designing cloud infrastructure in enterprise environments
- Hands-on experience with Microsoft Azure or AWS
- Experience supporting AI or machine learning infrastructure in production
- Strong knowledge of Kubernetes, Docker, and container orchestration
- Experience with Infrastructure as Code (Terraform preferred)
- Experience with Linux administration and cloud networking
- Strong understanding of cloud security, identity management, and networking principles
- Excellent communication and client-facing consulting experience
- Azure AI Foundry
- Azure OpenAI Service
- AWS Bedrock
- Google Vertex AI
- NVIDIA GPU infrastructure
- MLflow
- Kubeflow
- Azure Machine Learning
- Databricks
- Snowflake
- Apache Spark
- Python
- GitHub Actions or Azure DevOps
- MLOps frameworks and model deployment pipelines
- Retrieval-Augmented Generation (RAG)
- Vector databases (Pinecone, Weaviate, ChromaDB, Azure AI Search)
Benefits
- Mainly Remote
- Flexible Engagement Length
- Opportunity to support cutting-edge AI initiatives across enterprise organizations
- May require some travel to client site
Company Overview