[Remote] Platform Engineer
Note: The job is a remote job and is open to candidates in USA. Prosum is seeking a Principal Linux Platform Engineer who is passionate about Linux, virtualization, automation, and enterprise infrastructure. This role will be responsible for engineering and operating highly scalable Linux environments that support mission-critical healthcare applications across their enterprise.
Responsibilities
- Administer and maintain large-scale Red Hat Enterprise Linux environment deployed to our High-Performance Compute infrastructure
- Design, deploy, and support KVM-based virtualization platforms
- Build and maintain automation solutions using Ansible, Python, and shell scripting
- Configure and support IBM Spectrum Scale (GPFS) storage environments
- Implement system security, hardening, encryption, and compliance controls
- Manage Linux networking including:
- Bonding
- Bridging
- VLANs
- Macvtap
- Configure and administer Linux storage technologies including:
- LVM
- Dm-crypt
- LUKS2
- Troubleshoot complex operating system, virtualization, networking, and storage issues
- Collaborate with Kubernetes, network, security, and application teams to deliver reliable infrastructure services
- Drive platform modernization through automation and standardization initiatives
Skills
- Bachelor's degree or equivalent experience
- 10+ years of Linux systems administration and engineering experience
- Deep expertise with Red Hat Enterprise Linux
- Strong experience deploying and supporting KVM virtualization environments
- Expert-level knowledge of:
- Linux operating systems
- SELinux
- Linux firewalls
- Package management
- System performance tuning
- Experience with Ansible automation and Infrastructure-as-Code methodologies
- Proficiency in shell scripting and at least one programming language such as Python, Go, Rust, Java, or C
- Strong knowledge of storage, encryption, and Linux filesystem technologies
- Excellent troubleshooting and analytical skills
- Strong verbal and written communication skills
- Experience implementing and supporting enterprise monitoring, logging, and alerting solutions using Grafana, Prometheus, Loki, AlertManager, and Thanos
- Strong experience automating infrastructure deployment, configuration, and operational processes using Ansible, scripting, and Infrastructure-as-Code practices
- Demonstrated security-first mindset with expertise in platform hardening, identity and access management, vulnerability remediation, encryption, and regulatory compliance
- Experience administering Red Hat Identity Manager (IdM), including LDAP, Kerberos, SSSD, certificate management, and enterprise authentication services
- Experience with cross-datacenter, high availability failover, and load balancing (F5 & haproxy) between multiple datacenters and K8s clusters
- Red Hat certifications or equivalent practical experience
- Experience with large-scale KVM virtualization deployments
- Knowledge of Kubernetes and OpenShift environments
- Experience with disaster recovery, resiliency, and high-availability solutions
- Familiarity with distributed storage platforms such as IBM Spectrum Scale (GPFS)
- Experience supporting enterprise-scale Linux environments in regulated industries
- Candidates with experience in any of the following will stand out:
- IBM LinuxONE and IBM Z environments
- S390x Linux architecture
- Logical Partitions (LPARs)
- OSA, FCP, RoCE, and Crypto Express adapters
- HMC administration and DPM mode
- SAN architectures and Brocade zoning
Company Overview