[Remote] AI/ML Engineer, Senior - WFH1659
Note: The job is a remote job and is open to candidates in USA. Global InfoTek, Inc. is seeking a Senior AI/ML Engineer to support an R&D program focused on passive RF emitter identification and network analysis from real-time sensor data streams. The role involves designing, building, and validating machine learning models, conducting exploratory data analysis, and implementing ML data pipelines while collaborating with research scientists and software engineers.
Responsibilities
- Design, build, and validate machine learning models for RF emitter identification including feature engineering from sensor data, training pipeline development, model evaluation, and iterative refinement based on results
- Conduct hands-on exploratory data analysis on RF sensor datasets using Python and Jupyter notebooks writing and running analytical code, characterizing feature distributions, identifying data quality issues, and producing documented findings
- Implement and maintain ML data pipelines ingesting NDF sensor streams, applying rollup and preprocessing logic, constructing training datasets, and ensuring pipeline correctness on constrained edge hardware with no cloud dependency
- Collaborate with the technical lead and Principal AI/ML Engineer to investigate RF sensor data quality, attribution reliability, and feature behavior under contention writing code to characterize error sources, validate assumptions, and reproduce findings
- Produce clear technical documentation of experiments, model configurations, and results maintaining reproducibility through disciplined versioning, and contributing to monthly status reports and team knowledge sharing
Skills
- 5+ years of hands-on applied experience in machine learning, data science, or RF signal processing
- Demonstrated proficiency in Python for ML and data science work PyTorch or TensorFlow for model development, Pandas/NumPy for data manipulation, and scikit-learn or similar for evaluation and baseline modeling
- Hands-on experience designing, training, and evaluating deep learning models particularly metric learning, Siamese networks, or other similarity-learning architectures on real-world, noisy, imbalanced datasets
- Practical experience handling real-world data quality problems missing values, label noise, class imbalance, systematic bias, and sensor artifacts and the ability to diagnose and address them without discarding valid data
- Ability to develop and run ML pipelines on Linux-based systems without cloud infrastructure or GPU acceleration optimizing for CPU-only inference and multi-threaded data processing on resource-constrained x86 hardware
- Familiarity with RF signal characteristics, passive receiver phenomenology, and sensor data interpretation including awareness of processing artifacts, attribution ambiguities, and measurement limits common in signals intelligence datasets
- Hands-on experience applying machine learning particularly metric learning, deep learning networks, or similarity-learning architectures to RF or time-series signal data, including feature engineering, training pipeline development, and model validation
- Exposure to TDMA network protocols or military datalink systems, and interest in learning the signal processing challenges of dense, contested electromagnetic environments
- Familiarity with direction-finding, time-difference-of-arrival (TDOA), or related passive geolocation concepts understanding of their mathematical foundations and common failure modes is more important than operational experience
- Experience with binary serialization formats (FlatBuffers, Protocol Buffers) and high-throughput sensor data pipelines operating in near-real-time on resource-constrained hardware
- Background in statistical signal processing error ellipses, bearing estimation uncertainty, feature reliability under noise with the ability to distinguish statistically significant findings from artifacts of small sample size or improper normalization
Company Overview