Back to Job Listings

Lead Research Software Engineer, Portable AI Performance Engineering

SpringCube

Full-time - Senior Engineer

Education & Training

United States, Boston - Massachusetts

Published 3 weeks ago

Salary: Disclosed upon interview

Contact Employer
  • Share:
Send Feedback
Report This Job

Job Description

The SpringCube team curated the following job opportunity to help you in your job search. Explore the position below to find your next career move.

Company Overview
A leading academic and research institution is seeking a Lead Research Software Engineer to join its High Performance Computing Center. The organization provides an inclusive, multicultural, and collaborative environment where researchers and engineers work on cutting-edge AI and computational workloads. Employees benefit from unique and generous programs that support health, well-being, and work/life balance while contributing to world-class research in science and technology.

Summary
The Lead Research Software Engineer will serve as a hands-on expert in applied performance engineering for AI workloads. The role involves evaluating, adapting, and optimizing AI research workloads on state-of-the-art hardware, with a particular focus on GPU performance and portability. The position collaborates closely with academic research groups and industry partners to enhance the efficiency and portability of complex AI models and scientific code.

Responsibilities

  • Analyze and profile AI workloads to identify performance bottlenecks and portability challenges
  • Optimize existing NVIDIA GPU-based workloads for AMD GPUs such as MI355X and beyond
  • Port and optimize complex AI models using ROCm, HIP, and related translation tools
  • Collaborate with research teams and industry partners to ensure high-performance computing solutions meet project requirements
  • Apply performance profiling, benchmarking, and optimization techniques on Linux-based HPC systems
  • Support the translation and adaptation of AI workloads for scalable, efficient deployment across heterogeneous hardware

Qualifications

Required:

  • Bachelor’s degree or equivalent with a minimum of five years of technical or computational research experience
  • Proficiency in Python and C++ and deep familiarity with AI/ML frameworks such as PyTorch, TensorFlow, or JAX
  • Hands-on experience with GPU programming models (CUDA, HIP, OpenCL)
  • Experience with performance profiling and benchmarking tools on Linux HPC systems
  • Excellent communication skills and ability to collaborate effectively with academic and industry teams
  • Self-motivated and able to work independently in remote or hybrid environments

Preferred:

  • Direct experience with AMD ROCm software stack and translating CUDA code to HIP
  • Familiarity with AI agentic tools and Large Language Models (LLMs) for code generation and refactoring
  • Experience supporting large-scale, domain-specific scientific research on institutional clusters
  • Knowledge of open-source schedulers, provisioners, and Linux container technologies (LXC, apptainer, systemd-nspawn)
  • Advanced degree in a relevant technical field

Disclaimer
SpringCube curates tech job listings from various company websites to support tech professionals in globally.

1. No Endorsement: Job ads on SpringCube do not imply endorsement of their authenticity or quality.
2. No Client Relationship: This company is not a client of SpringCube unless stated.
3. To Apply: Click the “Apply” button to be redirected to the hiring company’s application page for this job.
4. No Liability: SpringCube is not liable for inaccuracies.