Join our AI Infrastructure Team as an AI Infrastructure / ML Engineer to enhance CapaCloud for AI workloads with a focus on MLOps, scalable GPU utilization, and model deployment.
Experience
0–3 yrs
Location
Remote
United States
Experience
0–3 yrs
Location
Remote
United States
The Brief
TITLE
AI Infrastructure / ML Engineer
TEAM
AI Infrastructure Team
TYPE
Full-time
POSTED
Jun 3, 2026
JOB ID
019e8da3
TITLE
AI Infrastructure / ML Engineer
TEAM
AI Infrastructure Team
TYPE
Full-time
POSTED
Jun 3, 2026
JOB ID
019e8da3
We are hiring an AI Infrastructure / ML Engineer to help optimize CapaCloud for AI workloads, model deployment, training, inference, and scalable GPU utilization.
You will work closely with infrastructure engineers to ensure the platform supports modern AI workflows for startups, researchers, and enterprise users.
This role is ideal for someone passionate about AI systems, MLOps, and large-scale GPU computing.
Build and optimize AI deployment pipelines
Improve GPU workload efficiency for AI applications
Support AI training and inference infrastructure
Optimize performance for PyTorch, TensorFlow, and LLM workloads
Build scalable APIs and inference systems
Develop benchmarking and performance testing tools
Collaborate with infrastructure teams on orchestration systems
Support model deployment and containerized AI workloads
Improve developer experience for AI users
Monitor and optimize AI compute performance
Experience with AI/ML infrastructure and MLOps
Strong Python programming skills
Experience with PyTorch, TensorFlow, or JAX
Experience with GPU computing and CUDA environments
Familiarity with containerized deployment systems
Experience deploying AI models in production
Understanding of inference optimization techniques
Experience with APIs and backend systems
Strong debugging and analytical skills
Experience with LLM infrastructure
Familiarity with Hugging Face ecosystem
Experience with distributed training systems
Knowledge of Kubernetes and orchestration systems
Experience with AI inference optimization tools
Open-source AI contributions
High-performance AI deployment infrastructure
Optimized GPU utilization for AI workloads
Smooth onboarding for AI developers
Reliable inference and training systems
Strong benchmark performance across workloads
Full-time
Remote
About the company
CapaCloud
capa.cloud