Pitch N Hire
AI Infrastructure Team Full-time Remote

AI Infrastructure / ML Engineer.

Join our AI Infrastructure Team as an AI Infrastructure / ML Engineer to enhance CapaCloud for AI workloads with a focus on MLOps, scalable GPU utilization, and model deployment.

Experience

0–3 yrs

Location

Remote

United States


The Brief

TITLE

AI Infrastructure / ML Engineer

TEAM

AI Infrastructure Team

TYPE

Full-time

POSTED

Jun 3, 2026

JOB ID

019e8da3

Mlops (machine Learning Operations) With Production Ai Model Deployment.

We are hiring an AI Infrastructure / ML Engineer to help optimize CapaCloud for AI workloads, model deployment, training, inference, and scalable GPU utilization.

You will work closely with infrastructure engineers to ensure the platform supports modern AI workflows for startups, researchers, and enterprise users.

This role is ideal for someone passionate about AI systems, MLOps, and large-scale GPU computing.

Key Responsibilities

  • Build and optimize AI deployment pipelines

  • Improve GPU workload efficiency for AI applications

  • Support AI training and inference infrastructure

  • Optimize performance for PyTorch, TensorFlow, and LLM workloads

  • Build scalable APIs and inference systems

  • Develop benchmarking and performance testing tools

  • Collaborate with infrastructure teams on orchestration systems

  • Support model deployment and containerized AI workloads

  • Improve developer experience for AI users

  • Monitor and optimize AI compute performance

Required Skills & Experience

  • Experience with AI/ML infrastructure and MLOps

  • Strong Python programming skills

  • Experience with PyTorch, TensorFlow, or JAX

  • Experience with GPU computing and CUDA environments

  • Familiarity with containerized deployment systems

  • Experience deploying AI models in production

  • Understanding of inference optimization techniques

  • Experience with APIs and backend systems

  • Strong debugging and analytical skills

Nice To Have

  • Experience with LLM infrastructure

  • Familiarity with Hugging Face ecosystem

  • Experience with distributed training systems

  • Knowledge of Kubernetes and orchestration systems

  • Experience with AI inference optimization tools

  • Open-source AI contributions

What Success Looks Like

  • High-performance AI deployment infrastructure

  • Optimized GPU utilization for AI workloads

  • Smooth onboarding for AI developers

  • Reliable inference and training systems

  • Strong benchmark performance across workloads

Employment Type

  • Full-time

  • Remote

About the company

C

CapaCloud

capa.cloud