AI DevOps Engineer Jobs

Discover the latest remote and onsite AI DevOps Engineer roles across top active AI companies. Updated hourly.

Join our AI community Interested in Hiring?

Hiring by

Check out 932 new AI DevOps Engineer opportunities posted on AI Chopping Block

View detail

Technical Account Manager (TAM), AI Factory

New

Top rated

Together AI

–

Full-time

–

Posted

May 1, 2026 0:27

Participate in on-call rotation to respond to production incidents, build and run infrastructure with Ansible, Terraform, and Kubernetes to enable scaling to a massive number of concurrent users, build monitoring systems to ensure the highest quality service for customers, design and implement operational processes such as deployments and upgrades, debug production issues across all services and levels of the stack, identify improvements for product architecture from reliability, performance, and availability perspectives, and plan the growth of Together AI's infrastructure.

$190,000 – $270,000

Undisclosed

YEAR

(USD)

San Francisco

Maybe global

Onsite

View detail

Software Engineer, Compute Infrastructure

New

Top rated

OpenAI

–

Full-time

–

Posted

Apr 28, 2026 1:01

In this role, you will spin up and scale large Kubernetes clusters, including automating provisioning, bootstrapping, and cluster lifecycle management; build software abstractions that unify multiple clusters and provide a seamless interface to training workloads; own node bring-up from bare metal through firmware upgrades ensuring fast and repeatable deployment at massive scale; improve operational metrics such as reducing cluster restart times and accelerating firmware or OS upgrade cycles; integrate networking and hardware health systems to deliver end-to-end reliability across servers, switches, and data center infrastructure; develop monitoring and observability systems to detect issues early and maintain cluster stability under extreme load; solve real-time operational challenges, diagnose and fix issues quickly, and continuously improve automation, resilience, performance, and uptime across the systems powering frontier AI model training.

$230,000 – $405,000

Undisclosed

YEAR

(USD)

San Francisco, United States

Maybe global

Remote

View detail

DevOps Engineer

New

Top rated

Observe

–

Full-time

–

Posted

Apr 20, 2026 14:23

Build and deploy AI agents including prompt design, workflow configuration, integrations, telephony setup, and evaluation frameworks. Act as the primary technical partner for customers by leading demos, communicating progress, gathering feedback, and guiding solutions from concept to production. Configure and connect systems via APIs, handling authentication, data mapping, error handling, and integrations with CRMs, knowledge bases, and other enterprise tools. Set up telephony integration including SIP/CCaaS/PSTN routing, metadata passing, fallback configurations, and troubleshooting call quality. Write and refine prompts for LLM-driven agents, monitor performance, conduct iterative testing, and ensure agents meet automation and containment targets. Translate customer requirements into actionable solutions and work consultatively to resolve challenges related to security, connectivity, or knowledge ingestion. Collaborate with product and engineering teams to address platform gaps, resolve technical issues, and lead client implementations independently.

$108,000 – $170,000

Undisclosed

YEAR

(USD)

Bengaluru or Redwood City, United States

Maybe global

Hybrid

View detail

Senior DevOps Engineer, APJ

New

Top rated

Arize AI

–

Full-time

–

Posted

Apr 15, 2026 5:25

Debug and fix issues in the platform and ship pull requests with your fixes. Build internal tools and copilots powered by generative AI to enhance the team. Rapidly prototype proof-of-concepts for customer use cases. Work across Engineering, Product, and Solutions teams to unblock customers and advance AI adoption.

Undisclosed

()

Singapore

Maybe global

Remote

View detail

Senior DevOps Engineer, APJ

New

Top rated

Arize AI

–

Full-time

–

Posted

Apr 15, 2026 5:17

Debug and fix issues in the platform and ship pull requests with fixes. Build internal tools and copilots powered by generative AI to enhance the team. Rapidly prototype proof-of-concepts for customer use cases. Collaborate across Engineering, Product, and Solutions teams to unblock customers and advance AI adoption.

Undisclosed

()

Malaysia

Maybe global

Remote

View detail

DevOps Engineer (Argentina)

New

Top rated

Arize AI

–

Full-time

–

Posted

Apr 14, 2026 4:07

Debug and fix issues in the platform and ship pull requests with fixes. Build internal tools and copilots powered by generative AI to enhance the team. Rapidly prototype proof-of-concept solutions for customer use cases. Collaborate across Engineering, Product, and Solutions teams to unblock customers and push the boundaries of AI adoption.

Undisclosed

()

Buenos Aires, Argentina

Maybe global

Remote

View detail

Senior Platform/DevOps Engineer (Kubernetes-Linux)

New

Top rated

Armada

–

Full-time

–

Posted

Apr 10, 2026 4:45

Translate business requirements into requirements for AI/ML models; prepare data to train and evaluate AI/ML/DL models; build AI/ML/DL models by applying state-of-the-art algorithms, especially transformers; leverage existing algorithms from academic or industrial research when applicable; test, evaluate, and benchmark AI/ML/DL models and publish the models, data sets, and evaluations; deploy models in production by containerizing them; work with customers and internal employees to refine model quality; establish continuous learning pipelines for models using online or transfer learning; build and deploy containerized applications on cloud or on-premise environments.

$154,560 – $193,200

Undisclosed

YEAR

(USD)

Bellevue, United States

Maybe global

Onsite

View detail

Senior Infrastructure Engineer

New

Top rated

Bland

–

Full-time

–

Posted

Mar 25, 2026 7:36

As a Senior Infrastructure Engineer at Bland, responsibilities include contributing to the design of scalable architecture by building distributed systems using Kubernetes that handle high-volume, real-time voice processing with strict latency and reliability requirements; building and supporting machine learning infrastructure including training pipelines and real-time inference serving across multiple regions; maintaining robust integrations with enterprise telephony systems, SIP trunks, and VoIP infrastructure; identifying architectural flaws and solving them; ensuring platform reliability through monitoring, alerting, and incident response systems to maintain enterprise-grade uptime; anticipating and solving scaling challenges related to exponential call volume growth; and implementing security best practices and compliance requirements for enterprise customers in regulated industries.

$120,000 – $200,000

Undisclosed

YEAR

(USD)

San Francisco, United States

Maybe global

Onsite

View detail

Lead DevOps Engineer

New

Top rated

Observe

–

Full-time

–

Posted

Mar 23, 2026 18:16

Lead the design, building, deployment, and optimization of enterprise-grade AI agents including voice, chat, and AI copilots. Own the full lifecycle of AI agent development including prompt engineering, workflow creation, API integration, telephony setup, and evaluation forms. Engage with clients through weekly demos, progress updates, feedback gathering, and act as the primary technical contact for deployed solutions. Configure system integrations involving APIs, data maps, authentication, and connectivity to CRM, databases, and knowledge systems. Set up telephony routing (SIP/CCaaS/PSTN), manage metadata, configure fallbacks, and troubleshoot call quality issues. Monitor agent performance and iteratively refine prompts to meet automation and containment goals. Work strategically to translate customer requirements into technical solutions, addressing challenges related to security, connectivity, and knowledge ingestion. Collaborate with product and engineering teams to support deep technical fixes and platform development while independently leading client delivery and support.

$108,000 – $170,000

Undisclosed

YEAR

(USD)

Redwood City, United States

Maybe global

Hybrid

View detail

Senior Pathologist

New

Top rated

PathAI

–

Full-time

–

Posted

Mar 11, 2026 1:54

Lead the team responsible for the infrastructure supporting AI/ML Stack, focusing on scalability and efficiency of the Machine Learning Operations platform. Develop and execute the long-term vision and roadmap for the MLOps team to support ML development and deployment across business units, balancing short-term tactical deliveries with long-term architectural transformation. Manage and mentor a team of 6-7+ engineers, allocating resources strategically to support existing services and execute key strategic initiatives. Collaborate cross-functionally with leaders in machine learning, data science, product engineering, and infrastructure to identify pain points, remove bottlenecks, and facilitate new solution deployment. Architect compute and storage pipelines for ML Engineers to manage large datasets and artifacts efficiently. Modernize the AI product inference stack for significant growth in global deployments. Work with Site Reliability Engineering to establish comprehensive system observability metrics. Conduct assessments for technology refresh and benchmark proprietary tools against commercial and open-source alternatives to meet future needs.

$181,500 – $278,300

Undisclosed

YEAR

(USD)

Boston or Memphis, United States

Maybe global

Hybrid

Want to see more AI DevOps Engineer jobs?

View all jobs

Access all 4,256 remote & onsite AI jobs.

Join our private AI community to unlock full job access, and connect with founders, hiring managers, and top AI professionals.

Join our community

(Yes, it’s still free—your best contributions are the price of admission.)

Frequently Asked Questions

Have questions about roles, locations, or requirements for AI DevOps Engineer jobs?

Question text goes here

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Suspendisse varius enim in eros elementum tristique. Duis cursus, mi quis viverra ornare, eros dolor interdum nulla, ut commodo diam libero vitae erat. Aenean faucibus nibh et justo cursus id rutrum lorem imperdiet. Nunc ut sem vitae risus tristique posuere.

[{"question":"What does an AI DevOps Engineer do?","answer":"AI DevOps Engineers build and maintain ML pipelines in cloud environments, implementing CI/CD workflows specifically for AI applications. They create monitoring solutions that track not just system health but also data quality and model performance. Their daily work includes developing cloud infrastructure code using tools like Terraform and Ansible, ensuring AI applications scale effectively. They collaborate with data scientists to deploy models, troubleshoot production issues, and implement security protocols. Unlike traditional developers, they bridge the gap between data science and operations, ensuring ML models transition smoothly from development to production environments."},{"question":"What skills are required for AI DevOps Engineer jobs?","answer":"AI DevOps Engineers need strong cloud platform expertise, particularly in AWS, Azure, or GCP. Proficiency with infrastructure-as-code tools like Terraform and Ansible is essential. Container orchestration skills using Docker and Kubernetes help manage AI workloads. Experience with CI/CD pipelines through Jenkins or GitLab CI enables automated model deployment. Python scripting ability supports both automation and ML pipeline integration. Monitoring skills using Prometheus and Grafana help track model performance. Beyond technical abilities, these roles require collaboration skills to work effectively with data scientists and developers, plus problem-solving aptitude to troubleshoot complex AI system issues."},{"question":"What qualifications are needed for AI DevOps Engineer jobs?","answer":"Most AI DevOps Engineer positions require a minimum of 3 years of software development experience and 2+ years of cloud deployment experience, with Azure often preferred. A computer science or related degree is typically expected, though equivalent experience may substitute. Employers look for candidates with hands-on experience using development and deployment tools like GitLab and Atlassian suite products. While not always mandatory, certifications in cloud platforms (AWS Solutions Architect, Azure DevOps Engineer) and container orchestration (CKA) strengthen applications. Experience building CI/CD pipelines specifically for ML workflows gives candidates a significant advantage in the hiring process."},{"question":"What is the salary range for AI DevOps Engineer jobs?","answer":"AI DevOps Engineer salaries vary based on several key factors. Geographic location significantly impacts compensation, with tech hubs like San Francisco and New York offering higher wages. Experience level creates substantial differences, with senior engineers earning considerably more. Specialized expertise in high-demand tools like Kubernetes or specific cloud platforms (AWS/Azure/GCP) can boost earnings. Industry sector also matters—financial services and healthcare organizations often pay premium rates for AI infrastructure expertise. Company size influences packages too, with large enterprises typically offering better benefits but startups potentially providing equity. Security clearances for sensitive projects may command additional compensation."},{"question":"How long does it take to get hired as an AI DevOps Engineer?","answer":"The hiring timeline for AI DevOps Engineers typically ranges from 4-8 weeks. The process usually begins with a screening call, followed by technical assessments testing cloud infrastructure skills and coding abilities. Candidates often face 2-3 rounds of interviews, including sessions with engineering managers and team members. Many employers include practical challenges related to containerization, CI/CD pipeline setup, or infrastructure-as-code implementations. Companies hiring for specialized AI infrastructure may extend the process with additional technical evaluations. Candidates with demonstrated experience in both DevOps and machine learning environments generally move through the pipeline faster than those from only traditional DevOps backgrounds."},{"question":"Are AI DevOps Engineer jobs in demand?","answer":"AI DevOps Engineer roles show strong demand as organizations integrate machine learning into their product offerings. Major companies like Boeing actively recruit for these positions to support AI applications in secure cloud environments. The specialized skillset—combining traditional DevOps practices with ML pipeline expertise—creates a smaller talent pool than for general DevOps roles. Organizations increasingly recognize that successful AI deployment requires specialized infrastructure and monitoring beyond conventional applications. This demand spans industries from technology and finance to manufacturing and healthcare, as each sector adopts AI capabilities requiring robust deployment pipelines, monitoring solutions, and infrastructure that traditional DevOps approaches don't fully address."},{"question":"What is the difference between AI DevOps Engineer and Traditional DevOps Engineer?","answer":"Traditional DevOps Engineers focus on application delivery pipelines, infrastructure automation, and system monitoring for conventional software. AI DevOps Engineers extend these skills to handle machine learning workflows, requiring specialized knowledge of model deployment, training pipelines, and experiment tracking. While both roles use similar tools (Docker, Kubernetes, CI/CD platforms), AI DevOps Engineers must understand data quality monitoring and model performance metrics that don't exist in traditional applications. They work more closely with data scientists and ML engineers, bridging the gap between data science and operations. AI DevOps requires additional considerations around computational resources, GPU scheduling, and optimizing infrastructure for machine learning workloads."}]