AI Platform Engineer Jobs

Discover the latest remote and onsite AI Platform Engineer roles across top active AI companies. Updated hourly.

Join our AI community Interested in Hiring?

Hiring by

Check out 43 new AI Platform Engineer opportunities posted on AI Chopping Block

View detail

Staff Software Engineer, Core Infrastructure

New

Top rated

Harvey

–

Full-time

–

Posted

Apr 28, 2026 3:18

As a Staff Software Engineer on the Core Infrastructure team at Harvey, your responsibilities include designing and building scalable, fault-tolerant infrastructure systems that power Harvey's AI platform across multiple cloud regions. You will own and evolve the multi-cloud infrastructure (Azure, GCP), including Kubernetes orchestration, networking, and container management. You will lead technical initiatives focused on observability, incident response, and operational excellence, building systems for rapid detection and resolution of issues. Architecting and optimizing distributed systems for reliability, including load balancing, quota management, and failover mechanisms, will be part of your role. You will partner with Product Engineering and Security teams to ensure infrastructure accelerates product development, drive infrastructure-as-code practices using tools like Terraform and Pulumi for reproducible deployments, and mentor engineers through code reviews, design reviews, and technical leadership. Representative projects include designing model proxy architecture for handling inference requests, building distributed rate limiting and quota management systems, architecting multi-region deployment strategies for data residency compliance, developing observability infrastructure with SLA monitoring and cost tracking, and leading CI/CD pipeline evolution to improve velocity and stability.

$236,000 – $290,000

Undisclosed

YEAR

(USD)

San Francisco, United States

Maybe global

Onsite

View detail

Tokens-as-a-Service (Taas) Software Engineer

New

Top rated

OpenAI

–

Full-time

–

Posted

Apr 28, 2026 2:55

Develop systems and tooling to measure, monitor, and improve token throughput across first-party and partner-owned compute environments. Support performance benchmarking, tokenomics analysis, and model porting across heterogeneous infrastructure environments. Build tooling to integrate external or partner infrastructure into OpenAI’s internal compute, observability, and workload management systems. Develop and monitor operational metrics including billing, usage, SLAs, utilization, reliability, and throughput. Identify bottlenecks across hardware, networking, software, and workload enablement that prevent capacity from becoming productive tokens. Partner with compute, infrastructure, networking, finance, and operations teams to translate raw capacity into usable workload-serving capacity. Build dashboards, automation, and reporting systems that provide clear visibility into TaaS capacity, performance, and business outcomes.

$293,000 – $455,000

Undisclosed

YEAR

(USD)

San Francisco, United States

Maybe global

Remote

View detail

Software Engineer, Early Career

New

Top rated

Mirage

–

Full-time

–

Posted

Apr 27, 2026 20:53

As a Software Engineer at Mirage, you will work across product engineering, backend/platform engineering, and applied AI teams. Responsibilities include designing and building systems, APIs, and infrastructure that power products; solving challenges involving distributed systems, scaling, and performance; integrating and operating large AI models in production; building core platform components such as storage, billing, observability, and security; shipping end-to-end product experiences for creative workflows; building polished, performant user interfaces (web or native mobile); pushing the boundaries of video, graphics, and AI-powered creation tools; instrumenting, A/B testing, and iterating quickly with real user data; building and shipping AI-powered product experiences end-to-end; working with state-of-the-art models across video, audio, image, and text; designing systems for context, reasoning, and intelligent behavior; and building evals, datasets, and tooling for improving model quality.

$160,000 – $165,000

Undisclosed

YEAR

(USD)

New York, United States

Maybe global

Onsite

View detail

Staff Software Engineer, RLE

New

Top rated

Handshake

–

Full-time

–

Posted

Apr 23, 2026 21:46

Define and drive architecture for scalable, extensible Reinforcement Learning Environments (RLE) systems and data pipelines. Lead development of platform capabilities enabling rapid domain creation. Partner with Research, Product, and Operations to shape strategy and execution. Set standards for reliability, observability, performance, and data quality. Mentor engineers and elevate engineering excellence across the team. Identify and solve systemic bottlenecks in scaling environments and data generation.

$265,500 – $295,000

Undisclosed

YEAR

(USD)

United States

Maybe global

Remote

View detail

Director of Engineering, Infrastructure

New

Top rated

Zapier

–

Full-time

–

Posted

Apr 21, 2026 2:45

As the Director of Engineering for Infrastructure at Zapier, you will lead multiple multidisciplinary teams responsible for building, supporting, and evolving Zapier's core services, platforms, and infrastructure. Your responsibilities include shaping platform engineering vision, scalability, accountability mechanisms, and organizational operations. You will be responsible for defining and driving the strategy and long-term roadmap for the organization in collaboration with your teams and leadership peers, understanding and articulating how your work enables product development velocity, and how reactive and Keep-The-Lights-On (KTLO) work will be reduced to increase proactive platform improvements. You will lead the AI transformation of platform engineering by re-architecting workflows, minimizing reactive work, implementing AI-powered tooling and automation, setting AI adoption pace, and building repeatable AI-enhanced systems. Additionally, you will unblock software delivery pain points, establish data-driven approaches to measure delivery velocity and quality, ensure the reliability and uptime of core infrastructure in partnership with service-owning teams, and lead the organization during major incidents when necessary. You are also accountable for team building, talent development, recruiting, mentoring, and sustaining a compelling work culture that supports growth, with clear growth paths for managers and ICs, ultimately owning output and outcomes for your teams and their systems.

$280,400 – $420,500

Undisclosed

YEAR

(USD)

San Francisco, United States

Maybe global

Remote

View detail

RISC-V AI / HPC & Agentic Software Engineer

New

Top rated

Tenstorrent

–

Full-time

–

Posted

Apr 16, 2026 1:34

Lead and contribute to cross-functional efforts solving complex physical design challenges across IPs, projects, and advanced technology nodes. Develop and enhance RTL-to-GDS methodologies, including floorplanning, synthesis, placement and routing (P&R), static timing analysis (STA), signoff, and assembly. Architect and deploy AI/ML-driven solutions in production flows to improve engineering efficiency, turnaround time, and quality of results (QoR). Optimize EDA tools and custom CAD flows using data-driven and machine learning-based techniques, collaborating closely with verification, extraction, timing, Design for Test (DFT), and EDA vendors.

$100,000 – $500,000

Undisclosed

YEAR

(USD)

New Taipei City, Taiwan

Maybe global

Remote

View detail

Director, Engineering, Proactive Offense

New

Top rated

Horizon3ai

–

Full-time

–

Posted

Apr 7, 2026 21:09

Lead and scale Horizon3.ai's Offensive Engineering organization, overseeing teams responsible for exploit development, offensive content, and attack automation within the NodeZero platform. Set clear technical and product direction for how NodeZero identifies, exploits, and validates vulnerabilities across large, complex environments. Partner with Product, Precision Defense, and Platform teams to define and deliver offensive capabilities that influence the roadmap and enhance customer outcomes. Drive execution from proof-of-concept through production to transform cutting-edge attack research into scalable, productized features. Stay hands-on to guide architectural decisions and evaluate exploit and automation approaches, mentoring technical leads in building resilient, modular systems. Build, mentor, and scale diverse teams of software engineers, exploit developers, and offensive researchers, fostering a culture of collaboration, creativity, and engineering excellence that bridges offensive and product software development. Collaborate across engineering, product, and GTM teams to align offensive innovation with business priorities and ensure delivery of impactful capabilities for customers. This role is central to the mission of delivering continuous, autonomous security testing at scale.

$240,000 – $285,000

Undisclosed

YEAR

(USD)

US, United States

Maybe global

Remote

View detail

Technical Lead Manager, Platform (India)

New

Top rated

Cartesia

–

Full-time

–

Posted

Apr 1, 2026 5:32

Lead the design and development of low latency, scalable, and reliable model inference and serving stack for SSM foundation models. Manage and mentor a team of platform engineers maintaining a high technical bar and strong engineering culture. Work closely with research and product teams to translate research into products. Own the architecture and roadmap for model serving infrastructure, distributed systems, and data processing platforms. Build highly parallel, high quality data processing and evaluation infrastructure for foundation model training. Drive execution across ambiguous, zero-to-one engineering projects and platform initiatives. Establish best practices for reliability, observability, scalability, and performance across platform systems. Help recruit, interview, and build the engineering team in India. Have significant autonomy to shape the platform and impact how AI is applied across devices and applications.

₹10,000,000 – ₹13,000,000

Undisclosed

YEAR

(INR)

Bangalore, India

Maybe global

Onsite

View detail

IC Agentic Engineering Manager - Stargate

New

Top rated

OpenAI

–

Full-time

–

Posted

Mar 31, 2026 18:57

Design and build agent-based systems to support infrastructure deployment and operations. Identify high-impact opportunities to apply agents across workflows such as cluster bring-up and deployment readiness, incident triage and root cause analysis, system validation and health monitoring, and capacity management and operational decision-making. Lead a small team while contributing directly as an individual contributor across system design, development, and integration. Partner with infrastructure, hardware, and networking teams to integrate agentic systems into production workflows. Develop systems that leverage telemetry, logs, and system signals to enable closed-loop automation. Define evaluation frameworks to measure system effectiveness, reliability, and operational impact. Drive iteration from prototype to production, ensuring robustness and scalability.

$293,000 – $490,000

Undisclosed

YEAR

(USD)

San Francisco, United States

Maybe global

Onsite

View detail

Senior Platform Engineer, Voice AI

New

Top rated

Together AI

–

Full-time

–

Posted

Mar 31, 2026 4:40

Advance inference efficiency end-to-end by designing and prototyping algorithms, architectures, and scheduling strategies for low-latency, high-throughput inference. Implement and maintain changes in high-performance inference engines, including kernel backends, speculative decoding, and quantization. Profile and optimize performance across GPU, networking, and memory layers to improve latency, throughput, and cost. Design and operate RL and post-training pipelines that optimize algorithms and systems jointly, making workloads more efficient with inference-aware training loops and techniques such as async RL rollouts and speculative decoding. Use these pipelines to train, evaluate, and iterate on frontier models, and co-design algorithms and infrastructure tightly coupled to efficient inference. Run ablations and scale-up experiments to understand trade-offs between model quality, latency, throughput, and cost, feeding insights back into model, RL, and system design. Own critical systems at production scale by profiling, debugging, and optimizing inference and post-training services under real workloads. Drive roadmap items requiring engine modification, establish metrics, benchmarks, and experimentation frameworks to rigorously validate improvements. Provide technical leadership by setting technical direction for cross-team efforts at the intersection of inference, RL, and post-training, and mentor other engineers and researchers on full-stack ML systems and performance engineering.

$200,000 – $280,000

Undisclosed

YEAR

(USD)

San Francisco

Maybe global

Onsite

Want to see more AI Platform Engineer jobs?

View all jobs

Access all 4,256 remote & onsite AI jobs.

Join our private AI community to unlock full job access, and connect with founders, hiring managers, and top AI professionals.

Join our community

(Yes, it’s still free—your best contributions are the price of admission.)

Frequently Asked Questions

Have questions about roles, locations, or requirements for AI Platform Engineer jobs?

Question text goes here

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Suspendisse varius enim in eros elementum tristique. Duis cursus, mi quis viverra ornare, eros dolor interdum nulla, ut commodo diam libero vitae erat. Aenean faucibus nibh et justo cursus id rutrum lorem imperdiet. Nunc ut sem vitae risus tristique posuere.

[{"question":"What does a AI Platform Engineer do?","answer":"AI Platform Engineers develop and maintain infrastructure that supports machine learning workloads. They collaborate with data scientists and software teams to deploy, manage, and optimize AI models while implementing automation for deployment and scaling. Their responsibilities include ensuring high availability, designing scalable data pipelines, integrating models, and resolving platform issues. They also monitor system performance and stay current with advancements in AI infrastructure technologies."},{"question":"What skills are required for AI Platform Engineer?","answer":"Successful AI Platform Engineers need proficiency in cloud platforms (AWS, GCP, Azure), containerization technologies like Kubernetes, and infrastructure automation tools such as Terraform. Programming skills in Python, Java, or R are essential, along with experience in AI frameworks like TensorFlow and PyTorch. Knowledge of CI/CD pipelines, DevOps practices, and data engineering are crucial. Strong problem-solving abilities and collaboration skills are also important for working across technical teams."},{"question":"What qualifications are needed for AI Platform Engineer role?","answer":"Most employers require a bachelor's degree in computer science, software engineering, or related technical field. Typically, companies seek candidates with at least 5 years of experience in DevOps and CI/CD projects. Cloud computing certifications for AWS, GCP, or Azure are highly valued. Demonstrated expertise in machine learning technologies, containerization, infrastructure automation, and security best practices is essential for succeeding in this specialized role."},{"question":"What is the salary range for AI Platform Engineer job?","answer":"The research provided doesn't include specific salary information for AI Platform Engineers. Compensation typically varies based on factors including location, company size, years of experience, and specific technical expertise. Given the specialized nature of this role combining AI, ML, and platform engineering skills, salaries are likely competitive with other advanced technical positions in the AI industry."},{"question":"How long does it take to get hired as a AI Platform Engineer?","answer":"The hiring timeline for AI Platform Engineer positions isn't specified in the research. The process typically involves technical interviews to assess cloud platform knowledge, containerization experience, and AI infrastructure skills. With companies like EY requiring minimum 5 years of DevOps experience, building the necessary qualifications takes significant time. The specialized nature of these roles, requiring both AI and platform engineering expertise, may extend the hiring process."},{"question":"Are AI Platform Engineer job in demand?","answer":"Yes, AI Platform Engineer jobs show strong demand signals. Currently, major organizations like General Motors and Millennium have open positions specifically for AI infrastructure development. As companies increasingly integrate AI into their operations, the need for specialists who can build robust platforms to support machine learning workloads continues to grow. The specialized combination of AI knowledge and platform engineering skills makes these professionals particularly valuable in today's technology landscape."}]

Find AI jobs in by countries

AI Jobs in Argentina

AI Jobs in Australia

AI Jobs in Brazil

AI Jobs in Canada

AI Jobs in China

AI Jobs in France

AI Jobs in Germany

AI Jobs in Hong Kong

AI Jobs in India

AI Jobs in Japan

AI Jobs in Mexico

AI Jobs in Poland

AI Jobs in Singapore

AI Jobs in South Korea

AI Jobs in Spain

AI Jobs in Sweden

AI Jobs in Taiwan

AI Jobs in United Kingdom

AI Jobs in United States

Find AI jobs for similar categories

AI Technical Trainer Jobs

Applied ML Engineer Jobs

AI Full-Stack Engineer Jobs

AI MLOps Engineer Jobs

AI Systems Engineer Jobs

AI Applied Data Scientist Jobs

AI Robotics Software Engineer Jobs

AI Developer Educator Jobs

Applied AI Engineer Jobs

AI Applied Research Scientist Jobs

ML Researcher Jobs

AI Backend Engineer Jobs

AI Training Data Specialist Jobs

ML Research Engineer Jobs

ML Infrastructure Engineer Jobs

AI Infrastructure Engineer Jobs

AI Autonomous Systems Engineer Jobs

AI Perception Engineer Jobs

AI Robotics Researcher Jobs