Senior Platform/DevOps Engineer (Kubernetes-Linux)
Translate business requirements into requirements for AI/ML models; prepare data to train and evaluate AI/ML/DL models; build AI/ML/DL models by applying state-of-the-art algorithms, especially transformers; leverage existing algorithms from academic or industrial research when applicable; test, evaluate, and benchmark AI/ML/DL models and publish the models, data sets, and evaluations; deploy models in production by containerizing them; work with customers and internal employees to refine model quality; establish continuous learning pipelines for models using online or transfer learning; build and deploy containerized applications on cloud or on-premise environments.
Software Engineer - Tools & Automation
As a Software Engineer and member of the Platform Stability team, you will help build, fine-tune, and maintain a novel AI-powered tool for diagnosing technical issues and identifying root causes. You will collaborate cross-functionally to gather requirements, develop AI/ML and analytical models, and drive data-driven insights as part of a high-performing team. Responsibilities include designing and implementing agentic AI systems with structured interfaces, reasoning loops, and robust error handling; building and maintaining data pipelines, scheduled workflows, and benchmarking infrastructure; developing evaluation and scoring systems to measure and improve model output quality; integrating the platform with internal and external services such as ticketing, messaging, storage, and observability; collaborating with cross-functional teams to translate business requirements into technical AI solutions; and architecting and maintaining production-grade AI solutions with a focus on scalability, reliability, and performance.
Tech Lead Manager, Data Infrastructure
The Tech Lead Manager, Data Infrastructure at Cartesia is responsible for defining the overall multi-modal data strategy across pre-training and post-training, including human, synthetic, and web-scale data sources. They lead, manage, and mentor a team of data engineers and specialists. They design and oversee the construction of robust, scalable data pipelines for text, audio, and video, establish and enforce rigorous standards for data quality across the organization, deeply understand how data affects model capability and proactively identify and source novel datasets, and manage relationships and budgets with external data vendors and partners.
Mid/Senior/Staff Software Engineer, Agents
As a Software Engineer, Agents, you will build systems that make AI agents indispensable to legal professionals by designing environments and actions for agentic professional work, making model selection decisions, managing context windows, creating optimal tools, and developing evaluation harnesses for faster iteration loops to unlock new capabilities. You will partner with customers and product managers to understand legal workflows, design practical evaluations to capture what excellence means, and ship agents that effectively complete tasks. Additionally, you will optimize agent performance through prompt engineering, model selection, tool design, skill writing, context window management, and evaluation harness development. You will work with the model infrastructure team to design and implement infrastructure for low-latency agent execution, including caching strategies, parallel tool calls, or subagent patterns. Improving observability and instrumentation to profile agent behavior, identify bottlenecks, and drive optimization decisions is also part of the role. Staying current on new developments in agentic systems and applying those insights to product development is expected.
Forward Deployed AI Engineer
Drive the end-to-end technical deployment of Latent Labs models into customer environments, ensuring seamless integration with existing scientific and IT infrastructure. Design and build production-grade API integrations, data pipelines and model-serving infrastructure tailored to each customer’s requirements. Work on-site or embedded with pharma and biotech partners to scope technical requirements, troubleshoot issues and deliver solutions. Ensure deployments meet enterprise standards for security, performance and reliability. Serve as the technical point of contact for assigned customers, building trusted relationships with their scientific and engineering teams, including spending time working on-site at international partner locations as needed. Gather and synthesise customer feedback, translating it into actionable insights for product, research and platform teams. Collaborate with internal teams to shape the product roadmap based on real-world deployment learnings. Create technical documentation, integration guides and best-practice resources for customers. Stay on top of the latest developments in ML infrastructure, model serving and cloud-native tooling. Gain a strong working understanding of protein and cell biology as it relates to the product. Participate in knowledge sharing, including organizing and presenting at internal reading groups.
Staff Engineer, G&C (R4763)
As a Guidance and Controls engineer, you will be responsible for creating and maintaining all control and autonomy algorithms within the XBAT code base. This includes algorithm development, unit tests, component tests, flight software qualification, and flight test support. You will also be responsible for helping update and validate the truth models as required.
Director, Data Center Operations
The responsibilities include advancing inference efficiency end-to-end by designing and prototyping algorithms, architectures, and scheduling strategies for low-latency, high-throughput inference. Implementing and maintaining changes in high-performance inference engines, including kernel backends and speculative decoding, profiling and optimizing performance across GPU, networking, and memory layers to improve latency, throughput, and cost. Unifying inference with RL/post-training by designing and operating RL and post-training pipelines, making RL and post-training workloads more efficient with inference-aware training loops, and using these pipelines to train, evaluate, and iterate on frontier models. Co-designing algorithms and infrastructure so that objectives, rollout collection, and evaluation are tightly coupled to efficient inference, identifying bottlenecks across the training engine, inference engine, data pipeline, and user-facing layers. Running ablations and scale-up experiments to understand trade-offs between model quality, latency, throughput, and cost, and feeding these insights back into model, RL, and system design. Owning critical systems at production scale by profiling, debugging, and optimizing inference and post-training services under real production workloads, driving roadmap items requiring engine modification, and establishing metrics, benchmarks, and experimentation frameworks to validate improvements rigorously. Providing technical leadership by setting technical direction for cross-team efforts at the intersection of inference, RL, and post-training, and mentoring other engineers and researchers on full-stack ML systems work and performance engineering.
Regional Sales Lead, Singapore
Lead and contribute to cross-functional efforts solving complex physical design challenges across IPs, projects, and advanced technology nodes. Develop and enhance RTL-to-GDS methodologies, including floorplanning, synthesis, placement and routing (P&R), static timing analysis (STA), signoff, and assembly. Architect and deploy AI/ML-driven solutions in production flows to improve engineering efficiency, turnaround time, and quality of results (QoR). Optimize EDA tools and custom CAD flows using data-driven and machine learning-based techniques, working closely with internal teams such as verification, extraction, timing, Design for Test (DFT), and electronic design automation (EDA) vendors.
Forward Deployed Engineer - Sydney
Forward Deployed Engineers lead complex end-to-end deployments of frontier models in production alongside strategic customers, owning discovery, technical scoping, system design, build, and production rollout while partnering with customer engineering and domain teams. They own technical delivery across multiple deployments from prototype to stable production, build full-stack systems to deliver customer value, embed closely with customer teams to understand needs and guide adoption, scope work, sequence delivery, and remove blockers early. They make trade-offs between scope, speed, and quality, contribute directly in the code when needed, codify working patterns into reusable tools and playbooks, share field feedback to help Research and Product improve models, and keep teams moving through clarity and follow-through.
Staff Analytics Engineer — Data Warehouse
Advance inference efficiency end-to-end by designing and prototyping algorithms, architectures, and scheduling strategies for low-latency, high-throughput inference. Implement and maintain changes in high-performance inference engines, including kernel backends, speculative decoding, and quantization. Profile and optimize performance across GPU, networking, and memory layers to improve latency, throughput, and cost. Design and operate RL and post-training pipelines where most cost is inference, jointly optimizing algorithms and systems. Make RL and post-training workloads more efficient with inference-aware training loops, async RL rollouts, and speculative decoding. Use these pipelines to train, evaluate, and iterate on frontier models. Co-design algorithms and infrastructure to tightly couple objectives, rollout collection, and evaluation with efficient inference, identifying bottlenecks across the training engine, inference engine, data pipeline, and user-facing layers. Run ablations and scale-up experiments to understand trade-offs between model quality, latency, throughput, and cost, feeding insights into model, RL, and system design. Profile, debug, and optimize inference and post-training services under real production workloads. Drive roadmap items requiring engine modification such as changing kernels, memory layouts, scheduling logic, and APIs. Establish metrics, benchmarks, and experimentation frameworks to validate improvements rigorously. Provide technical leadership to set direction for cross-team efforts in inference, RL, and post-training and mentor engineers and researchers on full-stack ML systems work and performance engineering.
Access all 4,256 remote & onsite AI jobs.
Frequently Asked Questions
Need help with something? Here are our most frequently asked questions.
Lorem ipsum dolor sit amet, consectetur adipiscing elit. Suspendisse varius enim in eros elementum tristique. Duis cursus, mi quis viverra ornare, eros dolor interdum nulla, ut commodo diam libero vitae erat. Aenean faucibus nibh et justo cursus id rutrum lorem imperdiet. Nunc ut sem vitae risus tristique posuere.
