Machine Learning Engineer Jobs

Discover the latest remote and onsite Machine Learning Engineer roles across top active AI companies. Updated hourly.

Check out 1962 new Machine Learning Engineer opportunities posted on AI Chopping Block

Member of Engineering (Pre-training / Data Research)

New
Top rated
Poolside
Full-time
Full-time
Posted

Follow the latest research related to Large Language Models (LLMs) and data quality, being familiar with relevant open-source datasets and models. Design and implement complex pipelines to generate large amounts of diverse data while optimizing available resources. Collaborate closely with teams such as Pretraining, Posttraining, Evals, and Product to ensure short feedback loops on the quality of models delivered. Suggest, conduct, and analyze data ablations or training experiments to improve the quality of generated datasets using quantitative insights.

Undisclosed

()

United Kingdom
Maybe global
Remote

Director of Technology & Systems

New
Top rated
Scale AI
Full-time
Full-time
Posted

As a Production AI Ops Lead, you will design and develop the production lifecycle of full-stack AI applications, while supporting end-to-end system reliability, real-time inference observability, sovereign data orchestration, high-security software integration, and the resilient cloud infrastructure required for international government partners. You will own the production outcome by taking full accountability for the long-term performance and reliability of AI use cases deployed across international government agencies. You will ensure full-stack integrity by overseeing the end-to-end health of the platform, ensuring seamless integration between the AI core and all full-stack components, from APIs to UI, to maintain a responsive and production-ready environment. You will build automated systems to monitor model performance and data drift across geographically dispersed environments, ensuring the right levels of reliability. You will manage the technical lifecycle within diverse regulatory frameworks, lead the response for production issues in mission-critical environments ensuring rapid resolution and building guardrails to prevent recurrence. You will translate deep technical performance metrics into clear insights for senior international government officials and partner with Engineering and ML teams to ensure lessons learned in the field directly influence the technical architecture and decisions of future use cases.

Undisclosed

()

San Francisco, United States
Maybe global
Onsite

Research Engineer – Evals

New
Top rated
Firecrawl
Full-time
Full-time
Posted

Build the evaluation systems from scratch that measure whether Firecrawl's outputs are effective across scraping, crawling, extracting, and mapping. This includes designing metrics, building pipelines, curating datasets, and integrating evaluations into continuous integration and deployment to catch regressions before release. Design benchmarks that represent real customer data distribution including edge cases, and create the collection and labeling systems. Own LLM-as-judge pipelines by designing and validating automated judges for scoring extraction quality, understanding LLM evaluation failure modes, and building human review tooling. Collaborate with research engineers working on models and reinforcement learning to use evaluation metrics as training signals and feedback loops to improve models. Design, run, and communicate fast experiments that test meaningful hypotheses and enable clear decision-making across the team.

$160,000 – $240,000
Undisclosed
YEAR

(USD)

San Francisco, United States
Maybe global
Hybrid

Machine Learning Engineer (Singapore)

New
Top rated
Cantina Labs
Full-time
Full-time
Posted

Build and scale systems for ingesting, processing, and delivering large-scale video and multimodal data for model training. Own the full pipeline from raw content to curated, filtered, and training-ready datasets focusing on speed, reliability, reproducibility, and cost-efficiency. Design and scale distributed data pipelines for preprocessing, dataset generation, and repeated dataset refreshes. Own workflow orchestration, job scheduling, monitoring, and failure recovery for large-scale data processing jobs. Implement and maintain containerized pipeline infrastructure using Kubernetes or equivalent orchestration systems. Optimize cloud-based data storage and movement across providers (AWS, GCS, or Azure) for cost, throughput, and operational efficiency. Define and implement best practices for dataset storage layout, versioning, caching, retention, and access patterns. Design and implement curation pipelines for selection, filtering, and retention of video and image content for model training including image-text pair datasets. Build and improve VLM-based captioning and metadata generation workflows at scale across video and image data. Develop and apply quality and aesthetic scoring models, CLIP-based semantic filtering, and other signal-extraction approaches for data selection. Build tooling to support deduplication workflows at scale, including near-dedup and exact deduplication pipelines over large video corpora. Analyze dataset composition, identify quality issues, iterate on curation logic to improve training outcomes. Define and evolve standards for high-quality, training-ready video data across different training regimes.

Undisclosed

()

Singapore
Maybe global
Onsite

Research Engineer, Training & Inference

New
Top rated
Harmonic
Full-time
Full-time
Posted

Maintain and optimize the proprietary reinforcement learning (RL) training and serving infrastructure with total stack ownership, including the Python API to CUDA kernels, to achieve peak performance for foundation model workloads. Maximize throughput of the RL system from data generation to model training utilizing sharded multi-node training and inference algorithms. Optimize the inference stack for high-throughput RL and low-latency large language model (LLM) production traffic by tuning the inference engine, router, scheduler, and custom kernels if necessary. Identify and resolve performance bottlenecks in distributed clusters to ensure optimal throughput and memory efficiency for multi-billion parameter models, balancing memory constraints with compute-heavy training cycles.

$200,000 – $450,000
Undisclosed
YEAR

(USD)

Palo Alto, United States
Maybe global
Onsite

Director, Forward Deployed Engineering

New
Top rated
Scale AI
Full-time
Full-time
Posted

As a Production AI Ops Lead, you will design and develop the production lifecycle of full-stack AI applications, while supporting end-to-end system reliability, real-time inference observability, sovereign data orchestration, high-security software integration, and the resilient cloud infrastructure required for international government partners. You will take full accountability for the long-term performance and reliability of AI use cases deployed across international government agencies. You will oversee the end-to-end health of the platform, ensuring seamless integration between the AI core and all full-stack components from APIs to UI to maintain a responsive and production-ready environment. You will build automated systems to monitor model performance and data drift across geographically dispersed environments, ensuring the right levels of reliability. You will manage the technical lifecycle within diverse regulatory frameworks. You will lead the response for production issues in mission-critical environments, ensuring rapid resolution and building guardrails to prevent recurrence. You will translate deep technical performance metrics into clear insights for senior international government officials. You will partner with Engineering and ML teams to ensure lessons learned in the field directly influence the technical architecture and decisions of future use cases.

Undisclosed

()

London, United Kingdom
Maybe global
Onsite

Applied ML Researcher (Force Fields and Simulation)

New
Top rated
CuspAI
Full-time
Full-time
Posted

In this role, you will train, fine-tune, and distill machine learning force fields and research and develop novel ML force field architectures suited to production simulation workloads. You will integrate these models into public and in-house high-performance simulators and develop training and inference architectures for large-scale training, data generation, and simulation. You will distribute these workloads via Ray to scale across compute infrastructure and build modular systems so components can be reused across many kinds of chemistry. Additionally, you will build an active learning system that closes the loop between simulation, data generation, and training, develop interfaces that make the system easy for domain scientists to use and extend, and collaborate closely with computational chemists on density functional theory (DFT) data generation and validation.

Undisclosed

()

Amsterdam, Netherlands
Maybe global
Remote

Senior/Staff Machine Learning Engineer - Perception HD Mapping

New
Top rated
Zoox
Full-time
Full-time
Posted

Design and develop novel algorithms and machine learning models for 2D/3D machine perception and mapping in real-world environments. Contribute to large-scale, automated mapping pipelines. Serve as a technical leader on the team by maintaining coding and machine learning development best practices and making architectural decisions. Help set the vision for the team and build out technical roadmaps. Coordinate cross-functional initiatives and collaborate with engineers from Mapping, Perception, Planner, Simulation, Data Science, and more. Drive the use of metrics and tools to guide development, validate algorithms, and measure progress.

$242,000 – $333,000
Undisclosed
YEAR

(USD)

Foster City, United States
Maybe global
Onsite

Senior Design Producer

New
Top rated
HP IQ
Full-time
Full-time
Posted

As a modeling lead for the AI lab, the primary responsibilities include defining the technical roadmap for the team and supporting the modeling needs across the organization. The role involves defining and establishing best practices to manage the model life cycle from data acquisition to deployment, building tools and platforms to support building and deploying machine learning models on devices with specific constraints, and working closely with different teams to translate user needs into specific modeling requirements. The position also requires defining and driving the AI Lab technical strategy in support of HP’s AI roadmap, making decisions regarding models, runtimes, inference engines, and optimization, leading device AI strategy including tasks such as model compression, quantization, distillation, and hardware-aware optimization across CPUs, GPUs, NPUs, and TPUs, architecting and evolving tooling and platforms for the full model lifecycle including evaluation, deployment, and monitoring, establishing standards and evaluation frameworks to ensure high-quality and safe Gen AI models in production, partnering with cross-functional leaders to align technical direction with product and hardware strategy, mentoring a small group of senior engineers, and operating as a hands-on technical leader who sets direction and moves quickly.

$200,000 – $340,000
Undisclosed
YEAR

(USD)

San Francisco, United States
Maybe global
Onsite

Forward Deployed Engineer Intern

New
Top rated
Labelbox
Intern
Full-time
Posted

As an Applied Research Engineer at Labelbox, you will develop systems and methods to create, analyze, and leverage high-quality human-in-the-loop data for frontier AI model developers. This includes designing and implementing advanced systems that align human feedback into AI training processes such as Reinforcement Learning from Human Feedback (RLHF) and Direct Preference Optimization (DPO). You will work on techniques to measure and improve human data quality, develop AI-assisted tools to enhance the data labeling process, and investigate how different types of human feedback impact model performance and alignment. Your work will involve optimizing human feedback collection through novel algorithms, integrating breakthroughs into Labelbox's product suite to make human-AI alignment scalable, engaging with customers and the AI community to understand data needs and share best practices, publishing research, exploring new frontiers in human-AI collaboration, creating technical documentation, blog posts, and educational content, and driving industry innovation through these activities.

$250,000 – $300,000
Undisclosed
YEAR

(USD)

San Francisco or Wrocław, United States or Poland
Maybe global
Hybrid

Want to see more Machine Learning Engineer jobs?

View all jobs

Access all 4,256 remote & onsite AI jobs.

Join our private AI community to unlock full job access, and connect with founders, hiring managers, and top AI professionals.
(Yes, it’s still free—your best contributions are the price of admission.)

Frequently Asked Questions

Have questions about roles, locations, or requirements for Machine Learning Engineer jobs?

Question text goes here

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Suspendisse varius enim in eros elementum tristique. Duis cursus, mi quis viverra ornare, eros dolor interdum nulla, ut commodo diam libero vitae erat. Aenean faucibus nibh et justo cursus id rutrum lorem imperdiet. Nunc ut sem vitae risus tristique posuere.

[{"question":"What does a Machine Learning Engineer do?","answer":"Machine Learning Engineers design, build, and deploy AI systems that solve real-world problems. They transform research prototypes into production-ready solutions by creating scalable ML pipelines, optimizing model performance, and handling data preprocessing workflows. They integrate models with applications via APIs, implement monitoring systems, and ensure models perform reliably in production environments. Daily tasks include collaborating with data scientists, fine-tuning algorithms, building deployment infrastructure, and maintaining data privacy. They work across diverse applications like recommendation engines, fraud detection systems, and computer vision tools while ensuring models remain accurate and efficient."},{"question":"What skills are required for Machine Learning Engineer jobs?","answer":"Strong programming skills in Python are fundamental, alongside proficiency with ML frameworks like TensorFlow and PyTorch. Machine Learning Engineers need solid mathematics and statistics knowledge, particularly in linear algebra, calculus, and probability theory. Experience with cloud platforms (AWS, GCP, Azure) is essential for deploying models at scale. Skills in data preprocessing, feature engineering, and model evaluation are critical for building effective systems. Engineers should understand MLOps practices, RESTful APIs, containerization tools like Docker, and version control systems. Practical experience with deep learning architectures and natural language processing is valuable for specialized roles."},{"question":"What qualifications are needed for Machine Learning Engineer jobs?","answer":"Most Machine Learning Engineer positions require a bachelor's degree in computer science, mathematics, or related field, with many employers preferring advanced degrees for senior roles. Beyond formal education, employers value demonstrated experience building and deploying machine learning models. A strong portfolio showcasing completed projects is often more important than academic credentials alone. Relevant certifications from cloud providers or in specific ML frameworks can strengthen applications. Employers look for candidates with verifiable experience in model deployment, optimization, and maintenance. Knowledge of software engineering best practices like testing, version control, and documentation is increasingly essential in this hybrid role."},{"question":"What is the salary range for Machine Learning Engineer jobs?","answer":"Machine Learning Engineer salaries vary based on several key factors. Geographic location significantly impacts compensation, with tech hubs like San Francisco, Seattle, and New York typically offering higher wages. Experience level creates substantial differences, with senior engineers earning considerably more than entry-level positions. Specialized expertise in areas like computer vision, reinforcement learning, or NLP can command premium compensation. Company size and industry also influence pay scales, with large tech companies and finance firms often offering higher salaries than startups or non-profits. Educational background, portfolio quality, and demonstrated impact on previous business outcomes further affect earning potential."},{"question":"How long does it take to get hired as a Machine Learning Engineer?","answer":"The hiring timeline for Machine Learning Engineer positions typically ranges from 4-12 weeks, depending on the company's hiring process and your qualifications. The interview process often includes technical screenings, coding challenges, system design discussions, and model implementation exercises. Candidates with strong portfolios demonstrating deployed ML projects may progress more quickly through initial screens. Specialized roles requiring expertise in deep learning or specific domain knowledge might have longer evaluation periods. Companies often test both theoretical understanding and practical implementation skills through multi-stage interviews. Building relationships with hiring managers through professional networks can sometimes accelerate the process."},{"question":"Are Machine Learning Engineer jobs in demand?","answer":"Machine Learning Engineer jobs remain in high demand across industries as organizations implement AI solutions to solve complex problems. Companies actively recruit ML Engineers for applications in recommendation systems, fraud detection, computer vision, natural language processing, and autonomous technologies. The role's hybrid nature—combining software engineering and data science expertise—makes qualified candidates particularly valuable. Organizations need specialists who can both develop models and deploy them in production environments. While the field is competitive, professionals with demonstrated experience building and maintaining ML systems at scale continue to find strong opportunities, especially those with specialized knowledge in emerging areas like reinforcement learning."},{"question":"What is the difference between Machine Learning Engineer and Data Scientist?","answer":"Machine Learning Engineers focus on implementing and deploying models in production environments, while Data Scientists concentrate on research, analysis, and prototype development. ML Engineers build scalable pipelines, optimize model performance, and create deployment infrastructure using software engineering practices. Data Scientists explore data, develop statistical insights, and experiment with algorithms to solve business problems. ML Engineers work extensively with frameworks like TensorFlow and deployment tools, whereas Data Scientists may spend more time with analytical tools and statistical methods. While Data Scientists uncover patterns and build proofs of concept, ML Engineers transform these prototypes into robust, production-ready systems that can operate at scale."}]