AI Research Scientist Jobs

Discover the latest remote and onsite AI Research Scientist roles across top active AI companies. Updated hourly.

Check out 338 new AI Research Scientist opportunities posted on AI Chopping Block

Research Engineer - Evals

New
Top rated
AGI Inc
Full-time
Full-time
Posted

Build the eval harness for AGI covering model capability, agentic behavior, on-device performance, and end-user experience. Own eval suites gating every model and agent release, including capability, behavior, regressions, and human-rated rubrics. Maintain dashboards and tooling to facilitate fast researcher experiment loops and informed leadership decisions. Set and uphold the criteria for what counts as ready to ship. Assist research by ensuring measurements align with goals. Aid product engineers by instrumenting real-user behavior on devices. Support partnerships by translating performance improvements into measurable terms for OEM partners.

Undisclosed

()

San Francisco, United States
Maybe global
Onsite

Senior Scientist, Analytical Chemistry

New
Top rated
Osmo
Full-time
Full-time
Posted

The Senior Scientist is responsible for owning the end-to-end analytical strategy for GC-MS-based programs, including method design, validation frameworks, and data quality standards for targeted and untargeted analyses. They define and evolve sample preparation methodologies for headspace, liquid-phase, and solid-phase extraction of fragrance compounds from complex matrices and consumer products. They maintain and improve Osmo's high-throughput analytical pipeline, ensuring data integrity, reproducibility, and compatibility with downstream machine learning workflows. The role involves partnering with the Platform and ML teams as the chemistry-side technical owner of the data interface, determining methods and procedures for new analytical assignments independently while coordinating execution across team members and collaborating functions. They enforce high standards of scientific rigor and data quality, mentor and develop junior and mid-level scientists, establish best practices, review work for scientific integrity, and elevate the team’s overall analytical capability. Additional responsibilities include writing, editing, and auditing analytical and experimental protocols, serving as an internal expert resource and external-facing collaborator for analytical chemistry questions across Osmo’s scientific and commercial programs.

$150,000 – $180,000
Undisclosed
YEAR

(USD)

Elizabeth, United States
Maybe global
Onsite

Researcher, Context - Agent Post-Training

New
Top rated
OpenAI
Full-time
Full-time
Posted

As a Context Researcher on the Agent Post-Training team, the role involves designing and running experiments to improve the scaling of compute on context. The researcher will own end-to-end improvements to the post-training stack, including reinforcement learning, data pipelines, graders, reward signals, evaluations, diagnostics, and model-behavior analysis. Responsibilities include building evaluations and environments to identify model failures and turning those failures into training data, product fixes, or new research directions. The researcher will partner with Codex and ChatGPT product teams to translate product signals into model improvements and work on early-training and alignment interventions such as data mixtures, objectives, synthetic data, and evaluation loops to shape downstream agent behavior. The role involves deciding which integrations, capabilities, and fixes are ready for major model runs, improving machinery for large-scale training and launch including experiment velocity, reliability, observability, reproducibility, cost, latency, and production readiness. The researcher will take on cross-functional projects involving model training, product infrastructure, and the production agent harness and debug failures in shipped or near-shipped models by developing hypotheses, experiments, and fixes from qualitative behaviors.

$250,000 – $380,000
Undisclosed
YEAR

(USD)

San Francisco, United States
Maybe global
Remote

Researcher, Connectors - Agent Post-Training

New
Top rated
OpenAI
Full-time
Full-time
Posted

As a member of Agent Post-Training, Connectors, you will teach models how to interface with professional software using code, helping train agents to use code, APIs, tools, and structured integrations to operate across applications like Slack, Google Workspace, GitHub, Notion, Linear, Salesforce, and other core systems. You will design and run experiments to improve agentic model behavior for complex software and plugins, own end-to-end improvements to the post-training stack including RL, data pipelines, graders, reward signals, evaluations, diagnostics, and model behavior analysis, and build evaluations and environments that expose model failures to turn those failures into training data, product fixes, or new research directions. You will partner with product teams to understand user needs and translate product signals into model improvements, work on early-training and alignment interventions such as data mixtures, objectives, synthetic data, and evaluation loops, and decide which integrations and capabilities to include in major model runs. Additionally, you will improve large-scale training and launch infrastructure for experiment velocity, reliability, observability, reproducibility, cost, latency, and production readiness, take on cross-functional projects touching model training, product infrastructure, and the production agent harness, and debug failures in shipped or near-shipped models to develop concrete hypotheses, experiments, and fixes.

$250,000 – $380,000
Undisclosed
YEAR

(USD)

San Francisco, United States
Maybe global
Remote

Researcher, Computer Use - Agent Post-Training

New
Top rated
OpenAI
Full-time
Full-time
Posted

As a member of Agent Post-Training, Computer Use, you will teach models to operate computers, helping to train models that can navigate browsers and desktops, use tools and applications, reason through complex workflows, collaborate with users and other agents, and complete long-horizon tasks with reliability and judgment. Responsibilities include designing and running experiments to improve agentic model behavior for complex computer use, owning end-to-end improvements to the post-training stack such as reinforcement learning, data pipelines, graders, reward signals, evaluations, diagnostics, and model-behavior analysis. You will build evaluations and environments to identify model failures and convert those into training data, product fixes, or research directions. The role involves partnering with product teams to understand user needs and translate product signals into model improvements, working on early-training and alignment interventions, deciding on suitable integrations and fixes for major model runs, and improving large-scale training and launch machinery regarding experiment velocity, reliability, observability, reproducibility, cost, latency, and production readiness. You will also handle cross-functional projects involving model training, product infrastructure, and production agent harness, debug failures in shipped or near-shipped models, and transform qualitative model behavior into concrete hypotheses, experiments, and fixes.

$250,000 – $380,000
Undisclosed
YEAR

(USD)

San Francisco, United States
Maybe global
Onsite

Researcher, Artifacts - Agent Post-Training

New
Top rated
OpenAI
Full-time
Full-time
Posted

As a member of Agent Post-Training, Artifacts, the role involves training frontier models to produce polished, useful work products such as documents, spreadsheets, slide decks, dashboards, reports, analyses, and other interactive or editable artifacts. Responsibilities include designing and running experiments to improve agentic model behavior for complex software and plugins, owning end-to-end improvements to the post-training stack including reinforcement learning, data pipelines, graders, reward signals, evaluations, diagnostics, and model-behavior analysis. The role involves building evaluations and environments to identify new model failures and converting these failures into training data, product fixes, or new research paths. Collaboration with Codex and ChatGPT product teams to translate product signals into model improvements is required. Other duties include working on early-training and alignment interventions, deciding integration and capability readiness for major model runs, improving machinery for large-scale training and launch regarding experiment velocity, reliability, observability, reproducibility, cost, latency, and production readiness, and undertaking cross-functional projects that involve model training, product infrastructure, and production agent systems. Debugging hard failures in shipped or near-shipped models and transforming qualitative behaviors into hypotheses, experiments, and fixes is also part of the role.

$250,000 – $380,000
Undisclosed
YEAR

(USD)

San Francisco, United States
Maybe global
Remote

Applied AI Researcher, Multi-Agent Systems

New
Top rated
Distyl
Full-time
Full-time
Posted

The Multi-Agent Systems team focuses on designing architectures in which multiple agents coordinate to solve problems that require structured interaction across multiple reasoning processes. Researchers build systems that structure communication, route information, and coordinate decision-making across agents operating with different views of the problem. Researchers investigate the interaction patterns that govern how agents collaborate, studying how agents exchange information, critique and refine each other’s reasoning, and coordinate execution across complex workflows. Their work identifies the mechanics behind effective communication, delegation, and coordination, establishing the design language for how systems of agents can operate as cohesive, high-performing teams, with capabilities that arise from interaction rather than individual performance.

$150,000 – $250,000
Undisclosed
YEAR

(USD)

San Francisco or New York, United States
Maybe global
Hybrid

Research Scientist, Safety Post Training

New
Top rated
Scale AI
Full-time
Full-time
Posted

The role involves owning the production outcome and taking full accountability for the long-term performance and reliability of AI use cases deployed across international government agencies. It includes ensuring full-stack integrity by overseeing the end-to-end health of the platform and seamless integration between the AI core and all full-stack components, from APIs to UI, to maintain a responsive and production-ready environment. Responsibilities also cover scaling the feedback loop by building automated systems to monitor model performance and data drift across geographically dispersed environments for appropriate reliability. Managing the technical lifecycle within diverse regulatory frameworks and leading the response for production issues in mission-critical environments to ensure rapid resolution and prevent recurrence are also required. Additionally, the role entails translating deep technical performance metrics into clear insights for senior international government officials and partnering with Engineering and ML teams to influence the technical architecture and decisions of future use cases based on lessons learned in the field.

Undisclosed

()

San Francisco or New York, United States
Maybe global
Onsite

Research Scientist (Singapore)

New
Top rated
Cantina Labs
Full-time
Full-time
Posted

Drive foundational research on video generation models, taking ownership across the full research cycle and driving post-training research. Collaborate closely with data, infrastructure, and adjacent modeling teams to translate research findings into durable model improvements. Build and maintain scalable systems for ingesting, preprocessing, and delivering large-scale video data for model training. Design and scale distributed data pipelines for preprocessing, dataset generation, and repeated dataset refreshes. Own workflow orchestration, job scheduling, monitoring, and failure recovery for large-scale data processing jobs. Implement and maintain containerized pipeline infrastructure using Kubernetes or equivalent orchestration systems. Optimize cloud-based data storage and movement across providers (AWS, GCS, or Azure) for cost, throughput, and operational efficiency. Define and implement best practices for dataset storage layout, versioning, caching, retention, and access patterns. Build tooling to support deduplication workflows at scale, including near-dedup pipelines over large video corpora. Research and develop distillation methods for large-scale diffusion and flow-based video generation models, including guidance distillation and adversarial distillation, focusing on preserving or improving generation quality while reducing inference cost. Develop reward models and preference-based fine-tuning pipelines that align video generation quality with human judgments across aesthetics, motion quality, and prompt adherence. Analyze the relationship between base model behavior and post-training outcomes, working with foundation model team to inform pretraining decisions accordingly.

Undisclosed

()

Singapore
Maybe global
Onsite

Researcher, Alignment Oversight

New
Top rated
OpenAI
Full-time
Full-time
Posted

As a researcher on the Alignment Oversight team, you will design and run experiments to improve oversight of increasingly capable AI models, involving model training, evaluation design, and research infrastructure. Responsibilities include deploying practical systems for action monitoring, red-teaming, and human-in-the-loop control; developing evaluations for alignment failure modes of frontier models, such as overeagerness and instruction following failures; analyzing deployment data to understand model failures and oversight gaps; developing techniques to feed oversight signals back into training while preserving oversight reliability; producing publishable research advancing alignment science; collaborating with research, product, security, safety, and engineering teams to implement alignment ideas; and rapidly moving from research intuition to working experiments, prototypes, and evidence that inform future model improvements.

$250,000 – $445,000
Undisclosed
YEAR

(USD)

San Francisco, United States
Maybe global
Hybrid

Want to see more AI Research Scientist jobs?

View all jobs

Access all 4,256 remote & onsite AI jobs.

Join our private AI community to unlock full job access, and connect with founders, hiring managers, and top AI professionals.
(Yes, it’s still free—your best contributions are the price of admission.)

Frequently Asked Questions

Have questions about roles, locations, or requirements for AI Research Scientist jobs?

Question text goes here

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Suspendisse varius enim in eros elementum tristique. Duis cursus, mi quis viverra ornare, eros dolor interdum nulla, ut commodo diam libero vitae erat. Aenean faucibus nibh et justo cursus id rutrum lorem imperdiet. Nunc ut sem vitae risus tristique posuere.

[{"question":"What does an AI Research Scientist do?","answer":"AI Research Scientists conduct research to advance artificial intelligence by developing novel algorithms, techniques, and methodologies. They design experiments, build models, test theories, and analyze results to create new AI capabilities. These researchers implement prototypes using machine learning frameworks, validate systems, and document findings. They frequently publish in academic journals and present at conferences. AI Research Scientists collaborate with cross-functional teams to apply research findings to real-world problems. They also mentor junior researchers, provide technical leadership, and continuously monitor emerging AI trends in specialized areas like deep learning, natural language processing, and computer vision."},{"question":"What skills are required for AI Research Scientists?","answer":"AI Research Scientists need strong theoretical knowledge in mathematics, statistics, and computational methods. Programming proficiency in Python and frameworks like TensorFlow or PyTorch is essential. They must excel at experimental design, hypothesis testing, and data analysis. Critical thinking and problem-solving abilities help navigate complex research challenges. Expertise in specific AI domains such as deep learning, reinforcement learning, or natural language processing is typically required. Communication skills for publishing papers and presenting findings are crucial. Collaboration abilities support interdisciplinary work with engineers, domain experts, and stakeholders. Ethical research practices and knowledge of research methodologies round out the necessary skillset."},{"question":"What qualifications are needed for AI Research Scientists?","answer":"Most AI Research Scientist positions require a PhD in artificial intelligence, machine learning, computer science, or related fields. Employers like Meta explicitly specify this educational requirement in job postings. Candidates need demonstrated expertise in specific AI subfields such as machine learning, deep learning, or specialized areas like large language models. A strong publication record in peer-reviewed journals or at major AI conferences (NeurIPS, ICML, ICLR) is typically expected. Prior research experience developing novel algorithms and conducting experiments is essential. Some positions may accept exceptional candidates with Master's degrees who have substantial research contributions or publications in relevant AI domains."},{"question":"What is the salary range for AI Research Scientists?","answer":"Salaries for AI Research Scientists vary based on several factors including education level, research specialty, publication record, and prior contributions to the field. Geographic location significantly impacts compensation, with positions in tech hubs like San Francisco or New York typically paying more. Employer type affects pay scales—research positions at top tech companies often offer higher compensation than academic or nonprofit research labs. Experience level creates substantial variation, with senior scientists commanding significantly higher salaries. Specialized expertise in high-demand areas like large language models or reinforcement learning can command premium compensation. Many roles include additional compensation through research bonuses, stock options, or conference funding."},{"question":"How long does it take to get hired as an AI Research Scientist?","answer":"The hiring process for AI Research Scientists typically takes 2-4 months from application to offer. The timeline includes initial screening, technical interviews assessing research expertise, and evaluation of published work. Many employers require candidates to present previous research or complete a research proposal task. PhD candidates may face longer timelines as companies evaluate their dissertation research and publication potential. The process often includes multiple rounds of interviews with research teams and leadership. Specialized positions focusing on cutting-edge areas like foundation models or AI safety may have extended evaluation periods as employers carefully assess candidates' expertise in these emerging fields."},{"question":"Are AI Research Scientists in demand?","answer":"AI Research Scientists are currently in high demand, with major organizations like Meta, OpenAI, and leading research institutions actively recruiting. Demand is particularly strong in specialized areas such as large language models, generative AI, reinforcement learning, and AI safety. Research institutions, universities, tech firms, and even freelance opportunities are available across subfields like NLP, robotics, and computer vision. The push to advance AI capabilities drives consistent demand for researchers who can develop novel algorithms and techniques. Competition remains fierce for top positions, with employers seeking candidates who have demonstrated innovation through published research, conference presentations, and practical implementations of theoretical work."},{"question":"What is the difference between AI Research Scientist and Data Scientist?","answer":"AI Research Scientists focus on creating new AI algorithms and advancing theoretical foundations, while Data Scientists primarily analyze existing data to extract insights and solve business problems. Research Scientists typically need PhDs and publish academic papers, whereas Data Scientists often work with Master's degrees and produce business reports. The research role requires deeper mathematical understanding and develops novel techniques, while Data Scientists apply established methods to specific datasets. AI Research Scientists work on longer-term theoretical projects that may take months or years, whereas Data Scientists typically deliver results on shorter timelines with immediate business applications. The research position emphasizes innovation, while data roles prioritize practical implementation."}]