⚠️ Sorry, this job is no longer available.

The AI job market moves fast. We keep up so you don't have to.

Fresh roles added daily, reviewed for quality — across every corner of the AI ecosystem.

Edit filters

New AI Opportunities

Showing 6179  of 79 jobs
Tag
OpenAI.jpg

AI Deployment Engineer

OpenAI
JP.svg
Japan
Full-time
Remote
false
About the TeamThe Technical Success team is responsible for ensuring developers and enterprises are successful in building scalable production applications with the OpenAI API platform. We guide and support customers to achieve maximum benefits, value, and adoption from deploying our highly-capable models. OpenAI's customers represent a range of diverse backgrounds and maturity, from early-stage startups to established global enterprises.About the RoleWe are looking for a technically savvy and business-minded AI Deployment Engineer to deeply partner with our most strategic and high-impact platform customers, guiding them through application ideation, development, delivery, and scale to accelerate and maximize the value of what they build with our platform. You will have the opportunity to work on the most novel and creative use cases being built on our API, serving as a critical partner for collecting and delivering high fidelity feedback to Product and Research teams.This role is based in Tokyo, Japan. We use a hybrid work model of 3 days in the office per week and offer relocation assistance to new employeesIn this role, you will:Deeply embed with our most strategic platform customers, serving as their technical thought partner in ideating and building novel applications on our API.Proactively provide guidance to our customers on how to maximize business impact from their applications, accelerating their time to value.Experiment and prototype solutions with and for your customers.Forge and manage relationships with our customers’ leadership and stakeholders to ensure their application’s successful deployment and scale.Contribute to our open-source developer and enterprise resources.Scale the AI Deployment Engineering function through sharing knowledge, codifying best practices, and publishing notebooks to our internal and external repositories.Validate, synthesize, and deliver high-signal feedback to the Product and Research teams.Use your expertise in programming with Python and Javascript.You’ll thrive in this role if you:Have 5+ years of technical consulting (or equivalent) experience.Are proficient in Python and Javascript.Are fluent in both Japanese and English, as this is essential for partnering with customers, providing technical expertise, demonstrating value, and collaborating effectively with teams at headquarters.Built and/or delivered prototypes on top of our API platform.Led complex technical projects and programs with many stakeholders.Can proactively identify opportunities for maximizing our customers’ business value through leveraging the OpenAI API.Own problems end-to-end, and are willing to pick up whatever knowledge you're missing to get the job done to ensure both your team and our customers succeed.Have a humble attitude and an eagerness to help others with empathy.Operate with high horsepower, are adept at frequent context switching and working on multiple projects at once with expansive ownership, and ruthlessly prioritize.Thrive in dynamic environments and can navigate ambiguity with ease.About OpenAIOpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity. We are an equal opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or other applicable legally protected characteristic. For additional information, please see OpenAI’s Affirmative Action and Equal Employment Opportunity Policy Statement.Background checks for applicants will be administered in accordance with applicable law, and qualified applicants with arrest or conviction records will be considered for employment consistent with those laws, including the San Francisco Fair Chance Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act, for US-based candidates. For unincorporated Los Angeles County workers: we reasonably believe that criminal history may have a direct, adverse and negative relationship with the following job duties, potentially resulting in the withdrawal of a conditional offer of employment: protect computer hardware entrusted to you from theft, loss or damage; return all computer hardware in your possession (including the data contained therein) upon termination of employment or end of assignment; and maintain the confidentiality of proprietary, confidential, and non-public information. In addition, job duties require access to secure and protected information technology systems and related data security obligations.To notify OpenAI that you believe this job posting is non-compliant, please submit a report through this form. No response will be provided to inquiries unrelated to job posting compliance.We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link.OpenAI Global Applicant Privacy PolicyAt OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.
No items found.
Hidden link
The Reflection.jpg

Member of Technical Staff - Pre-Training Infra

Reflection
US.svg
United States
Full-time
Remote
false
Our MissionReflection’s mission is to build open superintelligence and make it accessible to all.We’re developing open weight models for individuals, agents, enterprises, and even nation states. Our team of AI researchers and company builders come from DeepMind, OpenAI, Google Brain, Meta, Character.AI, Anthropic and beyond.About the RoleBuild and scale distributed training systems that power frontier model pre-training.Work closely with research teams to design and operate large-scale training runs for foundation models.Develop infrastructure that enables efficient training across thousands of GPUs using modern distributed training frameworks.Optimize training throughput, stability, and efficiency for large model training workloads.Collaborate directly with pre-training researchers to translate experimental ideas into scalable, production-ready training systems.Improve performance of distributed training workloads through optimization of communication, memory usage, and GPU utilization.Build and maintain training pipelines that support large-scale datasets, checkpointing, and experiment iteration.Debug and resolve performance bottlenecks across distributed training stacks including model parallelism, GPU communication, and training runtime systems.Contribute to the development of systems that enable rapid experimentation and iteration on new training techniques. Ideal ExperienceExperience building or operating distributed training systems for large machine learning models.Strong experience working with modern distributed training frameworks such as Megatron, DeepSpeed, or similar large-scale training systems.Familiarity with large-scale model parallelism strategies (data, tensor, pipeline, or expert parallelism).Experience optimizing training throughput and GPU utilization in large distributed environments.Familiarity with GPU communication libraries such as NCCL and performance tuning for distributed workloads.Experience working closely with ML researchers to productionize experimental training workflows.Strong debugging skills across GPU compute, distributed training systems, and large-scale ML pipelinesExperience working with large datasets and training pipelines used for foundation model pre-training.What We Offer:We believe that to build superintelligence that is truly open, you need to start at the foundation. Joining Reflection means building from the ground up as part of a small talent-dense team. You will help define our future as a company, and help define the frontier of open foundational models.We want you to do the most impactful work of your career with the confidence that you and the people you care about most are supported.Top-tier compensation: Salary and equity structured to recognize and retain the best talent globally.Health & wellness: Comprehensive medical, dental, vision, life, and disability insurance.Life & family: Fully paid parental leave for all new parents, including adoptive and surrogate journeys. Financial support for family planning.Benefits & balance: paid time off when you need it, relocation support, and more perks that optimize your time. Opportunities to connect with teammates: lunch and dinner are provided daily. We have regular off-sites and team celebrations.
No items found.
Hidden link
The Reflection.jpg

Member of Technical Staff - Mid-Training Infra

Reflection
US.svg
United States
Full-time
Remote
false
Our MissionReflection’s mission is to build open superintelligence and make it accessible to all.We’re developing open weight models for individuals, agents, enterprises, and even nation states. Our team of AI researchers and company builders come from DeepMind, OpenAI, Google Brain, Meta, Character.AI, Anthropic and beyond.About the RoleDesign, build, and operate large-scale GPU infrastructure for high-throughput model inference and mid-training workloads.Develop systems that power synthetic data generation and reinforcement learning pipelines at scale.Build high-performance inference platforms capable of serving and evaluating models across thousands of GPUs.Optimize throughput, latency, and GPU utilization for large language model inference and rollout workloads.Build infrastructure that supports reinforcement learning pipelines, including large-scale rollout generation, evaluation, and policy improvement loops.Work closely with research teams to support distributed RL workloads and large-scale model evaluation infrastructure.Improve performance of model execution through kernel-level optimization, model parallelism strategies, and GPU runtime improvements.Develop distributed systems that enable large-scale synthetic data generation and RL-driven training workflows.Diagnose and resolve performance bottlenecks across inference runtimes, GPU kernels, networking, and distributed compute systems. Ideal ExperienceExperience deploying and operating large-scale GPU systems for inference or model serving.Several years of hands-on experience building and running production infrastructure.Strong understanding of GPU performance characteristics and optimization techniques.Experience working with modern inference frameworks such as SGLang, Megatron, or similar high-performance LLM runtimes.Familiarity with distributed reinforcement learning infrastructure or rollout generation systems.Experience optimizing throughput for large-scale model execution workloads.Experience working with GPU kernels or low-level performance optimization.Familiarity with infrastructure used for synthetic data pipelines or RL training workflows.Experience debugging performance issues across GPU, networking, and distributed execution layers.What We Offer:We believe that to build superintelligence that is truly open, you need to start at the foundation. Joining Reflection means building from the ground up as part of a small talent-dense team. You will help define our future as a company, and help define the frontier of open foundational models.We want you to do the most impactful work of your career with the confidence that you and the people you care about most are supported.Top-tier compensation: Salary and equity structured to recognize and retain the best talent globally.Health & wellness: Comprehensive medical, dental, vision, life, and disability insurance.Life & family: Fully paid parental leave for all new parents, including adoptive and surrogate journeys. Financial support for family planning.Benefits & balance: paid time off when you need it, relocation support, and more perks that optimize your time. Opportunities to connect with teammates: lunch and dinner are provided daily. We have regular off-sites and team celebrations.
No items found.
Hidden link
Bland.jpg

Senior Infrastructure Engineer

Bland
$120,000 – $200,000
US.svg
United States
Full-time
Remote
false
About Bland At Bland.com, our goal is to empower enterprises to make AI-phone agents at scale. Based out of San Francisco, we're a quickly growing team striving to change the way customers interact with businesses. We've raised $65 million from Silicon Valley's finest; Including Emergence Capital, Scale Venture Partners, YC, the founders of Twilio, Affirm, ElevenLabs, and many more.About the Role As a Senior Infrastructure Engineer at Bland, you'll help us to build the backbone that enables millions of AI-powered phone conversations. You're not just keeping servers running, you're architecting distributed systems that handle real-time voice processing, scale ML inference, and integrate with enterprise telephony infrastructure. Your work directly determines whether our platform can handle business-defining call volumes for our customers, or leaves them with dead air.What You'll DoContribute to the designing of scalable architecture: Build distributed systems using Kubernetes that handle high-volume, real-time voice processing with strict latency and reliability requirements.Build and Support ML infrastructure: Create and optimize the infrastructure supporting our AI models, from training pipelines to real-time inference serving across multiple regions.Integrate with telephony: Maintain robust connections between our platform and complex enterprise phone systems, SIP trunks, and VoIP infrastructure.Recognize Flaws, Control for them: We’re building a new type of architecture that takes something from Column A, and Column B. We’re never going to get it perfect, so you’ll be helping us keep a look out for what we need to solve.Ensure reliability: Implement monitoring, alerting, and incident response systems that keep our platform running 24/7 with enterprise-grade uptime.Scale with growth: Anticipate and solve scaling challenges before they become problems—our call volume grows exponentially and infrastructure needs to stay ahead.Security and compliance: Implement security best practices and compliance requirements for enterprise customers in regulated industries.Interesting Problems to OwnOld-Meets-New: Telephone calls have been around for awhile. Now with an explosion in modern technologies - comes interesting new ways to wrangle old-school protocols and techniques. You’ll have the space to be creative and really own a new emergent type of architecture.Sizable Call Volumes requires new approaches: Understand and deeply invest in ensuring that we match any amount of customer’s customers call volume! We need unique solutions, that you’ll help us discover along the way.Streaming Architectures: On top of building to support our APIs, you’ll also be building to helping maintain the reliability, failover, and scaling of our important stream-based traffic.What Makes You a Great FitInfrastructure expertise: 5+ years building and scaling distributed systems, with deep knowledge of cloud infrastructure (AWS/GCP preferred).You “get” the fundamentals, and beyond: For example, you can casually tell someone how TLS works beyond buzzwords, do a quick sketch of how different load balancing strategies work, or even tell us the obscure thing you fell asleep reading about last night. There isn’t a blank stare, there’s an excitement to share.Real-time systems experience: You've built systems that handle high-throughput, low-latency workloads, streaming, real-time processing, or similar.Startup mentality: You've worked at fast-growing companies where you wear multiple hats and solve problems as they come up.You’re opinionated, but you’re not alienating: You accept that opinions drive progress, but you don’t intend to break into alienating discussions at the risk of not finding compromises for our customers.You’re familiar with some tools/components like: Cloudflare, HAProxy, Go, TypeScript, Datadog, Terraform, Docker, Kubernetes, Nvidia Hardware (nvlink for example), and anything in between.Bonus Points If You HaveExperience with telephony systems (SIP, VOIP, WebRTC.)Background in ML infrastructure, model serving, or GPU computing.Experience with real-time audio/video processing.Benefits and Pay:Healthcare, dental, vision, all the good stuffMeaningful equity in a fast-growing companyEvery tool you need to succeedBeautiful office in Jackson Square, SF with rooftop viewsIf you don't have the perfect experience that is fine! We're a bunch of drop-outs and hackers. Working at a start-up is really hard. We work a lot and we figure things out on the fly.Compensation Range: $120,000-$200,000
No items found.
Hidden link
Mistral AI.jpg

Applied AI, Technical Lead, Forward Deployed AI Engineer - Montreal

Mistral AI
CA.svg
Canada
Full-time
Remote
false
About Mistral   At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to integrate seamlessly into daily working life.   We democratize AI through high-performance, optimized, open-source and cutting-edge models, products, and solutions. Our comprehensive AI platform is designed to meet enterprise needs, whether on-premises or in cloud environments. Our offerings include le Chat, the AI assistant for life and work.   We are a dynamic, collaborative team passionate about AI and its potential to transform society. Our diverse workforce thrives in competitive environments and is committed to driving innovation. Our teams are distributed between France, USA, UK, Germany, and Singapore. We are creative, low-ego, and team-spirited.   Join us to be part of a pioneering company shaping the future of AI. Together, we can make a meaningful impact. See more about our culture on mistral.ai/careers.   About The Job:   Mistral AI is seeking a Technical Lead, Applied AI to drive the technical strategy, execution, and delivery of complex AI solutions for our enterprise customers. In this role, you will lead a project teams of Applied AI Engineers, ensuring the successful deployment of Mistral AI products and the development of high-impact, scalable AI use cases.   You will act as the primary technical point of contact for our most strategic customers, guiding them through the entire lifecycle—from pre-sales to post-implementation—while collaborating closely with research, product, and engineering teams to shape the future of our offerings.   As a Technical Lead, you will bridge the gap between cutting-edge AI research and real-world enterprise applications, ensuring our solutions are robust, scalable, and aligned with both customer needs and Mistral’s technological vision.   What you will do   - Deliver as an IC the critical lines of codes of our complex projects, you’ll be hands-on and de-risk the critical parts of our complex projects. You’ll stay deeply involved in coding, reviewing, and optimizing AI solutions. - Lead technical teams of Applied AI Engineers, providing mentorship, technical guidance, and best practices for deploying state-of-the-art GenAI applications across industries. - Lead technical discussions during pre-sales, translating customer requirements into actionable solutions and communicating Mistral’s technological advantages to diverse stakeholders. - Design and oversee the implementation of complex AI systems, including fine-tuning, RAG, agentic workflows, and custom LLM applications, ensuring alignment with Mistral’s product roadmap and open-source initiatives. - Drive innovation by identifying emerging trends in AI, evaluating new tools and methodologies, and championing best practices for fine-tuning, inference, and deployment. - Work closely with product managers, researchers, and engineers to ensure seamless integration of customer feedback into Mistral’s product development cycle.   How We Work in Applied AI   - We care about people and outputs. - What matters is what you ship, not the time you spend on it - Bureaucracy is where urgency goes to vanish. You talk to whoever you need to talk to. The best idea wins, whether it comes from a principal engineer or someone in their first week. - Always ask why. The best solutions come from deep understanding, not from copying what worked before - We say what we mean. Feedback is direct, timely, and given because we care. - No politics. Low ego, high standards. - We embrace an unstructured environment and find joy in it.   About you   - You are fluent in French and English. - You hold a PhD or Master’s degree in AI, Machine Learning, Computer Science, or a related field. - You have 7/8+ years of experience in AI/ML, with at least 2+ years in a technical leadership role (e.g., Tech Lead, Engineering Manager, Staff Engineer or Solutions Architect) focused on AI products or enterprise solutions. - You have a proven track record of leading teams to deliver complex AI projects, from prototyping to production, in industries such as tech, finance, healthcare, or industrial automation. - You possess deep expertise in fine-tuning LLMs, advanced RAG, agentic systems, and deploying NLP applications at scale. - You are proficient in Python, PyTorch, and modern AI frameworks (e.g., LangChain, Hugging Face). Experience with cloud platforms (AWS, GCP, Azure) and MLOps tools is a plus. - You have strong software engineering skills, including API design, backend/full-stack development, and system architecture. - You excel in technical communication, with the ability to articulate complex concepts to both technical and non-technical audiences, including executives and engineers. - You thrive in fast-paced, collaborative environments and are passionate about mentoring and growing technical talent.   Ideally, you have: - Contributed to open-source projects, particularly in the LLM or AI space. - Experience in customer-facing roles (e.g., Solutions Architect, Customer Engineer, or Technical Product Manager) with a focus on enterprise AI adoption. - A track record of driving technical strategy and influencing product direction based on customer needs and market opportunities.   Why joining us?  You’ll have the opportunity to shape the future of AI adoption in enterprises, work with a world-class team, and contribute to open-source projects that impact millions. If you’re excited about leading technical innovation and solving real-world challenges with AI, we’d love to hear from you!Why join us?    You’ll have the opportunity to shape the future of AI adoption in enterprises, work with a world-class team, and contribute to open-source projects that impact millions. If you’re excited about leading technical innovation and solving real-world challenges with AI, we’d love to hear from you!
No items found.
Hidden link
Figure.jpg

Hardware Technician (TeleOperations)

Figure AI
$150,000 – $250,000
US.svg
United States
Full-time
Remote
false
Figure is an AI Robotics company developing a general purpose humanoid. Our humanoid robot is designed for commercial tasks and the home. We are based in San Jose, CA and require 5 days/week in-office collaboration. It’s time to build. Figure’s vision is to deploy autonomous humanoids at a global scale. Our Helix team is seeking an experienced AI Tooling Engineer to enhance our internal, web-based data and AI training tools. This role focuses on developing intuitive web interfaces that support key AI research functions, including robot data annotation, training dataset visualization, and experiment tracking. The ideal candidate has experience building rich, interactive web interfaces using React and TypeScript. Responsibilities Design and build intuitive web interfaces for robot data annotation, datasets visualization, and experiment tracking Utilize data-driven techniques to optimize interfaces for efficiency and fast iteration cycles Integrate AI models to automate manual tasks Work together with AI researchers, robot operators, and annotators to support new user experiences Requirements Strong software engineering fundamentals Bachelor's or Master's degree in Computer Science, Robotics, Engineering, or a related field Minimum of 4 years of professional, full-time experience building rich, interactive web interfaces Proficiency in React and TypeScript Bonus Qualifications Experience using data stores (Postgres, MySQL, ElasticSearch, Redis, etc.) Experience managing cloud infrastructure (AWS, Azure, GCP) Experience with Tailwind CSS Experience building data annotation and dataset management tools. The US base salary range for this full-time position is between $150,000 - $250,000 annually. The pay offered for this position may vary based on several individual factors, including job-related knowledge, skills, and experience. The total compensation package may also include additional components/benefits depending on the specific role. This information will be shared if an employment offer is extended.
No items found.
Hidden link
Figure.jpg

Service Tooling Engineer

Figure AI
$150,000 – $250,000
US.svg
United States
Full-time
Remote
false
Figure is an AI Robotics company developing a general purpose humanoid. Our humanoid robot is designed for commercial tasks and the home. We are based in San Jose, CA and require 5 days/week in-office collaboration. It’s time to build. Figure’s vision is to deploy autonomous humanoids at a global scale. Our Helix team is seeking an experienced AI Tooling Engineer to enhance our internal, web-based data and AI training tools. This role focuses on developing intuitive web interfaces that support key AI research functions, including robot data annotation, training dataset visualization, and experiment tracking. The ideal candidate has experience building rich, interactive web interfaces using React and TypeScript. Responsibilities Design and build intuitive web interfaces for robot data annotation, datasets visualization, and experiment tracking Utilize data-driven techniques to optimize interfaces for efficiency and fast iteration cycles Integrate AI models to automate manual tasks Work together with AI researchers, robot operators, and annotators to support new user experiences Requirements Strong software engineering fundamentals Bachelor's or Master's degree in Computer Science, Robotics, Engineering, or a related field Minimum of 4 years of professional, full-time experience building rich, interactive web interfaces Proficiency in React and TypeScript Bonus Qualifications Experience using data stores (Postgres, MySQL, ElasticSearch, Redis, etc.) Experience managing cloud infrastructure (AWS, Azure, GCP) Experience with Tailwind CSS Experience building data annotation and dataset management tools. The US base salary range for this full-time position is between $150,000 - $250,000 annually. The pay offered for this position may vary based on several individual factors, including job-related knowledge, skills, and experience. The total compensation package may also include additional components/benefits depending on the specific role. This information will be shared if an employment offer is extended.
No items found.
Hidden link
Figure.jpg

Deployment Logistics Coordinator

Figure AI
$150,000 – $250,000
US.svg
United States
Full-time
Remote
false
Figure is an AI Robotics company developing a general purpose humanoid. Our humanoid robot is designed for commercial tasks and the home. We are based in San Jose, CA and require 5 days/week in-office collaboration. It’s time to build. Figure’s vision is to deploy autonomous humanoids at a global scale. Our Helix team is seeking an experienced AI Tooling Engineer to enhance our internal, web-based data and AI training tools. This role focuses on developing intuitive web interfaces that support key AI research functions, including robot data annotation, training dataset visualization, and experiment tracking. The ideal candidate has experience building rich, interactive web interfaces using React and TypeScript. Responsibilities Design and build intuitive web interfaces for robot data annotation, datasets visualization, and experiment tracking Utilize data-driven techniques to optimize interfaces for efficiency and fast iteration cycles Integrate AI models to automate manual tasks Work together with AI researchers, robot operators, and annotators to support new user experiences Requirements Strong software engineering fundamentals Bachelor's or Master's degree in Computer Science, Robotics, Engineering, or a related field Minimum of 4 years of professional, full-time experience building rich, interactive web interfaces Proficiency in React and TypeScript Bonus Qualifications Experience using data stores (Postgres, MySQL, ElasticSearch, Redis, etc.) Experience managing cloud infrastructure (AWS, Azure, GCP) Experience with Tailwind CSS Experience building data annotation and dataset management tools. The US base salary range for this full-time position is between $150,000 - $250,000 annually. The pay offered for this position may vary based on several individual factors, including job-related knowledge, skills, and experience. The total compensation package may also include additional components/benefits depending on the specific role. This information will be shared if an employment offer is extended.
No items found.
Hidden link
Hippocratic AI.jpg

Agent Deployment Engineer (Residency Program)

Hippocratic AI
US.svg
United States
Full-time
Remote
false
About UsHippocratic AI is the leading generative AI company in healthcare. We have the only system that can have safe, autonomous, clinical conversations with patients. We have trained our own LLMs as part of our Polaris constellation, resulting in a system with over 99.9% accuracy.Why Join Our TeamReinvent healthcare with AI that puts safety first. We’re building the world’s first healthcare‑only, safety‑focused LLM — a breakthrough platform designed to transform patient outcomes at a global scale. This is category creation.Work with the people shaping the future. Hippocratic AI was co‑founded by CEO Munjal Shah and a team of physicians, hospital leaders, AI pioneers, and researchers from institutions like El Camino Health, Johns Hopkins, Washington University in St. Louis, Stanford, Google, Meta, Microsoft, and NVIDIA.Backed by the world’s leading healthcare and AI investors. We recently raised a $126M Series C at a $3.5B valuation, led by Avenir Growth, bringing total funding to $404M with participation from CapitalG, General Catalyst, a16z, Kleiner Perkins, Premji Invest, UHS, Cincinnati Children’s, WellSpan Health, John Doerr, Rick Klausner, and others.Build alongside the best in healthcare and AI. Join experts who’ve spent their careers improving care, advancing science, and building world‑changing technologies — ensuring our platform is powerful, trusted, and truly transformative.Location RequirementWe believe the best ideas happen together. To support fast collaboration and a strong team culture, this role is expected to be in our Palo Alto office five days a week, unless otherwise specified.About the RoleWe're seeking a Agent Deployment Engineer to join our collaborative team of engineers, scientists, and healthcare professionals working on transformative AI solutions. In this role, you'll help develop and maintain our conversation layer using a mix of software and prompting skills to allow our AI agent facilitate autonomous conversations.What You’ll DoDevelop complex prompts in our SOTA conversation planning layer to enable complex agentic model workflowsCollaborate with Product Managers, Software Engineers and Clinicians to create fully autonomous, clinical conversations.Write, configure and iterate on prompts to create engaging, patient-oriented conversationsUse advanced prompting techniques to develop and optimize prompts for language models to improve model performance, clinical safety, and patient experience.Conduct experiments and analyze outcomes of model outputs to refine and iterate on prompt strategies.What You BringMust-Have:Bachelor’s degree in Computer Science, Computer Engineering, or a related field (or equivalent coursework/projects).2+ years industry experience Experience with Python Experience with LLM prompting (chatGPT, Claude, etc) through professional or personal useExposure to working with databases and building simple RESTful APIs.Strong problem-solving mindset and eagerness to learn new technologies.Nice-to-Have:Experience with personal or academic projects involving backend development.Basic understanding of AI/ML concepts or a desire to learn about them.Knowledge of modern web frameworks (e.g., Flask, Django, or Spring Boot).Awareness of data privacy and security best practices, especially in regulated environments.Please be aware of recruitment scams impersonating Hippocratic AI. All recruiting communication will come from @hippocraticai.com email addresses. We will never request payment or sensitive personal information during the hiring process.
No items found.
Hidden link
Decagon.jpg

Enterprise Solutions Engineer

Decagon
A$220,000 – A$290,000
AU.svg
Australia
Full-time
Remote
false
About DecagonDecagon is the leading conversational AI platform empowering every brand to deliver concierge customer experiences.Our technology enables industry-defining enterprises like Avis Budget Group, Block’s Cash App and Square, Chime, Oura Health, and Hunter Douglas to deploy AI agents that power personalized, deeply satisfying interactions across voice, chat, email, SMS, and every other channel.We’re building a future where customer experiences are being redefined from support tickets and hold music to faster resolutions, richer conversations, and deeper relationships. We’re proud to be backed by world-class investors who share that vision, including a16z, Accel, Bain Capital Ventures, Coatue, and Index Ventures, along with many others.We’re an in-office company, driven by a shared commitment to excellence and velocity. Our values — Just Get It Done, Invent What Customers Want, Winner’s Mindset, and The Polymath Principle — shape how we work and grow as a team.About the RoleDecagon is looking for a Solutions Engineer, a foundational technical hire who will help shape the future of our AI-powered solutions in the enterprise space. This role sits at the intersection of pre-sales engineering, product strategy, and enterprise architecture. You will serve as a technical advisor and trusted consultant to enterprise prospects and customers, driving value-based solutioning and integration of Decagon’s AI-native platform into complex customer ecosystems.Your background looks something like this4-6 years of customer facing experience in a sales engineering (pre-sales) rolePartner with Account Executives to discover and qualify solutions that lead to strong return on investment for your customerCreate and architect generative AI experiences for our customer’s end usersDevelop custom demonstrations using the Decagon platform tailored to a customer’s specific needs and valueCommunicate complex technical concepts clearly to diverse stakeholders, including C-level decision makers, business users, and engineering stakeholdersStrong interpersonal and teamwork skillsFunctional understanding of GTM systems and workflowsEven better if you have2+ years of AI sales engineering experienceExperience in a high-growth startup environmentBenefitsMedical, dental, and vision benefitsTake what you need vacation policyDaily lunches, dinners and snacks in the office to keep you at your best
No items found.
Hidden link
Zoox.jpg

AI Enterprise Architect Lead

Zoox
$203,000 – $255,000
US.svg
United States
Full-time
Remote
false
Zoox is seeking an AI Solutions Architect who will be the primary driver  for the design, development, and deployment of Generative AI (GenAI) and Large Language Model (LLM) solutions. You will work across the enterprise—partnering with Procurement, Legal, Finance, HR, and Marketing—to build intelligent agents and automated workflows that transform how our employees work and how our business makes decisions.In this role, you will: Lead the development and deployment of AI-powered platforms and tools that enhance productivity across core business functions such as Finance, HR, IT, Marketing, Procurement, and Supply Chain. Identify, evaluate, and implement off-the-shelf AI agents to streamline operations, reduce costs, and accelerate business processes. Architect and deploy enterprise-grade LLM applications (RAG, Fine-tuning, Agentic workflows) to automate complex processes. Act as the technical consultant for non-technical leads in Finance (forecasting), Legal (contract analysis), HR (talent matching), and Marketing (content personalization). Build autonomous AI agents capable of navigating multi-step business tasks with minimal human intervention. Collaborate with the Data Analytics team to ensure all AI deployments adhere to strict data privacy, security, and "responsible AI" standards. Manage the lifecycle of AI products from prompt engineering and model selection (GPT-4, Claude, Llama 3, etc.) to integration with internal APIs and databases. Qualifications: 5+ years in Data Engineering, Software Engineering, or Data Science, with at least 2+ years of hands-on experience deploying GenAI/LLMs in a production enterprise environment. Deep proficiency in Python and frameworks such as LangChain, LlamaIndex, or AutoGen. Experience with cloud AI services (Azure OpenAI, AWS Bedrock, or Google Vertex AI) and vector databases (Pinecone, Weaviate, or Milvus). Proven ability to translate "business pain" into "technical requirements." You need to speak "Legal" and "Finance" as fluently as you speak "Python." 203,000 - 255,000 a year Base Salary Range   There are three major components to compensation for this position: salary, Amazon Restricted Stock Units (RSUs), and Zoox Stock Appreciation Rights. A sign-on bonus may be offered as part of the compensation package. The listed range applies only to the base salary. Compensation will vary based on geographic location and level. Leveling, as well as positioning within a level, is determined by a range of factors, including, but not limited to, a candidate's relevant years of experience, domain knowledge, and interview performance. The salary range listed in this posting is representative of the range of levels Zoox is considering for this position.   Zoox also offers a comprehensive package of benefits, including paid time off (e.g. sick leave, vacation, bereavement), unpaid time off, Zoox Stock Appreciation Rights, Amazon RSUs, health insurance, long-term care insurance, long-term and short-term disability insurance, and life insurance.
No items found.
Hidden link
Langdock.jpg

Agent Engineer

Langdock
€80,000 – €120,000
GE.svg
Germany
Full-time
Remote
false
Help Companies Work Better with AIBuild something that matters.Right now, over 5,000 companies, from DAX enterprises like Merck to fast-growing startups, use Langdock every day. Their employees open our product to draft strategies, analyze documents, automate workflows, and think through hard problems with AI.We are the platform that makes this possible: secure, model agnostic, and deeply integrated with how work actually happens.What "Agent Engineer" Means HereAI agents are becoming the operating system of how companies work. Not chatbots. Not simple automations. Agents that own tasks end-to-end: reading customer tickets, updating CRMs, delegating subtasks to other agents, and knowing when to escalate to a human.At Langdock, we are building both the product that lets our customers deploy agents and the internal infrastructure that runs our own. This role sits at that intersection. You will design, build, and operate the AI agents that power critical parts of Langdock's operations, from customer support automation to internal workflows. And the patterns you develop will directly inform how we build the agent platform for thousands of other companies.This is not a role where you configure tools built by someone else. You will work across the full stack: prompt engineering, orchestration protocols, integrations, reliability, and cost management. You will own agents the way an engineer owns a production service.What You Will Actually DoOwn production agents end-to-end. You design, deploy, monitor, and improve the AI agents that handle real customer interactions and internal workflows. When an agent misbehaves, you dig into the logs to identify the root cause and fix it. When it works beautifully, that is also you.Build and operate customer support automation. You will monitor Langdock's support experience by supervising agents that resolve issues faster than most humans could, while knowing exactly when to hand off.Manage agent reliability, cost, and quality. You track how agents perform, what they cost, where they fail, and why. You set budgets, tune behavior, adjust governance rules, and make sure every agent earns its keep.Design the orchestration layer. You define how agent teams are structured, set up approval gates, and build the operational infrastructure that lets autonomous agents work safely at scale.Integrate agents with systems. You connect agents to both internal and external APIs, giving them the context and capabilities they need to do real work.Experiment constantly. New models drop, new techniques emerge, new use cases surface. You are the person who tries them first, benchmarks them honestly, and ships the ones that actually improve outcomes.What Makes This DifferentThis role barely existed two years ago. There is no playbook for it. You are building the discipline of "agent operations" at a company that is also building the platform for it.The agents you run will directly reach Langdock's 5,000+ customers, meaning your work has an immediate, measurable impact. And because the patterns you develop internally become the patterns we productize for customers, you are not just operating agents. You are defining how agents should be operated.We went from 2 to 20M+ ARR in one year. You will join early enough to shape how things work, but late enough that we have traction, customers, and a product people love.You will also learn fast. Our team is small, the scope is large, and the feedback loops are short. People who joined a year ago are now running critical functions. If you are good, you will grow.You Might Be a Fit If...You have built and maintained AI agents or automations in production.You are deeply technical. You understand LLMs, token economics, prompt engineering, API design, and orchestration patterns. You can read a protocol spec and immediately see the edge cases.You obsess over experimentation and reliability. You think in terms of error rates and customer satisfaction. You instrument everything.You are not precious about your agents. When one fails, you find the root cause and fix it. When a simpler approach works better, you kill the clever one.You do not just use AI tools daily. You actively nerd about your setup. You experiment with different models, prompts, workflows, and automations. You have strong opinions about what works and why.You would rather own a problem than be told exactly what to do.You are a kind person who cares about the people around you.The EnvironmentWe work from our office in Berlin, Greifswalder Strasse 212. Everyone works together in person because the hardest problems get solved faster at a whiteboard than in a Slack thread. Conversations happen faster, problems get solved quicker, and we actually know each other.Days start at 8:30. Lunch & dinner are together. We run, go to the gym, and take care of ourselves. Health is not separate from work here, it is part of how we work well.The vibe is calm but intense. No one is yelling or panicking. But everyone is working hard on things that matter.CompensationSalaries are transparent and tied to levels, not negotiation. All roles include equity.We will figure out the right level together based on your experience and scope. Levels are about the work you own, not your title or years of experience. We narrow down the expected salary range early in the process.Next stepsWe move fast. Most processes complete within two weeks.If this sounds like your kind of work, we would like to meet you.
No items found.
Hidden link
Observe.AI

Lead DevOps Engineer

Observe
$108,000 – $170,000
US.svg
United States
Full-time
Remote
false
(This position is based in Redwood City, CA, and applicants must be able to work on-site three days per week as part of our hybrid schedule.) About Us Observe.AI is the leading AI agent platform for customer experience. It enables enterprises to deploy AI agents that automate customer interactions, delivering natural conversations for customers with predictable outcomes for the business. Observe.AI combines advanced speech understanding, workflow automation, and enterprise-grade governance to execute end-to-end workflows with AI agents. It also enables teams to guide and augment human agents with AI copilots, and analyze 100% of human and AI interactions for insights, coaching, and quality management. Companies like DoorDash, Affordable Care, Signify Health, and Verida use Observe.AI to transform customer experiences every day by accelerating service speed, increasing operational efficiency, and strengthening customer loyalty across every channel. Why Join Us We’re looking for an AI Agent Engineer to lead the charge in building and deploying enterprise-grade Voice, Chat AI agents and AI Copilot. This role is hands-on, customer-facing, and pivotal in bringing AI solutions to life - from design and integration to deployment and optimization. You’ll own the end-to-end lifecycle of AI agents: building, integrating, testing, demoing to clients, deploying into production, and tuning performance. What you’ll be doing Build & Deploy Agents: Own the full AI agent build process - prompts, workflows, integrations, telephony setup, and evaluation forms. Client Engagement: Lead weekly demos, show progress, gather feedback, and act as the primary technical point of contact once a solution is defined. Systems Integration: Configure APIs, data maps, authentication, error handling, and connect to CRMs, databases, or knowledge systems. Telephony Integration: Set up SIP/CCaaS/PSTN routing, pass metadata, configure fallbacks, and troubleshoot call quality. Optimization: Monitor performance, refine prompts, test iteratively, and ensure agents meet automation and containment targets. Strategic Partner: Translate customer requirements into actionable solutions; work consultatively to unblock challenges in security, connectivity, or knowledge ingestion. Shadow Core Engineering: Collaborate with product/engineering teams for deep technical fixes and platformization, while independently leading client delivery. What you'll bring to the role 3+ years in conversational AI, ML engineering, or system integration with hands-on delivery of AI/LLM-based solutions. Strong skills in prompt engineering, workflow building, API integration, and telephony (SIP, Twilio, Amazon Connect, etc.). Familiarity with LLMs (GPT, Claude, Gemini), vector DBs, and orchestration frameworks (LangChain, LlamaIndex, etc.). ML expertise in embeddings, retrieval-augmented generation (RAG), evaluation frameworks, fine-tuning models, and performance optimization. Solid programming skills (Python, JavaScript, or similar). Comfort leading customer-facing discussions - from deep technical troubleshooting to weekly project demos. Strong problem-solving mindset: ability to find workarounds, unblock integrations, and adapt to customer-specific ecosystems. Bachelor’s degree in Computer Science, Engineering, or a related technical field Hands-on experience with Integration Platform-as-a-Service (iPaaS) providers, such as n8n, Zapier, or similar platforms and proficient in API integrations and data flow management. Strong experience in telephony integrations, including knowledge of protocols like SIP, PSTN, and other telephony technologies. Perks & Benefits Competitive compensation including equity Excellent medical, dental, and vision insurance options Flexible time off  10 Company holidays + Winter Break and up to 16-weeks of parental leave 401K plan Quarterly Lifestyle Spend Monthly Mobile + Internet Stipend Pre-tax Commuter Benefits Salary Range The base salary compensation range targeted for this full-time position is $108 - 170K per annum. Compensation may vary outside of this range depending on a number of factors, including a candidate’s qualifications, skills, competencies and experience. Base pay is one part of the Total Package that is provided to compensate and recognize employees for their work, and this role may be eligible for additional discretionary bonuses/incentives and equity (in the form of options). This salary range is an estimate, and the actual salary may vary based on the Company’s compensation practices. Our Commitment to Inclusion and Belonging Observe.AI is an Equal Employment Opportunity employer that proudly pursues and hires a diverse workforce. Observe AI does not make hiring or employment decisions on the basis of race, color, religion or religious belief, ethnic or national origin, nationality, sex, gender, gender identity, sexual orientation, disability, age, military or veteran status, or any other basis protected by applicable local, state, or federal laws or prohibited by Company policy. Observe.AI also strives for a healthy and safe workplace and strictly prohibits harassment of any kind. We welcome all people. We celebrate diversity of all kinds and are committed to creating an inclusive culture built on a foundation of respect for all individuals. We seek to hire, develop, and retain talented people from all backgrounds. Individuals from non-traditional backgrounds, historically marginalized or underrepresented groups are strongly encouraged to apply. If you are ambitious, make an impact wherever you go, and you're ready to shape the future of Observe.AI, we encourage you to apply. For more information, visit www.observe.ai.  #LI- Redwood City, CA (Hybrid)
No items found.
Hidden link
OpenAI.jpg

Hardware / Software CoDesign Engineer - 3P

OpenAI
$342,000 – $555,000
US.svg
United States
Full-time
Remote
false
About the TeamOpenAI’s Hardware organization develops silicon and system-level solutions designed for the unique demands of advanced AI workloads. The team is responsible for building the next generation of AI-native silicon while working closely with software and research partners to co-design hardware tightly integrated with AI models. In addition to delivering production-grade silicon for OpenAI’s supercomputing infrastructure, the team also creates custom design tools and methodologies that accelerate innovation and enable hardware optimized specifically for AI. About the RoleAs an Engineer on our hardware optimization and co-design team, you will co-design future hardware from different vendors for programmability and performance. You will work with our kernel, compiler and machine learning engineers to understand their unique needs related to ML techniques, algorithms, numerical approximations, programming expressivity, and compiler optimizations. You will evangelize these constraints with various vendors to develop and influence future hardware architectures towards efficient training and inference on our models. If you are excited about efficiently distributing a large language model across devices, dealing with and optimizing system-wide/rack-wide networking bottlenecks and eventually tailoring the compute pipe and memory hierarchy of the hardware platform, simulating workloads at different abstractions and working closely with our partners, this is the perfect opportunity!This role is based in San Francisco, CA. We use a hybrid work model of 3 days in the office per week and offer relocation assistance to new employees. Key ResponsibilitiesCo-design future hardware for programmability and performance with our hardware vendorsAssist hardware vendors in developing optimal kernels and add support for it in our compilerDevelop performance estimates for critical kernels for different hardware configurations and drive decisions on compute core and memory hierarchy featuresBuild system performance models at different abstraction levels and carry out analysis to drive decisions on scale up, scale out, front end networkingWork with machine learning engineers, kernel engineers and compiler developers to understand their vision and needs from high performance acceleratorsManage communication and coordination with internal and external partnersInfluence the roadmap of hardware partners to optimize them for OpenAI’s workloads.Evaluate potential partners’ accelerators and platforms.As the scope of the role and team grows, understand and influence roadmaps for hardware partners for our datacenter networks, racks, and buildings. Qualifications4+ years of industry experience, including experience harnessing compute at scale and optimizing ML platform code to run efficiently on target hardware.Strong experience in software/hardware co-designDeep understanding of GPU and/or other AI acceleratorsExperience with CUDA, Triton or a related accelerator programming languageExperience driving Machine Learning accuracy with low precision formatsExperience with system performance modeling and analysis to optimize ML model deploymentStrong coding skills in C/C++ and PythonAre familiar with the fundamentals of deep learning computing and chip architecture/microarchitecture.Able to actively collaborate with ML engineers, kernel writers, compiler developers, system engineers, chip architects/microarchitects Preferred SkillsPhD in Computer Science and Engineering with a specialization in Computer Architecture, Parallel Computing. Compilers or other SystemsStrong understanding of LLMs and challenges related to their training and inference About OpenAIOpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity. We are an equal opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or other applicable legally protected characteristic. For additional information, please see OpenAI’s Affirmative Action and Equal Employment Opportunity Policy Statement.Background checks for applicants will be administered in accordance with applicable law, and qualified applicants with arrest or conviction records will be considered for employment consistent with those laws, including the San Francisco Fair Chance Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act, for US-based candidates. For unincorporated Los Angeles County workers: we reasonably believe that criminal history may have a direct, adverse and negative relationship with the following job duties, potentially resulting in the withdrawal of a conditional offer of employment: protect computer hardware entrusted to you from theft, loss or damage; return all computer hardware in your possession (including the data contained therein) upon termination of employment or end of assignment; and maintain the confidentiality of proprietary, confidential, and non-public information. In addition, job duties require access to secure and protected information technology systems and related data security obligations.To notify OpenAI that you believe this job posting is non-compliant, please submit a report through this form. No response will be provided to inquiries unrelated to job posting compliance.We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link.OpenAI Global Applicant Privacy PolicyAt OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.
No items found.
Hidden link
BJAK.jpg

Backend Engineer, AI

Bjak
CN.svg
China
Full-time
Remote
false
About the RoleA1 is building a proactive AI system that carries work forward across conversations, tools, and time.As a Backend Engineer, AI, you own the inference and orchestration layer that powers every AI interaction in the product. Your work sits between models and users, where latency, correctness, reliability, and cost directly impact real-world experience.You will build and operate production systems that turn model capability into fast, stable, observable APIs used across mobile and desktop clients.FocusBuild and operate backend systems that serve AI-powered features in production.Design inference pipelines, orchestration layers, and service boundaries around models.Own production concerns: monitoring, logging, alerting, and incident response.Optimize latency and throughput across inference, caching, batching, and streaming.Ideal ExperiencesStrong backend engineering fundamentals in production environments.Experience running high-throughput, low-latency services.Familiarity with AI inference patterns (LLMs, embeddings, multimodal).Comfortable debugging distributed systems under load.Bias toward shipping and learning from production behavior.OutcomesBackend systems run reliably at scale, handling production AI traffic with low latency and high throughput.APIs are stable, clear, and support seamless integration with frontend and ML systems.Production incidents are quickly detected, diagnosed, and resolved, minimizing user impact.Iterative improvements based on real usage continuously increase system performance and reliability.Tech StackPythonNodeJsPytorchOpenAI / Anthropic / open-source LLMsSQl & noSQLKubernetesDockerHow We WorkThe best products today in the world were built by small, world class teams. We are a high talent density and hands-on team. We make decisions collectively, move at rapid speed, striking a balance between shipping high quality work and learning. Joining our team requires the ability to bring structure, exercise judgment, and execute independently. Our goal is to put in hands of our users a truly magical productInterview processIf there appears to be a fit, we'll reach to schedule 3, but no more than 4 interviews.Applications are evaluated by our technical team members. Interviews will be conducted via virtual meetings and/or onsite.We value transparency and efficiency, so expect a prompt decision. If you've demonstrated the exceptional skills and mindset we're looking for, we'll extend an offer to join us. This isn't just a job offer; it's an invitation to be part of a team that's bringing AI to have practical benefits to billions globally.
No items found.
Hidden link
BJAK.jpg

Backend Engineer, AI

Bjak
US.svg
United States
Full-time
Remote
false
About the RoleA1 is building a proactive AI system that carries work forward across conversations, tools, and time.As a Backend Engineer, AI, you own the inference and orchestration layer that powers every AI interaction in the product. Your work sits between models and users, where latency, correctness, reliability, and cost directly impact real-world experience.You will build and operate production systems that turn model capability into fast, stable, observable APIs used across mobile and desktop clients.FocusBuild and operate backend systems that serve AI-powered features in production.Design inference pipelines, orchestration layers, and service boundaries around models.Own production concerns: monitoring, logging, alerting, and incident response.Optimize latency and throughput across inference, caching, batching, and streaming.Ideal ExperiencesStrong backend engineering fundamentals in production environments.Experience running high-throughput, low-latency services.Familiarity with AI inference patterns (LLMs, embeddings, multimodal).Comfortable debugging distributed systems under load.Bias toward shipping and learning from production behavior.OutcomesBackend systems run reliably at scale, handling production AI traffic with low latency and high throughput.APIs are stable, clear, and support seamless integration with frontend and ML systems.Production incidents are quickly detected, diagnosed, and resolved, minimizing user impact.Iterative improvements based on real usage continuously increase system performance and reliability.Tech StackPythonNodeJsPytorchOpenAI / Anthropic / open-source LLMsSQl & noSQLKubernetesDockerHow We WorkThe best products today in the world were built by small, world class teams. We are a high talent density and hands-on team. We make decisions collectively, move at rapid speed, striking a balance between shipping high quality work and learning. Joining our team requires the ability to bring structure, exercise judgment, and execute independently. Our goal is to put in hands of our users a truly magical productInterview processIf there appears to be a fit, we'll reach to schedule 3, but no more than 4 interviews.Applications are evaluated by our technical team members. Interviews will be conducted via virtual meetings and/or onsite.We value transparency and efficiency, so expect a prompt decision. If you've demonstrated the exceptional skills and mindset we're looking for, we'll extend an offer to join us. This isn't just a job offer; it's an invitation to be part of a team that's bringing AI to have practical benefits to billions globally.
No items found.
Hidden link
Mistral AI.jpg

Applied AI, Forward Deployed Machine Learning Engineer - Montreal

Mistral AI
CA.svg
Canada
Full-time
Remote
false
About Mistral   At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to integrate seamlessly into daily working life.   We democratize AI through high-performance, optimized, open-source and cutting-edge models, products and solutions. Our comprehensive AI platform is designed to meet enterprise needs, whether on-premises or in cloud environments. Our offerings include le Chat, the AI assistant for life and work.   We are a dynamic, collaborative team passionate about AI and its potential to transform society. Our diverse workforce thrives in competitive environments and is committed to driving innovation. Our teams are distributed between France, USA, UK, Germany and Singapore. We are creative, low-ego and team-spirited.   Join us to be part of a pioneering company shaping the future of AI. Together, we can make a meaningful impact. See more about our culture on https://mistral.ai/careers.   About The Job   Mistral AI is seeking a Applied AI Engineer to facilitate the adoption of its products among customers and collaborate with them to address complex technical challenges.   The Applied AI Engineer will be an integral part of our Applied AI Engineering team, which is dedicated to driving the successful deployment of Mistral AI products. They will work hand-in-hand with customers from the pre-sale stage to post-implementation, ensuring our solutions meet and exceed client expectations.    In this role, you’ll manage daily customer relations involving multiple stakeholders (CEO/CTO, data scientists, and software engineers) and function as a key resource in externalising our research in production settings.   What you will do   • You’ll be responsible for onboarding customers on our products and APIs, providing guidance on prompting, evaluation, and fine-tuning, and ensuring the best production integration with back-end and front-end interfaces.  • You’ll work on state-of-the-art GenAI applications from consumer products to industrial use cases, driving with our customers a crucial technological transformation.  • You’ll individually help deploy into production use cases with a considerable business impact across various industries.  • You’ll work in collaboration with our researchers, other AI engineers, product engineers on our most complex customer projects involving complex fine-tuning, state-of-the-art LLM applications, and contributing to our open source codebases for tasks such as inference and fine-tuning. • You’ll be involved in pre-sales calls to understand potential clients' needs, challenges, and aspirations. You will provide technical guidance on our products and explain Mistral technologies to various stakeholders.  • Your collaboration with our product and science team to improve continuously our product and model capabilities based on customers’ feedback     About you   • You are fluent in English • You hold a PhD / master in AI / data science. • You have 2+ years as a technical individual contributor (data scientist or software engineer) on AI-based products • You have experience in Fine Tuning LLMs, tackling advanced RAG or agentic use cases • You have deep understanding of concepts and algorithms underlying machine learning and LLMs • You're experienced with building and deploying LLMs or NLP applications • You have proven experience in AI or machine learning product implementation with APIs, back-end and front-end interfaces. • You have strong technical coding skills in Python • You have experience with deep learning with Pytorch • You have experience with agents framework such as Langchain, vector DBs • You hold strong communication skills with an ability to explain complex technical concepts in simple terms with technical and non-technical audiences   Ideally you have:   • Contributed to open-source projects in particular in the space of LLMs • Experience as a Customer Engineer, Forward Deployed Engineer, Sales Engineer, Solutions Architect or Technical Product Manager   Benefits   💰 Competitive salary  🚀 Generous Equity 🧑‍⚕️ Health : Sun Life 👴🏻 Pension : Match up to 6% of your contributions 🏝️ PTO : 25 days  🚗 Transportation: Allowance public transportation or Parking charges reimbursed  🤝 Coaching: we offer Betterup coaching on a voluntary basis 🏀 Sport: 145 CAD/month reimbursement for gym membership 🥕 Meal stipend: 480 CAD monthly allowance for meals (solution might evolve as we grow bigger)    About The Team   At Mistral AI, we are a tight-knit, nimble team dedicated to bringing our cutting-edge AI technology to the world. Our mission is to make AI ubiquitous and open.   Our team values are reflected in our product values:   - Cool: We have a tongue-in-cheek way of looking at things, it’s hard to describe but you know it when you see it - Precision: Our designs mirror the rigor and excellence that underpin our technology, reflecting our commitment to quality and reliability - Human-Centric: We strive to make our technology open, approachable, and accessible - Captivating: Our designs reflect the magic of our technology and our playful, exploratory approach to innovation - Ambitious: We push the boundaries of what is possible, reflecting our bold vision for the future
No items found.
Hidden link
Faculty.jpg

Machine Learning Engineer

Faculty
GB.svg
United Kingdom
Full-time
Remote
false
Why Faculty? We established Faculty in 2014 because we thought that AI would be the most important technology of our time. Since then, we’ve worked with over 350 global customers to transform their performance through human-centric AI. You can read about our real-world impact here.We don’t chase hype cycles. We innovate, build and deploy responsible AI which moves the needle - and we know a thing or two about doing it well. We bring an unparalleled depth of technical, product and delivery expertise to our clients who span government, finance, retail, energy, life sciences and defence.Our business, and reputation, is growing fast and we’re always on the lookout for individuals who share our intellectual curiosity and desire to build a positive legacy through technology.AI is an epoch-defining technology, join a company where you’ll be empowered to envision its most powerful applications, and to make them happen.About the team Our Public Services Business Unit is committed to leveraging AI for the benefit of individual citizens and the public good. From our work informing strategic government decisions, to optimising our NHS, through to reducing bureaucratic backlogs - we know that AI offers opportunities to drive improvements at every level of Government and we are proud to lead on some of the most impactful work happening in the sector. Because of the nature of the work we do with our Government clients, you may need to be eligible for UK Security Clearance (SC) and willing to work on site with these clients from time to time. About the roleJoin us as a Machine Learning Engineer to deliver bespoke, impactful AI solutions for our diverse clients.You will be instrumental in bringing machine learning out of the lab and into the real world, contributing to scalable software architecture and defining best practices. Working with clients, and cross-functional teams, you'll ensure technical feasibility and timely delivery of high-quality, production-grade ML systems. What you'll be doing:Building and deploying production-grade ML software, tools, and infrastructure.Creating reusable, scalable solutions that accelerate the delivery of ML systems.Collaborating with engineers, data scientists, and commercial leads to solve critical client challenges.Leading technical scoping and architectural decisions to ensure project feasibility and impact.Defining and implementing Faculty’s standards for deploying machine learning at scale.Acting as a technical advisor to customers and partners, translating complex ML concepts for stakeholders.Who we're looking for:You understand the full machine learning lifecycle and have experience operationalising models built with frameworks like Scikit-learn, TensorFlow, or PyTorch.You possess strong Python skills and solid experience in software engineering best practices.You bring hands-on experience with cloud platforms and infrastructure (e.g., AWS, Azure, GCP), including architecture and security.You've worked with container and orchestration tools such at Docker & Kubernetes to build and manage applications at scaleYou are comfortable with core ML concepts, including probability, statistics, and common learning techniques.You're an excellent communicator, able to guide technical teams and confidently advise non-technical stakeholders.You thrive in a fast-paced environment, and enjoy the autonomy to own scope, solve and delivery solutionsOur Interview ProcessTalent Team Screen (30 minutes)Pair Programming Interview (90 minutes) System Design Interview (90 minutes) Commercial Interview (60 minutes)Our Recruitment EthosWe aim to grow the best team - not the most similar one. We know that diversity of individuals fosters diversity of thought, and that strengthens our principle of seeking truth. And we know from experience that diverse teams deliver better work, relevant to the world in which we live. We’re united by a deep intellectual curiosity and desire to use our abilities for measurable positive impact. We strongly encourage applications from people of all backgrounds, ethnicities, genders, religions and sexual orientations.Some of our standout benefits:Unlimited Annual Leave PolicyPrivate healthcare and dentalEnhanced parental leaveFamily-Friendly Flexibility & Flexible workingSanctus CoachingHybrid WorkingIf you don’t feel you meet all the requirements, but are excited by the role and know you bring some key strengths, please don't hesitate in applying as you might be right for this role, or other roles. We are open to conversations about part-time hours.
No items found.
Hidden link
Faculty.jpg

Full Stack Software Engineer

Faculty
GB.svg
United Kingdom
Full-time
Remote
false
Why Faculty? We established Faculty in 2014 because we thought that AI would be the most important technology of our time. Since then, we’ve worked with over 350 global customers to transform their performance through human-centric AI. You can read about our real-world impact here.We don’t chase hype cycles. We innovate, build and deploy responsible AI which moves the needle - and we know a thing or two about doing it well. We bring an unparalleled depth of technical, product and delivery expertise to our clients who span government, finance, retail, energy, life sciences and defence.Our business, and reputation, is growing fast and we’re always on the lookout for individuals who share our intellectual curiosity and desire to build a positive legacy through technology.AI is an epoch-defining technology, join a company where you’ll be empowered to envision its most powerful applications, and to make them happen.About the team Our Public Services Business Unit is committed to leveraging AI for the benefit of individual citizens and the public good. From our work informing strategic government decisions, to optimising our NHS, through to reducing bureaucratic backlogs - we know that AI offers opportunities to drive improvements at every level of Government and we are proud to lead on some of the most impactful work happening in the sector. Because of the nature of the work we do with our Government clients, you may need to be eligible for UK Security Clearance (SC) and willing to work on site with these clients from time to time.About the roleAs a Full Stack Software Engineer, you will collaborate with cross-functional teams to build and scale AI-powered systems that solve unique challenges for global clients.This is an opportunity to take ownership of the entire stack, turning sophisticated machine learning outputs into high-impact, production-ready solutions that drive real-world transformation.What you'll be doing:Working in cross-functional teams, alongside Data Scientists and Product Managers, to deploy technically sophisticated, high-impact software.Building reliable, and scalable frontend and backend architectures to enable the seamless delivery of advanced AI systems.Building data pipelines and cloud infrastructure as part of wider AI systems.Providing deep technical expertise to customers, helping them navigate complex requirements.Who we're looking for:You possess solid backend development skills in Python frameworks (such as FastAPI) and are comfortable managing the full application lifecycle.You have full-stack experience and exposure to TypeScript and modern frontend frameworks like React or VueYou have experience deploying software within cloud environments like AWS, Azure, or GCP, ideally using containerisation tools like Docker.You demonstrate a scientific and pragmatic mindset, testing assumptions and balancing big-picture vision with the technical details required for execution.You know how to maintain a balance between rapid prototyping and long-term maintainability.You thrive in autonomous, fast-paced environments where you can take full ownership of problems.You are a compelling communicator who enjoys working in a team-oriented culture and has a relentless drive to learn and apply novel technologies.Our Interview ProcessTalent Team Screen (30 minutes)Pair Programming Interview (90 minutes)System Design Interview (90 minutes)Commercial Interview (60 minutes)Our Recruitment EthosWe aim to grow the best team - not the most similar one. We know that diversity of individuals fosters diversity of thought, and that strengthens our principle of seeking truth. And we know from experience that diverse teams deliver better work, relevant to the world in which we live. We’re united by a deep intellectual curiosity and desire to use our abilities for measurable positive impact. We strongly encourage applications from people of all backgrounds, ethnicities, genders, religions and sexual orientations.Some of our standout benefits:Unlimited Annual Leave PolicyPrivate healthcare and dentalEnhanced parental leaveFamily-Friendly Flexibility & Flexible workingSanctus CoachingHybrid WorkingIf you don’t feel you meet all the requirements, but are excited by the role and know you bring some key strengths, please don't hesitate in applying as you might be right for this role, or other roles. We are open to conversations about part-time hours.
No items found.
Hidden link
Scale AI.jpg

Research Scientist, AI Controls and Monitoring

Scale AI
US.svg
United States
Full-time
Remote
false
Role Overview Scale’s rapidly growing Global Public Sector team is focused on using AI to address critical challenges facing the public sector around the world. Our core work consists of: Creating custom AI applications that will impact millions of citizens Generating high-quality training data for national LLMs Upskilling and advisory services to spread the impact of AI As a Production AI Ops Lead, you will design and develop the production lifecycle of full-stack AI applications, while supporting end-to-end system reliability, real-time inference observability, sovereign data orchestration, high-security software integration, and the resilient cloud infrastructure required for our international government partners. At Scale, we’re not just building AI solutions—we’re enabling the public sector to transform their operations and better serve citizens through cutting-edge technology. If you’re ready to shape the future of AI in the public sector and be a founding member of our team, we’d love to hear from you. You will: Own the production outcome: Take full accountability for the long-term performance and reliability of AI use cases deployed across international government agencies. Ensure Full-Stack integrity: Oversee the end-to-end health of the platform, ensuring seamless integration between the AI core and all full-stack components, from APIs to UI, to maintain a responsive and production-ready environment. Scale the feedback loop: Build automated systems to monitor model performance and data drift across geographically dispersed environments, ensuring the right levels of reliability. Navigate global compliance: Manage the technical lifecycle within diverse regulatory frameworks. Incident command: Lead the response for production issues in mission-critical environments, ensuring rapid resolution and building the guardrails to prevent them from happening again. Bridge the gap: Translate deep technical performance metrics into clear insights for senior international government officials. Drive product evolution: Partner with our Engineering and ML teams to ensure the lessons learned in the field directly influence the technical architecture and decisions of future use cases. Ideally, you have: Experience: 6+ years in a high-impact technical role (SRE, FDE or MLOps) with experience in the public sector. Global perspective: Familiarity with international government security standards and the complexities of deploying sovereign AI. System architecture proficiency: Proven experience maintaining production-grade applications with a deep understanding of the full request lifecycle-connecting frontend/API layers to the backend and AI core. Modern AI Stack expertise: Proficiency in coding and the modern AI infrastructure, including Kubernetes, vector databases, agentic development, and LLM observability tools. Ownership: You treat every production deployment as your own. You race toward solving hard problems before the customer even sees them. Reliability: You understand that in the public sector, a model failure may be a risk to public safety or privacy. Customer communication: The ability to explain to a high-ranking official why the performance of the system has degraded and how we are fixing it. PLEASE NOTE: Our policy requires a 90-day waiting period before reconsidering candidates for the same role. This allows us to ensure a fair and thorough evaluation of all applicants. About Us: At Scale, our mission is to develop reliable AI systems for the world's most important decisions. Our products provide the high-quality data and full-stack technologies that power the world's leading models, and help enterprises and governments build, deploy, and oversee AI applications that deliver real impact. We work closely with industry leaders like Meta, Cisco, DLA Piper, Mayo Clinic, Time Inc., the Government of Qatar, and U.S. government agencies including the Army and Air Force. We are expanding our team to accelerate the development of AI applications. We believe that everyone should be able to bring their whole selves to work, which is why we are proud to be an inclusive and equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity or Veteran status.  We are committed to working with and providing reasonable accommodations to applicants with physical and mental disabilities. If you need assistance and/or a reasonable accommodation in the application or recruiting process due to a disability, please contact us at accommodations@scale.com. Please see the United States Department of Labor's Know Your Rights poster for additional information. We comply with the United States Department of Labor's Pay Transparency provision.  PLEASE NOTE: We collect, retain and use personal data for our professional business purposes, including notifying you of job opportunities that may be of interest and sharing with our affiliates. We limit the personal data we collect to that which we believe is appropriate and necessary to manage applicants’ needs, provide our services, and comply with applicable laws. Any information we collect in connection with your application will be treated in accordance with our internal policies and programs designed to protect personal data. Please see our privacy policy for additional information.
No items found.
Hidden link
No job found
Your search did not match any job. Please try again
Department
Clear
Category
Clear
Country
Clear
Job type
Clear
Remote
Clear
Only remote job
Company size
Clear
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.