Research Intern, Inference (Fall 2026)
As an AI Infrastructure Engineer at Together, the responsibilities include participating in on-call rotation to respond to production incidents, building and running infrastructure using Ansible, Terraform, and Kubernetes to support scaling to a large number of concurrent users, building monitoring systems to ensure high-quality service, designing and implementing operational processes such as deployments and upgrades, debugging production issues across all services and stack levels, identifying improvements for product architecture in terms of reliability, performance, and availability, and planning the growth of Together AI's infrastructure.
Sr. Manager, Integrated Campaigns and ABX
Build and deploy AI Agents including prompt design, workflow configuration, integrations, telephony setup, and evaluation frameworks. Act as the primary technical partner for customers by leading demos, communicating progress, gathering feedback, and guiding solutions from concept to production. Configure and connect systems using APIs, handling authentication, data mapping, error handling, and integrations with CRMs, knowledge bases, and other enterprise tools. Set up telephony systems including SIP/CCaaS/PSTN routing, pass metadata, configure fallbacks, and troubleshoot call quality. Write and refine prompts for LLM-driven agents, monitor performance, and ensure agents meet automation and containment targets. Translate customer requirements into actionable solutions and work consultatively to unblock challenges in security, connectivity, or knowledge ingestion. Collaborate with product and engineering teams to address platform gaps and resolve technical issues, independently driving leading client implementations.
Senior Backend Engineer- AI Agents (Remote)
Design and build scalable backend systems powering AI Agents that operate in real-time enterprise environments. Develop agent orchestration frameworks involving multi-step reasoning, tool usage, and decisioning workflows. Build systems for agent memory, context management, and state persistence across interactions. Architect low-latency inference pipelines integrating Large Language Models, Small Language Models, and external tools/services. Implement evaluation frameworks to measure agent performance, accuracy, and reliability. Enable continuous improvement loops for AI agents in production including feedback, retraining, and deployment. Design and manage event-driven, asynchronous workflows for complex agent tasks. Optimize systems for high throughput, low latency, and cost-efficient inference at scale. Build and maintain robust APIs and service layers (REST/gRPC) for agent capabilities. Partner closely with Applied AI/ML teams to productionize models and agent behaviors. Collaborate with Product and Solutions teams to translate real customer workflows into agentic systems. Drive best practices in observability, monitoring, safety, and guardrails for AI systems. Contribute to architecture decisions for scaling multi-tenant, enterprise-grade AI platforms.
Member of Technical Staff (Machine Learning Engineer)
Translate cutting-edge research into production-ready machine learning systems. Design, build, and deploy end-to-end ML models and pipelines. Develop and optimize models for image and video processing. Own the full ML lifecycle including experimentation, training/fine-tuning, evaluation, and deployment. Rapidly prototype using open-source models and adapt them for product needs. Conduct experiments, analyze results, and iterate to improve performance. Collaborate with researchers and cross-functional teams (product, engineering, design) to deliver ML solutions at scale. Participate with advancements in machine learning and apply them to continuously improve products.
Warehouse Supervisor (Temporary)
Utilize proprietary software to provide accurate input and labels for healthcare and administration projects, ensuring high-quality data for AI model training. Deliver curated, high-quality data for scenarios involving patient care coordination, medical billing, administrative workflows, and healthcare operations. Collaborate with technical staff to support the training of new AI tasks and contribute to the development of innovative technologies. Assist in designing and improving efficient annotation tools tailored for healthcare and administration data. Select and analyze complex problems in healthcare and administration fields aligned with your expertise to enhance AI model performance. Interpret, analyze, and execute tasks based on evolving instructions, maintaining precision and adaptability.
Deployment Engineer
Translate business requirements into AI/ML model requirements. Prepare data to train and evaluate AI/ML/DL models. Build AI/ML/DL models using state-of-the-art algorithms, especially transformers, sometimes leveraging existing algorithms from research. Test and evaluate models, benchmark quality, and publish models, datasets, and evaluations. Deploy models in production by containerizing them. Work with customers and internal employees to refine model quality. Establish continuous learning pipelines for models with online or transfer learning. Build and deploy containerized applications on cloud or on-premise environments.
Software Engineer, AI Product (Canada)
As a Senior Applied AI Engineer at Vanta, you will work cross-functionally to design and implement AI-powered features that deliver customer value and integrate large language models (LLMs) with Vanta's existing products and systems. You will collaborate with product engineers across Vanta to understand how AI systems can accelerate product adoption, instrument evaluations, guardrails, and monitoring, and review customer usage to continually improve quality. Additionally, you will collaborate with AI Platform engineers on foundational AI systems and tooling to accelerate product teams, make pragmatic tradeoffs considering business priorities, user experience, and sustainable technical foundation, mentor engineers, champion good technical and product instincts, and model a collaborative, high-ownership engineering culture.
Medical Review Nurse - Clinical Validation
Design agent systems from first principles including deciding the loop, tools, context strategy, evaluation harness, and system topology. Engineer the context by focusing on prompt construction, context windows, tool surfaces, structured outputs, and citation grounding. Drive evaluation rigor by building evaluations prior to agent construction, diagnosing failures, fixing root causes, and proving improvements through metrics. Use AI tooling such as Claude Code and Codex extensively to plan, scaffold, refactor, and debug work. Become a domain expert in healthcare claims, coding guidelines, and medical records as an integral part of the job.
Engineering Manager, AI
As an Engineering Manager at Vanta, you will build and scale a high-performing team by hiring strategically to fill skill gaps as the team grows. You will coach, mentor, and create an environment that enables your team to do their best work and deliver for the business. You will set direction and guide technical strategy for AI agent and downmarket products, ensuring long-term value aligned with Vanta's business priorities. You will partner closely with product, design, and AI platform teams to ship customer-facing AI features that automate audit work while maintaining human-in-the-loop controls. Additionally, you will champion best practices for applied AI, including prompt engineering, retrieval-augmented generation (RAG), agentic frameworks, and quality evaluation. You will also navigate rapid change and ambiguity with adaptability, iterating quickly on roadmaps as the team's charter and direction evolve.
Software Engineer, ML Data Infrastructure
The Software Engineer, ML Data Infrastructure will collaborate with engineers to build advanced AI design experiences, tackle complex technical challenges including scaling distributed systems and enabling generative media experiences, build robust data infrastructure at petabyte scale ensuring reliability and performance across multi-modal training pipelines, optimize data processing workflows for high throughput involving distributed systems, TPU infrastructure, and large-scale storage, and partner with research scientists to understand data requirements and translate them into production-grade systems to accelerate model development cycles.
Access all 4,256 remote & onsite AI jobs.
Frequently Asked Questions
Need help with something? Here are our most frequently asked questions.
Lorem ipsum dolor sit amet, consectetur adipiscing elit. Suspendisse varius enim in eros elementum tristique. Duis cursus, mi quis viverra ornare, eros dolor interdum nulla, ut commodo diam libero vitae erat. Aenean faucibus nibh et justo cursus id rutrum lorem imperdiet. Nunc ut sem vitae risus tristique posuere.
