TensorFlow AI Jobs

Discover the latest remote and onsite TensorFlow AI roles across top active AI companies. Updated hourly.

Check out 230 new TensorFlow AI roles opportunities posted on AI Chopping Block

AI Inference Engineer - Model Optimization & Deployment

New
Top rated
Zoox
Full-time
Full-time
Posted

As a Model Optimization & Deployment Engineer, you will optimize large-scale models (LLMs, VLMs) using advanced quantization techniques such as PTQ, QAT, mixed-precision inference workflows, and parameter-efficient fine-tuning methods like LoRA and QLoRA. You will architect and implement model conversion and compilation pipelines using TensorRT and TensorRT-LLM for deployment on edge devices. The role involves performing rigorous parity checking, accuracy recovery, and latency benchmarking between PyTorch frameworks and compiled edge binaries. You are responsible for writing and optimizing custom CUDA kernels and TensorRT Plugins to maximize memory bandwidth and minimize latency on AI accelerators. Furthermore, you will write production-level, highly concurrent, memory-safe C++ and Python code to ensure real-time, deterministic execution of inference on vehicle System on Chips (SOCs).

$242,000 – $290,000
Undisclosed
YEAR

(USD)

Foster City, United States
Maybe global
Onsite
Python
C++
PyTorch
TensorFlow
CUDA

Deployment Lead

New
Top rated
PhysicsX
Full-time
Full-time
Posted

The Deployment Lead will work closely with Simulation Engineers, Machine Learning Engineers, and customers to understand and define engineering and physics challenges, providing technical leadership to their team. Responsibilities include leading pre-processing and analysis of complex data for predictive modelling, establishing best practices, architecting and developing innovative deep learning models combined with optimisation methods, taking responsibility for the quality and impact of their work and their team's work, designing and testing robust, scalable data pipelines for production environments, leading cross-functional collaboration for seamless integration of data science models with simulations, driving internal R&D and product development to refine models and identify new applications, mentoring junior team members, leading communication and presentations to technical teams and customers, onboarding users and co-developing solutions with customers. The role involves representing the company as a technical authority at customer sites internationally, collaborating on-site to build solutions, influencing technical direction, shaping future solutions and products, and developing leadership skills.

Undisclosed

()

London, United Kingdom
Maybe global
Hybrid
Python
NumPy
Pandas
TensorFlow
PyTorch

Senior Platform/DevOps Engineer (Kubernetes-Linux)

New
Top rated
Armada
Full-time
Full-time
Posted

Translate business requirements into requirements for AI/ML models; prepare data to train and evaluate AI/ML/DL models; build AI/ML/DL models by applying state-of-the-art algorithms, especially transformers; leverage existing algorithms from academic or industrial research when applicable; test, evaluate, and benchmark AI/ML/DL models and publish the models, data sets, and evaluations; deploy models in production by containerizing them; work with customers and internal employees to refine model quality; establish continuous learning pipelines for models using online or transfer learning; build and deploy containerized applications on cloud or on-premise environments.

$154,560 – $193,200
Undisclosed
YEAR

(USD)

Bellevue, United States
Maybe global
Onsite
Python
Java
C++
Docker
Kubernetes

Machine Learning PhDs - AI Trainer

New
Top rated
Handshake
Contractor
Full-time
Posted

Use machine learning expertise to create domain-relevant questions and review AI-generated responses for accuracy, rigor, and relevance to real-world physics research and practice.

$75 – $75 / hour
Undisclosed
HOUR

(USD)

United States
Maybe global
Remote
Machine Learning
Model Evaluation
Python
TensorFlow
PyTorch

Real Estate, Workplace Programs and User Experience Lead

New
Top rated
Intrinsic
Full-time
Full-time
Posted

Lead the research and development of novel deep learning algorithms that enable robots to perform complex, contact-rich manipulation tasks. Explore the intersection of computer vision and robotic control, designing systems that allow robots to perceive and interact with objects in dynamic environments. Create models that integrate visual data to guide physical manipulation, enabling sophisticated handling of diverse items. Collaborate with a multidisciplinary team of engineers and researchers to translate concepts into robust capabilities deployable on physical hardware for industrial applications. Research and develop deep learning architectures for visual perception and sensorimotor control in contact-rich scenarios. Design algorithms for high precision manipulation of complex or deformable objects. Collaborate with software engineers to optimize and deploy research prototypes on robotic hardware. Evaluate model performance in simulation and real-world environments to ensure robustness and reliability. Identify opportunities to apply advancements in computer vision and robot learning to practical industrial problems. Mentor junior researchers and contribute to the technical direction of the manipulation research roadmap.

Undisclosed

()

Mountain View, United States
Maybe global
Onsite
Python
C++
PyTorch
TensorFlow
JAX

Forward Deployed AI Engineer

New
Top rated
Talent Labs
Full-time
Full-time
Posted

Drive the end-to-end technical deployment of Latent Labs models into customer environments, ensuring seamless integration with existing scientific and IT infrastructure. Design and build production-grade API integrations, data pipelines and model-serving infrastructure tailored to each customer’s requirements. Work on-site or embedded with pharma and biotech partners to scope technical requirements, troubleshoot issues and deliver solutions. Ensure deployments meet enterprise standards for security, performance and reliability. Serve as the technical point of contact for assigned customers, building trusted relationships with their scientific and engineering teams, including spending time working on-site at international partner locations as needed. Gather and synthesise customer feedback, translating it into actionable insights for product, research and platform teams. Collaborate with internal teams to shape the product roadmap based on real-world deployment learnings. Create technical documentation, integration guides and best-practice resources for customers. Stay on top of the latest developments in ML infrastructure, model serving and cloud-native tooling. Gain a strong working understanding of protein and cell biology as it relates to the product. Participate in knowledge sharing, including organizing and presenting at internal reading groups.

Undisclosed

()

San Francisco, United States
Maybe global
Hybrid
Python
AWS
Docker
Kubernetes
CI/CD

Senior Machine Learning Scientist

New
Top rated
Chattermill
Full-time
Full-time
Posted

The Senior Machine Learning Scientist will train, evaluate, and iterate on ML models and agentic systems for customer feedback, including owning custom fine-tuning pipelines. They will run experiments end-to-end, track results rigorously, and make recommendations on what to ship, iterate, or retire. The role involves building and maintaining LLM-powered features such as retrieval pipelines, reranking systems, insight agents, data mining agents, and automated taxonomy generation. The scientist will design and run robust evaluation frameworks including building test sets, defining metrics, evaluating non-deterministic systems, handling class imbalance, and automating checkpoint comparisons. They will improve and extend semantic search and retrieval methods, write production-quality code, and collaborate closely with Engineering on productionisation, model serving, data pipelines, and monitoring. The role includes working with Product and Commercial teams to translate business needs into practical ML solutions and supporting client evaluations and accuracy benchmarking. Additionally, the scientist will mentor team members, review code and research, and integrate relevant advances from literature into the product.

Undisclosed

()

United Kingdom
Maybe global
Remote
Python
PyTorch
TensorFlow
Transformers
Model Evaluation

Director, Data Center Operations

New
Top rated
Together AI
Full-time
Full-time
Posted

The responsibilities include advancing inference efficiency end-to-end by designing and prototyping algorithms, architectures, and scheduling strategies for low-latency, high-throughput inference. Implementing and maintaining changes in high-performance inference engines, including kernel backends and speculative decoding, profiling and optimizing performance across GPU, networking, and memory layers to improve latency, throughput, and cost. Unifying inference with RL/post-training by designing and operating RL and post-training pipelines, making RL and post-training workloads more efficient with inference-aware training loops, and using these pipelines to train, evaluate, and iterate on frontier models. Co-designing algorithms and infrastructure so that objectives, rollout collection, and evaluation are tightly coupled to efficient inference, identifying bottlenecks across the training engine, inference engine, data pipeline, and user-facing layers. Running ablations and scale-up experiments to understand trade-offs between model quality, latency, throughput, and cost, and feeding these insights back into model, RL, and system design. Owning critical systems at production scale by profiling, debugging, and optimizing inference and post-training services under real production workloads, driving roadmap items requiring engine modification, and establishing metrics, benchmarks, and experimentation frameworks to validate improvements rigorously. Providing technical leadership by setting technical direction for cross-team efforts at the intersection of inference, RL, and post-training, and mentoring other engineers and researchers on full-stack ML systems work and performance engineering.

$200,000 – $280,000
Undisclosed
YEAR

(USD)

San Francisco
Maybe global
Onsite
Python
PyTorch
TensorFlow
MLOps
Model Evaluation

Regional Sales Lead, Singapore

New
Top rated
Tenstorrent
Full-time
Full-time
Posted

Lead and contribute to cross-functional efforts solving complex physical design challenges across IPs, projects, and advanced technology nodes. Develop and enhance RTL-to-GDS methodologies, including floorplanning, synthesis, placement and routing (P&R), static timing analysis (STA), signoff, and assembly. Architect and deploy AI/ML-driven solutions in production flows to improve engineering efficiency, turnaround time, and quality of results (QoR). Optimize EDA tools and custom CAD flows using data-driven and machine learning-based techniques, working closely with internal teams such as verification, extraction, timing, Design for Test (DFT), and electronic design automation (EDA) vendors.

$100,000 – $500,000
Undisclosed
YEAR

(USD)

Santa Clara or Austin or Fort Collins, United States
Maybe global
Remote
Python
PyTorch
TensorFlow
MLOps
Docker

Head of ISV Partnerships, Experience GTM

New
Top rated
Tenstorrent
Full-time
Full-time
Posted

Lead and contribute to cross-functional efforts solving complex physical design challenges across IPs, projects, and advanced technology nodes. Develop and enhance RTL-to-GDS methodologies, including floorplanning, synthesis, placement and routing (P&R), static timing analysis (STA), signoff, and assembly. Architect and deploy AI/ML-driven solutions in production physical design flows to improve engineering efficiency, turnaround time, and quality of results (QoR). Optimize EDA tools and custom CAD flows using data-driven and ML-based techniques, collaborating closely with verification, extraction, timing, design-for-test (DFT), and EDA vendor teams.

$100,000 – $500,000
Undisclosed
YEAR

(USD)

Santa Clara or Austin or Fort Collins, United States
Maybe global
Hybrid
Python
PyTorch
TensorFlow
Data Pipelines
MLOps

Want to see more AI Egnineer jobs?

View all jobs

Access all 4,256 remote & onsite AI jobs.

Join our private AI community to unlock full job access, and connect with founders, hiring managers, and top AI professionals.
(Yes, it’s still free—your best contributions are the price of admission.)

Frequently Asked Questions

Need help with something? Here are our most frequently asked questions.

Question text goes here

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Suspendisse varius enim in eros elementum tristique. Duis cursus, mi quis viverra ornare, eros dolor interdum nulla, ut commodo diam libero vitae erat. Aenean faucibus nibh et justo cursus id rutrum lorem imperdiet. Nunc ut sem vitae risus tristique posuere.

[{"question":"What are TensorFlow AI jobs?","answer":"Jobs that involve building, training, and deploying machine learning models using the TensorFlow framework. These positions focus on developing solutions for image recognition, natural language processing, computational graphs, and deep learning neural networks. Professionals in these roles create AI applications for mobile devices, web platforms, and cloud services across various industries."},{"question":"What roles commonly require TensorFlow skills?","answer":"Software developers implementing machine learning for mobile, web, and cloud applications. Machine learning developers working on natural language processing and computer vision systems. AI engineers building convolutional neural networks. Developers creating consumer products with AI capabilities. Backend engineers developing production ML pipelines and services that leverage deep learning models."},{"question":"What skills are typically required alongside TensorFlow?","answer":"Python or C++ programming proficiency is essential. Knowledge of neural networks, data preprocessing, and model training workflows is required. Experience with Keras, TensorBoard, and TFX strengthens candidacy. Familiarity with data structures, estimators, and inference processes is valuable. Skills in handling diverse datasets and understanding computational graphs are frequently requested by employers."},{"question":"What experience level do TensorFlow AI jobs usually require?","answer":"Experience requirements vary widely based on the role. Entry-level positions often require understanding of machine learning fundamentals and basic TensorFlow implementation. Mid-level roles typically seek 2-3 years of hands-on experience building and deploying models. Senior positions generally demand deep expertise in production ML pipelines, distributed training, and optimization techniques across platforms."},{"question":"What is the salary range for TensorFlow AI jobs?","answer":"Salaries for AI jobs utilizing this framework vary based on location, experience, industry, and specific role. Machine learning engineers and AI developers command competitive compensation. Higher salaries typically correlate with expertise in production deployment, cross-platform implementation, and specialized applications like computer vision or NLP. The growing demand for these skills continues to drive favorable compensation."},{"question":"Are TensorFlow AI jobs in demand?","answer":"Yes, these jobs are in high demand across multiple industries including information technology, cybersecurity, e-commerce, social media, and healthcare. Major companies like Coca-Cola use this technology for applications such as product recognition. The demand is particularly strong for professionals who can deploy scalable production models in real-world applications across mobile, web, and cloud platforms."},{"question":"What is the difference between TensorFlow and PyTorch in AI roles?","answer":"PyTorch offers a more intuitive, user-friendly experience ideal for beginners and research, while TensorFlow emphasizes production readiness and deployment at scale. TensorFlow excels in cross-platform compatibility with dedicated tools for mobile (TensorFlow Lite) and web (TensorFlow.js). PyTorch provides a more dynamic computational approach, whereas TensorFlow's structured graph execution supports enterprise-level production systems and distributed training."}]