Automotive Engineering & Python Expert - Freelance AI Trainer
Contributors may design graduate- and industry-level automotive engineering problems grounded in real practice; evaluate AI-generated solutions for correctness, assumptions, and engineering logic; validate analytical or numerical results using Python (NumPy, SciPy, Pandas); improve AI reasoning to align with first principles and accepted engineering standards; and apply structured scoring criteria to assess multi-step problem solving.
Machine Learning Developer (Freelance)
Design original computational STEM problems that simulate real scientific workflows. Create problems that require Python programming to solve. Ensure problems are computationally intensive and cannot be solved manually within reasonable timeframes (days/weeks). Develop problems requiring non-trivial reasoning chains and creative problem-solving approaches. Verify solutions using Python with standard libraries (numpy, pandas, scipy, sklearn). Document problem statements clearly and provide verified correct answers.
Data Scientist (Python & SQL) - Freelance AI Trainer
Design original computational data science problems simulating real-world analytical workflows across industries such as telecom, finance, government, e-commerce, and healthcare. Create problems requiring Python programming to solve using libraries like pandas, numpy, scipy, sklearn, statsmodels, matplotlib, and seaborn. Ensure problems are computationally intensive and cannot be solved manually within reasonable timeframes. Develop problems requiring non-trivial reasoning chains in data processing, statistical analysis, feature engineering, predictive modeling, and insight extraction. Create deterministic problems with reproducible answers, avoiding stochastic elements or using fixed random seeds. Base problems on real business challenges including customer analytics, risk assessment, fraud detection, forecasting, optimization, and operational efficiency. Design end-to-end problems covering the complete data science pipeline from data ingestion to deployment considerations. Incorporate big data processing scenarios requiring scalable computational approaches. Verify solutions using Python with standard data science libraries and statistical methods. Document problem statements clearly with realistic business contexts and provide verified correct answers.
Freelance AI Evaluation Engineer (Python/Full-Stack)
Create challenging coding test cases to push AI coding systems to their limits by reviewing and refining realistic coding tasks based on provided production codebases with realistic scope, requirements, and information sources. Write comprehensive functional tests that validate actual end-to-end behavior and edge-cases. Craft challenges that are fair but hard, where the AI has all the context it needs, requiring complex reasoning with information scattered across files and external sources. Analyze AI failures to understand the model's struggles and strengths. Iterate based on feedback from expert QA reviewers who score work on seven quality criteria.
Mechanical Engineer & Python Expert - Freelance AI Trainer
Contributors design graduate- and industry-level mechanical engineering problems grounded in real practice, evaluate AI-generated solutions for correctness, assumptions, and engineering logic, validate analytical or numerical results using Python (NumPy, SciPy, Pandas), improve AI reasoning to align with first principles and accepted engineering standards, and apply structured scoring criteria to assess multi-step problem solving.
Electrical Engineer & Python Expert - Freelance AI Trainer
Contributors may design rigorous electrical engineering problems reflecting professional practice, evaluate AI solutions for correctness, assumptions, and constraints, validate calculations or simulations using Python (NumPy, Pandas, SciPy), improve AI reasoning to align with industry-standard logic, and apply structured scoring criteria to multi-step problems.
Senior Product Designer, Mobile
Own the observability and lifecycle management of AI features across the organization. Build tools and infrastructure to enable teams to develop, monitor, and optimize LLM-powered features. Design and implement closed-loop evaluation pipelines that automatically validate prompt changes. Develop comprehensive metrics and dashboards to track LLM usage, including cost per feature, token patterns, and latency. Create systems that tie user feedback to specific prompts and LLM calls. Establish best practices and processes for the full lifecycle of prompts, including development, testing, deployment, and monitoring. Collaborate with engineering teams across the organization to ensure they have the tools and visibility needed to build high-quality AI features.
Senior ML Operations (MLOps) Engineer
The Senior ML Operations (MLOps) Engineer at Eight Sleep is responsible for introducing and implementing cutting-edge ML technologies, owning the design and operation of robust ML infrastructure including scalable data, model, and deployment pipelines to ensure reliable model delivery to production. They collaborate cross-functionally with R&D, firmware, data, and backend teams to ensure reliable and scalable ML inference on Pods. They optimize ML systems for cost, scalability, and performance across training and inference, and develop tooling, microservices, and frameworks to streamline data processing, experimentation, and deployment. The role requires effective communication in a remote work environment.
Manual Quality Assurance Engineer, Web Core Product
Work alongside machine learning researchers, engineers, and product managers to bring AI Voices to customers for diverse use cases. Deploy and operate the core ML inference workloads for the AI Voices serving pipeline. Introduce new techniques, tools, and architecture that improve performance, latency, throughput, and efficiency of deployed models. Build tools to identify bottlenecks and sources of instability and design and implement solutions to address the highest priority issues.
Safety Engineer
The AI Safety Engineer is responsible for designing and building scalable backend infrastructure for content moderation, abuse detection, and agents guardrails by deploying AI/ML models into production systems. They will architect robust APIs, data pipelines, and service architectures to support real-time and batch moderation workflows. The role includes implementing comprehensive monitoring, alerting, and observability systems, establishing SLIs, SLOs, and performance benchmarks. The engineer will collaborate with ML engineers to translate research models into production-ready systems and integrate them across the product suite. Additionally, they will drive technical decisions and contribute to the vision for the safety roadmap to build next-generation platform guardrails for scale and precision.
Access all 4,256 remote & onsite AI jobs.
Frequently Asked Questions
Lorem ipsum dolor sit amet, consectetur adipiscing elit. Suspendisse varius enim in eros elementum tristique. Duis cursus, mi quis viverra ornare, eros dolor interdum nulla, ut commodo diam libero vitae erat. Aenean faucibus nibh et justo cursus id rutrum lorem imperdiet. Nunc ut sem vitae risus tristique posuere.
