Full Stack Engineer
Build and maintain features for the web-based property management platform using TypeScript, React, Node.js, PostgreSQL, and AWS. Contribute to a monorepo architecture, working within two-week sprint cycles to deliver high-quality code. Implement integrations including DocuSign, Plaid, Stripe, and ownership group payout systems. Optimize platform performance and user experience by replacing legacy systems. Build and integrate AI agents using Claude and other AI APIs to automate organizational processes, developing API integrations and custom agents. Collaborate with the CEO on prioritizing automation opportunities. Take ownership of tasks, independently research and implement solutions to challenges, proactively identify and implement improvements, and contribute ideas to platform architecture and development priorities.
Senior Software Engineer, Agents
Design and build AI agents that outperform human agents in managing complex customer interactions and driving customer retention. Identify cross-customer trends that guide the evolution of Decagon’s agent building platform and research efforts. Experiment with and run evaluations on the latest text and voice models, then integrate them at scale with large enterprise-grade customers.
Senior Software Engineer, Agents
Design and build AI agents that outperform human agents in managing complex customer interactions and driving customer retention. Identify cross-customer trends that guide the evolution of Decagon’s agent building platform and research efforts. Experiment with and run evaluations on the latest text and voice models, then integrate them at scale with large enterprise-grade customers. Have complete ownership and autonomy in building and shipping best-in-class AI agents, from initial implementation through continuous iteration, working directly with leaders across industries like finance, healthcare, and hospitality to solve their users’ needs with reliable and intuitive AI agents. Dive deep into complex system challenges and build elegant solutions that scale to millions of users.
Product Engineer
As a Product Engineer, you will dream up, build, and ship LM Studio features to millions of users worldwide at a fast pace. Your work will intermingle UI development with systems engineering, design, and applied AI/agentic engineering. You are expected to have a holistic understanding of software systems and the ability to work across the stack.
Freelance AI Evaluation Engineer (Python/Full-Stack)
Create challenging coding test cases that push AI coding systems to their limits. Review and refine realistic coding tasks based on provided production codebases with realistic scope, requirements, and information sources. Write comprehensive functional tests that validate actual end-to-end behavior and edge-cases, not just superficial checks. Craft fair but hard challenges where the AI has all the context it needs but must work for it, involving information scattered across files and external sources and requiring complex reasoning. Analyze AI failures to understand what the model struggles with versus what it masters. Iterate based on feedback from expert QA reviewers who score work on seven quality criteria.
Freelance AI Evaluation Engineer (Python/Full-Stack)
Create challenging coding test cases that push AI coding systems to their limits by reviewing and refining realistic coding tasks based on provided production codebases with realistic scope, requirements, and information sources. Write comprehensive functional tests that validate actual end-to-end behavior and edge-cases, not just superficial checks. Craft "fair but hard" challenges where the AI has all the context it needs but must work for it, involving information scattered across files and external sources and requiring complex reasoning. Analyze AI failures to understand areas where the model struggles versus what it masters. Iterate based on feedback from expert QA reviewers who score the work on seven quality criteria.
Freelance AI Evaluation Engineer (Python/Full-Stack)
Create challenging coding test cases that push AI coding systems to their limits by reviewing and refining realistic coding tasks based on provided production codebases with realistic scope, requirements, and information sources. Write comprehensive functional tests that validate actual end-to-end behavior and edge cases, not just superficial checks. Craft "fair but hard" challenges where the AI has all the context it needs but must work for it, involving information scattered across files and external sources and complex reasoning. Analyze AI failures to understand what the model struggles with versus what it masters. Iterate based on feedback from expert QA reviewers who score work on seven quality criteria.
Freelance AI Evaluation Engineer (Python/Full-Stack)
Create challenging coding test cases to push AI coding systems to their limits by reviewing and refining realistic coding tasks based on provided production codebases with realistic scope, requirements, and information sources. Write comprehensive functional tests that validate actual end-to-end behavior and edge-cases. Craft challenges that are fair but hard, where the AI has all the context it needs, requiring complex reasoning with information scattered across files and external sources. Analyze AI failures to understand the model's struggles and strengths. Iterate based on feedback from expert QA reviewers who score work on seven quality criteria.
Freelance AI Evaluation Engineer (Python/Full-Stack)
Create challenging coding test cases that push AI coding systems to their limits by reviewing and refining realistic coding tasks based on provided production codebases. Write comprehensive functional tests that validate actual end-to-end behavior and edge cases, craft fair but hard challenges requiring complex reasoning and scattered information, analyze AI failures to understand model strengths and weaknesses, and iterate based on feedback from expert QA reviewers who score work on seven quality criteria.
Freelance AI Evaluation Engineer (Python/Full-Stack)
You will create challenging coding test cases to push AI coding systems to their limits by reviewing and refining realistic coding tasks based on provided production codebases with realistic scope, requirements, and information sources. You will write comprehensive functional tests that validate actual end-to-end behavior and edge cases, not just superficial checks. You are to craft "fair but hard" challenges where the AI has all the necessary context but must work through scattered information and complex reasoning. Additionally, you will analyze AI failures to understand what the model struggles with versus what it masters, and iterate your work based on feedback from expert QA reviewers who score your work on seven quality criteria.
Access all 4,256 remote & onsite AI jobs.
Frequently Asked Questions
Lorem ipsum dolor sit amet, consectetur adipiscing elit. Suspendisse varius enim in eros elementum tristique. Duis cursus, mi quis viverra ornare, eros dolor interdum nulla, ut commodo diam libero vitae erat. Aenean faucibus nibh et justo cursus id rutrum lorem imperdiet. Nunc ut sem vitae risus tristique posuere.
