Baseten logo

    Applied AI Inference - Forward Deployed Engineer

    Baseten

    US Remote

    About the role

    Baseten is hiring a Forward Deployed Engineer on its Applied AI Inference team.

    Baseten runs production inference for AI companies like Cursor, Notion and Writer (it recently raised a $300M Series E). This role pairs you directly with those customers: you architect, build and deploy high-scale AI applications on Baseten's platform, owning the path from initial exploration to production with clear targets for quality, latency and cost. It is explicitly an engineering job with hands-on coding (Python preferred for the ML work), plus elements of product management, technical customer success and pre-sales solution engineering.

    You turn vague business goals into specs and PoCs, ship well-tested services, and optimize AI/ML projects across the stack. They want 2+ years in a fast-paced environment, familiarity with ML pipelines and the full model development-to-deployment lifecycle, and strong communication on complex technical topics.

    What stands out is the front-row seat: you see how dozens of frontier AI companies actually put models into production, which is rare exposure for a single role.

    This recap is dataskew's editorial summary, not the company's copy.