All roles

[Remote] AI Field Engineer - AI Natives

Remote · USA Full-time New today

Note: The job is a remote job and is open to candidates in USA. Fireworks AI is a leading company in generative AI infrastructure, focused on delivering high-quality models with scalable inference. They are seeking an AI Field Engineer to work closely with customers, building production systems and enhancing their AI capabilities while engaging in technical discussions and stakeholder management.

Responsibilities

  • Build end-to-end POCs and MVPs alongside customer engineering teams, working inside their codebases, infrastructure, and constraints
  • For customers whose core product is built on GenAI, architect the inference foundations that capability depends on, and size deployments so they can scale in their market without infrastructure becoming the bottleneck
  • Run load tests and establish latency, throughput, and cost baselines against realistic customer traffic profiles, and tune deployments to hit those targets
  • Deploy and validate new model families on inference frameworks (vLLM, SGLang), determining optimal shapes, quantization configs, and serving patterns across workloads
  • Guide customers on model selection, fine-tuning strategy (SFT, DPO, RFT), and evaluation methodology
  • Build and run fine-tuning pipelines directly with customers, navigating trade-offs between model families, compute cost, and quality targets
  • Design and implement evaluation frameworks that measure production-quality metrics, not just benchmark scores
  • Help customers bake frontier model capabilities into their core offering and turn that into a durable competitive edge
  • Lead structured discovery conversations to unpack customer pain points, constraints, and success criteria before proposing solutions
  • Own the technical relationship from first engagement through production deployment. Embed with their engineering team as a peer, your credibility comes from what you build alongside them
  • Spend time on-site with customers. Build trust and momentum in person, embedding with their teams where the work happens
  • Identify recurring customer pain points and translate them into concrete product proposals, working directly with engineering and product to ship fixes and features
  • Codify repeatable deployment patterns and contribute them back to internal tooling, documentation, and the platform itself
  • Feed customer signals (deployment patterns, failure modes, feature gaps) back into the product roadmap with specificity and urgency

Skills

  • 5+ years in a hands-on, customer-facing technical role: Forward Deployed Engineer, Applied AI Engineer, Solutions Architect, ML Engineer with field exposure, or technical founder
  • Demonstrated ability to build production software with customers, not just advise on it. You have shipped code running in someone else's production environment
  • Strong Python skills. Comfortable reading, writing, and debugging production code. Familiarity with Kubernetes and infrastructure engineering
  • Working knowledge of the LLM stack: inference trade-offs, model serving, fine-tuning workflows (SFT at minimum; DPO/RFT a strong plus)
  • Experience with cloud infrastructure (AWS, Azure, GCP) and deploying models on GPU infrastructure
  • Exceptional communication: able to run a sharp discovery call, present to a VP, and debug a latency issue with an ML engineer in the same afternoon
  • Experience building or integrating agentic systems, tool-use chains, or AI-native developer toolchains
  • 10+ years in technical field or engineering roles
  • Experience with inference serving frameworks (vLLM, SGLang, TensorRT-LLM) and tuning deployments for real workloads
  • Prior experience at a company with a forward-deployed or embedded engineering model (Palantir, Scale AI, Anthropic, OpenAI, BCG X, McKinsey Quantum Black, AI Native startups with FDE motions)
  • Prior experience as a technical founder or early engineer at an AI-native company is a strong signal
  • Track record taking GenAI POCs from prototype to production-scale deployments
  • Experience with hyperscaler AI platforms (Azure AI Foundry, AWS Bedrock/SageMaker, GCP Vertex)

Benefits

  • Meaningful equity in a fast-growing startup
  • Competitive salary
  • Comprehensive benefits package

Company Overview

  • Fireworks AI is an advanced platform that enables users to build, tune, and scale AI applications using open-source models It was founded in 2022, and is headquartered in Redwood City, California, USA, with a workforce of 51-200 employees. Its website is https://fireworks.ai.
  • Company H1B Sponsorship

  • Fireworks AI has a track record of offering H1B sponsorships, with 11 in 2026, 9 in 2025, 2 in 2024, 1 in 2023. Please note that this does not guarantee sponsorship for this specific role.
  • Apply To This Job

    Related roles

    [Remote] Marketing Specialist

    Remote · USA Full-time

    [Remote] APPIAN PROJECT MANAGER - REMOTO

    Remote · USA Full-time

    [Remote] AI Staff Software Engineer

    Remote · USA Full-time

    [Remote] Pharmacy Revenue & Reimbursement Analyst

    Remote · USA Full-time

    [Remote] Principal Full Stack Engineer (Remote)

    Remote · USA Full-time

    [Remote] Remote Sales Consultant

    Remote · USA Full-time

    [Remote] Calgaz Account Manager

    Remote · USA Full-time

    [Remote] AI Catalyst & Product Manager

    Remote · USA Full-time

    [Remote] Principal Data Engineer Boulder, Colorado or New York City, New York or Remote

    Remote · USA Full-time

    [Remote] Principal Software Engineer Boulder, Colorado or New York City, New York or Remote

    Remote · USA Full-time

    Live Chat Support Agent - No Experience Necessary, Start Your Remote Career Today | Earn $25-$35/HR

    Remote · USA Full-time

    Remote Data Entry Specialist – Work From Home Opportunity with Competitive Pay at arenaflex

    Remote · USA Full-time

    Motion Graphic Designer (Remote) – Mid-Senior Level

    Remote · USA Full-time

    Freight Dispatchers Wanted (Experienced & Entry-Level) – Work From Home & to $15,000+ Per Month

    Remote · USA Full-time

    Experienced Chat Support Agent (Remote) – Revolutionizing the Gig Staffing Industry with arenaflex

    Remote · USA Full-time

    Clinical Case Manager Behavioral Health

    Remote · USA Full-time

    International SEO/AEO Manager

    Remote · USA Full-time

    Experienced Data Entry Operator - Clerk / Typing Remote Position at arenaflex

    Remote · USA Full-time

    Sr. Account Manager – Centene - Accredo - Remote

    Remote · USA Full-time

    Bilingual Customer Service Representative – Email, Chat, Social Media & Sales Support (English/Spanish)

    Remote · USA Full-time