All roles

Software Engineer, Inference AI/ML

Remote · USA Full-time New today

CoreWeave is The Essential Cloud for AI™, providing a platform for innovators to build and scale AI. The role involves joining the Inference team to implement features that enhance model serving on the GPU platform, focusing on improving latency, reliability, and cost.

Responsibilities

  • Implement well-scoped features and fixes in Python/Go/C++ for model-serving services (e.g., Triton, vLLM, TensorRT-LLM, Ray Serve)
  • Write tests, code comments, and short design docs; participate in code reviews
  • Add basic metrics and dashboards; assist with alarms and runbooks
  • Follow on-call runbooks and learn incident response in a guided rotation
  • Contribute to performance experiments (e.g., request batching, concurrency, caching) with guidance

Skills

  • BS/MS in CS, EE, or related field, or equivalent practical experience
  • Foundations in data structures, algorithms, and networked services
  • Experience with Python or Go (C++ a plus) and Linux fundamentals; Git/CI basics
  • Exposure to containers and Kubernetes (coursework or projects welcome)
  • Curiosity about GPU inference concepts (micro-batching, KV cache, streaming)
  • Internship or project that deployed a microservice or ML inference demo
  • Coursework/research with PyTorch or TensorFlow; simple CUDA projects a plus
  • Familiarity with Grafana/Prometheus/OpenTelemetry or similar tooling

Benefits

  • Medical, dental, and vision insurance - 100% paid for by CoreWeave
  • Company-paid Life Insurance
  • Voluntary supplemental life insurance
  • Short and long-term disability insurance
  • Flexible Spending Account
  • Health Savings Account
  • Tuition Reimbursement
  • Ability to Participate in Employee Stock Purchase Program (ESPP)
  • Mental Wellness Benefits through Spring Health
  • Family-Forming support provided by Carrot
  • Paid Parental Leave
  • Flexible, full-service childcare support with Kinside
  • 401(k) with a generous employer match
  • Flexible PTO
  • Catered lunch each day in our office and data center locations
  • A casual work environment
  • A work culture focused on innovative disruption

Company Overview

  • CoreWeave is a cloud-based AI infrastructure company offering GPU cloud services to simplify AI and machine learning workloads. It was founded in 2017, and is headquartered in Livingston, New Jersey, USA, with a workforce of 1001-5000 employees. Its website is https://www.coreweave.com.
  • Apply To This Job

    Related roles

    Analyst, Data and Insights

    Remote · USA Full-time

    Civil Engineering Associate - Entry-Level - May 2026

    Remote · USA Full-time

    Residential Outside Sales Representative (Hybrid)

    Remote · USA Full-time

    VFX Artist [Splinter Cell Remake]

    Remote · USA Full-time

    2026 Business Banking Rotational Development Program

    Remote · USA Full-time

    Regional Sales Specialist - Chicago, IL

    Remote · USA Full-time

    [Remote] Tier 3 Technical Support Engineer

    Remote · USA Full-time

    Real Estate Salesperson

    Remote · USA Full-time

    Analyst, Treasury Operations - 12-month contract

    Remote · USA Full-time

    Litigation Paralegal

    Remote · USA Full-time

    Remote Customer Sales Associate-High Pay+ Career Advancement

    Remote · USA Full-time

    Attorney | Employment Law | Remote | 147491

    Remote · USA Full-time

    Delivery Driver (Part-Time)

    Remote · USA Full-time

    Medical Recruiter - Remote - Nationwide

    Remote · USA Full-time

    Experienced Customer Account Representative – Remote Customer Service Position

    Remote · USA Full-time

    Join Our Team at Virtual Visions: Fully Remote Part Time Jobs Available

    Remote · USA Full-time

    Experienced Virtual Data Entry Specialist – Remote Work Opportunity with Comprehensive Training and Career Growth Prospects at arenaflex

    Remote · USA Full-time

    Experienced Customer Support Representative – Flexible Hours at arenaflex

    Remote · USA Full-time

    Experienced Part-Time Online Live Chat Support Specialist – Thriving Aerospace & Defense Industry

    Remote · USA Full-time

    Data Engineer

    Remote · USA Full-time