All roles

[Remote] Machine Learning Operations Engineer

Remote · USA Full-time New today

Note: The job is a remote job and is open to candidates in USA. The Associated Press is an independent global news organization dedicated to factual reporting. The Machine Learning Operations Engineer will own the production lifecycle of machine-learning systems, ensuring reliable, secure, and cost-effective operation of ML workloads in production environments.

Responsibilities

  • Design, deploy, and operate end‑to‑end production ML pipelines across Dev, QA, and Prod environments
  • Set up and manage AWS SageMaker pipelines, endpoints, and monitoring for large scale inference workloads, including embedding generation, named entity recognition, reranking, and video processing
  • Own GPU and CPU infrastructure selection, scaling, and optimization, including instance benchmarking, autoscaling behavior, and load testing
  • Deploy, monitor, and operate inference services that support hundreds of thousands of queries per day across text, image, and video pipelines
  • Establish standardized ML deployment patterns at AP, including containerization and orchestration strategies, environment isolation (Dev / QA / Prod) and versioned promotion, rollback, and recovery mechanisms
  • Implement monitoring, alerting, drift detection, and evaluation metrics for production ML systems, tracking latency, error rates, throughput, and model/data drift
  • Enable A/B testing and controlled rollout strategies for ML models in production, in partnership with engineering and product teams
  • Partner closely with ML Engineers, Data Scientists, DevOps, and Platform teams to operationalize new models and pipeline improvements, promote systems across environments safely, and ensure deployments meet reliability, scale, and cost targets
  • Manage high-throughput I/O and data movement for large collections of media assets (text, images, video), avoiding CPU, network, and storage bottlenecks
  • Reduce operational risk by enforcing reproducibility, observability, security, and cost controls across all production ML systems

Skills

  • 5+ years of experience deploying and operating ML inference systems in production
  • Strong experience with AWS SageMaker, including pipelines, endpoints, monitoring, and multi‑environment deployments
  • Expertise deploying ML models using PyTorch and TensorFlow from an operational and serving perspective
  • Proven experience with model deployment and orchestration, including containerized inference and autoscaling
  • Experience selecting, evaluating, and optimizing compute resources (GPU/CPU) for production ML workloads
  • Experience setting up monitoring, evaluation metrics, and A/B testing frameworks for ML systems in production
  • Ability to collaborate effectively with ML Engineers, Data Scientists, and platform teams in a shared ownership model
  • Operational experience supporting ML systems involving transformer‑based NLP models (e.g., BERT‑family models), computer vision models, and ranking and reranking systems
  • Familiarity operating systems that use common ML model types such as convolutional and feed‑forward neural networks, ranking algorithms, and approximate Nearest Neighbor methods (e.g., HNSW)
  • Experience running ML workloads over large‑scale text, image, and video datasets

Benefits

  • Competitive medical, dental and vision coverage
  • Retirement benefits
  • Company paid life insurance
  • Paid vacation and sick days
  • Paid parental leave for any new parent
  • Mental well-being resources

Company Overview

  • The Associated Press is a source of independent newsgathering, supplying a steady stream of news to its members, and more. It was founded in 1846, and is headquartered in New York, New York, USA, with a workforce of 1001-5000 employees. Its website is http://www.ap.org.
  • Company H1B Sponsorship

  • The Associated Press has a track record of offering H1B sponsorships, with 1 in 2026, 2 in 2025, 7 in 2024, 4 in 2023, 4 in 2022, 2 in 2021, 2 in 2020. Please note that this does not guarantee sponsorship for this specific role.
  • Apply To This Job

    Related roles

    [Remote] Technical Leader, Platform Engineering (Remote)

    Remote · USA Full-time

    [Remote] Commercial Insurance Account Manager

    Remote · USA Full-time

    [Remote] Global Head, Client Finance

    Remote · USA Full-time

    [Remote] Content Strategist (Client Account Manager)

    Remote · USA Full-time

    [Remote] SALES ACCOUNT EXECUTIVE

    Remote · USA Full-time

    [Remote] Fact Checking Analyst - English (US)

    Remote · USA Full-time

    [Remote] Senior Software Engineer

    Remote · USA Full-time

    [Remote] Senior Sales Recruiter

    Remote · USA Full-time

    [Remote] Director, Financial Planning & Analysis

    Remote · USA Full-time

    [Remote] Senior Business Analyst – Medicaid Pharmacy Claims / MMIS

    Remote · USA Full-time

    Closing Coordinator

    Remote · USA Full-time

    Financial Advisor Associate (TRAINING & LICENSING PROVIDED)

    Remote · USA Full-time

    Revenue Cycle Specialist I - Commercial Collections

    Remote · USA Full-time

    Technical Support Engineer - Sales Cloud

    Remote · USA Full-time

    Experienced Junior Data Entry Clerk – Remote Opportunity with arenaflex

    Remote · USA Full-time

    Experienced Full Stack Customer Experience Manager – Digital Character Development for arenaflex

    Remote · USA Full-time

    Experienced Remote Healthcare Billing and Customer Service Representative – Patient Advocacy and Financial Support

    Remote · USA Full-time

    Experienced Remote Customer Service & Administrative Assistant - Data Entry Specialist - Entry-Level Opportunity at arenaflex

    Remote · USA Full-time

    Experienced Virtual Data Entry Clerk – Remote Opportunity with arenaflex

    Remote · USA Full-time

    [Remote] Senior Business Analyst (Remote)

    Remote · USA Full-time