All roles

[Remote] Lead Data Scientist

Remote · USA Full-time New today

Note: The job is a remote job and is open to candidates in USA. Smarsh empowers its customers to manage risk and unleash intelligence in their digital communications. As a Lead Data Scientist (NLP & Financial Compliance), you will develop NLP and large language model solutions for compliance and surveillance systems, working with data to uncover misconduct and risk while mentoring junior team members.

Responsibilities

  • Collect, analyze, and interpret small/large datasets to uncover meaningful insights to support the development of statistical methods / machine learning algorithms
  • Lead the design, training, and deployment of NLP and transformer-based models for financial surveillance and supervisory use cases (e.g., misconduct detection, market abuse, trade manipulation, insider communication)
  • Development of machine learning models and other analytics following established workflows, while also looking for optimization and improvement opportunities
  • Data annotation and quality review
  • Exploratory data analysis and model fail state analysis
  • Contribute to model governance, documentation, and explainability frameworks aligned with internal and regulatory AI standards
  • Client/prospect guidance in machine learning model and analytic fine-tuning/development processes
  • Provide guidance to junior team members on model development and EDA
  • Work with Product Manager(s) to intake project/product requirements and translate these to technical tasks within the team’s tooling, technique and procedures
  • Continued self-led personal development

Skills

  • Strong understanding of financial markets, compliance, surveillance, supervision, or regulatory technology
  • Experience with one or more data science and machine/deep learning frameworks and tooling, including scikit-learn, H2O, keras, pytorch, tensorflow, pandas, numpy, carot, tidyverse
  • Command of data science and statistics principles (regression, Bayes, time series, clustering, P/R, AUROC, exploratory data analysis etc…)
  • Strong knowledge of key programming concepts (e.g. split-apply-combine, data structures, object-oriented programming)
  • Solid statistics knowledge (hypothesis testing, ANOVA, chi-square tests, etc…)
  • Knowledge of NLP transfer learning, including word embedding models (gloVe, fastText, word2vec) and transformer models (Bert, SBert, HuggingFace, and GPT-x etc.)
  • Experience with natural language processing toolkits like NLTK, spaCy, Nvidia NeMo
  • Knowledge of microservices architecture and continuous delivery concepts in machine learning and related technologies such as helm, Docker and Kubernetes
  • Familiarity with Deep Learning techniques for NLP
  • Familiarity with LLMs - using ollama & Langchain
  • Excellent verbal and written skills
  • Proven collaborator, thriving on teamwork
  • Master's or Doctor of Philosophy degree in Computer Science, Applied Math, Statistics, or a scientific field
  • Familiarity with cloud computing platforms (AWS, GCS, Azure)
  • Experience with automated supervision/surveillance/compliance tools

Company Overview

  • Smarsh manage the risk and see the value in their communications data. It was founded in 2001, and is headquartered in Portland, Oregon, USA, with a workforce of 1001-5000 employees. Its website is http://www.smarsh.com.
  • Company H1B Sponsorship

  • Smarsh has a track record of offering H1B sponsorships, with 16 in 2025, 5 in 2024, 12 in 2023, 22 in 2022, 2 in 2021, 1 in 2020. Please note that this does not guarantee sponsorship for this specific role.
  • Apply To This Job

    Related roles

    [Remote] Environmental Project Manager

    Remote · USA Full-time

    [Remote] Senior Financial Analytics Advisor-Remote

    Remote · USA Full-time

    [Remote] Product Manager – IT Cooling Systems

    Remote · USA Full-time

    [Remote] Director-Delivery Operations - CDH - Remote

    Remote · USA Full-time

    [Remote] Manufacturing Engineering Technician

    Remote · USA Full-time

    [Remote] Project Manager, Data & Insights Solutions

    Remote · USA Full-time

    [Remote] Bilingual Customer Service Representative (Spanish)

    Remote · USA Full-time

    [Remote] Enterprise Account Manager

    Remote · USA Full-time

    [Remote] Head of Marketing

    Remote · USA Full-time

    [Remote] Assistant Manager, Marketing & Communications (Contract Employee)

    Remote · USA Full-time

    CSR Information

    Remote · USA Full-time

    Experienced Remote Customer Service Representative – Delivering Exceptional Patient Care and Support in a Dynamic Telehealth Environment

    Remote · USA Full-time

    [Remote] Account Executive, SC SaaS - CPG & Mfg. - North America

    Remote · USA Full-time

    ServiceNow Architect/Developer (Remote)

    Remote · USA Full-time

    Data Entry Specialist – Student & Fresh Graduate Opportunity at arenaflex – Earn While You Learn

    Remote · USA Full-time

    Experienced Chat Support Officer – Enhancing Customer Satisfaction through Proactive Support and Collaboration

    Remote · USA Full-time

    Remote Senior Scrum Master, Agile Delivery (Enterprise Data / Transformation)

    Remote · USA Full-time

    Remote Sales Lead

    Remote · USA Full-time

    Experienced Remote Client Support Sales Assistant – Debt Solutions and Client Success

    Remote · USA Full-time

    Art Museum Student Receptionist (AY 25-26 900111)

    Remote · USA Full-time