Назад
Company hidden
13 часов назад

Senior Data Scientist (Big Data)

140 000 - 170 000$
Формат работы
remote (только USA)/hybrid
Тип работы
fulltime
Грейд
senior
Английский
b2
Страна
US
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Senior Data Scientist (Big Data): Designing and deploying advanced ML and graph algorithms for entity resolution and identity matching within a large-scale PII ecosystem with an accent on graph-based identity representations and scalable data pipelines. Focus on optimizing match rates, reducing false positives/negatives, and implementing rigorous A/B testing for identity trust scoring.

Location: Remote (US), but must be located within 45 miles of a hirify.global talent hub

Salary: $140,000 - $170,000

Company

hirify.global builds identity trust infrastructure for the digital economy, verifying identities in real-time to stop fraud and enable secure digital transactions.

What you will do

  • Own the design and evaluation of ML, statistical, and graph-based algorithms for entity resolution and anomaly detection on massive datasets.
  • Architect and optimize graph-based identity representations to improve match rates and support KYC models.
  • Build and maintain scalable data pipelines and feature stores using Spark/PySpark in AWS/Databricks environments.
  • Lead A/B tests and offline/online experimentation, defining success metrics and ensuring rigorous validation before rollout.
  • Evaluate internal and external data sources to quantify incremental value and provide vendor selection recommendations.
  • Collaborate with product and engineering teams to translate regulatory requirements into concrete modeling and data roadmaps.

Requirements

  • Master's degree with 3+ years of industry experience, or Ph.D. with 1+ years in applied ML/data science roles.
  • Strong proficiency in Python or Scala, and ML libraries such as scikit-learn, XGBoost, TensorFlow, or PyTorch.
  • Extensive experience with Spark/PySpark and distributed systems like AWS EMR or Databricks.
  • Deep understanding of supervised/unsupervised learning, feature engineering, and experiment design.
  • Experience developing production-quality data pipelines using Airflow or similar orchestration tools.
  • Must be located within 45 miles of a talent hub in the US; sponsorship is not available.

Nice to have

  • Familiarity with graph databases and frameworks (Neo4j, AWS Neptune, GraphFrames, DGL, PyTorch Geometric).
  • Experience in identity verification, fraud detection, or credit risk domains.

Culture & Benefits

  • Opportunity to work on complex, high-impact problems with a high technical bar.
  • Ownership-driven environment where critical thinking and speed are valued.
  • Collaborative culture focusing on precision and solving real-world customer problems.
  • Commitment to diversity, equity, and inclusion as an equal opportunity employer.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →