Senior Data Scientist (Big Data)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Senior Data Scientist (Big Data): Designing and deploying advanced ML and graph algorithms for entity resolution and identity matching within a large-scale PII ecosystem with an accent on graph-based identity representations and scalable data pipelines. Focus on optimizing match rates, reducing false positives/negatives, and implementing rigorous A/B testing for identity trust scoring.
Location: Remote (US), but must be located within 45 miles of a talent hub
Salary: $140,000 - $170,000
Company
builds identity trust infrastructure for the digital economy, verifying identities in real-time to stop fraud and enable secure digital transactions.
What you will do
- Own the design and evaluation of ML, statistical, and graph-based algorithms for entity resolution and anomaly detection on massive datasets.
- Architect and optimize graph-based identity representations to improve match rates and support KYC models.
- Build and maintain scalable data pipelines and feature stores using Spark/PySpark in AWS/Databricks environments.
- Lead A/B tests and offline/online experimentation, defining success metrics and ensuring rigorous validation before rollout.
- Evaluate internal and external data sources to quantify incremental value and provide vendor selection recommendations.
- Collaborate with product and engineering teams to translate regulatory requirements into concrete modeling and data roadmaps.
Requirements
- Master's degree with 3+ years of industry experience, or Ph.D. with 1+ years in applied ML/data science roles.
- Strong proficiency in Python or Scala, and ML libraries such as scikit-learn, XGBoost, TensorFlow, or PyTorch.
- Extensive experience with Spark/PySpark and distributed systems like AWS EMR or Databricks.
- Deep understanding of supervised/unsupervised learning, feature engineering, and experiment design.
- Experience developing production-quality data pipelines using Airflow or similar orchestration tools.
- Must be located within 45 miles of a talent hub in the US; sponsorship is not available.
Nice to have
- Familiarity with graph databases and frameworks (Neo4j, AWS Neptune, GraphFrames, DGL, PyTorch Geometric).
- Experience in identity verification, fraud detection, or credit risk domains.
Culture & Benefits
- Opportunity to work on complex, high-impact problems with a high technical bar.
- Ownership-driven environment where critical thinking and speed are valued.
- Collaborative culture focusing on precision and solving real-world customer problems.
- Commitment to diversity, equity, and inclusion as an equal opportunity employer.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →