Назад
Company hidden
4 дня назад

Trust & Safety Engineer (AI)

210 000 - 265 000$
Формат работы
hybrid
Тип работы
fulltime
Грейд
middle
Английский
b2
Страна
US
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Trust & Safety Engineer (AI): Designing and implementing LLM guardrails and automated abuse detection systems for an AI-native software creation platform with an accent on anti-abuse, phishing prevention, and LLM-specific attack vectors. Focus on building AI-powered detection systems, automating response mechanisms, and mitigating adversarial attacks at scale.

Location: Hybrid in Foster City, CA (In-office requirement: Monday, Wednesday, Friday)

Salary: $210,000 – $265,000 + Equity

Company

hirify.global is an agentic software creation platform that democratizes software development by enabling anyone to build applications using natural language.

What you will do

  • Design and implement LLM guardrails to detect abuse scenarios in AI-generated code and agent interactions.
  • Build AI-powered detection systems using LLMs to identify malicious patterns, classify threats, and automate response decisions.
  • Operate abuse detection systems to identify phishing, cryptomining, account takeover, and financial fraud across millions of actions.
  • Design and implement automated response mechanisms to enforce platform policies without manual intervention.
  • Analyze attack patterns using BigQuery and Hex to translate investigation findings into new detection rules.
  • Integrate and tune security scanners (SAST, SCA) within CI pipelines to maintain strict performance SLAs.

Requirements

  • 4+ years of experience in security engineering, anti-abuse, trust & safety, or fraud detection.
  • Strong programming skills in Python and/or TypeScript for building detection systems and automation.
  • Experience with SQL and data analysis at scale using BigQuery, Snowflake, or similar tools.
  • Experience building or fine-tuning ML/LLM-based classifiers for security or abuse detection.
  • Familiarity with prompt injection, jailbreaking, and other LLM-specific attack vectors.
  • Must be based in or able to work from the Foster City, CA office on a hybrid schedule.

Nice to have

  • Experience at platform companies dealing with user-generated content or compute abuse.
  • Background in fraud detection, payment abuse, or financial crime.
  • Familiarity with device fingerprinting, IP reputation, and email validation services.
  • Knowledge of container security, Linux internals, or cloud infrastructure (GCP preferred).
  • Experience with CI/CD security tooling like Dependabot, Snyk, or SAST/SCA scanners.

Culture & Benefits

  • Competitive salary and equity packages.
  • 401(k) program with a 4% match (US only).
  • Comprehensive health, dental, vision, and life insurance.
  • Flexible Time Off (FTO), holidays, and paid parental/medical leave.
  • Monthly wellness stipend and in-office setup reimbursement.
  • Autonomous work environment with quarterly team gatherings.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →