Trust & Safety Engineer (AI)

210 000 - 265 000$

Формат работы

hybrid

Тип работы

fulltime

Грейд

middle

Английский

Страна

Вакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:

TL;DR

Trust & Safety Engineer (AI): Designing and implementing LLM guardrails and automated abuse detection systems for an AI-native software creation platform with an accent on anti-abuse, phishing prevention, and LLM-specific attack vectors. Focus on building AI-powered detection systems, automating response mechanisms, and mitigating adversarial attacks at scale.

Location: Hybrid in Foster City, CA (In-office requirement: Monday, Wednesday, Friday)

Salary: $210,000 – $265,000 + Equity

Company

hirify.global is an agentic software creation platform that democratizes software development by enabling anyone to build applications using natural language.

What you will do

Design and implement LLM guardrails to detect abuse scenarios in AI-generated code and agent interactions.
Build AI-powered detection systems using LLMs to identify malicious patterns, classify threats, and automate response decisions.
Operate abuse detection systems to identify phishing, cryptomining, account takeover, and financial fraud across millions of actions.
Design and implement automated response mechanisms to enforce platform policies without manual intervention.
Analyze attack patterns using BigQuery and Hex to translate investigation findings into new detection rules.
Integrate and tune security scanners (SAST, SCA) within CI pipelines to maintain strict performance SLAs.

Requirements

4+ years of experience in security engineering, anti-abuse, trust & safety, or fraud detection.
Strong programming skills in Python and/or TypeScript for building detection systems and automation.
Experience with SQL and data analysis at scale using BigQuery, Snowflake, or similar tools.
Experience building or fine-tuning ML/LLM-based classifiers for security or abuse detection.
Familiarity with prompt injection, jailbreaking, and other LLM-specific attack vectors.
Must be based in or able to work from the Foster City, CA office on a hybrid schedule.

Nice to have

Experience at platform companies dealing with user-generated content or compute abuse.
Background in fraud detection, payment abuse, or financial crime.
Familiarity with device fingerprinting, IP reputation, and email validation services.
Knowledge of container security, Linux internals, or cloud infrastructure (GCP preferred).
Experience with CI/CD security tooling like Dependabot, Snyk, or SAST/SCA scanners.

Culture & Benefits

Competitive salary and equity packages.
401(k) program with a 4% match (US only).
Comprehensive health, dental, vision, and life insurance.
Flexible Time Off (FTO), holidays, and paid parental/medical leave.
Monthly wellness stipend and in-office setup reimbursement.
Autonomous work environment with quarterly team gatherings.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →