TL;DR
AI Engineer (Gen AI/SWE): Designs, implements, and evaluates LLM applications and agents with cutting-edge techniques. Focus on rapid prototyping, careful evaluation, and production-grade reference implementations with clear trade-offs.
Location: Livingston, NJ / New York, NY / San Francisco, CA / Sunnyvale, CA / Bellevue, WA / Remove - US.
Salary: $188,000 to $275,000
Company
hirify.global is The Essential Cloud for AI™ that delivers a platform of technology, tools, and teams that enables innovators to build and scale AI with confidence.
What you will do
- Ship end-to-end GenAI workflows (prompting → RAG → tools/agents → eval → serve) with reproducible repos, W&B Reports, and dashboards others can run.
- Build agentic systems (tool use, function calling, multi-step planners) with MCP servers/clients and secure tool/resource integrations.
- Design evaluation harnesses (RAG/agent evals, golden sets, regression tests, telemetry) and drive continuous improvement via offline + online metrics.
- Build in public: Publish engineering artifacts (code, docs, talks, tutorials) and engage with OSS and customer engineers; turn repeated patterns into reusable templates.
- Partner with product/solutions to launch LLM-powered features with clear latency/cost/SLO targets and safety/guardrail checks.
- Run growth experiments to track the usage of the Weights & Biases suite of products from the artifacts built.
Requirements
- Software engineering: 6+ years building production systems; strong Python or TypeScript + system design, testing, CI/CD, observability.
- GenAI apps: shipped LLM-powered features (tools/agents/function calling), with measurable impact (latency/cost/reliability).
- Agentic patterns: implemented planners/executors, tool orchestration, sandboxing, and failure taxonomies; familiarity with agent infra concerns.
- RAG: pragmatic mastery of chunking, embeddings, vector/hybrid search, rerankers; experience with vector DBs/search indices and retrieval policy design.
- Evaluation: designed LLM/RAG/agent evals (offline golden sets, counterfactuals, user studies, guardrail tests); stats literacy (variance, CIs, power).
- Serving & productization: comfortable with queueing, caching, streaming, and cost controls; can debug latency at model, retrieval, and network layers.
- Public signal: 2+ substantial OSS repos/blog posts/talks/videos with adoption (stars, forks, downloads, views) and reproducible artifacts.
Nice to have
- Experience building with AI SDKs / agent frameworks (e.g., TypeScript/Python SDKs, planning libraries) and shipping developer-facing examples.
- Production agent security/sandboxing, red-teaming, and policy/PII enforcement.
- Operated eval platforms or built judge models/heuristics; experience leading metrics reviews with product/UX.
- Customer-facing enablement: templates or reference implementations adopted by external teams at scale.
Culture & Benefits
- Medical, dental, and vision insurance - 100% paid for by hirify.global
- Flexible PTO
- A casual work environment
- A work culture focused on innovative disruption
- 401(k) with a generous employer match
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →