TL;DR
AI Solutions Engineer (AI): Helping customers design, deploy, and scale modern ML and GenAI systems using Weights & Biases and hirify.global’s AI cloud with an accent on building real systems and solving complex problems. Focus on shaping how AI is developed and operationalized at scale.
Location: Livingston, NJ / New York, NY / Sunnyvale, CA / Bellevue, WA / Philadelphia, PA / San Francisco, CA / Washington, D.C. While we prioritize a hybrid work environment, remote work may be considered for candidates located more than 30 miles from an office.
Salary: $165,000 to $242,000
Company
hirify.global delivers a platform of technology, tools, and teams that enables innovators to build and scale AI with confidence.
What you will do
- Drive discovery, solution design, and successful technical evaluations by building proof of value (PoVs).
- Lead deep technical conversations with ML engineers, researchers, and platform teams to understand their AI/GenAI workloads and architect scalable solutions.
- Design and deliver compelling live demos, proof-of-value engagements, and rapid prototypes showcasing experiment tracking, distributed training, agentic workflows, fine-tuning, evaluation, and inference at scale.
- Translate business objectives into measurable technical outcomes such as model quality improvements, cost reductions, performance gains, and faster iteration cycles.
- Serve as a trusted technical advisor, influencing architectural decisions and best practices across MLOps, LLMOps, and platform engineering.
- Partner closely with Product and Engineering to relay customer feedback, shape roadmap priorities, and improve the end-to-end AI developer experience.
Requirements
- 2+ years of experience in Solutions Engineering, ML Engineering, AI Engineering, or a similar customer-facing technical role.
- Strong proficiency in Python and experience building production-grade ML or GenAI applications.
- Hands-on experience training, fine-tuning, evaluating, debugging, and deploying deep learning or LLM-based systems.
- Experience with modern GenAI architectures including retrieval-augmented generation (RAG), tool-using agents, evaluation frameworks, and guardrails.
- Comfortable designing cloud-native architectures across AWS, GCP, or Azure, with working knowledge of containers, networking fundamentals, and security best practices.
- Applicant must either be (A) a U.S. person, defined as a (i) U.S. citizen or national, (ii) U.S. lawful permanent resident (green card holder), (iii) refugee under 8 U.S.C. § 1157, or (iv) asylee under 8 U.S.C. § 1158, (B) eligible to access the export controlled information without a required export authorization, or (C) eligible and reasonably likely to obtain the required export authorization from the applicable U.S. government agency.
Nice to have
- Experience building and evaluating agentic applications using frameworks such as LangChain, DSPy, LlamaIndex, CrewAI, OpenClaw, or similar ecosystems.
- Familiarity with vector databases, embeddings workflows, and LLM evaluation frameworks.
- Hands-on experience with Hugging Face Transformers, PEFT/LoRA/QLoRA fine-tuning, TRL, Unsloth, or similar fine-tuning stacks.
- Experience operating distributed workloads using Slurm, Ray, or Kubernetes.
- Experience with inference optimization and serving frameworks such as vLLM, TensorRT-LLM, Triton, or TGI.
Culture & Benefits
- Medical, dental, and vision insurance - 100% paid for by hirify.global.
- Flexible Spending Account and Health Savings Account.
- Tuition Reimbursement and Employee Stock Purchase Program (ESPP).
- Flexible PTO.
- Catered lunch each day in our office and data center locations.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →