TL;DR
Staff AI Engineer (AI/ML): Building and delivering high-performance AI features for observability tools with an accent on detecting, triaging, and resolving incidents using AI-driven capabilities. Focus on rapid experimentation, shipping scalable LLM/agent-powered workflows, and improving infrastructure quality through automation.
Location: Remote from USA time zones only
Salary: USD 174,986 - USD 209,983
Company
hirify.global is a remote-first, open-source company providing observability tools and the Grafana LGTM Stack to over 3,000 companies globally.
What you will do
- Build and deliver high-performance AI features to help users detect, triage, and resolve incidents using observability data and tools.
- Implement rapid experimentation where you quickly prototype, test, and validate LLM- or agent-powered workflows.
- Collaborate cross-functionally with data analysts, product managers, and designers to shape AI-driven product features.
- Utilize AI and automation tools to enhance product functionality and development workflows.
- Take full ownership of the AI solutions you develop, ensuring scalability, maintainability, and alignment with user workflows.
Requirements
- Strong engineering skills with solid experience building production software systems (backend and/or full stack).
- Familiarity with AI technologies and frameworks, including LLMs, prompt engineering, and building applications powered by GenAI.
- Quick iteration and experimentation mindset.
- Proven track record of delivering software that made it into production and is actively used by users.
- Exposure to working in cloud-native environments (e.g., AWS, GCP, Azure) and using observability tools to understand and troubleshoot system behavior.
Nice to have
- Experience building or working with agent frameworks or multi-agent workflows.
- Experience with infrastructure/devops related tooling: Kubernetes, Docker, Terraform or similar.
- Familiarity with model fine-tuning techniques.
- Experience building observability tooling.
Culture & Benefits
- 100% remote, global culture that thrives on transparency, autonomy, and trust.
- Equity (Restricted Stock Units) for every team member.
- Investment in developer productivity, including access to modern AI coding assistants and frontier models.
- Global annual leave policy of 30 days per annum, including 3 Grafana Shutdown Days.
- In-person onboarding for new hires.
- High trust, low ego culture that values outcomes over optics.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →