Назад
Company hidden
1 день назад

Machine Learning Engineer (AI)

172 000 - 250 000$
Формат работы
onsite
Тип работы
fulltime
Английский
b2
Страна
US
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Machine Learning Engineer (AI): Designing and scaling systems that bring AI models from research into production-grade deployments with an accent on large-scale ML systems and low-latency inference. Focus on distributed infrastructure, product integration, ensuring performance, reliability, and continuous improvement of AI systems.

Location: Palo Alto, California, United States

Salary: $172,000.00 to $250,000.00

Company

hirify.global harnesses the power of AI to improve human well-being and productivity.

What you will do

  • Design and implement scalable, low-latency model-serving infrastructure for large language models and multimodal systems.
  • Build and maintain robust APIs and services to support real-time conversational workloads.
  • Architect and improve end-to-end ML pipelines spanning training, evaluation, deployment, monitoring, and rollback.
  • Define data requirements and feedback loops to enable continuous model improvement.
  • Lead architectural decisions that balance performance, scalability, safety, and maintainability.

Requirements

  • 1-4 years of experience in machine learning engineering, backend systems, or distributed infrastructure.
  • Proven experience deploying and operating ML models in production environments.
  • Strong programming skills in Python and/or C++ (or equivalent systems language).
  • Experience with large-scale model serving (LLMs, transformers, or similar architectures).
  • Deep understanding of distributed systems, API design, and cloud infrastructure.
  • Experience with MLOps tools and workflows (CI/CD, model monitoring, experiment tracking).

Nice to have

  • Experience scaling high-throughput, low-latency inference systems.
  • Familiarity with GPU acceleration, model optimization (quantization, batching, caching), and performance tuning.
  • Experience working with conversational AI systems or real-time user-facing AI products.
  • Knowledge of ML evaluation methodologies, safety systems, and guardrail design.
  • Background collaborating closely with research teams in fast-paced AI environments.

Culture & Benefits

  • Diverse medical, dental and vision options
  • 401k matching program
  • Unlimited paid time off
  • Parental leave and flexibility for all parents and caregivers
  • Support of country-specific visa needs for international employees living in the Bay Area

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →

Текст вакансии взят без изменений

Источник - загрузка...