Назад
Company hidden
4 дня назад

Principal Software Engineer (AI)

139 900 - 274 800$
Формат работы
hybrid
Тип работы
fulltime
Грейд
principal
Английский
b2
Страна
US
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Principal Software Engineer (AI): Building and implementing complex inferencing capabilities for state-of-the-art deep learning models, driving innovations in AI infrastructure with an accent on optimizing for cost and leveraging open-source projects. Focus on model performance and deployment, identifying CPU/GPU bottlenecks, and optimizing latency‑critical online services.

Location: Expected to work from the office at least four days per week, if you live within a 50- mile commute of a designated hirify.global office in the U.S.

Salary: USD $139,900 – $274,800 per year.

Company

hirify.global’s mission is to empower every person and every organization on the planet to achieve more.

What you will do

  • Engage directly with key partners to understand, design, and implement complex inferencing capabilities for state-of-the-art deep learning models, driving innovations in AI infrastructure.
  • Work with cutting-edge hardware and software stacks to deliver best-in-class inference performance while optimizing for cost, leveraging open-source projects to advance deep learning applications.
  • Collaborate with external and internal teams to identify new areas for improvement and contribute to innovations that enhance model performance and deployment.
  • Discover/solve impactful technical problems, advance state-of-the-art technologies, and translate ideas into production.
  • Developing internal tools to support the AI lifecycle, including experiment tracking, model versioning, and performance monitoring.
  • Create deep connections within our communities, focus on increasing representation, retaining, and growing our current team members, while fostering awareness and growth through an inclusive environment.

Requirements

  • Bachelor’s Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including C, C++, C#, Java, JavaScript, or Python OR equivalent experience.
  • Ability to meet hirify.global, customer and/or government security screening requirements.

Nice to have

  • Experience with model compression (quantization, distillation, SVD, low‑rank methods).
  • Experience in building high‑throughput inference serving stacks (continuous batching, KV‑cache optimizations, routing).
  • Familiarity with hirify.global’s DLIS, Talon routing, Triton/TensorRT‑LLM stack, and Azure/H100/A100 GPU environments.
  • Publications, competition wins, or real‑world deployments related to model efficiency.
  • Solid experience in GPU inference optimization (CUDA, TensorRT, Triton, or custom GPU kernels).
  • Proficiency in profiling tools (Nsight, TensorBoard, PyTorch profiler) and ability to identify CPU/GPU bottlenecks.
  • Deep understanding of LLM/SLM architectures (attention, embeddings, MoE, decoders).
  • Experience optimizing latency‑critical online services.
  • Master’s Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR Bachelor’s Degree in Computer Science or related technical field AND 12+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience.

Culture & Benefits

  • Employees come together with a growth mindset, innovate to empower others, and collaborate to realize shared goals.
  • Build on values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.

Будьте осторожны: если вас просят войти в iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →