TL;DR
Member of Technical Staff (Multimodal): Building and advancing multimodal AI systems with an accent on image, video, audio, and text generation and understanding across the full stack. Focus on designing large-scale distributed training systems, optimizing data pipelines at petabyte scale, and delivering frontier capabilities in multimodal reasoning and agentic behaviors.
Location: Must be based in Palo Alto, CA
Salary: $180,000–$440,000 USD
Company
hirify.global is a mission-driven research organization focused on developing advanced AI systems that can accurately understand the universe and aid humanity.
What you will do
- Design and optimize large-scale distributed systems for multimodal pre-training, inference, and data processing.
- Develop high-throughput pipelines for data acquisition, filtering, and management across image, video, and audio modalities.
- Advance core multimodal capabilities including cross-modal alignment, world modeling, and reasoning.
- Drive data quality through scalable curation techniques, analysis, and synthetic data generation.
- Build evaluation frameworks, benchmarks, and reward models to capture real-world performance and human-AI synergy.
- Collaborate across teams to enable reasoning, tool calling, and agentic behaviors in real-time.
Requirements
- Hands-on experience with multimodal pre-training, post-training, or fine-tuning in vision, audio, or video.
- Expert-level proficiency in Python and experience with JAX, PyTorch, or XLA.
- Proven track record of building large-scale distributed ML systems and optimizing GPU utilization.
- Deep experience managing data pipelines at scale, including curation, filtering, and quality studies.
- Strong fundamentals in evaluation design, reward modeling, or RL techniques for interactive systems.
- Ownership-driven mindset with the ability to thrive in a high-intensity, flat organizational structure.
Nice to have
- Proficiency in Rust or C++ for performance-critical development.
- Experience with large-scale orchestration tools such as Spark, Ray, or Kubernetes.
- Background building full-stack tooling and interactive, real-time research demos.
- Deep knowledge of scaling laws, tokenizers, or compression techniques.
Culture & Benefits
- Competitive total rewards package including equity.
- Comprehensive medical, vision, and dental coverage.
- Participation in a 401(k) retirement plan.
- Life insurance and short & long-term disability coverage.
- Work in a small, highly motivated team with a flat organizational structure.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →