Data/Infrastructure Advocate Engineer (AI)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Data/Infrastructure Advocate Engineer (AI): Bridging the gap between data infrastructure and the global AI community by championing Xet storage and open data workflows on the Hub with an accent on storage optimization and community engagement. Focus on creating technical content, building demos with Parquet and S3, and growing the open-source data engineering ecosystem.
Location: Remote (distributed team with offices in Paris and New York City)
Company
is the fastest growing platform for AI builders, dedicated to democratizing AI through open-source libraries and a hub for sharing models and datasets.
What you will do
- Grow and nurture the open-source data/infra community by launching initiatives and collaborating with groups like Apache Parquet.
- Promote the Hub as the primary platform for data storage, versioning, and collaboration.
- Produce high-quality technical content, including tutorials, blog posts, videos, and Colab notebooks.
- Create demos and benchmarks to illustrate best practices for storage optimization, deduplication, and Parquet editing.
- Engage with developers on Discord, GitHub, and forums to answer questions and foster collaboration.
- Ensure datasets and tools released on the Hub are well-documented with clear examples and benchmarks.
Requirements
- 3+ years in developer relations or advocacy for data engineering, infrastructure, or ML tools.
- Established public technical presence with a track record of publishing content and an engaged audience on LinkedIn and X.
- Portfolio of developer-facing content such as tutorials, blog posts, videos, or conference talks.
- Strong Python skills and experience with pandas, pyarrow, and huggingface/datasets.
- Practical experience with S3, Parquet, Open Table Formats, and dataset versioning/compression.
- Fluent written and spoken English.
Nice to have
- Experience with the Hub, datasets ecosystem, or Xet.
- Open-source maintainer or contributor experience.
- Familiarity with large-scale data pipelines and data engineering workflows.
- Experience producing Colab notebooks for tutorials and benchmarks.
Culture & Benefits
- Distributed-first culture with flexible working hours and remote options.
- Comprehensive health, dental, and vision benefits for employees and their dependents.
- Company equity provided as part of the compensation package.
- Reimbursement for relevant conferences, training, and professional education.
- Flexible paid time off and parental leave.
- Workstation outfit provided for remote employees to ensure success.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →