TL;DR
Site Reliability Engineer (Fintech): Building and maintaining cloud-native infrastructure and tools for scalable and reliable systems with an accent on CI/CD pipeline automation, database monitoring, and observability. Focus on deploying infrastructure resources via Terraform, managing Kubernetes and Istio, and leveraging LLM models for tooling.
Location: Chicago, IL and Redwood City, CA. This role follows a hybrid schedule: Redwood City, CA requires in-office attendance on Mondays and choice of three days between Tuesday-Friday. Chicago, IL requires 4 days in-office and 1 day remote.
Company
hirify.global empowers consumers to leverage their data in exchange for modern financial services, aiming to build a more equitable and efficient data sharing ecosystem.
What you will do
- Build and maintain cloud-native infrastructure by writing Terraform modules and developing Helm charts.
- Define metrics, network policies, and routing rules for the Istio service mesh.
- Monitor and maintain GCP BigQuery and Spanner databases.
- Pipe metrics to Google-managed Prometheus and build Grafana dashboards and alerts.
- Experiment with GCP offerings, 3rd party vendors, and open-source tools to automate and secure operations.
- Participate in architecture design and capacity planning discussions to ensure system scalability, maintainability, reliability, and security.
Requirements
- 4+ years of experience building and maintaining large-scale cloud-native infrastructure (AWS and/or GCP).
- Experience working with containerization technologies such as Docker, Kubernetes, and Istio.
- Experience with SQL database technologies including MySQL, Google BigQuery, and Google Spanner.
- Experience with stream technologies like Kafka and Amazon Kinesis.
- Experience with pub/sub technologies such as AWS SNS and Google Pub/Sub.
- Experience with serverless computing technologies like AWS Lambda and Google Cloud Functions/Run.
- Proficiency with infrastructure-as-code tools such as Terraform.
- Experience with observability tools including Datadog, Prometheus, and Grafana.
Nice to have
- Experience with SOC2 Compliance processes and requirements.
Culture & Benefits
- Work directly with hands-on leaders and mission-driven individuals.
- Opportunity to learn and teach in a fast-paced, collaborative environment.
- Strong desire to automate processes and reduce reliance on manual tasks.
- Opportunity to experiment with new technologies and stress test them.
- Regularly provide and proactively seek constructive feedback for continuous improvement.
Будьте осторожны: если вас просят войти в iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →