Ready to apply? Sign up free to apply for jobs, save favorites, and track your applications!
Supercompute Infrastructure Engineer
Senior LevelEngineeringWorldwide
Over $120K
USD per year
Job Description
- About Periodic Labs:
- An AI + physical sciences lab building state-of-the-art models for novel scientific discoveries.
- Well funded and rapidly growing.
- Team members act as owners who identify and solve problems without boundaries or bureaucracy.
- Emphasis on learning new tools and science to advance their mission.
- About the Role:
- Lead, design, build, and operate large-scale compute clusters for AI scientific research.
- Write software to orchestrate large GPU and CPU clusters, manage resource allocation, and automate cluster lifecycle operations.
- Involved in bringup, operations, and maintenance of all cluster aspects.
- Build tools and participate directly in large-scale frontier research experiments.
- Aim to make Periodic Labs the world's best AI + science lab for physicists, computational materials scientists, AI researchers, and engineers.
- Ideal Candidate Experience:
- Managing >=5,000 GPU clusters.
- Cluster scheduling and orchestration tools like Kubernetes (k8s) and Slurm.
- Cloud environments such as GCP, AWS, or Azure.
- Observability and monitoring tools like DataDog, Prometheus, Grafana, or VictoriaMetrics.
- Infrastructure as Code (IaC) tools like Terraform and Ansible.
- GitOps tools like GitHub CI and ArgoCD.
How to Apply
About Periodic Labs
Periodic Labs aims to create an AI scientist and autonomous laboratories for them to operate, focusing on accelerating science in the physical sciences. They build AI scientists and autonomous labs to generate high-quality experimental data, enabling new scientific discoveries and applications such as discovering higher-temperature superconductors and aiding semiconductor manufacturers.
View Company Profile