USD per year
Staff Infrastructure Software Engineer, Enterprise AI
Location: New York, NY; San Francisco, CA Join the team shaping the future of AI at Scale. Apply Now→ Scale GP is building the infrastructure that makes enterprise AI seamless. We are looking for a Senior or Staff Infrastructure Engineer to act as a primary technical lead, engineering the 'paved road' for our knowledge retrieval and inference engines. You won't just be managing resources; you’ll be defining the deployment standards for Agentic workflows at scale. Your mission is to bridge the gap between complex AI orchestration and world-class infrastructure, ensuring our platform remains the most reliable destination for enterprise agents. The ideal candidate thrives in a fast-paced environment, has a passion for both deep technical work and mentoring, and is capable of setting a long-term technical strategy for a critical domain while maintaining a strong, hands-on delivery focus. You will architect and implement solutions across multiple cloud providers (GCP, Azure, AWS) for customers in diverse, highly-regulated industries like healthcare, telecom, finance, and retail.
What You’ll Do:
- Architect multi-cloud systems and abstractions to allow the SGP platform to run on top of existing Cloud providers.
- Use our own data and AI platform to analyze build and test logs and metrics to identify areas for improvement.
- Define the architectural patterns for our multi-cloud infrastructure to support secure, reliable, and scalable Agentic workflows for enterprise customers.
- Enhance engineering and infrastructure efficiency, reliability, accuracy, and response times, including CI/CD processes, test frameworks, data quality assurance, end-to-end reconciliation, and anomaly detection.
- Collaborate with platform and product teams to develop and implement innovative infrastructure that scales to meet evolving needs.
- Design and champion highly scalable, reliable, and low-latency infrastructure and frameworks for building,...
Scale AI provides high-quality data and full-stack technologies that power the world’s leading models and enable enterprises and governments to build, deploy, and oversee AI applications that deliver real impact. They offer a data-centric, end-to-end solution to manage the entire machine learning lifecycle, combining cutting edge technology with operational excellence to help teams develop the highest-quality datasets.
View Company Profile