Member of Technical Staff (Data): World Models
Remote
Full Time
2 hours ago
Senior LevelEngineeringWorldwide
$80K - $120K

USD per year

Job Description

Member of Technical Staff (Data): World Models

Location

Remote

Employment Type

Full time

Location Type

Remote

Department

Artificial IntelligenceAI Engineering

Your Charter

  • Data at Scale: Own the pipelines and storage systems that feed petabyte-scale multimodal datasets into model training.
  • Sustainable Platforms: Build tooling and systems that are automated and efficient, enabling processing at scale and handling many small heterogeneous datasets.

Required Skillsets

  • Data Engineering: Knowledge of Python ETL pipelines and supporting infrastructure, data formats, and storage systems at scale.
  • ML Data Ops: Experience managing datasets, annotations, and data versioning for model training.
  • Basic ML Knowledge: Solid grasp of ML fundamentals is essential to collaborate effectively with researchers and make sound data platform decisions.
  • Agentic Engineering: Skilled at writing high-quality specifications for AI agents, while maintaining effective human review of AI-generated work.

Responsibilities

  • Design, automate, maintain, and optimize Python ETL pipelines (Spark/Ray) for large-scale multimodal data.
  • Build and maintain data cataloging, lineage, quality tooling, integrity verification, access controls, and lifecycle management systems.
  • Provide guidance, internal tools, and documentation to colleagues on data best practices.
  • Serve as a custodian of the company’s datasets, ensuring overall data health, quality, and discoverability.

Challenges You'll Tackle

  • Implement high-performance, multimodal data pipelines capable of processing petabyte-scale datasets on 10,000s of CPUs and 100s of GPUs.
  • Evolve data formats,...
How to Apply
About Moonvalley

Studio-grade quality. Frame-perfect control. Trained on licensed data. Powered by Marey. Fully licensed, commercially safe. Model controls that respond to precise inputs. Natural results that match your vision. Features include Pose Transfer, Camera Control, Motion Transfer, Trajectory Control. Marey generates image-to-video and text-to-video with cinematic detail, motion, and lighting. Built for the screen, proven in the frame. Advanced lighting, natural physics, dynamic motion, composable layers. Ethical AI video model for filmmakers.

View Company Profile