Midjourney is seeking a Senior Data Engineer to join their data team in the San Francisco Bay Area. The role involves processing, filtering, and managing datasets for image generation models.
About the Role
As a Senior Data Engineer, you will work on large-scale dataset processing and filtering pipelines, training classifiers for content moderation, developing models for data quality evaluation, and creating data visualization tools. You will also be involved in performance optimization, infrastructure scaling, and occasionally participating in inference optimization projects. The role requires adaptability across different environments and technologies, including PySpark and distributed systems.
About You
Required:
Strong experience in data engineering or machine learning pipelines at scale.
Experience with cloud infrastructure and distributed systems.
Preferred:
Familiarity with PySpark and Slurm.
Comfort with adjacent technologies and a willingness to learn.
Benefits
Opportunity to work on cutting-edge generative AI projects.
Flexible work location with potential exceptions for exceptional candidates.
Midjourney
Midjourney is an independent research lab exploring new mediums of thought and expanding the imaginative powers of the human species.