Senior MLOps Engineer Job at DeepRec.ai, San Jose, CA

U2JRRmZYRlpFUmsrTWhCV2xLSE1BeGVndlE9PQ==
  • DeepRec.ai
  • San Jose, CA

Job Description

Senior MLOps Engineer

We are hiring for an MLOps Engineer for a fast-moving AI startup who are building a worldclass AI-powered video platform.

We are looking for a skilled and hands-on MLOps Engineer to join their growing team. You will play a critical role in deploying, scaling, and maintaining their machine learning infrastructure, supporting a range of tools that enable the controlled generation of high-quality animated videos.

Key Responsibilities

  • Design, deploy, and maintain scalable training and data-processing pipelines on distributed compute clusters (e.g., Slurm, Kubernetes, or cloud-native equivalents).
  • Optimize inference systems for latency and cost in a production setting.
  • Collaborate closely with ML researchers and engineers to productionize deep learning models.
  • Implement robust monitoring, logging, and alerting systems for model performance and infrastructure reliability.
  • Automate model testing, validation, and deployment processes across staging and production environments.
  • Ensure efficient usage of compute resources, including GPU clusters, and help identify bottlenecks or cost-saving opportunities.

Requirements

  • Proven experience in MLOps, ML infrastructure, or related roles.
  • Deep expertise in deploying and maintaining ML training pipelines on distributed systems.
  • Strong knowledge of inference optimization techniques, especially in reducing latency and cost at scale.
  • Proficiency with cloud platforms (AWS, GCP, Azure) and orchestration tools (Kubernetes, Docker).
  • Experience working with GPU scheduling, distributed training (e.g., PyTorch DDP), and model serving frameworks (e.g., Triton, TorchServe).
  • Familiarity with CI/CD for ML workflows.
  • Strong Python skills and experience with ML/DL frameworks like PyTorch or TensorFlow.

Bonus Points

  • Experience working in the creative media or animation industry.
  • Exposure to video processing, generative AI, or large-scale content production systems.
  • Experience collaborating with research teams or integrating research code into production pipelines.

Please apply for more information

Job Tags

Similar Jobs

Zobility

Production Associate (6 PM to 6 AM) Job at Zobility

Looking for a Production Employee to work onsite in Casa Grande, AZ. Pay: $22/hr, Overtime Pay: $33/hr Shift: 6 PM to 6 AM (12 hours; 8 hours are paid at 22/hr, 4 hours OT pay 33/hr) Benefits: 2 weeks PTO, 8 paid holidays, 401k, Insurance, $2000/year tuition reimbursement...

PEA Group

Environmental Project Manager Job at PEA Group

 ...Overview ASTI Environmental, now a division of PEA Group, has led the way in environmental consulting, engineering, and remediation services since 1985. Our commitment to delivering tailored, innovative solutions goes beyond traditional services. Our team includes highly... 

Pop-Up Talent

Legal Assistant Job at Pop-Up Talent

 ...Legal Assistant Los Angeles, CA 90045 SALARY: $54-58k a year based on experience BENEFITS: ~ Medical and Dental Insurance...  ...Paid Vacation and sick days ~12 paid holidays ~ Weekly 1 work from home day -assigned date based on team availability ~401k and... 

CrewSeekers Ltd

Crossing the Atlantic from Chesapeake to UK Job at CrewSeekers Ltd

 ...our boat across the Atlantic. Direct route to the UK between 40 and 38 deg latitude for most of the way across.She's a heavy boat and handles the weather and the seas well. She's sailed up to Canada and down to the Bahamas offshore. Boat and skipper are ready to go...

BlackRock Resources LLC

Chief Investigator Job at BlackRock Resources LLC

You must be able to work in the U.S. without sponsorship. No C2C or 3rd parties, please. Now Hiring: Sr. Engineer / Staff Engineer Safety, Incident Response & Technical Leadership Are you an experienced engineer ready to lead high-impact safety investigations ...