Machine Learning Engineer Job at Evolve Group, Fremont, CA

TGpreVo3cTVxdnZaUFRybE1QaDVSY0tvbkE9PQ==
  • Evolve Group
  • Fremont, CA

Job Description

Machine Learning Engineer

Tech start-up

San Fransisco based

We’ve partnered with one of the most ambitious and technically rigorous AI research labs in the world. Based in San Francisco, this team is building foundation models entirely from scratch.

They are now hiring ML Infrastructure Engineers to design and scale the systems that power large-scale, distributed model training. If you’ve built infrastructure that runs across hundreds of GPUs, thrive under technical complexity, and want to work side-by-side with elite AI researchers — this is the role.

Key Responsibilities:

  • Build and scale distributed training systems for large-scale model training across LLMs, vision, and robotics.
  • Set up and run large-scale training across many GPUs using tools like Kubernetes, DeepSpeed, and FSDP.
  • Troubleshoot system issues (GPU errors, network problems) and build tools to monitor and recover from failures.
  • Optimize PyTorch pipelines, sharding, and sampling strategies.
  • Collaborate closely with researchers to support novel model training at scale.

Requirements:

  • 3–15 years in ML infrastructure, systems, or research engineering roles.
  • Proven experience scaling distributed training for large models.
  • Strong with PyTorch, CUDA, NCCL, Kubernetes.
  • Familiar with setting up distributed training clusters.
  • Deep understanding of PyTorch dataloaders, data sharding, and sampling.
  • Strong communicator with a collaborative, mission-driven mindset.

This is a fully in-person role based in San Francisco , it's ideal for engineers excited to build at the edge of what's possible in AI.

Job Tags

Immediate start,

Similar Jobs

DataAnnotation

Physics Tutor Job at DataAnnotation

 ...Astrophysics, Biophysics, Electrical Engineering, Nuclear Engineering, Chemical Engineering, Mathematics. Benefits: This is a full-time or part-time REMOTE position Youll be able to choose which projects you want to work on You can work on your own schedule... 

KPC GLOBAL MEDICAL CENTERS INC.

Tasting Room Server - Mt Palomar Winery - Temecula, CA Job at KPC GLOBAL MEDICAL CENTERS INC.

 ...Job Description Job Description Job Description: The Tasting Room Server is responsible for ensuring that guests are offered an...  ... Replenish supplies in the tasting bar area, including wine, beer, glassware, napkins and other related items. Be prepared... 

Pacific Habitat Services, Inc

Live Chat Agents Job at Pacific Habitat Services, Inc

We are seeking a skilled and customer-focused Live Chat Agent to provide real-time support to customers through our website and/or app chat platform. As a key part of the customer service team, you will handle inquiries, provide accurate information, and resolve issues... 

JDEE Transport Services

LOCAL TRUCK DRIVER / CDL A / HOME DAILY Job at JDEE Transport Services

 ...JDEE Transport Services is a Class A employment agency that places drivers in permanent positions across the United...  ...seeking experienced drivers with a CDL A for a Localposition out of...  ...Cheyenne, WY and drivers will still be home daily. Job description: Shift:... 

Jobot

Executive Assistant Job at Jobot

 ...stakeholders Draft and follow up on outbound sales messaging (email, phone, text) Manage lead follow-up and outreach tracking in HubSpot CRM Provide regular updates and priorities through Monday.com Coordinate meetings both on and off site, prepare materials,...