Solutions Architect Job at Quantiphi, Dallas, TX

SkQwMVlyT3dwdkhYTnp2cE52UjVSOEtublE9PQ==
  • Quantiphi
  • Dallas, TX

Job Description

Job Description

Job Summary:

We are seeking experienced Platform Engineers with expertise in MLOps and handling

distributed systems, particularly Kubernetes, along with a strong background in managing

Multi-GPU, Multi-Node Deep Learning job/inference scheduling. Proficiency in Linux (Ubuntu)

systems, the ability to create intricate shell scripts, good proficiency in working with

configuration management tools and sufficient understanding of deep learning workflow.

Required Skills & Qualifications:

● Experience:

○ 3+ years of experience in platform engineering, DevOps, or systems

engineering, with a strong focus on machine learning and AI workloads.

○ Proven experience working with LLM workflows, and GPU-based machine

learning infrastructure.

○ Hands-on experience in managing distributed computing systems, training

large-scale models, and deploying AI systems in cloud environments.

○ Knowledge of GPU architectures (e.g., NVIDIA A100, V100, etc.), multi-GPU

systems, and optimization techniques for AI workloads.

● Technical Skills:

○ Proficiency in Linux systems and command-line tools. Strong scripting skills

(Python, Bash, or similar).

○ Expertise in containerization and orchestration technologies (e.g., Docker,

Kubernetes, Helm).

○ Experience with cloud platform (AWS), tools such as Terraform, /Terragrunt, or

similar infrastructure-as-code solutions, and exposure to automation of CICD

pipelines using Jenkins/Gitlab/Github, etc.

○ Familiarity with machine learning frameworks (TensorFlow, PyTorch, etc.) and

deep learning model deployment pipelines. Exposure to vLLM or NVIDIA

software stack for data & model management is preferred.

○ Expertise in performance optimization tools and techniques for GPUs, including

memory management, parallel processing, and hardware acceleration.

● Soft Skills:

○ Strong problem-solving skills and ability to work on complex system-level

challenges.

○ Excellent communication skills, with the ability to collaborate across technical

and non-technical teams.

○ Self-motivated and capable of driving initiatives in a fast-paced environment.

Good to Have Skills:

● Experience in building or managing machine learning platforms, specifically for

generative AI models or large-scale NLP tasks.

● Familiarity with distributed computing frameworks (e.g., Dask, MPI, Pytorch DDP) and

data pipeline orchestration tools (e.g., AWS Glue, Apache Airflow, etc).

● Knowledge of AI model deployment frameworks such as TensorFlow Serving,

TorchServe, vLLM, Triton Inference Server.

● Good understanding of LLM inference & how to optimize self-managed infrastructure

● Understanding of AI model explainability, fairness, and ethical AI considerations.

● Experience in automating and scaling the deployment of AI models on a global

infrastructure.

Job Tags

Similar Jobs

Milked Media

Paid Social + Email Graphic Designer Job at Milked Media

 ...manage multiple projects simultaneously as well as adhere to our processes and procedures. This role is contract to start and fully remote. Responsibilities: Design elevated customer-focused graphics for email and paid social ads that meet our clients' business,... 

Marten Transport

Technician Job at Marten Transport

 ...Technicians Join Our Team, we provide fleet maintenance services that is hands on for Kenworth, Peterbilt, Freightliner, Detroit, & Cummins. Our mechanics/technicians value opportunities and further development that offers advancement and a career. What we can offer... 

Riverside Payments

Field Sales Representative Job at Riverside Payments

 ...of Account Executives to accomplish what they set out to do. Welcome to the Riverside Family. Working with us is not just another sales job. Were changing our community and want you to be a part of our Account Executive team. Well set you up for success and be there... 

Curri

FT Customer Service Representative - Work From Home Job at Curri

 ...deliveries from booking to successful drop-off and everything in between; Provide best-in-class support for our customers and drivers via chat and phone; Communicate with pickup and drop-off contacts to ensure deliveries are completed successfully and smoothly; Provide... 

YMCA of Metropolitan Detroit

Fitness Yoga Instructor Job at YMCA of Metropolitan Detroit

 ...Employee Assistance Program & Retirement Plan General Function Under the guidance of the Member Experience Director, the Yoga Instructor instructs yoga classes in a safe, enjoyable, and positive environment that welcomes people of all skill and fitness levels....