Job Description
Minimum qualifications: Bachelor's degree in Computer Science, Electrical Engineering, or a related field or equivalent practical experience. 8 years of experience in optimizing machine learning models for resource-constrained environments. Experience in inference for Large Language Models (LLMs), including architectures like Mixture of Experts or diffusion models. Preferred qualifications: Experience with core software engineering and building highly available systems. Experience with ML frame…