Senior Software Engineer, Inference

Pika•Palo Alto HQ

Full-timeSenior

$185k - $250k

per year

👁️ 1 views•📝 0 applications•Posted 6/23/2026•Expires 7/24/2026

Tailor Resume for This Job Check ATS Score View Original Posting ↗

Job Description

About the Role We are seeking a Senior Inference Engineer to accelerate the performance of Pika's AI-driven products. In this highly technical role, you will operate at the intersection of cutting-edge inference acceleration, GPU parallelism, advanced model deployment, and video generation technologies. Your expertise will drive significant improvements to model speed and efficiency, ensuring our creative AI systems deliver industry-leading user experiences at scale. You will design and optimize inference pipelines, implement state-of-the-art acceleration techniques, and work closely with researchers and engineers across the team to push the boundaries of what’s possible in real-time AI deployment. Your efforts will play a foundational role in powering the next generation of Pika’s video and language models. What You’ll Do Accelerate Inference : Lead and implement advanced inference acceleration techniques, including attention optimization and quantization for efficient model serving. Maximize GPU Parallelism : Engineer and optimize GPU strategies across tensor, sequence, and pipeline parallelism (TP, SP, PP) for maximal efficiency and scalability. Programming for Performance : Develop and optimize high-performance computing kernels and distributed workloads using CUDA and NCCL. Advance AI Deployment : Collaborate with research and engineering teams to bring state-of-the-art videogen and large language models into production. Improve Training Efficiency : (Bonus) Contribute to improvements in model training speed, stability, and resource utilization as part of our deployment lifecycle. Technical Excellence : Drive rigorous code reviews, participate in technical discussions, and mentor fellow engineers on best practices in inference and GPU programming. What We’re Looking For Experience : 5+ years engineering experience, with a strong track record in inference acceleration and model deployment at scale. Inference Mastery : Proven expertise in inference optimization,

Similar Jobs

Other open roles matched to this job's skills and location.

Junior Electrical Software Engineer

Company not specified

📍 Johannesburg, GautengFull-time

Senior Software Development Engineer, Assistant Vice President

STATE STREET BANK INTERNATIONAL GMBH

📍 Kraków, małopolskieFull-time

Senior Software Engineer (Java, TypeScript, React) @ Edvantis

Edvantis

📍 Rzeszów, podkarpackieFull-time

Multimedia Software Engineer / Senior Specialist @ TOYOTA CENTRAL EUROPE SPÓŁKA Z OGRANICZONĄ ODPOWIEDZIALNOŚCIĄ

TOYOTA CENTRAL EUROPE SPÓŁKA Z OGRANICZONĄ ODPOWIEDZIALNOŚCIĄ

📍 Wrocław, dolnośląskieFull-time

Senior Software Engineer @ Brighstar Lottery

Brighstar Lottery

Tailor Resume for This Job