AI Inference Engineer Job at Signify Technology, Santa Clara, CA

TlJURUdJelF0WWFoNjBnSkNZRWxnZUUvWWc9PQ==
  • Signify Technology
  • Santa Clara, CA

Job Description

AI Inference Engineer – Stealth Startup | San Fransisco Onsite

Compensation: $200K–$300K + equity

Join a stealth-stage team backed by prominent academic research and successful technical founders, working at the bleeding edge of AI infrastructure. As generative AI continues to scale rapidly, the bottleneck is no longer training—it’s inference. This team is rebuilding the core systems that power inference, from kernel-level GPU optimizations to full-stack distributed deployment.

This role is ideal for engineers who want to go deep: working on quantization, KV caching, attention mechanisms like FlashAttention, and designing new strategies for parallelism across heterogeneous compute. You'll contribute to an integrated software-hardware stack that enables large-scale model deployment with dramatically improved performance, efficiency, and quality—at production scale.

What You’ll Be Doing:

  • Research and implement state-of-the-art techniques to improve AI model inference speed and quality
  • Architect and optimize distributed AI infrastructure across both GPU kernel and software layers
  • Profile, benchmark, and debug system performance across varied hardware environments
  • Drive improvements in model execution through compiler-level tuning, caching, and runtime strategies

What They’re Looking For:

  • Bachelor's degree in Computer Science, Engineering, Applied Math, or a related field
  • Strong experience with performance optimization and systems-level thinking
  • Proficiency in Python, C++, and CUDA
  • Familiarity with AI frameworks like PyTorch, TensorFlow, ONNX, or vLLM

Nice to Have:

  • Graduate degree in a technical field
  • Experience with MLIR or other compiler frameworks
  • Hands-on work with large-scale GPU infrastructure or custom kernels

This is a hands-on, foundational role in a fast-moving environment, offering the chance to shape the backbone of the next generation of AI systems.

Job Tags

Similar Jobs

St. Louis Park Public Schools

2025-26 Math Teacher - High School Job at St. Louis Park Public Schools

 ...Title: Teacher-Secondary DBM Classification: C43/Grade14 Department: Mathematics Salary Range: $47,066 - $103,6...  ...learning. Monitor student progress through assessments and provide timely feedback. Establish and maintain a positive and productive... 

Girl Scouts of Colorado

Product Program Specialist - Girl Scouts of Colorado - Denver Metro Area Job at Girl Scouts of Colorado

 ...will be joining a supportive and flexible work environment with team members who work together...  ...and direction to implement the highest quality customer service and Girl Scout program...  ...some overnight stays. Able to work from home or remote location with secure internet access... 

DHL Supply Chain

CDL - Class A Local Delivery Driver Job at DHL Supply Chain

 ...Shares Program Requirements: Minimum of 6-months verifiable Class A driving experience. Valid Class A operator's license. Be a minimum 21 years of age. Safe driving record. Want to see what it's like to drive for DHL? Check out this short video .... 

Auto-Chlor System, LLC

Territory Sales-Restaurant Equipment Job at Auto-Chlor System, LLC

 ...and Flexibility in the Bronx! Auto-Chlor is seeking an Outside Sales Representative Commercial Dishwasher & Chemicals to focus on new...  ...'s Degree preferred Preferred New business-to-business (B2B) sales experience Hunter sales mentality - goal driven and self... 

Pure Power Engineering

Civil Engineering Manager Job at Pure Power Engineering

Description: About Pure Power Pure Power is an engineering firm specializing in designing big, challenging, and high-profile Solar PV systems. As a full-service engineering firm, we create the electrical and structural drawings and calculation packages for bidding,...