AI Performance Software Engineer Job at Signify Technology, San Francisco, CA

VjZnMVRMdkJRNnFxd1dXS3ZQczZMeXVQb2c9PQ==
  • Signify Technology
  • San Francisco, CA

Job Description

AI Performance Engineer – CUDA & PyTorch Focus

Location: San Fransisco, CA

Compensation: $200,000-$300,000

A stealth-mode AI systems company is reimagining how large-scale inference is done. With generative AI workloads scaling rapidly, inference efficiency has become a critical bottleneck. We're building an integrated hardware-software platform that brings breakthrough performance and usability to production-scale LLM applications.

This is an opportunity to work on a highly technical team spun out of top-tier academic research, focused on the cutting edge of AI, distributed systems, and performance optimization.

What You’ll Do:

  • Drive core research and implementation of performance optimizations for modern AI models
  • Implement advanced techniques like FlashAttention, KV caching, quantization, and model compression
  • Design and build scalable, distributed compute strategies across GPU-based systems
  • Profile, benchmark, and optimize CUDA kernels and AI runtime performance across inference stacks
  • Work across frameworks like PyTorch, ONNX, and vLLM to improve end-to-end efficiency

What We're Looking For:

  • Strong background in CUDA and low-level GPU performance tuning
  • Proven experience building with PyTorch and deploying high-performance ML models
  • Proficiency in Python and C++
  • Experience with large-scale distributed systems in cloud environments (AWS, GCP, or Azure)
  • Exposure to AI compilers or frameworks like MLIR is a plus
  • Interest in system design, scalability, and accelerating LLM workloads in real production environments

If you’ve spent your time making large models faster, leaner, and more efficient—and want to solve hard technical problems at the core of GenAI infrastructure—this role is for you.

Reach out to learn more.

Job Tags

Similar Jobs

Leo Meyer Grain Production

Seasonal Harvest Operator Job at Leo Meyer Grain Production

Job Description Job Description The successful candidate will be part of an engaging family farm team to bring in the 2025 harvest on a grain farm in northern Alberta. As part of the farm team, the candidate will be working to harvest canola, wheat, oats, peas, and...

The LiRo Group

Healthcare Project Manager Job at The LiRo Group

Healthcare Project Manager US-NY-Syosset Job ID: 2024-3024 Type: Regular Full-Time # of Openings: 1 Category: Management The LiRo Group Overview We are currently seeking a Healthcare Project Manager for Nassau County projects. As a leader of Program... 

Pure Freight Lines, LTD.

Dedicated Runs Class A Owner Operator Job at Pure Freight Lines, LTD.

 ...Pure Freight Lines is Looking for Class A Team Owner Operators to Run Dedicated Runs out of Forest Park , IL~ OTR Teams out 3 weeks~ Great Time off~ Gross 12-15 k weekly ~ The tractor needs to be 5 years or newer~ Minimum of 2 years Experience. ~ apply... 

Jobs via Dice

Applications Systems Analyst Sr - Epic MyChart Job at Jobs via Dice

Applications Systems Analyst Sr - Epic MyChart 2 days ago Be among the first 25 applicantsDice is the leading career destination for tech experts at every stage of their careers. Our client, UNC Health Care, is seeking the following. Apply via Dice today!Description... 

HireTalent - Staffing & Recruiting Firm

Technical Writer II Job at HireTalent - Staffing & Recruiting Firm

 ...Job Title: Technical Writer II Location: Swiftwater, PA- 18370 Duration: 6-month contract Responsibilities: Responsible for...  ...required technical documentation. Responsible for technical writing/editing for all types of documentation produced within a modern...