GPU / CUDA Engineers (Multiple Openings)

Greylock Partners3 months ago
San Francisco, CA, United States
On-site
Full-time
Junior Level (1-3 years)

Job Description

Several growth-stage investments of ours in San Francisco, CA are looking for experts in GPU Optimization / Inference Acceleration.

In general, these are the responsibilities:

  • Primarily focused on GPGPU programming to increase the performance of the product -- writing, debugging, and optimizing CUDA code from GPU kernel-level on upward to improve the holistic performance of new AI models
  • Play a key role creating all of the tooling and associated infrastructure to increase the performance of the company -- from fairly straight-forward projects (profilers) to incredibly complex (new inference engines)

In general, these are the expectations:

  • Proven background in CPU acceleration and/or GPU optimization (latter preferred) with a strong preference toward candidates who have expertise in CUDA Kernel hacking
  • Experience working in deep learning environments and/or on products targeting high-performance ML systems
  • Strong coding skills in high-performance environments (C/C++)

About Us:

Greylock is an early-stage investor in hundreds of remarkable companies including Airbnb, LinkedIn, Dropbox, Workday, Cloudera, Facebook, Instagram, Roblox, Coinbase, Palo Alto Networks, among others. More can be found about us here: https://greylock.com/

How We Work:

We are full-time, salaried employees of Greylock and provide free candidate referrals/introductions to our active investments. We will contact anyone who looks like a potential match--requesting to schedule a call with you immediately.

Due to the selective nature of this service and the volume of applicants we typically receive from our job postings, a follow-up email will not be sent until a match is identified with one of our investments.

Please note: We are not recruiting for any roles within Greylock at this time. This job posting is for direct employment with a startup in our portfolio.

Required Skills

high-performance ML systems
C/C++ coding
deep learning
CPU acceleration
CUDA
GPGPU programming
GPU optimization