← all jobs

[Remote] Software Engineer - GPU Kernels

Work from home Full-time role Hiring

Note: The job is a remote job and is open to candidates in USA. Baseten is an innovative company powering AI solutions for leading firms like Notion and OpenEvidence, and they are seeking a GPU Kernel Engineer to enhance AI model performance. This role focuses on designing high-performance GPU kernels and optimizing computation for machine learning operations, directly impacting production systems for millions of users.

Responsibilities

  • Design and implement high-performance GPU kernels for key ML operations, including matrix multiplications, attention mechanisms, and mixture-of-experts routing
  • Write and optimize code using CUDA, PTX assembly, and architecture-specific techniques
  • Apply advanced performance optimization methods such as memory coalescing, warp-level programming, tensor core acceleration, and compute/memory overlap
  • Implement cutting-edge features like quantization (FP8/FP4), sparsity, and compute/communication overlap
  • Identify and resolve performance bottlenecks using tools like Nsight Systems, Nsight Compute, and Torch Profiler
  • Collaborate with research teams to productionize theoretical advancements
  • Contribute to internal and open-source GPU libraries
  • Present technical contributions at industry conferences (e.g., NVIDIA GTC, AWS re:Invent)

Skills

  • Strong understanding of GPU architecture and programming paradigms: Memory hierarchy (global, shared, registers, L1/L2 cache), Thread/block/grid organization, Synchronization techniques and race condition mitigation
  • Proficient in C++ and GPU performance profiling tools
  • Knowledge of: CUDA C++ API, Memory access patterns and bandwidth optimization, Numerical precision and quantization strategies, Modern GPU features (e.g., tensor cores, async operations)
  • Experience with Transformer models and attention optimization (e.g., Flash Attention)
  • Familiarity with GPU kernel libraries: Cutlass, Triton, Thrust, CUB
  • Background in GEMM tuning and distributed/multi-GPU compute
  • Contributions to open-source GPU projects
  • Research publications or conference presentations on GPU performance

Benefits

  • Competitive compensation, including meaningful equity.
  • 100% coverage of medical, dental, and vision insurance for employee and dependents
  • Flexible PTO policy including company wide Winter Break (our offices are closed from Christmas Eve to New Year's Day!)
  • Paid parental leave
  • Fertility and family-building stipend through Carrot
  • Company-facilitated 401(k)
  • Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities.

Company Overview

  • Baseten is an AI infrastructure company that integrates machine learning into business operations, production, and processes. It was founded in 2019, and is headquartered in San Francisco, California, USA, with a workforce of 201-500 employees. Its website is https://www.baseten.co.
  • Company H1B Sponsorship

  • Baseten has a track record of offering H1B sponsorships, with 1 in 2026, 6 in 2025, 8 in 2024, 1 in 2023, 1 in 2020. Please note that this does not guarantee sponsorship for this specific role.
  • More open positions

    [Remote] Senior Business Solutions Analyst (Pension Administration)

    Work from home Full-time role

    [Remote] Technical Recruiter

    Work from home Full-time role

    [Remote] Senior Data Engineer (Healthcare Data)

    Work from home Full-time role

    [Remote] Frontend Engineer (US)

    Work from home Full-time role

    [Remote] Solution Architect – Customer Engagements (Strategic Accounts)

    Work from home Full-time role

    Tech Lead, Android Core Product - Champaign, IL, USA

    Work from home Full-time role

    Vice President, M&A and Corporate Advisory

    Work from home Full-time role

    Founding Forward Deployed Engineer

    Work from home Full-time role

    Experienced Customer Success Advocate – Unlock Global Opportunity for Every Person, Team, and Business

    Work from home Full-time role

    Coord.- Pharmacy Care

    Work from home Full-time role

    Manager, Workforce Management Systems & Support

    Work from home Full-time role

    Senior Concept Artist - Treyarch (Los Angeles) [Remote]

    Work from home Full-time role

    Change Management Project Manager (Global Supply Chain)

    Work from home Full-time role

    Project Engagement Manager - SaaS

    Work from home Full-time role

    [Remote] Sales & Customer Service Officer (Remote – Night Shift)

    Work from home Full-time role

    [Remote] Customer Marketing Lead

    Work from home Full-time role

    Remote US Call Center Customer Support Agent – Healthcare Services – Inbound Provider Inquiries & Member Assistance

    Work from home Full-time role

    Specialist, logistics- weekends

    Work from home Full-time role

    Principal Competitive Intelligence Analyst – Automation & SOC AI (Cortex Agentix)

    Work from home Full-time role

    Director, Social Media

    Work from home Full-time role

    Application Security Engineer

    Work from home Full-time role