Skip to content
View LwhJesse's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report LwhJesse

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
LwhJesse/README.md

Jesse signature banner Jesse signature banner

GPU Numerical Linear Algebra · Scientific Computing · Open Source Systems

CUDA · Sparse Solvers · CFD Linear Algebra · Linux Graphics · Arch Linux


About

I am an Engineering Mechanics undergraduate working on GPU numerical linear algebra, scientific computing, and open-source systems.

My recent work focuses on CUDA performance engineering, sparse matrix operations, CFD linear algebra infrastructure, and graphics-pipeline debugging, especially where numerical methods meet low-level system behavior.

Current Focus

  • GPU performance engineering — CUDA data movement, sparse matrix-vector products, batched GEMM paths, and solver-side performance.
  • Sparse solver infrastructure — cuSPARSE, cuBLAS, Krylov-solver data flow, and CFD linear algebra paths.
  • Solver correctness and validation — block-sparse matrix operations, minimal numerical counterexamples, and reproducible correctness tests.
  • Scientific computing — numerical methods, nonlinear mechanics, FEM validation, and simulation reliability.

Selected Work

  • SU2 CUDA linear algebra path
    Reducing redundant Jacobian uploads during linear solves and investigating CUDA block-sparse matvec correctness in the GPU solver path.

  • CUDA sparse solver optimization
    Reusing cuSPARSE SpMV preprocessing in amgcl's CUDA CSR backend to reduce repeated CSR partition/preprocessing overhead in iterative solves.

  • ArrayFire CUDA batched GEMM
    Adding a strided-batched GEMM fast path for compatible batch layouts to avoid pointer-array setup and host-to-device pointer copies.

  • CUTLASS runtime datatype mapping
    Improving runtime datatype mapping paths in CUTLASS library tooling.

  • Hyprland ICC / blur rendering investigation
    Debugging ICC-enabled blur transparency and color-pipeline interactions in the compositor render path.

  • Nonlinear beam deflection computation
    Numerical calculation and FEM validation for the failure boundary of linear beam theory under large deflection.

Technical Stack

C++ · CUDA · Python · Linux · Arch Linux

cuSPARSE · cuBLAS · SpMV · Batched GEMM · Krylov Solvers · Sparse Linear Algebra

SU2 · amgcl · ArrayFire · CUTLASS · OpenSees

GitHub Activity

GitHub profile details GitHub profile details
External contribution languages External contribution languages Own repository languages Own repository languages

Jesse signature Jesse signature

Pinned Loading

  1. Euler-Elastica-Py Euler-Elastica-Py Public

    A highly robust Python framework for geometrically nonlinear beams (Euler Elastica)

    Python 1

  2. su2code/SU2 su2code/SU2 Public

    SU2: An Open-Source Suite for Multiphysics Simulation and Design

    C++ 1.7k 977

  3. NVIDIA/open-gpu-kernel-modules NVIDIA/open-gpu-kernel-modules Public

    NVIDIA Linux open GPU kernel module source

    C 17k 1.7k

  4. NVIDIA/cutlass NVIDIA/cutlass Public

    CUDA Templates and Python DSLs for High-Performance Linear Algebra

    C++ 9.7k 1.9k

  5. Binance-Ultra-HFT Binance-Ultra-HFT Public

    ⚡ Bare-metal Binance tick processor: zero-copy DMA, lockless pipelines, C++/CUDA, microsecond latency.

    C++ 1

  6. orthographic-blockcode orthographic-blockcode Public

    Experimental toolkit for evaluating and searching orthographic block-code text-entry mappings.

    Python 1