Your Name

Home Resume Portfolio Contact

Efficient LLM offloading

Pipeline scheduling for mobile GPUs

Role
First author
Timeline
2024
Stack
CUDA, PyTorch, FlexGen, TensorRT

LLM hero

Overview

Research findings

Impact

Read paper View slides

(C) 2025 Junon Lee. All Rights Reserved.