EmbroiderSnow

Follow

Zhaoyuan Bi EmbroiderSnow

Follow

A student in Peking University.

0 followers · 6 following

Peking University
Beijing, China
23:41 (UTC +08:00)

Achievements

Achievements

Highlights

Pro

EmbroiderSnow/README.md

Hi there 👋

I'm Zhaoyuan Bi, an undergraduate student in Computer Science at Peking University (Class of 2027).

🔭 What I'm working on

I focus on Machine Learning Systems, especially GPU kernel optimization for LLM inference.

Recently, I have been working on:

CUDA kernel development in GGML / Llama.cpp
Optimization of quantization, RoPE, and vecdot kernels
Performance analysis using Nsight (memory access, CPI, bottlenecks)
Improving end-to-end inference throughput

⚙️ Interests

GPU Computing (CUDA)
LLM Inference Optimization
Parallel Algorithms & Memory Optimization
Systems for Machine Learning (MLSys)

🌱 Currently exploring

Parallel primitives (e.g., scan, reduction)
Performance-critical kernel design
Memory-bound optimization in GPU workloads

🛠 Languages & Tools

Popular repositories Loading

MIT-6.828-JOS-DOC-Beautify MIT-6.828-JOS-DOC-Beautify Public

HTML 1
arap_deformation arap_deformation Public

Lab of Frontiers of Geometric Computation(2025 Spring PKU)

C++
pointconv_pytorch pointconv_pytorch Public

Repreduct of PointConv: Deep Convolutional Networks on 3D Point Clouds. CVPR 2019

Python
Point2Mesh-via-SDF Point2Mesh-via-SDF Public

Lab of Frontiers of Geometric Computation(2025 Spring PKU)

Python
RISC-V-Simulator RISC-V-Simulator Public

A RV simulator, implement RV-IM.

C
CacheSimulator CacheSimulator Public

A simple cache simulator with prefetch.

Python