I am a PhD student in Computer Science at Yonsei University, where I conduct research in the Artificial Intelligence and Information Systems Lab under the supervision of Professor Albert No. My research focuses on efficient large language models, with particular interest in quantization and low-rank adaptation. I am broadly interested in model compression, and my work revisits assumptions that have long been treated as standard practice.

News

Jul 2026
One paper was accepted to ICML 2026.
Apr 2026
One paper was accepted to ICLR 2026.
Jun 2025
One paper was accepted to ICML 2025 TTODLer-FM Workshop as an oral.
May 2025
One paper was accepted to ACL 2025 Findings.

Selected Publications

Preserve-Then-Quantize: Balancing Rank Budgets for Quantization Error Reconstruction in LLMs
[arXiv]
Assigning Distinct Roles to Quantized and Low-Rank Matrices Toward Optimal Weight Decomposition
[arXiv]

* equal contribution    corresponding author