(at Lake Louise, Alberta, Canada) |
Hanshi Sun   孙寒石I am a M.S. student in ECE at Carnegie Mellon University. Now I am working on LLM efficiency with Prof. Beidi Chen. I also work closely with Prof. Andrea Zanette. I obtained my bachelor degree at Southeast University. I was a
member of the PAttern Learning and Mining (PALM)
Lab , where I am fortunate to be advised by Prof. Yi Zhou. Before, I was
a research intern at the University of Alberta, supervised by Prof. Xingyu Li.
@preminstrel
|
[2024/10/01] Our Speculative Rejection has been accepted by NeurIPS 2024! See you in Vancouver!
[2024/07/09] Our TriForce has been accepted by 🦙 COLM 2024! See you in Philadelphia!
[2024/06/03] Work as a MLSys Research Intern in Seed-Foundation team at ByteDance (Seattle Office) .
[2023/11/06] Join Prof. Beidi Chen's group at CMU .
[2023/06/20] Graduate from Southeast University with a bachelor degree .
[2022/11/15] I am working at Apple as an intern in EE/SW team for eight months.
[2022/07/06] I am working as a research intern during summer 2022 in Prof. Xingyu Li’s group at the University of Alberta .
/ . MLSys papers are highlighted.
ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference
Hanshi Sun, Li-Wen Chang, Wenlei Bao, Size Zheng, Ningxin Zheng, Xin Liu, Harry Dong, Yuejie Chi, and Beidi Chen ArXiv, 2024 arXiv / website / code / bibtex High-Throughput Long-Context LLM Inference System |
|
Fast Best-of-N Decoding via Speculative Rejection
Hanshi Sun*, Momin Haider*, Ruiqi Zhang*, Huitao Yang, Jiahao Qiu, Ming Yin, Mengdi Wang, Peter Bartlett, and Andrea Zanette* (* for core authors) Conference on Neural Information Processing Systems (NeurIPS), 2024 arXiv / website / code / bibtex Fast Inference-time Aligment Algorithm |
|
|
*TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding
Hanshi Sun, Zhuoming Chen, Xinyu Yang, Yuandong Tian, and Beidi Chen Conference on Language Modeling (COLM), 2024 arXiv / website / code / demo / bibtex Training-free Lossless Long Sequence Generation Acceleration |
BMAD: Benchmarks for Medical Anomaly Detection
Jinan Bao, Hanshi Sun, Hanqiu Deng, Zhaoxiang Zhang, and Xingyu Li Computer Vision and Pattern Recognition (CVPR) Workshop, 2024 This benchmark encompasses six reorganized datasets from five medical domains (i.e. brain MRI, liver CT, retinal OCT, chest X-ray, and digital histopathology) and three key evaluation metrics, and includes a total of fourteen state-of-the-art AD algorithms. |
|
Carnegie Mellon University, Pittsburgh, United States (Aug 2023 - Dec 2024) M.S. in Electrical & Computer Engineering
|
||
Southeast University, Nanjing, China (Sep 2019 - Jul 2023) B.E. in Electronic Science and Technology
|
|
ByteDance Inc., Seattle, United States (Jun 2024 - Present) Machine Learning System Research Intern in Seed-Foundation Team
|
|
Apple Inc., Shenzhen, China (Nov 2022 - Jun 2023) R&D Intern in iPad System EE
|
|
© Hanshi Sun 2024