arxiv:2502.06282
Haiduo Huang
Hhaiduo
ยท
AI & ML interests
None yet
Recent Activity
upvoted a paper about 17 hours ago
Domino: Decoupling Causal Modeling from Autoregressive Drafting in Speculative Decoding upvoted a paper 2 days ago
OScaR: The Occam's Razor for Extreme KV Cache Quantization in LLMs and Beyond upvoted a paper about 2 months ago
Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation