Publications
You can also find my articles on my Google Scholar profile.
Hossein Entezari Zarch, Lei Gao, Chaoyi Jiang, Murali Annavaram “DELTA: Dynamic Layer-Aware Token Attention for Efficient Long-Context Reasoning.” (under review). [Link]
Hossein Entezari Zarch*, Lei Gao*, Chaoyi Jiang, Murali Annavaram “DEL: Context-Aware Dynamic Exit Layer for Efficient Self-Speculative Decoding.” Proceedings of the Conference on Language Modeling (COLM) 2025. [Link]
Chaoyi Jiang, Lei Gao, Hossein Entezari Zarch, Murali Annavaram “KVPR: Efficient LLM Inference with I/O-Aware KV Cache Partial Recomputation.” Findings of the Association for Computational Linguistics (ACL) 2025. [Link]
Chaoyi Jiang, Sungwoo Kim, Lei Gao, Hossein Entezari Zarch, Won Woo Ro, Murali Annavaram “MARché: Fast Masked Autoregressive Image Generation with Cache-Aware Attention.” (under review). [Link]
Arun Ramachandran, R. Govindarajan, Prakash Raghavendra, Murali Annavaram, Hossein Entezari Zarch, Chaoyi Jiang, Lei Gao “Balancing Memory and Compute (BMC) of Attention Blocks: An Effective Technique for Speculative LLM Inferencing.” (under review).
Hossein Entezari Zarch, Abdulla Alshabanah, Chaoyi Jiang, Murali Annavaram “CADC: Encoding User-Item Interactions for Compressing Recommendation Model Training Data.” ACM RecSys Workshop, 2024. [Link]
Chaoyi Jiang*, Abdulla Alshabanah*, Hossein Entezari Zarch, Keshav Balasubramanian, Murali Annavaram “HuffmanEmbed: Using Huffman Coding for Embedding Table Compression in Deep Learning Recommendation Models.” EuroSys 2025 Poster. [Link]
Milad Soltany*, Hesam Mojtahedi*, Hossein Entezari Zarch*, Amirhossein Kazerouni*, Alireza Morsali, Azra Abtahi, Farokh Marvasti “Ensemble Neural Representation Networks.” arXiv:2110.04124. [Link]
Seyed Masoud Rezaeijo, Hossein Entezari Zarch, Hesam Mojtahedi, Nahid Chegeni, Amir Danyaei “Feasibility Study of Synthetic DW-MR Images Using GANs.” Applied Magnetic Resonance, 2022. [Link]
Seyed Masoud Rezaeijo, Mohammadreza Ghorvei, Razzagh Abedi-Firouzjah, Hesam Mojtahedi, Hossein Entezari Zarch “Detecting COVID-19 in Chest Images via Transfer Learning.” Egyptian Journal of Radiology and Nuclear Medicine, 2021. [Link]