School of Computer Science and Technology, Harbin Institute of Technology, Shenzhen
*Equal contribution †Corresponding author
*Equal contribution †Corresponding author
- [07/2024] Arxiv paper released.
If you find this work useful for your research, please kindly cite our paper:
@misc{zhang2024token,
title={Token-level Correlation-guided Compression for Efficient Multimodal Document Understanding},
author={Renshan Zhang, Yibo Lyu, Rui Shao, Gongwei Chen, Weili Guan and Liqiang Nie},
year={2024},
eprint={2407.14439},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2407.14439},
}
