TAB: Transformer Attention Bottlenecks enable User Intervention and Debugging in Vision-Language Models
Published in IEEE/CVF International Conference on Computer Vision (ICCV), 2025
Recommended citation: Rahmanzadehgervi, P., Nguyen, H.H., Liu, R., Mai, L. and Nguyen, A.T., 2024. TAB: Transformer Attention Bottlenecks enable User Intervention and Debugging in Vision-Language Models. arXiv preprint arXiv:2412.18675. https://arxiv.org/pdf/2412.18675