CB-HVT Net: A Channel-Boosted Hybrid Vision Transformer Network for Lymphocyte Detection in Histopathological Images

Published in IEEE Access, 2023

This study presents a novel Channel Boosted Hybrid Vision Transformer (CB-HVT) architecture for detecting lymphocytes in histopathological images. By combining the local feature learning capabilities of CNNs with the global contextual awareness of Vision Transformers, the model effectively addresses challenges such as overlapping boundaries, artifacts, and morphological diversity of lymphocytes. The network incorporates multiple specialized modules, including channel generation, exploitation, and merging components, alongside a region-aware attention mechanism and a detection head. A feature fusion block with attention enhances discriminative learning. The CB-HVT was evaluated on two benchmark datasets (LYSTO and NuClick), achieving F-Scores of 0.88 and 0.82 respectively, and demonstrated robust performance on unseen test sets—highlighting its potential for real-time clinical application in pathology.

Gesture Detection Architecture Figure 1: Overall and detailed workflow of the proposed CB-HVT Net. The training and testing phases of the proposed approach are also elaborated.

Recommended citation: M. L. Ali, Z. Rauf, A. Khan, A. Sohail, R. Ullah and J. Gwak, "CB-HVT Net: A Channel-Boosted Hybrid Vision Transformer Network for Lymphocyte Detection in Histopathological Images," in IEEE Access, vol. 11, pp. 115740-115750, 2023, doi: 10.1109/ACCESS.2023.3324383
Download Paper