Abstract: In the field of autonomous driving, 3-D object detection is a crucial technology. Visual sensors are essential in this area and are widely used for 3-D object detection tasks. Recent ...
Abstract: Over the past few years especially in the context of communication and information processing the importance of Natural language processing which demands efficient deep learning models has ...
Abstract: Hybrid Mamba-Transformer networks have recently garnered broad attention. These networks can leverage the scalability of Transformers while capitalizing on Mamba’s strengths in long-context ...
Abstract: We present a simple approach to make pre-trained Vision Transformers (ViTs) interpretable for fine-grained analysis, aiming to identify and localize the traits that distinguish visually ...
Abstract: Vision Transformer (ViT) is an image recognition model that uses transformer architecture, which has a numerous advantage over Convolution Neural Networks (CNN). It offers improved accuracy, ...
Abstract: Skin cancer is a common cancer, and early detection is vital for effective treatment. Automated skin cancer segmentation, especially through deep learning models, has gained significant ...
Abstract: In recent years, the complementary advantages of convolutional neural networks (CNNs) and Transformers have been utilized to achieve significant results in image classification tasks.
Abstract: Vision transformers have demonstrated remarkable performance in hyperspectral image classification tasks. However, their complex computational mechanisms and excessive parameterization ...
Abstract: Skin cancer is a serious health concern globally, and melanoma is the cause of most deaths as it is usually diagnosed too late or misdiagnosed. Early diagnosis of skin lesion identifications ...
Abstract: High-mobility scenarios in next-generation wireless networks, such as those involving vehicular communications, require ultra-reliable and low-latency communications (URLLC). However, ...
Abstract: Accurate brain tumor detection is vital for effective diagnosis and treatment. This paper presents ViRCNN, a hybrid model that integrates Faster R-CNN and Vision Transformers (ViT) to ...
Abstract: Accurately detecting human attention levels is a key challenge in cognitive neuroscience, with broad application value in improving productivity. Although Electroencephalography (EEG) ...