Author: "Naseer, Muzammal" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Naseer, Muzammal"' showing total 81 results

Start Over Author "Naseer, Muzammal"

81 results on '"Naseer, Muzammal"'

51. On Generating Transferable Targeted Perturbations

Author: Naseer, Muzammal, Khan, Salman, Hayat, Munawar, Khan, Fahad Shahbaz, and Porikli, Fatih
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: While the untargeted black-box transferability of adversarial perturbations has been extensively studied before, changing an unseen model's decisions to a specific `targeted' class remains a challenging feat. In this paper, we propose a new generative approach for highly transferable targeted perturbations (\ours). We note that the existing methods are less suitable for this task due to their reliance on class-boundary information that changes from one model to another, thus reducing transferability. In contrast, our approach matches the perturbed image `distribution' with that of the target class, leading to high targeted transferability rates. To this end, we propose a new objective function that not only aligns the global distributions of source and target images, but also matches the local neighbourhood structure between the two domains. Based on the proposed objective, we train a generator function that can adaptively synthesize perturbations specific to a given input. Our generative approach is independent of the source or target domain labels, while consistently performs well against state-of-the-art methods on a wide range of attack settings. As an example, we achieve $32.63\%$ target transferability from (an adversarially weak) VGG19$_{BN}$ to (a strong) WideResNet on ImageNet val. set, which is 4$\times$ higher than the previous best generative attack and 16$\times$ better than instance-specific iterative attack. Code is available at: {\small\url{https://github.com/Muzammal-Naseer/TTP}}., Comment: ICCV, 2021. Code is available at https://github.com/Muzammal-Naseer/TTP
Published: 2021

52. Orthogonal Projection Loss

Author: Ranasinghe, Kanchana, Naseer, Muzammal, Hayat, Munawar, Khan, Salman, and Khan, Fahad Shahbaz
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Deep neural networks have achieved remarkable performance on a range of classification tasks, with softmax cross-entropy (CE) loss emerging as the de-facto objective function. The CE loss encourages features of a class to have a higher projection score on the true class-vector compared to the negative classes. However, this is a relative constraint and does not explicitly force different class features to be well-separated. Motivated by the observation that ground-truth class representations in CE loss are orthogonal (one-hot encoded vectors), we develop a novel loss function termed `Orthogonal Projection Loss' (OPL) which imposes orthogonality in the feature space. OPL augments the properties of CE loss and directly enforces inter-class separation alongside intra-class clustering in the feature space through orthogonality constraints on the mini-batch level. As compared to other alternatives of CE, OPL offers unique advantages e.g., no additional learnable parameters, does not require careful negative mining and is not sensitive to the batch size. Given the plug-and-play nature of OPL, we evaluate it on a diverse range of tasks including image recognition (CIFAR-100), large-scale classification (ImageNet), domain generalization (PACS) and few-shot learning (miniImageNet, CIFAR-FS, tiered-ImageNet and Meta-dataset) and demonstrate its effectiveness across the board. Furthermore, OPL offers better robustness against practical nuisances such as adversarial attacks and label noise. Code is available at: https://github.com/kahnchana/opl.
Published: 2021

53. Transformers in Vision: A Survey

Author: Khan, Salman, Naseer, Muzammal, Hayat, Munawar, Zamir, Syed Waqas, Khan, Fahad Shahbaz, and Shah, Mubarak
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Astounding results from Transformer models on natural language tasks have intrigued the vision community to study their application to computer vision problems. Among their salient benefits, Transformers enable modeling long dependencies between input sequence elements and support parallel processing of sequence as compared to recurrent networks e.g., Long short-term memory (LSTM). Different from convolutional networks, Transformers require minimal inductive biases for their design and are naturally suited as set-functions. Furthermore, the straightforward design of Transformers allows processing multiple modalities (e.g., images, videos, text and speech) using similar processing blocks and demonstrates excellent scalability to very large capacity networks and huge datasets. These strengths have led to exciting progress on a number of vision tasks using Transformer networks. This survey aims to provide a comprehensive overview of the Transformer models in the computer vision discipline. We start with an introduction to fundamental concepts behind the success of Transformers i.e., self-attention, large-scale pre-training, and bidirectional encoding. We then cover extensive applications of transformers in vision including popular recognition tasks (e.g., image classification, object detection, action recognition, and segmentation), generative modeling, multi-modal tasks (e.g., visual-question answering, visual reasoning, and visual grounding), video processing (e.g., activity recognition, video forecasting), low-level vision (e.g., image super-resolution, image enhancement, and colorization) and 3D analysis (e.g., point cloud classification and segmentation). We compare the respective advantages and limitations of popular techniques both in terms of architectural design and their experimental value. Finally, we provide an analysis on open research directions and possible future works., Comment: 30 pages (Accepted in ACM Computing Surveys December 2021)
Published: 2021
Full Text: View/download PDF

54. Self-distilled Vision Transformer for Domain Generalization

Author: Sultana, Maryam, Naseer, Muzammal, Khan, Muhammad Haris, Khan, Salman, Khan, Fahad Shahbaz, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Wang, Lei, editor, Gall, Juergen, editor, Chin, Tat-Jun, editor, Sato, Imari, editor, and Chellappa, Rama, editor
Published: 2023
Full Text: View/download PDF

55. Stylized Adversarial Defense

Author: Naseer, Muzammal, Khan, Salman, Hayat, Munawar, Khan, Fahad Shahbaz, and Porikli, Fatih
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Deep Convolution Neural Networks (CNNs) can easily be fooled by subtle, imperceptible changes to the input images. To address this vulnerability, adversarial training creates perturbation patterns and includes them in the training set to robustify the model. In contrast to existing adversarial training methods that only use class-boundary information (e.g., using a cross-entropy loss), we propose to exploit additional information from the feature space to craft stronger adversaries that are in turn used to learn a robust model. Specifically, we use the style and content information of the target sample from another class, alongside its class-boundary information to create adversarial perturbations. We apply our proposed multi-task objective in a deeply supervised manner, extracting multi-scale feature knowledge to create maximally separating adversaries. Subsequently, we propose a max-margin adversarial training approach that minimizes the distance between source image and its adversary and maximizes the distance between the adversary and the target image. Our adversarial training approach demonstrates strong robustness compared to state-of-the-art defenses, generalizes well to naturally occurring corruptions and data distributional shifts, and retains the model accuracy on clean examples., Comment: IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)
Published: 2020

56. A Self-supervised Approach for Adversarial Robustness

Author: Naseer, Muzammal, Khan, Salman, Hayat, Munawar, Khan, Fahad Shahbaz, and Porikli, Fatih
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Adversarial examples can cause catastrophic mistakes in Deep Neural Network (DNNs) based vision systems e.g., for classification, segmentation and object detection. The vulnerability of DNNs against such attacks can prove a major roadblock towards their real-world deployment. Transferability of adversarial examples demand generalizable defenses that can provide cross-task protection. Adversarial training that enhances robustness by modifying target model's parameters lacks such generalizability. On the other hand, different input processing based defenses fall short in the face of continuously evolving attacks. In this paper, we take the first step to combine the benefits of both approaches and propose a self-supervised adversarial training mechanism in the input space. By design, our defense is a generalizable approach and provides significant robustness against the \textbf{unseen} adversarial attacks (\eg by reducing the success rate of translation-invariant \textbf{ensemble} attack from 82.6\% to 31.9\% in comparison to previous state-of-the-art). It can be deployed as a plug-and-play solution to protect a variety of vision systems, as we demonstrate for the case of classification, segmentation and detection. Code is available at: {\small\url{https://github.com/Muzammal-Naseer/NRP}}., Comment: CVPR-2020 (Oral). Code this http https://github.com/Muzammal-Naseer/NRP}
Published: 2020

57. Cross-Domain Transferability of Adversarial Perturbations

Author: Naseer, Muzammal, Khan, Salman H., Khan, Harris, Khan, Fahad Shahbaz, and Porikli, Fatih
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Adversarial examples reveal the blind spots of deep neural networks (DNNs) and represent a major concern for security-critical applications. The transferability of adversarial examples makes real-world attacks possible in black-box settings, where the attacker is forbidden to access the internal parameters of the model. The underlying assumption in most adversary generation methods, whether learning an instance-specific or an instance-agnostic perturbation, is the direct or indirect reliance on the original domain-specific data distribution. In this work, for the first time, we demonstrate the existence of domain-invariant adversaries, thereby showing common adversarial space among different datasets and models. To this end, we propose a framework capable of launching highly transferable attacks that crafts adversarial patterns to mislead networks trained on wholly different domains. For instance, an adversarial function learned on Paintings, Cartoons or Medical images can successfully perturb ImageNet samples to fool the classifier, with success rates as high as $\sim$99\% ($\ell_{\infty} \le 10$). The core of our proposed adversarial function is a generative network that is trained using a relativistic supervisory signal that enables domain-invariant perturbations. Our approach sets the new state-of-the-art for fooling rates, both under the white-box and black-box scenarios. Furthermore, despite being an instance-agnostic perturbation function, our attack outperforms the conventionally much stronger instance-specific attack methods., Comment: Accepted at NeurIPS 2019 (Camera Ready). Source Code along with pretrained adversarial generators is available at https://github.com/Muzammal-Naseer/Cross-domain-perturbations
Published: 2019

58. Self-distilled Vision Transformer for Domain Generalization

Author: Sultana, Maryam, primary, Naseer, Muzammal, additional, Khan, Muhammad Haris, additional, Khan, Salman, additional, and Khan, Fahad Shahbaz, additional
Published: 2023
Full Text: View/download PDF

59. Frequency Domain Adversarial Training for Robust Volumetric Medical Segmentation

Author: Hanif, Asif, primary, Naseer, Muzammal, additional, Khan, Salman, additional, Shah, Mubarak, additional, and Khan, Fahad Shahbaz, additional
Published: 2023
Full Text: View/download PDF

60. Task-generalizable Adversarial Attack based on Perceptual Metric

Author: Naseer, Muzammal, Khan, Salman H., Rahman, Shafin, and Porikli, Fatih
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Deep neural networks (DNNs) can be easily fooled by adding human imperceptible perturbations to the images. These perturbed images are known as `adversarial examples' and pose a serious threat to security and safety critical systems. A litmus test for the strength of adversarial examples is their transferability across different DNN models in a black box setting (i.e. when the target model's architecture and parameters are not known to attacker). Current attack algorithms that seek to enhance adversarial transferability work on the decision level i.e. generate perturbations that alter the network decisions. This leads to two key limitations: (a) An attack is dependent on the task-specific loss function (e.g. softmax cross-entropy for object recognition) and therefore does not generalize beyond its original task. (b) The adversarial examples are specific to the network architecture and demonstrate poor transferability to other network architectures. We propose a novel approach to create adversarial examples that can broadly fool different networks on multiple tasks. Our approach is based on the following intuition: "Perpetual metrics based on neural network features are highly generalizable and show excellent performance in measuring and stabilizing input distortions. Therefore an ideal attack that creates maximum distortions in the network feature space should realize highly transferable examples". We report extensive experiments to show how adversarial examples generalize across multiple networks for classification, object detection and segmentation tasks.
Published: 2018

61. Local Gradients Smoothing: Defense against localized adversarial attacks

Author: Naseer, Muzammal, Khan, Salman H., and Porikli, Fatih
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Deep neural networks (DNNs) have shown vulnerability to adversarial attacks, i.e., carefully perturbed inputs designed to mislead the network at inference time. Recently introduced localized attacks, Localized and Visible Adversarial Noise (LaVAN) and Adversarial patch, pose a new challenge to deep learning security by adding adversarial noise only within a specific region without affecting the salient objects in an image. Driven by the observation that such attacks introduce concentrated high-frequency changes at a particular image location, we have developed an effective method to estimate noise location in gradient domain and transform those high activation regions caused by adversarial noise in image domain while having minimal effect on the salient object that is important for correct classification. Our proposed Local Gradients Smoothing (LGS) scheme achieves this by regularizing gradients in the estimated noisy region before feeding the image to DNN for inference. We have shown the effectiveness of our method in comparison to other defense methods including Digital Watermarking, JPEG compression, Total Variance Minimization (TVM) and Feature squeezing on ImageNet dataset. In addition, we systematically study the robustness of the proposed defense mechanism against Back Pass Differentiable Approximation (BPDA), a state of the art attack recently developed to break defenses that transform an input sample to minimize the adversarial effect. Compared to other defense mechanisms, LGS is by far the most resistant to BPDA in localized adversarial attack setting., Comment: Accepted At WACV-2019
Published: 2018

62. Indoor Scene Understanding in 2.5/3D for Autonomous Agents: A Survey

Author: Naseer, Muzammal, Khan, Salman H, and Porikli, Fatih
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: With the availability of low-cost and compact 2.5/3D visual sensing devices, computer vision community is experiencing a growing interest in visual scene understanding of indoor environments. This survey paper provides a comprehensive background to this research topic. We begin with a historical perspective, followed by popular 3D data representations and a comparative analysis of available datasets. Before delving into the application specific details, this survey provides a succinct introduction to the core technologies that are the underlying methods extensively used in the literature. Afterwards, we review the developed techniques according to a taxonomy based on the scene understanding tasks. This covers holistic indoor scene understanding as well as subtasks such as scene classification, object detection, pose estimation, semantic segmentation, 3D reconstruction, saliency detection, physics-based reasoning and affordance prediction. Later on, we summarize the performance metrics used for evaluation in different tasks and a quantitative comparison among the recent state-of-the-art techniques. We conclude this review with the current challenges and an outlook towards the open research problems requiring further investigation., Comment: IEEE Access
Published: 2018
Full Text: View/download PDF

63. Vita-CLIP: Video and text adaptive CLIP via Multimodal Prompting

Author: Wasim, Syed Talal, primary, Naseer, Muzammal, additional, Khan, Salman, additional, Khan, Fahad Shahbaz, additional, and Shah, Mubarak, additional
Published: 2023
Full Text: View/download PDF

64. PromptCAL: Contrastive Affinity Learning via Auxiliary Prompts for Generalized Novel Category Discovery

Author: Zhang, Sheng, primary, Khan, Salman, additional, Shen, Zhiqiang, additional, Naseer, Muzammal, additional, Chen, Guangyi, additional, and Khan, Fahad Shahbaz, additional
Published: 2023
Full Text: View/download PDF

65. CLIP2Protect: Protecting Facial Privacy Using Text-Guided Makeup via Adversarial Latent Search

Author: Shamshad, Fahad, primary, Naseer, Muzammal, additional, and Nandakumar, Karthik, additional
Published: 2023
Full Text: View/download PDF

66. PromptCAL: Contrastive Affinity Learning via Auxiliary Prompts for Generalized Novel Category Discovery

Author: Zhang, Sheng, Khan, Salman, Shen, Zhiqiang, Naseer, Muzammal, Chen, Guangyi, Khan, Fahad, Zhang, Sheng, Khan, Salman, Shen, Zhiqiang, Naseer, Muzammal, Chen, Guangyi, and Khan, Fahad
Abstract: Although existing semi-supervised learning models achieve remarkable success in learning with unannotated in-distribution data, they mostly fail to learn on unlabeled data sampled from novel semantic classes due to their closed-set assumption. In this work, we target a pragmatic but under-explored Generalized Novel Category Discovery (GNCD) setting. The GNCD setting aims to categorize unlabeled training data coming from known and novel classes by leveraging the information of partially labeled known classes. We propose a two-stage Contrastive Affinity Learning method with auxiliary visual Prompts, dubbed PromptCAL, to address this challenging problem. Our approach discovers reliable pairwise sample affinities to learn better semantic clustering of both known and novel classes for the class token and visual prompts. First, we propose a discriminative prompt regularization loss to reinforce semantic discriminativeness of prompt-adapted pre-trained vision transformer for refined affinity relationships. Besides, we propose contrastive affinity learning to calibrate semantic representations based on our iterative semi-supervised affinity graph generation method for semantically-enhanced supervision. Extensive experimental evaluation demonstrates that our PromptCAL method is more effective in discovering novel classes even with limited annotations and surpasses the current state-of-the-art on generic and fine-grained benchmarks (e.g., with nearly 11% gain on CUB-200, and 9% on ImageNet-100) on overall accuracy. Our code is available at https: // github.com/ sheng- eatamath / PromptCAL.
Published: 2023
Full Text: View/download PDF

67. Stylized Adversarial Defense

Author: Naseer, Muzammal, Khan, Salman, Hayat, Munawar, Khan, Fahad, Porikli, Fatih, Naseer, Muzammal, Khan, Salman, Hayat, Munawar, Khan, Fahad, and Porikli, Fatih
Abstract: Deep Convolution Neural Networks (CNNs) can easily be fooled by subtle, imperceptible changes to the input images. To address this vulnerability, adversarial training creates perturbation patterns and includes them in the training set to robustify the model. In contrast to existing adversarial training methods that only use class-boundary information (e.g., using a cross-entropy loss), we propose to exploit additional information from the feature space to craft stronger adversaries that are in turn used to learn a robust model. Specifically, we use the style and content information of the target sample from another class, alongside its class-boundary information to create adversarial perturbations. We apply our proposed multi-task objective in a deeply supervised manner, extracting multi-scale feature knowledge to create maximally separating adversaries. Subsequently, we propose a max-margin adversarial training approach that minimizes the distance between source image and its adversary and maximizes the distance between the adversary and the target image. Our adversarial training approach demonstrates strong robustness compared to state-of-the-art defenses, generalizes well to naturally occurring corruptions and data distributional shifts, and retains the models accuracy on clean examples.
Published: 2023
Full Text: View/download PDF

68. GeoChat: Grounded Large Vision-Language Model for Remote Sensing

Author: Kuckreja, Kartik, Danish, Muhammad Sohail, Naseer, Muzammal, Das, Abhijit, Khan, Salman, Khan, Fahad Shahbaz, Kuckreja, Kartik, Danish, Muhammad Sohail, Naseer, Muzammal, Das, Abhijit, Khan, Salman, and Khan, Fahad Shahbaz
Abstract: Recent advancements in Large Vision-Language Models (VLMs) have shown great promise in natural image domains, allowing users to hold a dialogue about given visual content. However, such general-domain VLMs perform poorly for Remote Sensing (RS) scenarios, leading to inaccurate or fabricated information when presented with RS domain-specific queries. Such a behavior emerges due to the unique challenges introduced by RS imagery. For example, to handle high-resolution RS imagery with diverse scale changes across categories and many small objects, region-level reasoning is necessary alongside holistic scene interpretation. Furthermore, the lack of domain-specific multimodal instruction following data as well as strong backbone models for RS make it hard for the models to align their behavior with user queries. To address these limitations, we propose GeoChat - the first versatile remote sensing VLM that offers multitask conversational capabilities with high-resolution RS images. Specifically, GeoChat can not only answer image-level queries but also accepts region inputs to hold region-specific dialogue. Furthermore, it can visually ground objects in its responses by referring to their spatial coordinates. To address the lack of domain-specific datasets, we generate a novel RS multimodal instruction-following dataset by extending image-text pairs from existing diverse RS datasets. We establish a comprehensive benchmark for RS multitask conversations and compare with a number of baseline methods. GeoChat demonstrates robust zero-shot performance on various RS tasks, e.g., image and region captioning, visual question answering, scene classification, visually grounded conversations and referring detection. Our code is available at https://github.com/mbzuai-oryx/geochat., Comment: 10 pages, 4 figures
Published: 2023

69. Guidance Through Surrogate: Toward a Generic Diagnostic Attack

Author: Naseer, Muzammal, Khan, Salman, Porikli, Fatih, and Khan, Fahad Shahbaz
Abstract: Adversarial training (AT) is an effective approach to making deep neural networks robust against adversarial attacks. Recently, different AT defenses are proposed that not only maintain a high clean accuracy but also show significant robustness against popular and well-studied adversarial attacks, such as projected gradient descent (PGD). High adversarial robustness can also arise if an attack fails to find adversarial gradient directions, a phenomenon known as “gradient masking.” In this work, we analyze the effect of label smoothing on AT as one of the potential causes of gradient masking. We then develop a guided mechanism to avoid local minima during attack optimization, leading to a novel attack dubbed guided projected gradient attack (G-PGA). Our attack approach is based on a “match and deceive” loss that finds optimal adversarial directions through guidance from a surrogate model. Our modified attack does not require random restarts a large number of attack iterations or a search for optimal step size. Furthermore, our proposed G-PGA is generic, thus it can be combined with an ensemble attack strategy as we demonstrate in the case of auto-attack, leading to efficiency and convergence speed improvements. More than an effective attack, G-PGA can be used as a diagnostic tool to reveal elusive robustness due to gradient masking in adversarial defenses.
Published: 2024
Full Text: View/download PDF

70. Self-supervised Video Transformer

Author: Ranasinghe, Kanchana, Naseer, Muzammal, Khan, Salman, Khan, Fahad Shahbaz, and Ryoo, Michael
Subjects: FOS: Computer and information sciences, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition
Abstract: In this paper, we propose self-supervised training for video transformers using unlabeled video data. From a given video, we create local and global spatiotemporal views with varying spatial sizes and frame rates. Our self-supervised objective seeks to match the features of these different views representing the same video, to be invariant to spatiotemporal variations in actions. To the best of our knowledge, the proposed approach is the first to alleviate the dependency on negative samples or dedicated memory banks in Self-supervised Video Transformer (SVT). Further, owing to the flexibility of Transformer models, SVT supports slow-fast video processing within a single architecture using dynamically adjusted positional encoding and supports long-term relationship modeling along spatiotemporal dimensions. Our approach performs well on four action recognition benchmarks (Kinetics-400, UCF-101, HMDB-51, and SSv2) and converges faster with small batch sizes. Code: https://git.io/J1juJ, Accepted to CVPR '22
Published: 2022
Full Text: View/download PDF

71. Transformers in Vision: A Survey

Author: Khan, Salman, Naseer, Muzammal, Hayat, Munawar, Zamir, Syed Waqas, Khan, Fahad, Shah, Mubarak, Khan, Salman, Naseer, Muzammal, Hayat, Munawar, Zamir, Syed Waqas, Khan, Fahad, and Shah, Mubarak
Abstract: Astounding results from Transformer models on natural language tasks have intrigued the vision community to study their application to computer vision problems. Among their salient benefits, Transformers enable modeling long dependencies between input sequence elements and support parallel processing of sequence as compared to recurrent networks, e.g., Long short-term memory. Different from convolutional networks, Transformers require minimal inductive biases for their design and are naturally suited as set-functions. Furthermore, the straightforward design of Transformers allows processing multiple modalities (e.g., images, videos, text, and speech) using similar processing blocks and demonstrates excellent scalability to very large capacity networks and huge datasets. These strengths have led to exciting progress on a number of vision tasks using Transformer networks. This survey aims to provide a comprehensive overview of the Transformer models in the computer vision discipline. We start with an introduction to fundamental concepts behind the success of Transformers, i.e., self-attention, large-scale pre-training, and bidirectional feature encoding. We then cover extensive applications of transformers in vision including popular recognition tasks (e.g., image classification, object detection, action recognition, and segmentation), generative modeling, multi-modal tasks (e.g., visual-question answering, visual reasoning, and visual grounding), video processing (e.g., activity recognition, video forecasting), low-level vision (e.g., image super-resolution, image enhancement, and colorization), and three-dimensional analysis (e.g., point cloud classification and segmentation). We compare the respective advantages and limitations of popular techniques both in terms of architectural design and their experimental value. Finally, we provide an analysis on open research directions and possible future works. We hope this effort will ignite further interest in the communit
Published: 2022
Full Text: View/download PDF

72. ON IMPROVING ADVERSARIAL TRANSFERABILITY OF VISION TRANSFORMERS

Author: Naseer, Muzammal, Ranasinghe, Kanchana, Khan, Salman, Khan, Fahad Shahbaz, Porikli, Fatih, Naseer, Muzammal, Ranasinghe, Kanchana, Khan, Salman, Khan, Fahad Shahbaz, and Porikli, Fatih
Published: 2022

73. Transformers in Vision: A Survey

Author: Khan, Salman, primary, Naseer, Muzammal, additional, Hayat, Munawar, additional, Zamir, Syed Waqas, additional, Khan, Fahad Shahbaz, additional, and Shah, Mubarak, additional
Published: 2022
Full Text: View/download PDF

74. Stylized Adversarial Defense

Author: Naseer, Muzammal, primary, Khan, Salman, additional, Hayat, Munawar, additional, Khan, Fahad Shahbaz, additional, and Porikli, Fatih, additional
Published: 2022
Full Text: View/download PDF

75. Guidance Through Surrogate: Toward a Generic Diagnostic Attack

Author: Naseer, Muzammal, primary, Khan, Salman, additional, Porikli, Fatih, additional, and Khan, Fahad Shahbaz, additional
Published: 2022
Full Text: View/download PDF

76. Orthogonal Projection Loss

Author: Ranasinghe, Kanchana, primary, Naseer, Muzammal, additional, Hayat, Munawar, additional, Khan, Salman, additional, and Khan, Fahad Shahbaz, additional
Published: 2021
Full Text: View/download PDF

77. On Generating Transferable Targeted Perturbations

Author: Naseer, Muzammal, primary, Khan, Salman, additional, Hayat, Munawar, additional, Khan, Fahad Shahbaz, additional, and Porikli, Fatih, additional
Published: 2021
Full Text: View/download PDF

78. A Self-supervised Approach for Adversarial Robustness

Author: Naseer, Muzammal, primary, Khan, Salman, additional, Hayat, Munawar, additional, Khan, Fahad Shahbaz, additional, and Porikli, Fatih, additional
Published: 2020
Full Text: View/download PDF

79. Local Gradients Smoothing: Defense Against Localized Adversarial Attacks

Author: Naseer, Muzammal, primary, Khan, Salman, additional, and Porikli, Fatih, additional
Published: 2019
Full Text: View/download PDF

80. Indoor Scene Understanding in 2.5/3D for Autonomous Agents: A Survey

Author: Naseer, Muzammal, primary, Khan, Salman, additional, and Porikli, Fatih, additional
Published: 2019
Full Text: View/download PDF

81. Stylized Adversarial Defense.

Author: Naseer M, Khan S, Hayat M, Khan FS, and Porikli F
Abstract: Deep Convolution Neural Networks (CNNs) can easily be fooled by subtle, imperceptible changes to the input images. To address this vulnerability, adversarial training creates perturbation patterns and includes them in the training set to robustify the model. In contrast to existing adversarial training methods that only use class-boundary information (e.g., using a cross-entropy loss), we propose to exploit additional information from the feature space to craft stronger adversaries that are in turn used to learn a robust model. Specifically, we use the style and content information of the target sample from another class, alongside its class-boundary information to create adversarial perturbations. We apply our proposed multi-task objective in a deeply supervised manner, extracting multi-scale feature knowledge to create maximally separating adversaries. Subsequently, we propose a max-margin adversarial training approach that minimizes the distance between source image and its adversary and maximizes the distance between the adversary and the target image. Our adversarial training approach demonstrates strong robustness compared to state-of-the-art defenses, generalizes well to naturally occurring corruptions and data distributional shifts, and retains the model's accuracy on clean examples.
Published: 2023
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

81 results on '"Naseer, Muzammal"'

51. On Generating Transferable Targeted Perturbations

52. Orthogonal Projection Loss

53. Transformers in Vision: A Survey

54. Self-distilled Vision Transformer for Domain Generalization

55. Stylized Adversarial Defense

56. A Self-supervised Approach for Adversarial Robustness

57. Cross-Domain Transferability of Adversarial Perturbations

58. Self-distilled Vision Transformer for Domain Generalization

59. Frequency Domain Adversarial Training for Robust Volumetric Medical Segmentation

60. Task-generalizable Adversarial Attack based on Perceptual Metric

61. Local Gradients Smoothing: Defense against localized adversarial attacks

62. Indoor Scene Understanding in 2.5/3D for Autonomous Agents: A Survey

63. Vita-CLIP: Video and text adaptive CLIP via Multimodal Prompting

64. PromptCAL: Contrastive Affinity Learning via Auxiliary Prompts for Generalized Novel Category Discovery

65. CLIP2Protect: Protecting Facial Privacy Using Text-Guided Makeup via Adversarial Latent Search

66. PromptCAL: Contrastive Affinity Learning via Auxiliary Prompts for Generalized Novel Category Discovery

67. Stylized Adversarial Defense

68. GeoChat: Grounded Large Vision-Language Model for Remote Sensing

69. Guidance Through Surrogate: Toward a Generic Diagnostic Attack

70. Self-supervised Video Transformer

71. Transformers in Vision: A Survey

72. ON IMPROVING ADVERSARIAL TRANSFERABILITY OF VISION TRANSFORMERS

73. Transformers in Vision: A Survey

74. Stylized Adversarial Defense

75. Guidance Through Surrogate: Toward a Generic Diagnostic Attack

76. Orthogonal Projection Loss

77. On Generating Transferable Targeted Perturbations

78. A Self-supervised Approach for Adversarial Robustness

79. Local Gradients Smoothing: Defense Against Localized Adversarial Attacks

80. Indoor Scene Understanding in 2.5/3D for Autonomous Agents: A Survey

81. Stylized Adversarial Defense.

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

81 results on '"Naseer, Muzammal"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources