Author: "Syed Waqas Zamir" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Syed Waqas Zamir"' showing total 63 results

Start Over Author "Syed Waqas Zamir"

63 results on '"Syed Waqas Zamir"'

1. Filling the Gaps in Atrous Convolution: Semantic Segmentation With a Better Context

Author: Liyuan Liu, Yanwei Pang, Syed Waqas Zamir, Salman Khan, Fahad Shahbaz Khan, and Ling Shao
Subjects: Image processing, neural networks, semantic segmentation, supervised learning, Electrical engineering. Electronics. Nuclear engineering, TK1-9971
Abstract: The main challenge for scene parsing arises when complex scenes with highly diverse objects are encountered. The objects not only differ in scale and appearance but also in semantics. Previous works focus on encoding the multi-scale contextual information (via pooling or atrous convolutions) generally on top of compact high-level features (i.e., at a single stage). In this work, we argue that a rich set of cues exist at multiple stages of the network, encapsulating low, mid and high-level scene details. Therefore, an optimal scene parsing model must aggregate multi-scale context at all three levels of the feature hierarchy; a capability that lacks in state-of-the-art scene parsing models. To address this limitation, we introduce a novel architecture with three new blocks that systematically aggregate low, mid and high tier features. The heart of our approach is a high-level feature aggregation module that augments sparsely connected atrous convolution with dense local and layer-wise connections to avoid gridding artifacts. Besides, we employ a novel feature pyramid augmentation and semantic refinement unit to generate low- and mid-level features that are mixed with high-level features at the decoder. We extensively evaluate our proposed approach on the large-scale Cityscapes and ADE2K benchmarks. Our approach surpasses many latest models on both datasets, achieving mean intersection-over-union (mIoU) scores of 80.5% and 44.0% on Cityscapes and ADE20K, respectively.
Published: 2020
Full Text: View/download PDF

2. Gated Multi-Resolution Transfer Network for Burst Restoration and Enhancement.

Author: Nancy Mehta, Akshay Dudhane, Subrahmanyam Murala, Syed Waqas Zamir, Salman H. Khan 0001, and Fahad Shahbaz Khan
Published: 2023
Full Text: View/download PDF

3. Burstormer: Burst Image Restoration and Enhancement Transformer.

Author: Akshay Dudhane, Syed Waqas Zamir, Salman Khan 0001, Fahad Shahbaz Khan, and Ming-Hsuan Yang 0001
Published: 2023
Full Text: View/download PDF

4. NTIRE 2023 Challenge on Image Denoising: Methods and Results.

Author: Yawei Li 0001, Yulun Zhang 0001, Radu Timofte, Luc Van Gool, Zhijun Tu, Kunpeng Du, Hailing Wang, Hanting Chen, Wei Li 0002, Xiaofei Wang, Jie Hu 0021, Yunhe Wang 0001, Xiangyu Kong, Jinlong Wu, Dafeng Zhang, Jianxing Zhang, Shuai Liu 0009, Furui Bai, Chaoyu Feng, Hao Wang 0073, Yuqian Zhang, Guangqi Shao, Xiaotao Wang, Lei Lei, Rongjian Xu, Zhilu Zhang, Yunjin Chen, Dongwei Ren, Wangmeng Zuo, Qi Wu 0017, Mingyan Han, Shen Cheng, Haipeng Li, Ting Jiang, Chengzhi Jiang, Xinpeng Li, Jinting Luo, Wenjie Lin, Lei Yu, Haoqiang Fan, Shuaicheng Liu, Aditya Arora, Syed Waqas Zamir, Javier Vazquez-Corral, Konstantinos G. Derpanis, Michael S. Brown, Hao Li 0058, Zhihao Zhao, Jinshan Pan, Jiangxin Dong, Jinhui Tang 0001, Bo Yang 0047, Jingxiang Chen, Chenghua Li, Xi Zhang, Zhao Zhang 0001, Jiahuan Ren, Zhicheng Ji, Kang Miao, Suiyi Zhao, Huan Zheng, Yanyan Wei, Kangliang Liu, Xiangcheng Du, Sijie Liu, Yingbin Zheng, Xingjiao Wu, Cheng Jin 0001, Rajeev Irny, Sriharsha Koundinya, Vighnesh Kamath, Gaurav Khandelwal, Sunder Ali Khowaja, Jiseok Yoon, Ik Hyun Lee, Shijie Chen, Chengqiang Zhao, Huabin Yang, Zhongjian Zhang, Junjia Huang, and Yanru Zhang
Published: 2023
Full Text: View/download PDF

5. Accelerated MRI Reconstruction via Dynamic Deformable Alignment Based Transformer.

Author: Wafa Al Ghallabi, Akshay Dudhane, Syed Waqas Zamir, Salman H. Khan 0001, and Fahad Shahbaz Khan
Published: 2023
Full Text: View/download PDF

6. Restormer: Efficient Transformer for High-Resolution Image Restoration.

Author: Syed Waqas Zamir, Aditya Arora, Salman Khan 0001, Munawar Hayat, Fahad Shahbaz Khan, and Ming-Hsuan Yang 0001
Published: 2022
Full Text: View/download PDF

7. NTIRE 2022 Burst Super-Resolution Challenge.

Author: Goutam Bhat, Martin Danelljan, Radu Timofte, Yizhen Cao, Yuntian Cao, Meiya Chen, Xihao Chen, Shen Cheng, Akshay Dudhane, Haoqiang Fan, Ruipeng Gang, Jian Gao 0008, Yan Gu, Jie Huang 0017, Liufeng Huang, Youngsu Jo, Sukju Kang, Salman Khan 0001, Fahad Shahbaz Khan, Yuki Kondo, Chenghua Li, Fangya Li, Jinjing Li, Youwei Li, Zechao Li, Chenming Liu, Shuaicheng Liu, Zikun Liu, Zhuoming Liu, Ziwei Luo, Zhengxiong Luo, Nancy Mehta, Subrahmanyam Murala, Yoonchan Nam, Chihiro Nakatani, Pavel Ostyakov, Jinshan Pan, Ge Song, Jian Sun 0001, Long Sun, Jinhui Tang 0001, Norimichi Ukita, Zhihong Wen, Qi Wu 0017, Xiaohe Wu, Zeyu Xiao, Zhiwei Xiong, Rongjian Xu, Ruikang Xu, Youliang Yan, Jialin Yang, Wentao Yang, Zhongbao Yang, Fuma Yasue, Mingde Yao, Lei Yu, Cong Zhang, Syed Waqas Zamir, Jianxing Zhang, Shuohao Zhang, Zhilu Zhang, Qian Zheng, Gaofeng Zhou, Magauiya Zhussip, Xueyi Zou, and Wangmeng Zuo
Published: 2022
Full Text: View/download PDF

8. NTIRE 2022 Challenge on Efficient Super-Resolution: Methods and Results.

Author: Yawei Li 0001, Kai Zhang 0008, Radu Timofte, Luc Van Gool, Fangyuan Kong, Mingxi Li, Songwei Liu, Zongcai Du, Ding Liu 0001, Chenhui Zhou, Jingyi Chen, Qingrui Han, Zheyuan Li, Yingqi Liu, Xiangyu Chen 0006, Haoming Cai, Yu Qiao 0001, Chao Dong 0005, Long Sun, Jinshan Pan, Yi Zhu, Zhikai Zong, Xiaoxiao Liu, Zheng Hui, Tao Yang, Peiran Ren, Xuansong Xie, Xian-Sheng Hua 0001, Yanbo Wang, Xiaozhong Ji, Chuming Lin, Donghao Luo, Ying Tai, Chengjie Wang, Zhizhong Zhang 0001, Yuan Xie 0006, Shen Cheng, Ziwei Luo, Lei Yu, Zhihong Wen, Qi Wu 0017, Youwei Li, Haoqiang Fan, Jian Sun 0001, Shuaicheng Liu, Yuanfei Huang, Meiguang Jin, Hua Huang, Jing Liu, Xinjian Zhang, Yan Wang, Lingshun Long, Gen Li 0008, Yuanfan Zhang, Zuowei Cao, Lei Sun, Panaetov Alexander, Yucong Wang, Minjie Cai, Li Wang, Lu Tian, Zheyuan Wang, Hongbing Ma, Jie Liu 0040, Chao Chen, Yidong Cai, Jie Tang 0006, Gangshan Wu, Weiran Wang, Shirui Huang, Honglei Lu, Huan Liu 0014, Keyan Wang, Jun Chen 0005, Shi Chen, Yuchun Miao, Zimo Huang, Lefei Zhang, Mustafa Ayazoglu, Wei Xiong 0008, Chengyi Xiong, Fei Wang, Hao Li 0030, Ruimian Wen, Zhijing Yang, Wenbin Zou, Weixin Zheng, Tian Ye 0001, Yuncheng Zhang, Xiangzhen Kong, Aditya Arora, Syed Waqas Zamir, Salman H. Khan 0001, Munawar Hayat, Fahad Shahbaz Khan, Dandan Gao, Dengwen Zhou, Qian Ning, Jingzhu Tang, Han Huang, Yufei Wang, Zhangheng Peng, Haobo Li, Wenxue Guan, Shenghua Gong, Xin Li 0005, Jun Liu 0071, Wanjun Wang, Kun Zeng, Hanjiang Lin, Xinyu Chen, and Jinsheng Fang
Published: 2022
Full Text: View/download PDF

9. Adaptive Feature Consolidation Network for Burst Super-Resolution.

Author: Nancy Mehta, Akshay Dudhane, Subrahmanyam Murala, Syed Waqas Zamir, Salman Khan 0001, and Fahad Shahbaz Khan
Published: 2022
Full Text: View/download PDF

10. Burst Image Restoration and Enhancement.

Author: Akshay Dudhane, Syed Waqas Zamir, Salman Khan 0001, Fahad Shahbaz Khan, and Ming-Hsuan Yang 0001
Published: 2022
Full Text: View/download PDF

11. EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Applications.

Author: Muhammad Maaz 0001, Abdelrahman M. Shaker, Hisham Cholakkal, Salman H. Khan 0001, Syed Waqas Zamir, Rao Muhammad Anwer, and Fahad Shahbaz Khan
Published: 2022
Full Text: View/download PDF

12. PromptIR: Prompting for All-in-One Image Restoration.

Author: Vaishnav Potlapalli, Syed Waqas Zamir, Salman H. Khan 0001, and Fahad Shahbaz Khan
Published: 2023

13. Multi-Stage Progressive Image Restoration.

Author: Syed Waqas Zamir, Aditya Arora, Salman H. Khan 0001, Munawar Hayat, Fahad Shahbaz Khan, Ming-Hsuan Yang 0001, and Ling Shao 0001
Published: 2021
Full Text: View/download PDF

14. CycleISP: Real Image Restoration via Improved Data Synthesis.

Author: Syed Waqas Zamir, Aditya Arora, Salman H. Khan 0001, Munawar Hayat, Fahad Shahbaz Khan, Ming-Hsuan Yang 0001, and Ling Shao 0001
Published: 2020
Full Text: View/download PDF

15. Synthesizing the Unseen for Zero-Shot Object Detection.

Author: Nasir Hayat, Munawar Hayat, Shafin Rahman, Salman H. Khan 0001, Syed Waqas Zamir, and Fahad Shahbaz Khan
Published: 2020
Full Text: View/download PDF

16. Learning Enriched Features for Real Image Restoration and Enhancement.

Author: Syed Waqas Zamir, Aditya Arora, Salman H. Khan 0001, Munawar Hayat, Fahad Shahbaz Khan, Ming-Hsuan Yang 0001, and Ling Shao 0001
Published: 2020
Full Text: View/download PDF

17. AIM 2020 Challenge on Real Image Super-Resolution: Methods and Results.

Author: Pengxu Wei, Hannan Lu, Radu Timofte, Liang Lin, Wangmeng Zuo, Zhihong Pan 0001, Baopu Li, Teng Xi, Yanwen Fan, Gang Zhang, Jingtuo Liu, Junyu Han, Errui Ding, Tangxin Xie, Liang Cao, Yan Zou, Yi Shen, Jialiang Zhang, Yu Jia, Kaihua Cheng, Chenhuan Wu, Yue Lin 0002, Cen Liu, Yunbo Peng, Xueyi Zou, Zhipeng Luo, Yuehan Yao, Zhenyu Xu, Syed Waqas Zamir, Aditya Arora, Salman H. Khan 0001, Munawar Hayat, Fahad Shahbaz Khan, Keon-Hee Ahn, Jun-Hyuk Kim, Jun-Ho Choi, Jong-Seok Lee, Tongtong Zhao, Shanshan Zhao 0003, Yoseob Han, Byung-Hoon Kim, JaeHyun Baek, Haoning Wu 0001, Dejia Xu, Bo Zhou, Wei Guan, Xiaobo Li, Chen Ye 0002, Hao Li 0058, Haoyu Zhong, Yukai Shi, Zhijing Yang, Xiaojun Yang, Xin Li 0082, Xin Jin 0014, Yaojun Wu, Yingxue Pang, Sen Liu 0001, Zhi-Song Liu, Li-Wen Wang, Chu-Tak Li, Marie-Paule Cani, Wan-Chi Siu, Yuanbo Zhou, Rao Muhammad Umer, Christian Micheloni, Xiaofeng Cong, Rajat Gupta, Feras Almasri, Thomas Vandamme, and Olivier Debeir
Published: 2020
Full Text: View/download PDF

18. Gaussian Affinity for Max-Margin Class Imbalanced Learning.

Author: Munawar Hayat, Salman H. Khan 0001, Syed Waqas Zamir, Jianbing Shen, and Ling Shao 0001
Published: 2019
Full Text: View/download PDF

19. NTIRE 2019 Challenge on Image Enhancement: Methods and Results.

Author: Andrey Ignatov, Radu Timofte, Xiaochao Qu, Xingguang Zhou, Ting Liu, Pengfei Wan, Syed Waqas Zamir, Aditya Arora, Salman H. Khan 0001, Fahad Shahbaz Khan, Ling Shao 0001, Dongwon Park, Se Young Chun, Pablo Navarrete Michelini, Hanwen Liu, Dan Zhu, Zhiwei Zhong, Xianming Liu, Junjun Jiang, Debin Zhao, Muhammad Haris 0002, Kazutoshi Akita, Tomoki Yoshida, Greg Shakhnarovich, Norimichi Ukita, Jie Liu 0042, Cheolkon Jung, Raimondo Schettini, Simone Bianco 0001, Claudio Cusano, Flavio Piccoli, Pengju Liu, Kai Zhang 0008, Jingdong Liu, Jiye Liu, Hongzhi Zhang, Wangmeng Zuo, Nelson Chong Ngee Bow, Lai-Kuan Wong, John See, Jinghui Qin, Lishan Huang, Yukai Shi, Pengxu Wei, Wushao Wen, Liang Lin, Zheng Hui, Xiumei Wang, Xinbo Gao 0001, Kanti Kumari, Vikas Kumar Anand, Mahendra Khened, and Ganapathy Krishnamurthi
Published: 2019
Full Text: View/download PDF

20. NTIRE 2019 Challenge on Real Image Denoising: Methods and Results.

Author: Abdelrahman Abdelhamed, Radu Timofte, Michael S. Brown, Songhyun Yu, Bumjun Park, Jechang Jeong, Seung-Won Jung, Dong-Wook Kim, Jae Ryun Chung, Jiaming Liu, Yuzhi Wang, Chi-Hao Wu 0001, Qin Xu, Chuan Wang 0001, Shaofan Cai, Yifan Ding, Haoqiang Fan, Jue Wang 0001, Kai Zhang 0008, Wangmeng Zuo, Magauiya Zhussip, Dongwon Park, Shakarim Soltanayev, Se Young Chun, Zhiwei Xiong, Chang Chen, Muhammad Haris 0002, Kazutoshi Akita, Tomoki Yoshida, Greg Shakhnarovich, Norimichi Ukita, Syed Waqas Zamir, Aditya Arora, Salman H. Khan 0001, Fahad Shahbaz Khan, Ling Shao 0001, Sung-Jea Ko, Dong-Pan Lim, Seung-Wook Kim 0002, Seo-Won Ji, Sang-Won Lee, Wenyi Tang, Yuchen Fan, Yuqian Zhou, Ding Liu 0001, Thomas S. Huang, Deyu Meng, Lei Zhang 0006, Hongwei Yong, Yiyun Zhao, Pengliang Tang, Yue Lu, Raimondo Schettini, Simone Bianco 0001, Simone Zini, Chi Li, Yang Wang 0023, and Zhiguo Cao 0001
Published: 2019
Full Text: View/download PDF

21. iSAID: A Large-scale Dataset for Instance Segmentation in Aerial Images.

Author: Syed Waqas Zamir, Aditya Arora, Akshita Gupta, Salman H. Khan 0001, Guolei Sun, Fahad Shahbaz Khan, Fan Zhu 0001, Ling Shao 0001, Gui-Song Xia, and Xiang Bai
Published: 2019

22. NTIRE 2019 Challenge on Video Deblurring: Methods and Results.

Author: Seungjun Nah, Radu Timofte, Sungyong Baik, Seokil Hong, Gyeongsik Moon, Sanghyun Son 0002, Kyoung Mu Lee, Xintao Wang, Kelvin C. K. Chan, Ke Yu, Chao Dong 0005, Chen Change Loy, Yuchen Fan, Jiahui Yu, Ding Liu 0001, Thomas S. Huang, Hyeonjun Sim, Munchurl Kim, Dongwon Park, Jisoo Kim, Se Young Chun, Muhammad Haris 0002, Greg Shakhnarovich, Norimichi Ukita, Syed Waqas Zamir, Aditya Arora, Salman H. Khan 0001, Fahad Shahbaz Khan, Ling Shao 0001, Rahul Kumar Gupta, Vishal M. Chudasama, Heena Patel, Kishor P. Upla, Hongfei Fan, Guo Li, Yumei Zhang, Xiang Li 0103, Wenjie Zhang, Qingwen He, Kuldeep Purohit, A. N. Rajagopalan 0001, Jeonghun Kim, Mohammad Tofighi, Tiantong Guo, and Vishal Monga
Published: 2019
Full Text: View/download PDF

23. Striking the Right Balance With Uncertainty.

Author: Salman H. Khan 0001, Munawar Hayat, Syed Waqas Zamir, Jianbing Shen, and Ling Shao 0001
Published: 2019
Full Text: View/download PDF

24. NTIRE 2019 Challenge on Real Image Super-Resolution: Methods and Results.

Author: Jianrui Cai, Shuhang Gu, Radu Timofte, Lei Zhang 0006, Xiao Liu 0022, Yukang Ding, Dongliang He, Chao Li 0034, Yi Fu, Shilei Wen, Ruicheng Feng, Jinjin Gu, Yu Qiao 0001, Chao Dong 0005, Dongwon Park, Se Young Chun, Sanghoon Yoon, Junhyung Kwak, Donghee Son, Syed Waqas Zamir, Aditya Arora, Salman H. Khan 0001, Fahad Shahbaz Khan, Ling Shao 0001, Zhengping Wei, Lei Liu, Hong Cai, Darui Li, Fujie Gao, Zheng Hui, Xiumei Wang, Xinbo Gao 0001, Guoan Cheng, Ai Matsune, Qiuyu Li, Leilei Zhu, Huaijuan Zang, Shu Zhan, Yajun Qiu, Ruxin Wang 0002, Jiawei Li, Yongcheng Jing, Mingli Song, Pengju Liu, Kai Zhang 0008, Jingdong Liu, Jiye Liu, Hongzhi Zhang, Wangmeng Zuo, Wenyi Tang, Jing Liu 0031, Youngjung Kim, Changyeop Shin, Minbeom Kim, Sungho Kim, Pablo Navarrete Michelini, Hanwen Liu, Dan Zhu, Xuan Xu, Xin Li 0005, Furui Bai, Xiaopeng Sun, Lin Zha, Yuanfei Huang, Wen Lu, Yanpeng Cao, Du Chen, Zewei He, Anshun Sun, Siliang Tang, Hongfei Fan, Xiang Li 0103, Guo Li, Wenjie Zhang, Yumei Zhang, Qingwen He, Jinghui Qin, Lishan Huang, Yukai Shi, Pengxu Wei, Wushao Wen, Liang Lin, Jun Yu 0001, Guochen Xie, Mengyan Li, Rong Chen, Xiaotong Luo, Chen Hong, Yanyun Qu, Cuihua Li, Zhi-Song Liu, Li-Wen Wang, Chu-Tak Li, Can Zhao 0001, Bowen Li, Chung-Chi Tsai, Shang-Chih Chuang, Joonhee Choi, Joonsoo Kim, Xiaoyun Jiang, Ze Pan, Qunbo Lv, Zheng Tan, and Peidong He
Published: 2019
Full Text: View/download PDF

25. Gamut Reduction Through Local Saturation Reduction.

Author: Syed Waqas Zamir, Javier Vazquez-Corral, and Marcelo Bertalmío
Published: 2017
Full Text: View/download PDF

26. Image processing applications through a variational perceptually-based color correction related to Retinex.

Author: Javier Vazquez-Corral, Syed Waqas Zamir, Adrian Galdran, David Pardo, and Marcelo Bertalmío
Published: 2016
Full Text: View/download PDF

27. Considering saliency in a perception inspired gamut reduction algorithm.

Author: Javier Vazquez-Corral, Syed Waqas Zamir, and Marcelo Bertalmío
Published: 2014
Full Text: View/download PDF

28. EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Applications

Author: Muhammad Maaz, Abdelrahman Shaker, Hisham Cholakkal, Salman Khan, Syed Waqas Zamir, Rao Muhammad Anwer, and Fahad Shahbaz Khan
Published: 2023

29. Gamut Mapping through Perceptually-Based Contrast Reduction.

Author: Syed Waqas Zamir, Javier Vazquez-Corral, and Marcelo Bertalmío
Published: 2013
Full Text: View/download PDF

30. NTIRE 2022 Challenge on Efficient Super-Resolution: Methods and Results

Author: Yawei Li, Kai Zhang, Radu Timofte, Luc Van Gool, Fangyuan Kong, Mingxi Li, Songwei Liu, Zongcai Du, Ding Liu, Chenhui Zhou, Jingyi Chen, Qingrui Han, Zheyuan Li, Yingqi Liu, Xiangyu Chen, Haoming Cai, Yu Qiao, Chao Dong, Long Sun, Jinshan Pan, Yi Zhu, Zhikai Zong, Xiaoxiao Liu, Zheng Hui, Tao Yang, Peiran Ren, Xuansong Xie, Xian-Sheng Hua, Yanbo Wang, Xiaozhong Ji, Chuming Lin, Donghao Luo, Ying Tai, Chengjie Wang, Zhizhong Zhang, Yuan Xie, Shen Cheng, Ziwei Luo, Lei Yu, Zhihong Wen, Qi Wul, Youwei Li, Haoqiang Fan, Jian Sun, Shuaicheng Liu, Yuanfei Huang, Meiguang Jin, Hua Huang, Jing Liu, Xinjian Zhang, Yan Wang, Lingshun Long, Gen Li, Yuanfan Zhang, Zuowei Cao, Lei Sun, Panaetov Alexander, Yucong Wang, Minjie Cai, Li Wang, Lu Tian, Zheyuan Wang, Hongbing Ma, Jie Liu, Chao Chen, Yidong Cai, Jie Tang, Gangshan Wu, Weiran Wang, Shirui Huang, Honglei Lu, Huan Liu, Keyan Wang, Jun Chen, Shi Chen, Yuchun Miao, Zimo Huang, Lefei Zhang, Mustafa Ayazoglu, Wei Xiong, Chengyi Xiong, Fei Wang, Hao Li, Ruimian Wen, Zhijing Yang, Wenbin Zou, Weixin Zheng, Tian Ye, Yuncheng Zhang, Xiangzhen Kong, Aditya Arora, Syed Waqas Zamir, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Dandan Gao, Dengwen Zhou, Qian Ning, Jingzhu Tang, Han Huang, Yufei Wang, Zhangheng Peng, Haobo Li, Wenxue Guan, Shenghua Gong, Xin Li, Jun Liu, Wanjun Wang, Kun Zeng, Hanjiang Lin, Xinyu Chen, and Jinsheng Fang
Published: 2022

31. Learning Enriched Features for Fast Image Restoration and Enhancement

Author: Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Ming-Hsuan Yang, and Ling Shao
Subjects: FOS: Computer and information sciences, Computational Theory and Mathematics, Artificial Intelligence, Computer Vision and Pattern Recognition (cs.CV), Applied Mathematics, Image and Video Processing (eess.IV), FOS: Electrical engineering, electronic engineering, information engineering, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, Computer Science - Computer Vision and Pattern Recognition, Computer Vision and Pattern Recognition, Electrical Engineering and Systems Science - Image and Video Processing, Software
Abstract: Given a degraded input image, image restoration aims to recover the missing high-quality image content. Numerous applications demand effective image restoration, e.g., computational photography, surveillance, autonomous vehicles, and remote sensing. Significant advances in image restoration have been made in recent years, dominated by convolutional neural networks (CNNs). The widely-used CNN-based methods typically operate either on full-resolution or on progressively low-resolution representations. In the former case, spatial details are preserved but the contextual information cannot be precisely encoded. In the latter case, generated outputs are semantically reliable but spatially less accurate. This paper presents a new architecture with a holistic goal of maintaining spatially-precise high-resolution representations through the entire network, and receiving complementary contextual information from the low-resolution representations. The core of our approach is a multi-scale residual block containing the following key elements: (a) parallel multi-resolution convolution streams for extracting multi-scale features, (b) information exchange across the multi-resolution streams, (c) non-local attention mechanism for capturing contextual information, and (d) attention based multi-scale feature aggregation. Our approach learns an enriched set of features that combines contextual information from multiple scales, while simultaneously preserving the high-resolution spatial details. Extensive experiments on six real image benchmark datasets demonstrate that our method, named as MIRNet-v2 , achieves state-of-the-art results for a variety of image processing tasks, including defocus deblurring, image denoising, super-resolution, and image enhancement. The source code and pre-trained models are available at https://github.com/swz30/MIRNetv2, This article supersedes arXiv:2003.06792. Accepted for publication in TPAMI
Published: 2022

32. Filling the Gaps in Atrous Convolution: Semantic Segmentation With a Better Context

Author: Salman Khan, Liyuan Liu, Ling Shao, Yanwei Pang, Syed Waqas Zamir, and Fahad Shahbaz Khan
Subjects: General Computer Science, Computer science, 02 engineering and technology, computer.software_genre, supervised learning, Convolution, Image processing, 020204 information systems, Pyramid, 0202 electrical engineering, electronic engineering, information engineering, General Materials Science, Segmentation, Context model, Parsing, business.industry, General Engineering, Pattern recognition, Image segmentation, neural networks, semantic segmentation, Kernel (image processing), 020201 artificial intelligence & image processing, Artificial intelligence, lcsh:Electrical engineering. Electronics. Nuclear engineering, business, computer, lcsh:TK1-9971
Abstract: The main challenge for scene parsing arises when complex scenes with highly diverse objects are encountered. The objects not only differ in scale and appearance but also in semantics. Previous works focus on encoding the multi-scale contextual information (via pooling or atrous convolutions) generally on top of compact high-level features (i.e., at a single stage). In this work, we argue that a rich set of cues exist at multiple stages of the network, encapsulating low, mid and high-level scene details. Therefore, an optimal scene parsing model must aggregate multi-scale context at all three levels of the feature hierarchy; a capability that lacks in state-of-the-art scene parsing models. To address this limitation, we introduce a novel architecture with three new blocks that systematically aggregate low, mid and high tier features. The heart of our approach is a high-level feature aggregation module that augments sparsely connected atrous convolution with dense local and layer-wise connections to avoid gridding artifacts. Besides, we employ a novel feature pyramid augmentation and semantic refinement unit to generate low- and mid-level features that are mixed with high-level features at the decoder. We extensively evaluate our proposed approach on the large-scale Cityscapes and ADE2K benchmarks. Our approach surpasses many latest models on both datasets, achieving mean intersection-over-union (mIoU) scores of 80.5% and 44.0% on Cityscapes and ADE20K, respectively.
Published: 2020

33. Transformers in Medical Imaging: A Survey

Author: Fahad Shamshad, Salman Khan, Syed Waqas Zamir, Muhammad Haris Khan, Munawar Hayat, Fahad Shahbaz Khan, and Huazhu Fu
Subjects: FOS: Computer and information sciences, Radiological and Ultrasound Technology, Computer Vision and Pattern Recognition (cs.CV), Image and Video Processing (eess.IV), Computer Science - Computer Vision and Pattern Recognition, FOS: Electrical engineering, electronic engineering, information engineering, Health Informatics, Radiology, Nuclear Medicine and imaging, Computer Vision and Pattern Recognition, Electrical Engineering and Systems Science - Image and Video Processing, Computer Graphics and Computer-Aided Design
Abstract: Following unprecedented success on the natural language tasks, Transformers have been successfully applied to several computer vision problems, achieving state-of-the-art results and prompting researchers to reconsider the supremacy of convolutional neural networks (CNNs) as {de facto} operators. Capitalizing on these advances in computer vision, the medical imaging field has also witnessed growing interest for Transformers that can capture global context compared to CNNs with local receptive fields. Inspired from this transition, in this survey, we attempt to provide a comprehensive review of the applications of Transformers in medical imaging covering various aspects, ranging from recently proposed architectural designs to unsolved issues. Specifically, we survey the use of Transformers in medical image segmentation, detection, classification, reconstruction, synthesis, registration, clinical report generation, and other tasks. In particular, for each of these applications, we develop taxonomy, identify application-specific challenges as well as provide insights to solve them, and highlight recent trends. Further, we provide a critical discussion of the field's current state as a whole, including the identification of key challenges, open problems, and outlining promising future directions. We hope this survey will ignite further interest in the community and provide researchers with an up-to-date reference regarding applications of Transformer models in medical imaging. Finally, to cope with the rapid development in this field, we intend to regularly update the relevant latest papers and their open-source implementations at \url{https://github.com/fahadshamshad/awesome-transformers-in-medical-imaging}., Comment: 41 pages, \url{https://github.com/fahadshamshad/awesome-transformers-in-medical-imaging}
Published: 2022
Full Text: View/download PDF

34. Restormer: Efficient Transformer for High-Resolution Image Restoration

Author: Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, and Ming-Hsuan Yang
Subjects: FOS: Computer and information sciences, Computer Vision and Pattern Recognition (cs.CV), Computer Science::Computer Vision and Pattern Recognition, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, Computer Science - Computer Vision and Pattern Recognition
Abstract: Since convolutional neural networks (CNNs) perform well at learning generalizable image priors from large-scale data, these models have been extensively applied to image restoration and related tasks. Recently, another class of neural architectures, Transformers, have shown significant performance gains on natural language and high-level vision tasks. While the Transformer model mitigates the shortcomings of CNNs (i.e., limited receptive field and inadaptability to input content), its computational complexity grows quadratically with the spatial resolution, therefore making it infeasible to apply to most image restoration tasks involving high-resolution images. In this work, we propose an efficient Transformer model by making several key designs in the building blocks (multi-head attention and feed-forward network) such that it can capture long-range pixel interactions, while still remaining applicable to large images. Our model, named Restoration Transformer (Restormer), achieves state-of-the-art results on several image restoration tasks, including image deraining, single-image motion deblurring, defocus deblurring (single-image and dual-pixel data), and image denoising (Gaussian grayscale/color denoising, and real image denoising). The source code and pre-trained models are available at https://github.com/swz30/Restormer., Accepted at CVPR 2022. #CVPR2022
Published: 2021

35. Burst Image Restoration and Enhancement

Author: Akshay Dudhane, Syed Waqas Zamir, Salman Khan, Fahad Shahbaz Khan, and Ming-Hsuan Yang
Subjects: FOS: Computer and information sciences, Computer Vision and Pattern Recognition (cs.CV), ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, Computer Science - Computer Vision and Pattern Recognition
Abstract: Modern handheld devices can acquire burst image sequence in a quick succession. However, the individual acquired frames suffer from multiple degradations and are misaligned due to camera shake and object motions. The goal of Burst Image Restoration is to effectively combine complimentary cues across multiple burst frames to generate high-quality outputs. Towards this goal, we develop a novel approach by solely focusing on the effective information exchange between burst frames, such that the degradations get filtered out while the actual scene details are preserved and enhanced. Our central idea is to create a set of pseudo-burst features that combine complementary information from all the input burst frames to seamlessly exchange information. However, the pseudo-burst cannot be successfully created unless the individual burst frames are properly aligned to discount inter-frame movements. Therefore, our approach initially extracts pre-processed features from each burst frame and matches them using an edge-boosting burst alignment module. The pseudo-burst features are then created and enriched using multi-scale contextual information. Our final step is to adaptively aggregate information from the pseudo-burst features to progressively increase resolution in multiple stages while merging the pseudo-burst features. In comparison to existing works that usually follow a late fusion scheme with single-stage upsampling, our approach performs favorably, delivering state-of-the-art performance on burst superresolution, burst low-light image enhancement, and burst denoising tasks. The source code and pre-trained models are available at \url{https://github.com/akshaydudhane16/BIPNet}., Accepted at CVPR 2022 [Oral]
Published: 2021

36. NTIRE 2021 Challenge for Defocus Deblurring Using Dual-pixel Images: Methods and Results

Author: Salman Khan, Ling Shao, Sabari Nathan, Xiaoying Bai, Fahad Shahbaz Khan, Xiaobing Wang, Nisarg Shah, Venkatakrishnan Srinija, Chenghua Li, Tu Vo, Abdullah Abuolaim, Xingyu Hu, Yunan Zhu, Chi Zhang, Zhiwei Xiong, Ruikang Xu, Munawar Hayat, Venkatakrishnan Srivatsav, Zhiwei Zhong, Ruipeng Gang, Chaoyu Feng, Thangavelu Ragavendran, Michael S. Brown, Aditya Arora, Lei Lei, Pengwei Liang, Shuai Liu, Syed Waqas Zamir, Zeyu Xiao, Yiqun Chen, Radu Timofte, Si Miao, Dong Liu, Yiheng Yao, and Dafeng Zhang
Subjects: Signal processing, Deblurring, Atmospheric measurements, Pixel, business.industry, Computer science, Pattern recognition (psychology), ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, Computer vision, Artificial intelligence, Depth of field, business, Dual (category theory)
Abstract: This paper provides a review of the NTIRE 2021 challenge targeting defocus deblurring using dual-pixel (DP) data. The goal of this single-track challenge was to reduce spatially varying defocus blur present in images captured with a shallow depth of field. The images used in this challenge were obtained using a DP sensor that provided a pair of DP views per captured image. Submitted solutions were evaluated using conventional signal processing metrics, namely peak signal-to-noise ratio (PSNR) and structural similarity index measure (SSIM). Out of 185 registered participants, nine teams provided methods and competed in the final stage. The paper describes the methods proposed by the participating teams and their results. The winning teams represent the state-of-the-art in terms of defocus de-blurring using DP images.
Published: 2021

37. NTIRE 2021 NonHomogeneous Dehazing Challenge Report

Author: Jian Cheng, Florin-Alexandru Vasluianu, Jiang Yang, Jerome Chang, Joseph Zacharias, Xiaotong Luo, Sreeni K G, Minyi Zhao, Zexi Huang, Jun Luo, Xiyao Wang, Yi Xu, Aditya Arora, Zhenyu Xu, Qixin Yan, Akhil K A, Fahad Shahbaz Khan, Zilong Huang, Tianyi Chen, Kiran B. Raja, Codruta Orniana Ancuti, Quan Xiao, Jun Chen, Tiantong Guo, Jindong Li, Pengliang Sun, Hongyuan Jing, Huan Liu, Shuigeng Zhou, Yiwen Zhang, Vishal Chudasama, Salman Khan, Chen Gao, Keyan Wang, Kele Xu, Kishor P. Upla, Lehan Yang, Junjun Zheng, Zhiwei Zhu, Zhipeng Luo, Yanting Huang, Qingchao Su, Yankun Yu, Tao Wang, Jiahui Fu, Si Liu, Minghan Fu, Kalpesh Prajapati, Xiaotong Ruan, Wenqi Ren, Akshay Dudhane, Xiaochun Cao, Jing Liu, Yunfeng Wang, Zhuoran Zheng, Chenghua Li, Wentao Jiang, Anjali Sarvaiya, Heena Patel, Jun-Cheng Chen, Lihua Han, Eunsung Jo, Xinjian Zhang, Christoph Busch, Xuetong Niu, Wenjin Yang, Shuxin Chen, Jichang Guo, Ling Shao, Hejun Lv, Haichuan Zhang, Quanxing Zha, Chang-Sung Sung, Mianjie Chen, Yanyun Qu, Munawar Hayat, Chongyi Li, Geethu M M, Guowen Huang, Raghavendra Ramachandra, Baofeng Zhang, Syed Waqas Zamir, Cosmin Ancuti, Sai Wang, Yiqun Chen, Jae-Young Sim, Sida Zheng, Radu Timofte, Jeena R S, Vishal Monga, Yiran Fu, Yudong Wang, Jin Lin, Cong Leng, Haoqiang Wu, and Chippy M Manu
Subjects: Haze, business.industry, Computer science, Pattern recognition (psychology), Computer vision, Artificial intelligence, business
Abstract: This work reviews the results of the NTIRE 2021 Challenge on Non-Homogeneous Dehazing. The proposed techniques and their results have been evaluated on a novel dataset that extends the NH-Haze datset. It consists of additional 35 pairs of real haze free and nonhomogeneous hazy images recorded outdoor. The nonhomogeneous haze has been introduced in the outdoor scenes by using a a professional setup that imitates the real conditions of haze scenes. 327 participants registered in the challenge and 23 teams competed in the final testing phase. The proposed solutions gauge the state-of-the-art in image dehazing.
Published: 2021

38. Gamut extension for cinema: psychophysical evaluation of the state of the art and a new algorithm.

Author: Syed Waqas Zamir, Javier Vazquez-Corral, and Marcelo Bertalmío
Published: 2015
Full Text: View/download PDF

39. Transformers in Vision: A Survey

Author: Salman Khan, Muzammal Naseer, Munawar Hayat, Syed Waqas Zamir, Fahad Shahbaz Khan, and Mubarak Shah
Subjects: FOS: Computer and information sciences, Computer Science - Machine Learning, Artificial Intelligence (cs.AI), General Computer Science, Computer Science - Artificial Intelligence, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, Theoretical Computer Science, Machine Learning (cs.LG)
Abstract: Astounding results from Transformer models on natural language tasks have intrigued the vision community to study their application to computer vision problems. Among their salient benefits, Transformers enable modeling long dependencies between input sequence elements and support parallel processing of sequence as compared to recurrent networks e.g., Long short-term memory (LSTM). Different from convolutional networks, Transformers require minimal inductive biases for their design and are naturally suited as set-functions. Furthermore, the straightforward design of Transformers allows processing multiple modalities (e.g., images, videos, text and speech) using similar processing blocks and demonstrates excellent scalability to very large capacity networks and huge datasets. These strengths have led to exciting progress on a number of vision tasks using Transformer networks. This survey aims to provide a comprehensive overview of the Transformer models in the computer vision discipline. We start with an introduction to fundamental concepts behind the success of Transformers i.e., self-attention, large-scale pre-training, and bidirectional encoding. We then cover extensive applications of transformers in vision including popular recognition tasks (e.g., image classification, object detection, action recognition, and segmentation), generative modeling, multi-modal tasks (e.g., visual-question answering, visual reasoning, and visual grounding), video processing (e.g., activity recognition, video forecasting), low-level vision (e.g., image super-resolution, image enhancement, and colorization) and 3D analysis (e.g., point cloud classification and segmentation). We compare the respective advantages and limitations of popular techniques both in terms of architectural design and their experimental value. Finally, we provide an analysis on open research directions and possible future works., Comment: 30 pages (Accepted in ACM Computing Surveys December 2021)
Published: 2021
Full Text: View/download PDF

40. Multi-Stage Progressive Image Restoration

Author: Munawar Hayat, Aditya Arora, Fahad Shahbaz Khan, Salman Khan, Ling Shao, Ming-Hsuan Yang, and Syed Waqas Zamir
Subjects: FOS: Computer and information sciences, Deblurring, Source code, Computer science, business.industry, Process (engineering), media_common.quotation_subject, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, 020206 networking & telecommunications, 02 engineering and technology, Machine learning, computer.software_genre, Feature (computer vision), Pattern recognition (psychology), 0202 electrical engineering, electronic engineering, information engineering, Key (cryptography), 020201 artificial intelligence & image processing, Artificial intelligence, business, computer, Image restoration, Information exchange, media_common
Abstract: Image restoration tasks demand a complex balance between spatial details and high-level contextualized information while recovering images. In this paper, we propose a novel synergistic design that can optimally balance these competing goals. Our main proposal is a multi-stage architecture, that progressively learns restoration functions for the degraded inputs, thereby breaking down the overall recovery process into more manageable steps. Specifically, our model first learns the contextualized features using encoder-decoder architectures and later combines them with a high-resolution branch that retains local information. At each stage, we introduce a novel per-pixel adaptive design that leverages in-situ supervised attention to reweight the local features. A key ingredient in such a multi-stage architecture is the information exchange between different stages. To this end, we propose a two-faceted approach where the information is not only exchanged sequentially from early to late stages, but lateral connections between feature processing blocks also exist to avoid any loss of information. The resulting tightly interlinked multi-stage architecture, named as MPRNet, delivers strong performance gains on ten datasets across a range of tasks including image deraining, deblurring, and denoising. The source code and pre-trained models are available at https://github.com/swz30/MPRNet., Comment: Accepted at CVPR 2021
Published: 2021
Full Text: View/download PDF

41. Synthesizing the Unseen for Zero-Shot Object Detection

Author: Munawar Hayat, Salman H. Khan, Fahad Shahbaz Khan, Syed Waqas Zamir, Nasir Hayat, and Shafin Rahman
Subjects: Class (computer programming), business.industry, Computer science, Inference, Pattern recognition, 02 engineering and technology, Semantic domain, Pascal (programming language), 010501 environmental sciences, Semantics, 01 natural sciences, Object detection, Generative model, Bounding overwatch, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Artificial intelligence, business, computer, 0105 earth and related environmental sciences, computer.programming_language
Abstract: The existing zero-shot detection approaches project visual features to the semantic domain for seen objects, hoping to map unseen objects to their corresponding semantics during inference. However, since the unseen objects are never visualized during training, the detection model is skewed towards seen content, thereby labeling unseen as background or a seen class. In this work, we propose to synthesize visual features for unseen classes, so that the model learns both seen and unseen objects in the visual domain. Consequently, the major challenge becomes, how to accurately synthesize unseen objects merely using their class semantics? Towards this ambitious goal, we propose a novel generative model that uses class-semantics to not only generate the features but also to discriminatively separate them. Further, using a unified model, we ensure the synthesized features have high diversity that represents the intra-class differences and variable localization precision in the detected bounding boxes. We test our approach on three object detection benchmarks, PASCAL VOC, MSCOCO, and ILSVRC detection, under both conventional and generalized settings, showing impressive gains over the state-of-the-art methods. Our codes are available at https://github.com/nasir6/zero_shot_detection.
Published: 2021

42. Learning Enriched Features for Real Image Restoration and Enhancement

Author: Ling Shao, Syed Waqas Zamir, Ming-Hsuan Yang, Salman Khan, Fahad Shahbaz Khan, Munawar Hayat, and Aditya Arora
Subjects: Source code, Channel (digital image), Computer science, business.industry, media_common.quotation_subject, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, 020207 software engineering, Pattern recognition, 02 engineering and technology, Real image, Convolutional neural network, Computational photography, 0202 electrical engineering, electronic engineering, information engineering, Benchmark (computing), 020201 artificial intelligence & image processing, Artificial intelligence, business, Image restoration, Block (data storage), media_common
Abstract: With the goal of recovering high-quality image content from its degraded version, image restoration enjoys numerous applications, such as in surveillance, computational photography and medical imaging. Recently, convolutional neural networks (CNNs) have achieved dramatic improvements over conventional approaches for image restoration task. Existing CNN-based methods typically operate either on full-resolution or on progressively low-resolution representations. In the former case, spatially precise but contextually less robust results are achieved, while in the latter case, semantically reliable but spatially less accurate outputs are generated. In this paper, we present an architecture with the collective goals of maintaining spatially-precise high-resolution representations through the entire network and receiving strong contextual information from the low-resolution representations. The core of our approach is a multi-scale residual block containing several key elements: (a) parallel multi-resolution convolution streams for extracting multi-scale features, (b) information exchange across the multi-resolution streams, (c) spatial and channel attention mechanisms for capturing contextual information, and (d) attention based multi-scale feature aggregation. In a nutshell, our approach learns an enriched set of features that combines contextual information from multiple scales, while simultaneously preserving the high-resolution spatial details. Extensive experiments on five real image benchmark datasets demonstrate that our method, named as MIRNet, achieves state-of-the-art results for image denoising, super-resolution, and image enhancement. The source code and pre-trained models are available at https://github.com/swz30/MIRNet.
Published: 2020

43. CycleISP: Real Image Restoration via Improved Data Synthesis

Author: Salman H. Khan, Ling Shao, Aditya Arora, Munawar Hayat, Syed Waqas Zamir, Fahad Shahbaz Khan, and Ming-Hsuan Yang
Subjects: FOS: Computer and information sciences, Computer science, sRGB, Noise reduction, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, 02 engineering and technology, Convolutional neural network, symbols.namesake, 0202 electrical engineering, electronic engineering, information engineering, Image noise, FOS: Electrical engineering, electronic engineering, information engineering, Computer vision, business.industry, Image and Video Processing (eess.IV), 020206 networking & telecommunications, Electrical Engineering and Systems Science - Image and Video Processing, Real image, Additive white Gaussian noise, Computer Science::Computer Vision and Pattern Recognition, Benchmark (computing), symbols, 020201 artificial intelligence & image processing, Artificial intelligence, business
Abstract: The availability of large-scale datasets has helped unleash the true potential of deep convolutional neural networks (CNNs). However, for the single-image denoising problem, capturing a real dataset is an unacceptably expensive and cumbersome procedure. Consequently, image denoising algorithms are mostly developed and evaluated on synthetic data that is usually generated with a widespread assumption of additive white Gaussian noise (AWGN). While the CNNs achieve impressive results on these synthetic datasets, they do not perform well when applied on real camera images, as reported in recent benchmark datasets. This is mainly because the AWGN is not adequate for modeling the real camera noise which is signal-dependent and heavily transformed by the camera imaging pipeline. In this paper, we present a framework that models camera imaging pipeline in forward and reverse directions. It allows us to produce any number of realistic image pairs for denoising both in RAW and sRGB spaces. By training a new image denoising network on realistic synthetic data, we achieve the state-of-the-art performance on real camera benchmark datasets. The parameters in our model are ~5 times lesser than the previous best method for RAW denoising. Furthermore, we demonstrate that the proposed framework generalizes beyond image denoising problem e.g., for color matching in stereoscopic cinema. The source code and pre-trained models are available at https://github.com/swz30/CycleISP., Comment: CVPR 2020 (Oral)
Published: 2020
Full Text: View/download PDF

44. Vision Models for Wide Color Gamut Imaging in Cinema

Author: Marcelo Bertalmío, Javier Vazquez-Corral, Syed Waqas Zamir, European Commission, and Ministerio de Ciencia, Innovación y Universidades (España)
Subjects: Computer science, Color reproduction, Gamut mappingfor cinema, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, Vision models for color and contrast, 02 engineering and technology, Display device, Reduction (complexity), Gamut, Wide gamut imaging, Artificial Intelligence, 0202 electrical engineering, electronic engineering, information engineering, Computer vision, ComputingMethodologies_COMPUTERGRAPHICS, business.industry, Applied Mathematics, Palette (computing), Gamut mapping for cinema, Gamut mapping algorithms, Cinematography, Vision science, Computational Theory and Mathematics, 020201 artificial intelligence & image processing, Computer Vision and Pattern Recognition, Artificial intelligence, business, Software
Abstract: 14 pags., 14 figs., Gamut mapping is the problem of transforming the colors of image or video content so as to fully exploit the color palette of the display device where the content will be shown, while preserving the artistic intent of the original content’s creator. In particular, in the cinema industry, the rapid advancement in display technologies has created a pressing need to develop automatic and fast gamut mapping algorithms. In this article, we propose a novel framework that is based on vision science models, performs both gamut reduction and gamut extension, is of low computational complexity, produces results that are free from artifacts and outperforms state-of-the-art methods according to psychophysical tests. Our experiments also highlight the limitations of existing objective metrics for the gamut mapping problem., This work has received funding from the European Union’s Horizon 2020 research and innovation programme under Grant agreement number 761544 (project HDR4EU) and under Grant agreement number 780470 (project SAUCE), and by the Spanish government and FEDER Fund, grant ref. PGC2018-099651-B-I00 (MCIU/AEI/FEDER, UE). The work of J. Vazquez-Corral was supported by the Spanish government under Grant IJCI-2014-19516.
Published: 2019

45. Gaussian Affinity for Max-Margin Class Imbalanced Learning

Author: Jianbing Shen, Ling Shao, Munawar Hayat, Syed Waqas Zamir, and Salman Khan
Subjects: Artificial neural network, business.industry, Computer science, Euclidean space, Feature vector, Gaussian, Pattern recognition, 02 engineering and technology, 010501 environmental sciences, 01 natural sciences, symbols.namesake, ComputingMethodologies_PATTERNRECOGNITION, Robustness (computer science), 0202 electrical engineering, electronic engineering, information engineering, symbols, 020201 artificial intelligence & image processing, Artificial intelligence, Differentiable function, business, Cluster analysis, Classifier (UML), 0105 earth and related environmental sciences
Abstract: Real-world object classes appear in imbalanced ratios. This poses a significant challenge for classifiers which get biased towards frequent classes. We hypothesize that improving the generalization capability of a classifier should improve learning on imbalanced datasets. Here, we introduce the first hybrid loss function that jointly performs classification and clustering in a single formulation. Our approach is based on an `affinity measure' in Euclidean space that leads to the following benefits: (1) direct enforcement of maximum margin constraints on classification boundaries, (2) a tractable way to ensure uniformly spaced and equidistant cluster centers, (3) flexibility to learn multiple class prototypes to support diversity and discriminability in feature space. Our extensive experiments demonstrate the significant performance improvements on visual classification and verification tasks on multiple imbalanced datasets. The proposed loss can easily be plugged in any deep architecture as a differentiable block and demonstrates robustness against different levels of data imbalance and corrupted labels.
Published: 2019

46. NTIRE 2019 Challenge on Real Image Super-Resolution: Methods and Results

Author: Furui Bai, Jiawei Li, Sun Anshun, Fujie Gao, Yajun Qiu, Lei Liu, Xiumei Wang, Jiye Liu, Jing Liu, Lei Zhang, Jinghui Qin, Sanghoon Yoon, Guoan Cheng, Ruicheng Feng, Zheng Hui, Lin Zha, Li Guo, Dongliang He, Chu-Tak Li, Cuihua Li, Guochen Xie, Jun Yu, Rong Chen, Bowen Li, Shuhang Gu, Wenyi Tang, Shu Zhan, Zhang Yumei, Xiaoyun Jiang, Yanyun Qu, Qingwen He, Pengxu Wei, Ze Pan, Sungho Kim, Pablo Navarrete Michelini, Zhengping Wei, Xuan Xu, Darui Li, Peidong He, Minbeom Kim, Fan Hongfei, Joon-Soo Kim, Liang Lin, Changyeop Shin, Leilei Zhu, Xiang Li, Wangmeng Zuo, Jianrui Cai, Ling Shao, Donghee Son, Qunbo Lv, Yongcheng Jing, Can Zhao, Yi Fu, Yu Qiao, Chung-Chi Tsai, Shilei Wen, Dongwon Park, Xin Li, Zhi-Song Liu, Li-Wen Wang, Yuanfei Huang, Hongzhi Zhang, Jingdong Liu, Xiaotong Luo, Chen Hong, Fahad Shahbaz Khan, Zewei He, Chao Dong, Pengju Liu, Se Young Chun, Zheng Tan, Yanpeng Cao, Joon-Hee Choi, Chao Li, Xiaopeng Sun, Syed Waqas Zamir, Kai Zhang, Hong Cai, Youngjung Kim, Qiuyu Li, Radu Timofte, Ai Matsune, Yukai Shi, Xiao Liu, Wen Lu, Salman H. Khan, Mengyan Li, Wushao Wen, Huaijuan Zang, Mingli Song, Junhyung Kwak, Shang-Chih Chuang, Hanwen Liu, Xinbo Gao, Du Chen, Ding Yukang, Siliang Tang, Aditya Arora, Zhang Wenjie, Lishan Huang, Jinjin Gu, Ruxin wang, and Dan Zhu
Subjects: business.industry, Computer science, Track (disk drive), ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, 020206 networking & telecommunications, 02 engineering and technology, Real image, Scale factor, Superresolution, Image (mathematics), 0202 electrical engineering, electronic engineering, information engineering, Focal length, 020201 artificial intelligence & image processing, Computer vision, Artificial intelligence, business, Focus (optics), Image resolution, Image restoration
Abstract: This paper reviewed the 3rd NTIRE challenge on single-image super-resolution (restoration of rich details in a low-resolution image) with a focus on proposed solutions and results. The challenge had 1 track, which was aimed at the real-world single image super-resolution problem with an unknown scaling factor. Participants were mapping low-resolution images captured by a DSLR camera with a shorter focal length to their high-resolution images captured at a longer focal length. With this challenge, we introduced a novel real-world super-resolution dataset (RealSR). The track had 403 registered participants, and 36 teams competed in the final testing phase. They gauge the state-of-the-art in real-world single image super-resolution.
Published: 2019

47. Learning Digital Camera Pipeline for Extreme Low-Light Imaging

Author: Fahad Shahbaz Khan, Aditya Arora, Salman Khan, Ling Shao, and Syed Waqas Zamir
Subjects: FOS: Computer and information sciences, 0209 industrial biotechnology, business.product_category, Computer science, Computer Vision and Pattern Recognition (cs.CV), Cognitive Neuroscience, media_common.quotation_subject, sRGB, Pipeline (computing), ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, Computer Science - Computer Vision and Pattern Recognition, 02 engineering and technology, 020901 industrial engineering & automation, Artificial Intelligence, Perception, 0202 electrical engineering, electronic engineering, information engineering, Quality (business), Computer vision, Visual artifact, media_common, Digital camera, Artificial neural network, business.industry, Computer Science Applications, Computer Science::Computer Vision and Pattern Recognition, Metric (mathematics), 020201 artificial intelligence & image processing, Artificial intelligence, business
Abstract: In low-light conditions, a conventional camera imaging pipeline produces sub-optimal images that are usually dark and noisy due to a low photon count and low signal-to-noise ratio (SNR). We present a data-driven approach that learns the desired properties of well-exposed images and reflects them in images that are captured in extremely low ambient light environments, thereby significantly improving the visual quality of these low-light images. The recent works on this problem only consider a pixel-level loss metric that ignores perceptual quality and thus generate outputs susceptible to visual artifacts. To address this problem, we propose a new loss function that exploits the characteristics of both pixel-wise and perceptual metrics, enabling our deep neural network to learn the camera processing pipeline to transform the short-exposure, low-light RAW sensor data to well-exposed sRGB images. The results show that our method outperforms the state-of-the-art according to psychophysical tests as well as pixel-wise standard metrics and recent learning-based perceptual image quality measures. In essence, the proposed model can potentially replace the conventional digital camera pipeline for the specific case of extreme low-light imaging.
Published: 2019

48. Striking the Right Balance with Uncertainty

Author: Syed Waqas Zamir, Ling Shao, Salman Khan, Jianbing Shen, and Munawar Hayat
Subjects: FOS: Computer and information sciences, Covariance matrix, business.industry, Generalization, Computer science, Deep learning, Gaussian, Feature vector, Computer Vision and Pattern Recognition (cs.CV), Bayesian probability, Computer Science - Computer Vision and Pattern Recognition, 020206 networking & telecommunications, Pattern recognition, Sample (statistics), 02 engineering and technology, Class (biology), symbols.namesake, 0202 electrical engineering, electronic engineering, information engineering, symbols, 020201 artificial intelligence & image processing, Artificial intelligence, business, Representation (mathematics)
Abstract: Learning unbiased models on imbalanced datasets is a significant challenge. Rare classes tend to get a concentrated representation in the classification space which hampers the generalization of learned boundaries to new test examples. In this paper, we demonstrate that the Bayesian uncertainty estimates directly correlate with the rarity of classes and the difficulty level of individual samples. Subsequently, we present a novel framework for uncertainty based class imbalance learning that follows two key insights: First, classification boundaries should be extended further away from a more uncertain (rare) class to avoid overfitting and enhance its generalization. Second, each sample should be modeled as a multi-variate Gaussian distribution with a mean vector and a covariance matrix defined by the sample's uncertainty. The learned boundaries should respect not only the individual samples but also their distribution in the feature space. Our proposed approach efficiently utilizes sample and class uncertainty information to learn robust features and more generalizable classifiers. We systematically study the class imbalance problem and derive a novel loss formulation for max-margin learning based on Bayesian uncertainty measure. The proposed method shows significant performance improvements on six benchmark datasets for face verification, attribute prediction, digit/object classification and skin lesion detection., CVPR 2019
Published: 2019

49. NTIRE 2019 challenge on image enhancement: Methods and results

Author: Ling Shao, Liang Lin, Flavio Piccoli, Xingguang Zhou, Dongwon Park, Syed Waqas Zamir, Lai-Kuan Wong, Greg Shakhnarovich, Cheolkon Jung, Hongzhi Zhang, Andrey Ignatov, Xiaochao Qu, Pengxu Wei, Zhiwei Zhong, Zheng Hui, Kazutoshi Akita, Jinghui Qin, Xinbo Gao, Pablo Navarrete Michelini, Wushao Wen, Jingdong Liu, Radu Timofte, Jie Liu, Jiye Liu, Salman Khan, Norimichi Ukita, Hanwen Liu, Wangmeng Zuo, Muhammad Haris, Yukai Shi, Debin Zhao, Fahad Shahbaz Khan, Pengfei Wan, Ganapathy Krishnamurthi, Xianming Liu, Se Young Chun, Simone Bianco, Tomoki Yoshida, Ting Liu, Xiumei Wang, Kai Zhang, Junjun Jiang, Claudio Cusano, John See, Nelson Chong Ngee Bow, Lishan Huang, Pengju Liu, Raimondo Schettini, Mahendra Khened, Kanti Kumari, Aditya Arora, Vikas Kumar Anand, Dan Zhu, Ignatov, A, Timofte, R, Qu, X, Zhou, X, Liu, T, Wan, P, Zamir, S, Arora, A, Khan, S, Khan, F, Shao, L, Park, D, Chun, S, Michelini, P, Liu, H, Zhu, D, Zhong, Z, Liu, X, Jiang, J, Zhao, D, Haris, M, Akita, K, Yoshida, T, Shakhnarovich, G, Ukita, N, Liu, J, Jung, C, Schettini, R, Bianco, S, Cusano, C, Piccoli, F, Liu, P, Zhang, K, Zhang, H, Zuo, W, Bow, N, Wong, L, See, J, Qin, J, Huang, L, Shi, Y, Wei, P, Wen, W, Lin, L, Hui, Z, Wang, X, Gao, X, Kumari, K, Anand, V, Khened, M, and Krishnamurthi, G
Subjects: 0209 industrial biotechnology, Computer science, Image quality, Structural similarity, business.industry, media_common.quotation_subject, image enhancement, image quality, tone adjustment, cameras, smartphones, task analysis, visualization, computer vision, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, 02 engineering and technology, Visualization, Image (mathematics), 020901 industrial engineering & automation, Perception, Metric (mathematics), 0202 electrical engineering, electronic engineering, information engineering, Contrast (vision), 020201 artificial intelligence & image processing, Computer vision, Artificial intelligence, business, Focus (optics), Image resolution, media_common
Abstract: This paper reviews the first NTIRE challenge on perceptual image enhancement with the focus on proposed solutions and results. The participating teams were solving a real-world photo enhancement problem, where the goal was to map low-quality photos from the iPhone 3GS device to the same photos captured with Canon 70D DSLR camera. The considered problem embraced a number of computer vision subtasks, such as image denoising, image resolution and sharpness enhancement, image color/contrast/exposure adjustment, etc. The target metric used in this challenge combined PSNR and SSIM scores with solutions' perceptual results measured in the user study. The proposed solutions significantly improved baseline results, defining the state-of-the-art for practical image enhancement.
Published: 2019

50. NTIRE 2019 challenge on real image denoising: Methods and results

Author: Kazutoshi Akita, Thomas S. Huang, Simone Zini, Raimondo Schettini, Jae-Ryun Chung, Bumjun Park, Chuan Wang, Sang-Won Lee, Seung-Won Jung, Simone Bianco, Lei Zhang, Yiyun Zhao, Yuchen Fan, Yifan Ding, Greg Shakhnarovich, Se Young Chun, Hongwei Yong, Ling Shao, Deyu Meng, Wangmeng Zuo, Chi Li, Salman Khan, Tomoki Yoshida, Chang Chen, Ding Liu, Dongwon Park, Wenyi Tang, Zhiwei Xiong, Syed Waqas Zamir, Yuqian Zhou, Norimichi Ukita, Haoqiang Fan, Seung-Wook Kim, Jue Wang, Zhiguo Cao, Yuzhi Wang, Radu Timofte, Dong-Wook Kim, Sung-Jea Ko, Fahad Shahbaz Khan, Magauiya Zhussip, Dong-Pan Lim, Seo-Won Ji, Yang Wang, Muhammad Haris, Aditya Arora, Michael S. Brown, Shakarim Soltanayev, Jiaming Liu, Qin Xu, Abdelrahman Abdelhamed, Shaofan Cai, Kai Zhang, Jechang Jeong, Chi-Hao Wu, Songhyun Yu, Yue Lu, Pengliang Tang, Abdelhamed, A, Timofte, R, Brown, M, Yu, S, Park, B, Jeong, J, Jung, S, Kim, D, Chung, J, Liu, J, Wang, Y, Wu, C, Xu, Q, Wang, C, Cai, S, Ding, Y, Fan, H, Wang, J, Zhang, K, Zuo, W, Zhussip, M, Park, D, Soltanayev, S, Chun, S, Xiong, Z, Chen, C, Haris, M, Akita, K, Yoshida, T, Shakhnarovich, G, Ukita, N, Zamir, S, Arora, A, Khan, S, Khan, F, Shao, L, Ko, S, Lim, D, Kim, S, Ji, S, Lee, S, Tang, W, Fan, Y, Zhou, Y, Liu, D, Huang, T, Meng, D, Zhang, L, Yong, H, Zhao, Y, Tang, P, Lu, Y, Schettini, R, Bianco, S, Zini, S, Li, C, and Cao, Z
Subjects: Noise measurement, Computer science, business.industry, Noise reduction, sRGB, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, 0211 other engineering and technologies, INF/01 - INFORMATICA, 02 engineering and technology, Color space, Real image, Image denoising, 0202 electrical engineering, electronic engineering, information engineering, RGB color model, 020201 artificial intelligence & image processing, Computer vision, Artificial intelligence, Focus (optics), business, 021101 geological & geomatics engineering
Abstract: This paper reviews the NTIRE 2019 challenge on real image denoising with focus on the proposed methods and their results. The challenge has two tracks for quantitatively evaluating image denoising performance in (1) the Bayer-pattern raw-RGB and (2) the standard RGB (sRGB) color spaces. The tracks had 216 and 220 registered participants, respectively. A total of 15 teams, proposing 17 methods, competed in the final phase of the challenge. The proposed methods by the 15 teams represent the current state-of-the-art performance in image denoising targeting real noisy images.
Published: 2019

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

63 results on '"Syed Waqas Zamir"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources