Author: "Deng, Xueqing" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Deng, Xueqing"' showing total 209 results

Start Over Author "Deng, Xueqing"

209 results on '"Deng, Xueqing"'

1. Randomized Autoregressive Visual Generation

Author: Yu, Qihang, He, Ju, Deng, Xueqing, Shen, Xiaohui, and Chen, Liang-Chieh
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: This paper presents Randomized AutoRegressive modeling (RAR) for visual generation, which sets a new state-of-the-art performance on the image generation task while maintaining full compatibility with language modeling frameworks. The proposed RAR is simple: during a standard autoregressive training process with a next-token prediction objective, the input sequence-typically ordered in raster form-is randomly permuted into different factorization orders with a probability r, where r starts at 1 and linearly decays to 0 over the course of training. This annealing training strategy enables the model to learn to maximize the expected likelihood over all factorization orders and thus effectively improve the model's capability of modeling bidirectional contexts. Importantly, RAR preserves the integrity of the autoregressive modeling framework, ensuring full compatibility with language modeling while significantly improving performance in image generation. On the ImageNet-256 benchmark, RAR achieves an FID score of 1.48, not only surpassing prior state-of-the-art autoregressive image generators but also outperforming leading diffusion-based and masked transformer-based methods. Code and models will be made available at https://github.com/bytedance/1d-tokenizer, Comment: simple method improving autoregressive image generator to SOTA performance; Project page at https://yucornetto.github.io/projects/rar.html
Published: 2024

2. MaskBit: Embedding-free Image Generation via Bit Tokens

Author: Weber, Mark, Yu, Lijun, Yu, Qihang, Deng, Xueqing, Shen, Xiaohui, Cremers, Daniel, and Chen, Liang-Chieh
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: Masked transformer models for class-conditional image generation have become a compelling alternative to diffusion models. Typically comprising two stages - an initial VQGAN model for transitioning between latent space and image space, and a subsequent Transformer model for image generation within latent space - these frameworks offer promising avenues for image synthesis. In this study, we present two primary contributions: Firstly, an empirical and systematic examination of VQGANs, leading to a modernized VQGAN. Secondly, a novel embedding-free generation network operating directly on bit tokens - a binary quantized representation of tokens with rich semantics. The first contribution furnishes a transparent, reproducible, and high-performing VQGAN model, enhancing accessibility and matching the performance of current state-of-the-art methods while revealing previously undisclosed details. The second contribution demonstrates that embedding-free image generation using bit tokens achieves a new state-of-the-art FID of 1.52 on the ImageNet 256x256 benchmark, with a compact generator model of mere 305M parameters., Comment: Project page: https://weber-mark.github.io/projects/maskbit.html
Published: 2024

3. An Image is Worth 32 Tokens for Reconstruction and Generation

Author: Yu, Qihang, Weber, Mark, Deng, Xueqing, Shen, Xiaohui, Cremers, Daniel, and Chen, Liang-Chieh
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Recent advancements in generative models have highlighted the crucial role of image tokenization in the efficient synthesis of high-resolution images. Tokenization, which transforms images into latent representations, reduces computational demands compared to directly processing pixels and enhances the effectiveness and efficiency of the generation process. Prior methods, such as VQGAN, typically utilize 2D latent grids with fixed downsampling factors. However, these 2D tokenizations face challenges in managing the inherent redundancies present in images, where adjacent regions frequently display similarities. To overcome this issue, we introduce Transformer-based 1-Dimensional Tokenizer (TiTok), an innovative approach that tokenizes images into 1D latent sequences. TiTok provides a more compact latent representation, yielding substantially more efficient and effective representations than conventional techniques. For example, a 256 x 256 x 3 image can be reduced to just 32 discrete tokens, a significant reduction from the 256 or 1024 tokens obtained by prior methods. Despite its compact nature, TiTok achieves competitive performance to state-of-the-art approaches. Specifically, using the same generator framework, TiTok attains 1.97 gFID, outperforming MaskGIT baseline significantly by 4.21 at ImageNet 256 x 256 benchmark. The advantages of TiTok become even more significant when it comes to higher resolution. At ImageNet 512 x 512 benchmark, TiTok not only outperforms state-of-the-art diffusion model DiT-XL/2 (gFID 2.74 vs. 3.04), but also reduces the image tokens by 64x, leading to 410x faster generation process. Our best-performing variant can significantly surpasses DiT-XL/2 (gFID 2.13 vs. 3.04) while still generating high-quality samples 74x faster., Comment: A compact 1D Image Tokenization method, leading to SOTA generation performance while being substantially faster. Project page at https://yucornetto.github.io/projects/titok.html
Published: 2024

4. CorrespondentDream: Enhancing 3D Fidelity of Text-to-3D using Cross-View Correspondences

Author: Kim, Seungwook, Li, Kejie, Deng, Xueqing, Shi, Yichun, Cho, Minsu, and Wang, Peng
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Leveraging multi-view diffusion models as priors for 3D optimization have alleviated the problem of 3D consistency, e.g., the Janus face problem or the content drift problem, in zero-shot text-to-3D models. However, the 3D geometric fidelity of the output remains an unresolved issue; albeit the rendered 2D views are realistic, the underlying geometry may contain errors such as unreasonable concavities. In this work, we propose CorrespondentDream, an effective method to leverage annotation-free, cross-view correspondences yielded from the diffusion U-Net to provide additional 3D prior to the NeRF optimization process. We find that these correspondences are strongly consistent with human perception, and by adopting it in our loss design, we are able to produce NeRF models with geometries that are more coherent with common sense, e.g., more smoothed object surface, yielding higher 3D fidelity. We demonstrate the efficacy of our approach through various comparative qualitative results and a solid user study., Comment: 25 pages, 22 figures, accepted to CVPR 2024
Published: 2024

5. COCONut: Modernizing COCO Segmentation

Author: Deng, Xueqing, Yu, Qihang, Wang, Peng, Shen, Xiaohui, and Chen, Liang-Chieh
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: In recent decades, the vision community has witnessed remarkable progress in visual recognition, partially owing to advancements in dataset benchmarks. Notably, the established COCO benchmark has propelled the development of modern detection and segmentation systems. However, the COCO segmentation benchmark has seen comparatively slow improvement over the last decade. Originally equipped with coarse polygon annotations for thing instances, it gradually incorporated coarse superpixel annotations for stuff regions, which were subsequently heuristically amalgamated to yield panoptic segmentation annotations. These annotations, executed by different groups of raters, have resulted not only in coarse segmentation masks but also in inconsistencies between segmentation types. In this study, we undertake a comprehensive reevaluation of the COCO segmentation annotations. By enhancing the annotation quality and expanding the dataset to encompass 383K images with more than 5.18M panoptic masks, we introduce COCONut, the COCO Next Universal segmenTation dataset. COCONut harmonizes segmentation annotations across semantic, instance, and panoptic segmentation with meticulously crafted high-quality masks, and establishes a robust benchmark for all segmentation tasks. To our knowledge, COCONut stands as the inaugural large-scale universal segmentation dataset, verified by human raters. We anticipate that the release of COCONut will significantly contribute to the community's ability to assess the progress of novel neural networks., Comment: Accepted at CVPR2024, data available at https://xdeng7.github.io/coconut.github.io/
Published: 2024

6. A Simple Video Segmenter by Tracking Objects Along Axial Trajectories

Author: He, Ju, Yu, Qihang, Shin, Inkyu, Deng, Xueqing, Yuille, Alan, Shen, Xiaohui, and Chen, Liang-Chieh
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Video segmentation requires consistently segmenting and tracking objects over time. Due to the quadratic dependency on input size, directly applying self-attention to video segmentation with high-resolution input features poses significant challenges, often leading to insufficient GPU memory capacity. Consequently, modern video segmenters either extend an image segmenter without incorporating any temporal attention or resort to window space-time attention in a naive manner. In this work, we present Axial-VS, a general and simple framework that enhances video segmenters by tracking objects along axial trajectories. The framework tackles video segmentation through two sub-tasks: short-term within-clip segmentation and long-term cross-clip tracking. In the first step, Axial-VS augments an off-the-shelf clip-level video segmenter with the proposed axial-trajectory attention, sequentially tracking objects along the height- and width-trajectories within a clip, thereby enhancing temporal consistency by capturing motion trajectories. The axial decomposition significantly reduces the computational complexity for dense features, and outperforms the window space-time attention in segmentation quality. In the second step, we further employ axial-trajectory attention to the object queries in clip-level segmenters, which are learned to encode object information, thereby aiding object tracking across different clips and achieving consistent segmentation throughout the video. Without bells and whistles, Axial-VS showcases state-of-the-art results on video segmentation benchmarks, emphasizing its effectiveness in addressing the limitations of modern clip-level video segmenters. Code and models are available at https://github.com/TACJu/Axial-VS., Comment: The paper and model names have been updated to better reflect the methodological contributions
Published: 2023

7. Selective Feature Adapter for Dense Vision Transformers

Author: Deng, Xueqing, Fan, Qi, Jin, Xiaojie, Yang, Linjie, and Wang, Peng
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Fine-tuning pre-trained transformer models, e.g., Swin Transformer, are successful in numerous downstream for dense prediction vision tasks. However, one major issue is the cost/storage of their huge amount of parameters, which becomes increasingly challenging to handle with the growing amount of vision tasks. In this paper, we propose an effective approach to alleviate the issue, namely selective feature adapter (SFA). It achieves state-of-the-art (SoTA) performance under any given budget of trainable parameters, and demonstrates comparable or better performance than fully fine-tuned models across various dense tasks. Specifically, SFA consists of external adapters and internal adapters which are sequentially operated over a transformer model. For external adapters, we properly select the places and amount of additional multilayer perception (MLP). For internal adapters, we transform a few task-important parameters inside the transformer, which are automatically discovered through a simple yet effective lottery ticket algorithm. Our experiments show that the dual adapter module, a.k.a SFA, is essential to achieve the best trade-off on dense vision tasks, such as segmentation, detection and depth-estimation, outperforming other adapters with a single module.
Published: 2023

8. Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP

Author: Yu, Qihang, He, Ju, Deng, Xueqing, Shen, Xiaohui, and Chen, Liang-Chieh
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Open-vocabulary segmentation is a challenging task requiring segmenting and recognizing objects from an open set of categories. One way to address this challenge is to leverage multi-modal models, such as CLIP, to provide image and text features in a shared embedding space, which bridges the gap between closed-vocabulary and open-vocabulary recognition. Hence, existing methods often adopt a two-stage framework to tackle the problem, where the inputs first go through a mask generator and then through the CLIP model along with the predicted masks. This process involves extracting features from images multiple times, which can be ineffective and inefficient. By contrast, we propose to build everything into a single-stage framework using a shared Frozen Convolutional CLIP backbone, which not only significantly simplifies the current two-stage pipeline, but also remarkably yields a better accuracy-cost trade-off. The proposed FC-CLIP, benefits from the following observations: the frozen CLIP backbone maintains the ability of open-vocabulary classification and can also serve as a strong mask generator, and the convolutional CLIP generalizes well to a larger input resolution than the one used during contrastive image-text pretraining. When training on COCO panoptic data only and testing in a zero-shot manner, FC-CLIP achieve 26.8 PQ, 16.8 AP, and 34.1 mIoU on ADE20K, 18.2 PQ, 27.9 mIoU on Mapillary Vistas, 44.0 PQ, 26.8 AP, 56.2 mIoU on Cityscapes, outperforming the prior art by +4.2 PQ, +2.4 AP, +4.2 mIoU on ADE20K, +4.0 PQ on Mapillary Vistas and +20.1 PQ on Cityscapes, respectively. Additionally, the training and testing time of FC-CLIP is 7.5x and 6.6x significantly faster than the same prior art, while using 5.9x fewer parameters. FC-CLIP also sets a new state-of-the-art performance across various open-vocabulary semantic segmentation datasets. Code at https://github.com/bytedance/fc-clip, Comment: NeurIPS 2023 camera ready. code and model available at https://github.com/bytedance/fc-clip
Published: 2023

9. MV-Adapter: Multimodal Video Transfer Learning for Video Text Retrieval

Author: Jin, Xiaojie, Zhang, Bowen, Gong, Weibo, Xu, Kai, Deng, XueQing, Wang, Peng, Zhang, Zhao, Shen, Xiaohui, and Feng, Jiashi
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: State-of-the-art video-text retrieval (VTR) methods typically involve fully fine-tuning a pre-trained model (e.g. CLIP) on specific datasets. However, this can result in significant storage costs in practical applications as a separate model per task must be stored. To address this issue, we present our pioneering work that enables parameter-efficient VTR using a pre-trained model, with only a small number of tunable parameters during training. Towards this goal, we propose a new method dubbed Multimodal Video Adapter (MV-Adapter) for efficiently transferring the knowledge in the pre-trained CLIP from image-text to video-text. Specifically, MV-Adapter utilizes bottleneck structures in both video and text branches, along with two novel components. The first is a Temporal Adaptation Module that is incorporated in the video branch to introduce global and local temporal contexts. We also train weights calibrations to adjust to dynamic variations across frames. The second is Cross Modality Tying that generates weights for video/text branches through sharing cross modality factors, for better aligning between modalities. Thanks to above innovations, MV-Adapter can achieve comparable or better performance than standard full fine-tuning with negligible parameters overhead. Notably, MV-Adapter consistently outperforms various competing methods in V2T/T2V tasks with large margins on five widely used VTR benchmarks (MSR-VTT, MSVD, LSMDC, DiDemo, and ActivityNet).
Published: 2023

10. GeoAI at ACM SIGSPATIAL: The New Frontier of Geospatial Artificial Intelligence Research

Author: Lunga, Dalton, Hu, Yingjie, Newsam, Shawn, Gao, Song, Martins, Bruno, Yang, Lexie, and Deng, Xueqing
Subjects: Computer Science - Artificial Intelligence
Abstract: Geospatial Artificial Intelligence (GeoAI) is an interdisciplinary field enjoying tremendous adoption. However, the efficient design and implementation of GeoAI systems face many open challenges. This is mainly due to the lack of non-standardized approaches to artificial intelligence tool development, inadequate platforms, and a lack of multidisciplinary engagements, which all motivate domain experts to seek a shared stage with scientists and engineers to solve problems of significant impact on society. Since its inception in 2017, the GeoAI series of workshops has been co-located with the Association for Computing Machinery International Conference on Advances in Geographic Information Systems. The workshop series has fostered a nexus for geoscientists, computer scientists, engineers, entrepreneurs, and decision-makers, from academia, industry, and government to engage in artificial intelligence, spatiotemporal data computing, and geospatial data science research, motivated by various challenges. In this article, we revisit and discuss the state of GeoAI open research directions, the recent developments, and an emerging agenda calling for a continued cross-disciplinary community engagement., Comment: 12 pages, 1 figure, 1 table
Published: 2022

11. DistPro: Searching A Fast Knowledge Distillation Process via Meta Optimization

Author: Deng, Xueqing, Sun, Dawei, Newsam, Shawn, and Wang, Peng
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Recent Knowledge distillation (KD) studies show that different manually designed schemes impact the learned results significantly. Yet, in KD, automatically searching an optimal distillation scheme has not yet been well explored. In this paper, we propose DistPro, a novel framework which searches for an optimal KD process via differentiable meta-learning. Specifically, given a pair of student and teacher networks, DistPro first sets up a rich set of KD connection from the transmitting layers of the teacher to the receiving layers of the student, and in the meanwhile, various transforms are also proposed for comparing feature maps along its pathway for the distillation. Then, each combination of a connection and a transform choice (pathway) is associated with a stochastic weighting process which indicates its importance at every step during the distillation. In the searching stage, the process can be effectively learned through our proposed bi-level meta-optimization strategy. In the distillation stage, DistPro adopts the learned processes for knowledge distillation, which significantly improves the student accuracy especially when faster training is required. Lastly, we find the learned processes can be generalized between similar tasks and networks. In our experiments, DistPro produces state-of-the-art (SoTA) accuracy under varying number of learning epochs on popular datasets, i.e. CIFAR100 and ImageNet, which demonstrate the effectiveness of our framework., Comment: 14 pages, 5 figures
Published: 2022

12. NightLab: A Dual-level Architecture with Hardness Detection for Segmentation at Night

Author: Deng, Xueqing, Wang, Peng, Lian, Xiaochen, and Newsam, Shawn
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: The semantic segmentation of nighttime scenes is a challenging problem that is key to impactful applications like self-driving cars. Yet, it has received little attention compared to its daytime counterpart. In this paper, we propose NightLab, a novel nighttime segmentation framework that leverages multiple deep learning models imbued with night-aware features to yield State-of-The-Art (SoTA) performance on multiple night segmentation benchmarks. Notably, NightLab contains models at two levels of granularity, i.e. image and regional, and each level is composed of light adaptation and segmentation modules. Given a nighttime image, the image level model provides an initial segmentation estimate while, in parallel, a hardness detection module identifies regions and their surrounding context that need further analysis. A regional level model focuses on these difficult regions to provide a significantly improved segmentation. All the models in NightLab are trained end-to-end using a set of proposed night-aware losses without handcrafted heuristics. Extensive experiments on the NightCity and BDD100K datasets show NightLab achieves SoTA performance compared to concurrent methods., Comment: 8pages, 6 figures, accept at CVPR 2022
Published: 2022

13. AutoAdapt: Automated Segmentation Network Search for Unsupervised Domain Adaptation

Author: Deng, Xueqing, Zhu, Yi, Tian, Yuxin, and Newsam, Shawn
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Neural network-based semantic segmentation has achieved remarkable results when large amounts of annotated data are available, that is, in the supervised case. However, such data is expensive to collect and so methods have been developed to adapt models trained on related, often synthetic data for which labels are readily available. Current adaptation approaches do not consider the dependence of the generalization/transferability of these models on network architecture. In this paper, we perform neural architecture search (NAS) to provide architecture-level perspective and analysis for domain adaptation. We identify the optimization gap that exists when searching architectures for unsupervised domain adaptation which makes this NAS problem uniquely difficult. We propose bridging this gap by using maximum mean discrepancy and regional weighted entropy to estimate the accuracy metric. Experimental results on several widely adopted benchmarks show that our proposed AutoAdapt framework indeed discovers architectures that improve the performance of a number of existing adaptation techniques., Comment: short version has been accepted at 1st NAS workshop co-organized with CVPR 2021
Published: 2021

14. Unveiling causal connections: Long-term particulate matter exposure and type 2 diabetes mellitus mortality in Southern China

Author: Guo, Tong, Cheng, Xi, Wei, Jing, Chen, Shirui, Zhang, Yuqin, Lin, Shao, Deng, Xinlei, Qu, Yanji, Lin, Ziqiang, Chen, Shimin, Li, Zhiqiang, Sun, Jie, Chen, Xudan, Chen, Zhibing, Sun, Xurui, Chen, Dan, Ruan, Xingling, Tuohetasen, Shaniduhaxi, Li, Xinyue, Zhang, Man, Sun, Yongqing, Zhu, Shuming, Deng, Xueqing, Hao, Yuantao, Jing, Qinlong, and Zhang, Wangjian
Published: 2024
Full Text: View/download PDF

15. Scale Aware Adaptation for Land-Cover Classification in Remote Sensing Imagery

Author: Deng, Xueqing, Zhu, Yi, Tian, Yuxin, and Newsam, Shawn
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Land-cover classification using remote sensing imagery is an important Earth observation task. Recently, land cover classification has benefited from the development of fully connected neural networks for semantic segmentation. The benchmark datasets available for training deep segmentation models in remote sensing imagery tend to be small, however, often consisting of only a handful of images from a single location with a single scale. This limits the models' ability to generalize to other datasets. Domain adaptation has been proposed to improve the models' generalization but we find these approaches are not effective for dealing with the scale variation commonly found between remote sensing image collections. We therefore propose a scale aware adversarial learning framework to perform joint cross-location and cross-scale land-cover classification. The framework has a dual discriminator architecture with a standard feature discriminator as well as a novel scale discriminator. We also introduce a scale attention module which produces scale-enhanced features. Experimental results show that the proposed framework outperforms state-of-the-art domain adaptation methods by a large margin., Comment: The open-sourced codes are available on Github: https://github.com/xdeng7/scale-aware_da
Published: 2020

16. Residential greenness and chronic obstructive pulmonary disease in a large cohort in southern China: Potential causal links, risk trajectories, and mediation pathways

Author: Wu, Wenjing, Chen, Dan, Ruan, Xingling, Wu, Gonghua, Deng, Xinlei, Lawrence, Wayne, Lin, Xiao, Li, Zhiqiang, Wang, Ying, Lin, Ziqiang, Zhu, Shuming, Deng, Xueqing, Lin, Qiaoxuan, Hao, Chun, Du, Zhicheng, Wei, Jing, Zhang, Wangjian, and Hao, Yuantao
Published: 2024
Full Text: View/download PDF

17. Generalizing Deep Models for Overhead Image Segmentation Through Getis-Ord Gi* Pooling

Author: Deng, Xueqing, Zhu, Yi, Tian, Yuxin, and Newsam, Shawn
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: That most deep learning models are purely data driven is both a strength and a weakness. Given sufficient training data, the optimal model for a particular problem can be learned. However, this is usually not the case and so instead the model is either learned from scratch from a limited amount of training data or pre-trained on a different problem and then fine-tuned. Both of these situations are potentially suboptimal and limit the generalizability of the model. Inspired by this, we investigate methods to inform or guide deep learning models for geospatial image analysis to increase their performance when a limited amount of training data is available or when they are applied to scenarios other than which they were trained on. In particular, we exploit the fact that there are certain fundamental rules as to how things are distributed on the surface of the Earth and these rules do not vary substantially between locations. Based on this, we develop a novel feature pooling method for convolutional neural networks using Getis-Ord Gi* analysis from geostatistics. Experimental results show our proposed pooling function has significantly better generalization performance compared to a standard data-driven approach when applied to overhead image segmentation.
Published: 2019

18. Using Conditional Generative Adversarial Networks to Generate Ground-Level Views From Overhead Imagery

Author: Deng, Xueqing, Zhu, Yi, and Newsam, Shawn
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: This paper develops a deep-learning framework to synthesize a ground-level view of a location given an overhead image. We propose a novel conditional generative adversarial network (cGAN) in which the trained generator generates realistic looking and representative ground-level images using overhead imagery as auxiliary information. The generator is an encoder-decoder network which allows us to compare low- and high-level features as well as their concatenation for encoding the overhead imagery. We also demonstrate how our framework can be used to perform land cover classification by modifying the trained cGAN to extract features from overhead imagery. This is interesting because, although we are using this modified cGAN as a feature extractor for overhead imagery, it incorporates knowledge of how locations look from the ground., Comment: 5 pages. arXiv admin note: text overlap with arXiv:1806.05129
Published: 2019

19. Unilateral biportal endoscopic laminectomy with lateral mass screw fixation for treating cervical spinal stenosis

Author: Zhu, Chengyue, Deng, Xueqing, Pan, Hao, and Zhang, Wei
Published: 2022
Full Text: View/download PDF

20. Region Convolutional Features for Multi-Label Remote Sensing Image Retrieval

Author: Zhou, Weixun, Deng, Xueqing, and Shao, Zhenfeng
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Conventional remote sensing image retrieval (RSIR) systems usually perform single-label retrieval where each image is annotated by a single label representing the most significant semantic content of the image. This assumption, however, ignores the complexity of remote sensing images, where an image might have multiple classes (i.e., multiple labels), thus resulting in worse retrieval performance. We therefore propose a novel multi-label RSIR approach with fully convolutional networks (FCN). In our approach, we first train a FCN model using a pixel-wise labeled dataset,and the trained FCN is then used to predict the segmentation maps of each image in the considered archive. We finally extract region convolutional features of each image based on its segmentation map.The region features can be either used to perform region-based retrieval or further post-processed to obtain a feature vector for similarity measure. The experimental results show that our approach achieves state-of-the-art performance in contrast to conventional single-label and recent multi-label RSIR approaches., Comment: 8 pages
Published: 2018
Full Text: View/download PDF

21. What Is It Like Down There? Generating Dense Ground-Level Views and Image Features From Overhead Imagery Using Conditional Generative Adversarial Networks

Author: Deng, Xueqing, Zhu, Yi, and Newsam, Shawn
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: This paper investigates conditional generative adversarial networks (cGANs) to overcome a fundamental limitation of using geotagged media for geographic discovery, namely its sparse and uneven spatial distribution. We train a cGAN to generate ground-level views of a location given overhead imagery. We show the "fake" ground-level images are natural looking and are structurally similar to the real images. More significantly, we show the generated images are representative of the locations and that the representations learned by the cGANs are informative. In particular, we show that dense feature maps generated using our framework are more effective for land-cover classification than approaches which spatially interpolate features extracted from sparse ground-level images. To our knowledge, ours is the first work to use cGANs to generate ground-level views given overhead imagery and to explore the benefits of the learned representations., Comment: 10 pages, 5 figures, camera-ready version of ACM SIGSPATIAL 2018 (ORAL)
Published: 2018

22. Spatial Morphing Kernel Regression For Feature Interpolation

Author: Deng, Xueqing, Zhu, Yi, and Newsam, Shawn
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: In recent years, geotagged social media has become popular as a novel source for geographic knowledge discovery. Ground-level images and videos provide a different perspective than overhead imagery and can be applied to a range of applications such as land use mapping, activity detection, pollution mapping, etc. The sparse and uneven distribution of this data presents a problem, however, for generating dense maps. We therefore investigate the problem of spatially interpolating the high-dimensional features extracted from sparse social media to enable dense labeling using standard classifiers. Further, we show how prior knowledge about region boundaries can be used to improve the interpolation through spatial morphing kernel regression. We show that an interpolate-then-classify framework can produce dense maps from sparse observations but that care must be taken in choosing the interpolation method. We also show that the spatial morphing kernel improves the results., Comment: accepted by ICIP 2018
Published: 2018

23. Fine-Grained Land Use Classification at the City Scale Using Ground-Level Images

Author: Zhu, Yi, Deng, Xueqing, and Newsam, Shawn
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Information Retrieval, Computer Science - Multimedia
Abstract: We perform fine-grained land use mapping at the city scale using ground-level images. Mapping land use is considerably more difficult than mapping land cover and is generally not possible using overhead imagery as it requires close-up views and seeing inside buildings. We postulate that the growing collections of georeferenced, ground-level images suggest an alternate approach to this geographic knowledge discovery problem. We develop a general framework that uses Flickr images to map 45 different land-use classes for the City of San Francisco. Individual images are classified using a novel convolutional neural network containing two streams, one for recognizing objects and another for recognizing scenes. This network is trained in an end-to-end manner directly on the labeled training images. We propose several strategies to overcome the noisiness of our user-generated data including search-based training set augmentation and online adaptive training. We derive a ground truth map of San Francisco in order to evaluate our method. We demonstrate the effectiveness of our approach through geo-visualization and quantitative analysis. Our framework achieves over 29% recall at the individual land parcel level which represents a strong baseline for the challenging 45-way land use classification problem especially given the noisiness of the image data.
Published: 2018

24. Quantitative Comparison of Open-Source Data for Fine-Grain Mapping of Land Use

Author: Deng, Xueqing and Newsam, Shawn
Subjects: Computer Science - Computers and Society
Abstract: This paper performs a quantitative comparison of open-source data available on the Internet for the fine-grain mapping of land use. Three points of interest (POI) data sources--Google Places, Bing Maps, and the Yellow Pages--and one volunteered geographic information data source--Open Street Map (OSM)--are compared with each other at the parcel level for San Francisco with respect to a proposed fine-grain land-use taxonomy. The sources are also compared to coarse-grain authoritative data which we consider to be the ground truth. Results show limited agreement among the data sources as well as limited accuracy with respect to the authoritative data even at coarse class granularity. We conclude that POI and OSM data do not appear to be sufficient alone for fine-grain land-use mapping., Comment: ACM SIGSPATIAL 2017 Workshop on Urban GIS
Published: 2017

25. α -Klotho: the hidden link between dietary inflammatory index and accelerated ageing.

Author: Li, Ruiqiang, Zhou, Baijing, Deng, Xueqing, Tian, Wenbo, Huang, Yingyue, Wang, Jiao, and Xu, Lin
Subjects: RISK assessment, PHENOMENOLOGICAL biology, MULTIVARIATE analysis, SURVEYS, GLUCURONIDASE, AGING, INFLAMMATION, FACTOR analysis, CONFIDENCE intervals, CELL receptors, DIET, ALGORITHMS, REGRESSION analysis, PHENOTYPES, BLOOD, DISEASE risk factors
Abstract: Recent studies suggest an association between greater dietary inflammatory index (DII) and higher biological ageing. As α -Klotho has been considered as a longevity protein, we examined whether α -Klotho plays a role in the association between DII and ageing. We included 3054 participants from the National Health and Nutrition Examination Survey. The associations of DII with biological and phenotypic age were assessed by multivariable linear regression, and the mediating role of α -Klotho was evaluated by mediation analyses. Participants' mean age was 58·0 years (sd 11·0), with a median DII score of 1·85 and interquartile range from 0·44 to 2·79. After adjusting for age, sex, race/ethnicity, BMI, education, marital status, poverty income ratio, serum cotinine, alcohol, physical activity, a higher DII was associated with both older biological age and phenotypic age, with per DII score increment being associated with a 1·01-year increase in biological age (1·01 (95 % CI: 1·005, 1·02)) and 1·01-year increase in phenotypic age (1·01 (1·001, 1·02)). Negative associations of DII with α -Klotho (β = –1·01 pg/ml, 95 % CI: –1·02, –1·006) and α -Klotho with biological age (β = –1·07 years, 95 % CI: –1·13, –1·02) and phenotypic age (β = –1·03 years, 95 % CI: –1·05, –1·01) were found. Furthermore, α -Klotho mediated 10·13 % (P < 0·001) and 9·61 % (P < 0·001) of the association of DII with biological and phenotypic age, respectively. Higher DII was associated with older biological and phenotypic age, and the potential detrimental effects could be partly mediated through α -Klotho. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

26. DistPro: Searching a Fast Knowledge Distillation Process via Meta Optimization

Author: Deng, Xueqing, primary, Sun, Dawei, additional, Newsam, Shawn, additional, and Wang, Peng, additional
Published: 2022
Full Text: View/download PDF

27. Enhancing 3D Fidelity of Text-to-3D using Cross-View Correspondences

Author: Kim, Seungwook, Li, Kejie, Deng, Xueqing, Shi, Yichun, Cho, Minsu, Wang, Peng, Kim, Seungwook, Li, Kejie, Deng, Xueqing, Shi, Yichun, Cho, Minsu, and Wang, Peng
Abstract: Leveraging multi-view diffusion models as priors for 3D optimization have alleviated the problem of 3D consistency, e.g., the Janus face problem or the content drift problem, in zero-shot text-to-3D models. However, the 3D geometric fidelity of the output remains an unresolved issue; albeit the rendered 2D views are realistic, the underlying geometry may contain errors such as unreasonable concavities. In this work, we propose CorrespondentDream, an effective method to leverage annotation-free, cross-view correspondences yielded from the diffusion U-Net to provide additional 3D prior to the NeRF optimization process. We find that these correspondences are strongly consistent with human perception, and by adopting it in our loss design, we are able to produce NeRF models with geometries that are more coherent with common sense, e.g., more smoothed object surface, yielding higher 3D fidelity. We demonstrate the efficacy of our approach through various comparative qualitative results and a solid user study., Comment: 25 pages, 22 figures, accepted to CVPR 2024
Published: 2024

28. A novel pellets/thermosensitive hydrogel depot with low burst release for long-term continuous drug release: Preparation, characterization, in vitro and in vivo studies

Author: Deng, Xueqing, Liu, Yangjia, Qin, Jianxiu, Ye, Tiantian, and Wang, Shujun
Published: 2020
Full Text: View/download PDF

29. Exploring the dynamics of RNA molecules with multiscale Gaussian network model

Author: Wang, Shihao, Gong, Weikang, Deng, Xueqing, Liu, Yang, and Li, Chunhua
Published: 2020
Full Text: View/download PDF

30. Contribution of diversity of social participation on the mental health of humanitarian migrants during resettlement

Author: Jiang, Weiqing, primary, Yang, Yuwei, additional, He, Yitong, additional, Liu, Qianyu, additional, Deng, Xueqing, additional, Hua, Yilin, additional, Hayixibayi, Alimila, additional, Ni, Yanyan, additional, and Guo, Lan, additional
Published: 2024
Full Text: View/download PDF

31. Organising a juvenile ratio monitoring programme for 10 key waterbird species in the Yangtze River floodplain: analysis and proposals

Author: Wang, Yuxi, Damba, Iderbat, Zhao, Qingshan, Xie, Yanbo, Deng, Xueqing, Ga, Rdi, Liu, Guanhua, Xu, Zhiwen, Li, Yue, Gao, Dali, Xu, Wenbin, Chen, Guoxun, and Cao, Lei
Published: 2021
Full Text: View/download PDF

32. Effects of online family health education on myopia prevention in children by parental myopia: a randomized clinical trial.

Author: Zhang, Jiayu, Wang, Lili, Guo, Lan, Guo, Yangfeng, Zhao, Feng, Hu, Yin, Li, Qian, Du, Xueying, Deng, Xueqing, Deng, Nali, Yang, Xiao, and Lu, Ciyong
Subjects: MYOPIA, FAMILY health, HEALTH education, CLINICAL trials, VISION disorders, CHILD development
Abstract: Online family health education may be effective for myopia prevention in children, and the effects may be different between children with myopic and non-myopic parents. Myopia is a common cause of vision loss. The aims of this study were to evaluate the effects of online family health education on preventing the development of myopia in children, and to estimate whether the effects vary according to parental myopia. A cluster randomised trial including grade 1 and grade 2 children from 12 primary schools was conducted in Guangzhou, China. Weekly online family health education messages were sent to parents in the intervention group. Data collection included eye examinations of children and questionnaires completed by parents. Among the 3123 children included at baseline (1703 boys [54.5%]; mean [SD] age, 6.83 [0.73] years), 2376 completed the follow-up after 3 years. The differences in the incidence of myopia and myopic shift between the study groups were not significant in total. However, the 3-year cumulative incidence rate of myopia in the intervention group (125 of 445 [28.1%]) was significantly lower than that in the control group (225 of 603 [37.3%]; difference, 9.2% [95% CI, −14.9% to −3.5%]; P = 0.001) among children with non-myopic parents. In parallel, among children with non-myopic parents, the mean myopic change in SER was less for the intervention group than for the control group (−1.10 D vs. −1.24 D; difference, 0.13 D [95% CI, 0.03 to 0.23 D]; P = 0.01). Compared with children with myopic parents, online family health education was more effective in children with non-myopic parents. The incidence of myopia and myopic shift in refraction have been reduced in children with non-myopic parents. Further studies are needed to assess these differences by parental myopia. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

33. The gut-joint axis mediates the TNF-induced RA process and PBMT therapeutic effects through the metabolites of gut microbiota

Author: Meng, Qingtai, primary, Lin, Monan, additional, Song, Wuqi, additional, Wu, Jiahui, additional, Cao, Guoding, additional, Huang, Ping, additional, Su, Zaiyu, additional, Gu, Wei, additional, Deng, Xueqing, additional, Xu, Peng, additional, Yang, Yi, additional, Li, Hui, additional, Liu, Hailiang, additional, and Zhang, Fengmin, additional
Published: 2023
Full Text: View/download PDF

34. Association between childhood maltreatment and non-medical prescription opioid use among Chinese senior high school students: The moderating role of gender

Author: Lei, Yiling, Xi, Chuhao, Li, Pengsheng, Luo, Min, Wang, Wanxin, Pan, Siyuan, Gao, Xue, Xu, Yan, Huang, Guoliang, Deng, Xueqing, Guo, Lan, and Lu, Ciyong
Published: 2018
Full Text: View/download PDF

35. Sexual attraction and the nonmedical use of opioids and sedative drugs among Chinese adolescents

Author: Li, Pengsheng, Huang, Yeen, Guo, Lan, Wang, Wanxin, Xi, Chuhao, Lei, Yiling, Luo, Min, Pan, Siyuan, Deng, Xueqing, Zhang, Wei-Hong, and Lu, Ciyong
Published: 2018
Full Text: View/download PDF

36. Association between body weight status and suicidal ideation among Chinese adolescents: the moderating role of the child’s sex

Author: Guo, Lan, Xu, Yan, Huang, Guoliang, Gao, Xue, Deng, Xueqing, Luo, Min, Xi, Chuhao, Zhang, Wei-Hong, and Lu, Ciyong
Published: 2019
Full Text: View/download PDF

37. Generalization and Adaptation of Deep Learning Models for Semantic Segmentation

Author: Deng, Xueqing
Subjects: Computer science, deep learning, domain adaptation, generalization, semantic segmentation
Abstract: Thanks to the development of deep neural networks, a number of computer vision tasks have achieved great success. However, the focus has been mostly limited to benchmarks with regular scenes in a supervised training fashion. A deep learning model trained with perfect and ideal benchmark datasets can have difficulty when applied to real-world scenes where the data are captured under different settings, for example. This indicates the model has poor generalization capability. Problems also occur when a benchmark model is applied to a different real-world application than it was designed for and where the input data varies. Therefore, this dissertation seeks to improve model generalization and adaptation for the computer vision problem of semantic segmentation particularly for real-world applications.
Published: 2021

38. Analyses on clustering of the conserved residues at protein-RNA interfaces and its application in binding site identification

Author: Yang, Zhen, Deng, Xueqing, Liu, Yang, Gong, Weikang, and Li, Chunhua
Published: 2020
Full Text: View/download PDF

39. MaXTron: Mask Transformer with Trajectory Attention for Video Panoptic Segmentation

Author: He, Ju, Yu, Qihang, Shin, Inkyu, Deng, Xueqing, Shen, Xiaohui, Yuille, Alan, Chen, Liang-Chieh, He, Ju, Yu, Qihang, Shin, Inkyu, Deng, Xueqing, Shen, Xiaohui, Yuille, Alan, and Chen, Liang-Chieh
Abstract: Video panoptic segmentation requires consistently segmenting (for both `thing' and `stuff' classes) and tracking objects in a video over time. In this work, we present MaXTron, a general framework that exploits Mask XFormer with Trajectory Attention to tackle the task. MaXTron enriches an off-the-shelf mask transformer by leveraging trajectory attention. The deployed mask transformer takes as input a short clip consisting of only a few frames and predicts the clip-level segmentation. To enhance the temporal consistency, MaXTron employs within-clip and cross-clip tracking modules, efficiently utilizing trajectory attention. Originally designed for video classification, trajectory attention learns to model the temporal correspondences between neighboring frames and aggregates information along the estimated motion paths. However, it is nontrivial to directly extend trajectory attention to the per-pixel dense prediction tasks due to its quadratic dependency on input size. To alleviate the issue, we propose to adapt the trajectory attention for both the dense pixel features and object queries, aiming to improve the short-term and long-term tracking results, respectively. Particularly, in our within-clip tracking module, we propose axial-trajectory attention that effectively computes the trajectory attention for tracking dense pixels sequentially along the height- and width-axes. The axial decomposition significantly reduces the computational complexity for dense pixel features. In our cross-clip tracking module, since the object queries in mask transformer are learned to encode the object information, we are able to capture the long-term temporal connections by applying trajectory attention to object queries, which learns to track each object across different clips. Without bells and whistles, MaXTron demonstrates state-of-the-art performances on video segmentation benchmarks., Comment: Code at https://github.com/TACJu/MaXTron
Published: 2023

40. Bullying behaviors among Chinese school-aged youth: A prevalence and Correlates Study in Guangdong Province

Author: Wu, Jie, He, Yuan, Lu, Ciyong, Deng, Xueqing, Gao, Xue, Guo, Lan, Wu, Hong, Chan, Fanfan, and Zhou, Ying
Published: 2015
Full Text: View/download PDF

41. Report of the 5th ACM SIGSPATIAL International Workshop on AI for Geographic Knowledge Discovery (GeoAI 2022)

Author: Martins, Bruno, primary, Lunga, Dalton, additional, Gao, Song, additional, Newsam, Shawn, additional, Yang, Lexie, additional, Deng, Xueqing, additional, and Mai, Gengchen, additional
Published: 2022
Full Text: View/download PDF

42. NightLab: A Dual-level Architecture with Hardness Detection for Segmentation at Night

Author: Deng, Xueqing, primary, Wang, Peng, additional, Lian, Xiaochen, additional, and Newsam, Shawn, additional
Published: 2022
Full Text: View/download PDF

43. Associations between childhood maltreatment and non‐medical use of prescription drugs among Chinese adolescents

Author: Guo, Lan, Xu, Yan, Deng, Jianxiong, Gao, Xue, Huang, Guoliang, Huang, Jinghui, Deng, Xueqing, Zhang, Wei‐Hong, and Lu, Ciyong
Published: 2017
Full Text: View/download PDF

44. Relationship between Arterial Stiffness and Renal Function Determined by Chronic Kidney Disease Epidemiology Collaboration (CKD-EPI) and Modification of Diet in Renal Disease (MDRD) Equations in a Chinese Cohort Undergoing Health Examination

Author: Tang, Biwen, primary, Tu, Weichao, additional, Zhao, Jiehui, additional, Deng, Xueqing, additional, Tan, Isabella, additional, Butlin, Mark, additional, Avolio, Alberto, additional, and Zuo, Junli, additional
Published: 2022
Full Text: View/download PDF

45. Alcohol use and alcohol-related problems among adolescents in China: A large-scale cross-sectional study

Author: Guo, Lan, Deng, Jianxiong, He, Yuan, Deng, Xueqing, Huang, Jinghui, Huang, Guoliang, Gao, Xue, Zhang, Wei-Hong, and Lu, Ciyong
Published: 2016
Full Text: View/download PDF

46. GeoAI at ACM SIGSPATIAL

Author: Lunga, Dalton, primary, Hu, Yingjie, additional, Newsam, Shawn, additional, Gao, Song, additional, Martins, Bruno, additional, Yang, Lexie, additional, and Deng, Xueqing, additional
Published: 2021
Full Text: View/download PDF

47. Dynamics of binding interactions of TDP ‐43 and RNA : An equally weighted multiscale elastic network model study

Author: Deng, Xueqing, primary, Wang, Shihao, additional, Han, Zhongjie, additional, Gong, Weikang, additional, Liu, Yang, additional, and Li, Chunhua, additional
Published: 2021
Full Text: View/download PDF

48. Additional file 1 of Organising a juvenile ratio monitoring programme for 10 key waterbird species in the Yangtze River floodplain: analysis and proposals

Author: Wang, Yuxi, Damba, Iderbat, Zhao, Qingshan, Xie, Yanbo, Deng, Xueqing, Ga, Rdi, Liu, Guanhua, Xu, Zhiwen, Li, Yue, Gao, Dali, Xu, Wenbin, Chen, Guoxun, and Cao, Lei
Abstract: Additional file 1: Table S1. The fieldwork that generated the data analysed here came from 17 lakes and 501 points in the YRF from 2016 to 2019. Table S2. Photographic information relating to the 10 waterbird species involved in the juvenile ratio survey of Yangtze River, illustrating the salient plumage differences and other features that are characteristic of adult and young birds of all species (Brazil 2009). Table S3. Autumn and spring migration schedules of eight large waterbird species in China based on published tracking studies. Table S4. The 1% biogeographical flyway population levels for each of the 10 large-bodied waterbirds wintering in the Yangtze River Floodplain analysed here. Table S5. The sample sizes and juvenile ratio from each of the 10 large waterbird species surveyed in each year from 2016 to 2019 (see ��Methods�� Data analysis). Table S6. The percentage of each large waterbird species counted in a given year at six sites of major flyway importance in the Yangtze River Floodplain. Table S7. The error rate between predicted juvenile ratio and YRF sampled juvenile ratio of 10 large-bodied waterbirds in Yangtze River Floodplain from 2016 to 2019. Table S8. The error rate between predicted juvenile ratio and Yangtze River Floodplain sampled juvenile ratio of 10 large-bodied waterbird species in sites of major flyway importance from 2016 to 2019. Table S9. The error rate between sampled juvenile ratio in November and sampled juvenile ratio in December of 10 large-bodied waterbirds in the same lake from 2016 to 2019. Table S10. The theoretical error, real error and T2_Weight results of 10 large-bodied waterbird species in the Yangtze River Floodplain based on statistical independence test analysis. Table S11. The sqrt_sum_Weight results of 10 large-bodied waterbird in YRF from 2016 to 2019 based on statistical independence test analysis. Table S12. Cross-comparison table showing juvenile ratios generated (j/N, where j is the number of juveniles in sample N) from each annual sample (sample size N) for each of the 10 large-bodied waterbird species in each year, 2016 to 2019.
Published: 2021
Full Text: View/download PDF

49. Generalizing Deep Models for Overhead Image Segmentation Through Getis-Ord Gi* Pooling

Author: Deng, Xueqing, Tian, Yuxin, and Newsam, Shawn
Subjects: Computing methodologies → Neural networks, convolutional neural networks, Remote sensing, pooling function, semantic segmentation, generalization
Abstract: That most deep learning models are purely data driven is both a strength and a weakness. Given sufficient training data, the optimal model for a particular problem can be learned. However, this is usually not the case and so instead the model is either learned from scratch from a limited amount of training data or pre-trained on a different problem and then fine-tuned. Both of these situations are potentially suboptimal and limit the generalizability of the model. Inspired by this, we investigate methods to inform or guide deep learning models for geospatial image analysis to increase their performance when a limited amount of training data is available or when they are applied to scenarios other than which they were trained on. In particular, we exploit the fact that there are certain fundamental rules as to how things are distributed on the surface of the Earth and these rules do not vary substantially between locations. Based on this, we develop a novel feature pooling method for convolutional neural networks using Getis-Ord Gi* analysis from geostatistics. Experimental results show our proposed pooling function has significantly better generalization performance compared to a standard data-driven approach when applied to overhead image segmentation.
Published: 2020
Full Text: View/download PDF

50. A syndemic of psychosocial problems places the MSM (men who have sex with men) population at greater risk of HIV infection.

Author: Wu Jie, Lu Ciyong, Deng Xueqing, Wang Hui, and Hong Lingyao
Subjects: Medicine, Science
Abstract: BackgroundThe MSM (Men who have sex with men) population suffers from very high rates of concurrent psychosocial problems. Together, these problems comprise a syndemic that increases the risk of HIV infection for this community. The precise mechanisms through which this syndemic can raise the likelihood of HIV infection warrant further exploration.Methodology/principal findingsA total of 522 MSM were enrolled via a multiframe sampling approach and were asked to report psychosocial problems, risky sexual behaviors and HIV test results. A count of psychosocial health problems was calculated to test the additive relationship of these factors on HIV risk. Adjusting analysis and restriction analysis were used to determine a proposed intermediate pathway. Psychosocial health problems are highly concurrent and intercorrelated among urban MSM. Greater numbers of health problems are significantly and positively associated with HIV infection, which is mediated, at least partially, by risky sexual behaviors.Conclusions/significanceMSM experience concurrent psychosocial health problems that correlate with HIV infection in this community. We recommend the development of coping strategies for this population to deal with these psychosocial problems, both in prevention research and health policy.
Published: 2012
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

209 results on '"Deng, Xueqing"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources