Author: "Xu, Xingqian" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Xu, Xingqian"' showing total 47 results

Start Over Author "Xu, Xingqian"

47 results on '"Xu, Xingqian"'

1. GradBias: Unveiling Word Influence on Bias in Text-to-Image Generative Models

Author: D'Incà, Moreno, Peruzzo, Elia, Mancini, Massimiliano, Xu, Xingqian, Shi, Humphrey, and Sebe, Nicu
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Recent progress in Text-to-Image (T2I) generative models has enabled high-quality image generation. As performance and accessibility increase, these models are gaining significant attraction and popularity: ensuring their fairness and safety is a priority to prevent the dissemination and perpetuation of biases. However, existing studies in bias detection focus on closed sets of predefined biases (e.g., gender, ethnicity). In this paper, we propose a general framework to identify, quantify, and explain biases in an open set setting, i.e. without requiring a predefined set. This pipeline leverages a Large Language Model (LLM) to propose biases starting from a set of captions. Next, these captions are used by the target generative model for generating a set of images. Finally, Vision Question Answering (VQA) is leveraged for bias evaluation. We show two variations of this framework: OpenBias and GradBias. OpenBias detects and quantifies biases, while GradBias determines the contribution of individual prompt words on biases. OpenBias effectively detects both well-known and novel biases related to people, objects, and animals and highly aligns with existing closed-set bias detection methods and human judgment. GradBias shows that neutral words can significantly influence biases and it outperforms several baselines, including state-of-the-art foundation models. Code available here: https://github.com/Moreno98/GradBias., Comment: Under review. Code: https://github.com/Moreno98/GradBias
Published: 2024

2. UVMap-ID: A Controllable and Personalized UV Map Generative Model

Author: Wang, Weijie, Zhang, Jichao, Liu, Chang, Li, Xia, Xu, Xingqian, Shi, Humphrey, Sebe, Nicu, and Lepri, Bruno
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Recently, diffusion models have made significant strides in synthesizing realistic 2D human images based on provided text prompts. Building upon this, researchers have extended 2D text-to-image diffusion models into the 3D domain for generating human textures (UV Maps). However, some important problems about UV Map Generative models are still not solved, i.e., how to generate personalized texture maps for any given face image, and how to define and evaluate the quality of these generated texture maps. To solve the above problems, we introduce a novel method, UVMap-ID, which is a controllable and personalized UV Map generative model. Unlike traditional large-scale training methods in 2D, we propose to fine-tune a pre-trained text-to-image diffusion model which is integrated with a face fusion module for achieving ID-driven customized generation. To support the finetuning strategy, we introduce a small-scale attribute-balanced training dataset, including high-quality textures with labeled text and Face ID. Additionally, we introduce some metrics to evaluate the multiple aspects of the textures. Finally, both quantitative and qualitative analyses demonstrate the effectiveness of our method in controllable and personalized UV Map generation. Code is publicly available via https://github.com/twowwj/UVMap-ID., Comment: Accepted to ACMMM2024
Published: 2024

3. OpenBias: Open-set Bias Detection in Text-to-Image Generative Models

Author: D'Incà, Moreno, Peruzzo, Elia, Mancini, Massimiliano, Xu, Dejia, Goel, Vidit, Xu, Xingqian, Wang, Zhangyang, Shi, Humphrey, and Sebe, Nicu
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence
Abstract: Text-to-image generative models are becoming increasingly popular and accessible to the general public. As these models see large-scale deployments, it is necessary to deeply investigate their safety and fairness to not disseminate and perpetuate any kind of biases. However, existing works focus on detecting closed sets of biases defined a priori, limiting the studies to well-known concepts. In this paper, we tackle the challenge of open-set bias detection in text-to-image generative models presenting OpenBias, a new pipeline that identifies and quantifies the severity of biases agnostically, without access to any precompiled set. OpenBias has three stages. In the first phase, we leverage a Large Language Model (LLM) to propose biases given a set of captions. Secondly, the target generative model produces images using the same set of captions. Lastly, a Vision Question Answering model recognizes the presence and extent of the previously proposed biases. We study the behavior of Stable Diffusion 1.5, 2, and XL emphasizing new biases, never investigated before. Via quantitative experiments, we demonstrate that OpenBias agrees with current closed-set bias detection methods and human judgement., Comment: CVPR 2024 Highlight - Code: https://github.com/Picsart-AI-Research/OpenBias
Published: 2024

4. VASE: Object-Centric Appearance and Shape Manipulation of Real Videos

Author: Peruzzo, Elia, Goel, Vidit, Xu, Dejia, Xu, Xingqian, Jiang, Yifan, Wang, Zhangyang, Shi, Humphrey, and Sebe, Nicu
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Recently, several works tackled the video editing task fostered by the success of large-scale text-to-image generative models. However, most of these methods holistically edit the frame using the text, exploiting the prior given by foundation diffusion models and focusing on improving the temporal consistency across frames. In this work, we introduce a framework that is object-centric and is designed to control both the object's appearance and, notably, to execute precise and explicit structural modifications on the object. We build our framework on a pre-trained image-conditioned diffusion model, integrate layers to handle the temporal dimension, and propose training strategies and architectural modifications to enable shape control. We evaluate our method on the image-driven video editing task showing similar performance to the state-of-the-art, and showcasing novel shape-editing capabilities. Further details, code and examples are available on our project page: https://helia95.github.io/vase-website/, Comment: Project Page https://helia95.github.io/vase-website/
Published: 2024

5. Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models

Author: Guo, Jiayi, Xu, Xingqian, Pu, Yifan, Ni, Zanlin, Wang, Chaofei, Vasu, Manushree, Song, Shiji, Huang, Gao, and Shi, Humphrey
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Recently, diffusion models have made remarkable progress in text-to-image (T2I) generation, synthesizing images with high fidelity and diverse contents. Despite this advancement, latent space smoothness within diffusion models remains largely unexplored. Smooth latent spaces ensure that a perturbation on an input latent corresponds to a steady change in the output image. This property proves beneficial in downstream tasks, including image interpolation, inversion, and editing. In this work, we expose the non-smoothness of diffusion latent spaces by observing noticeable visual fluctuations resulting from minor latent variations. To tackle this issue, we propose Smooth Diffusion, a new category of diffusion models that can be simultaneously high-performing and smooth. Specifically, we introduce Step-wise Variation Regularization to enforce the proportion between the variations of an arbitrary input latent and that of the output image is a constant at any diffusion training step. In addition, we devise an interpolation standard deviation (ISTD) metric to effectively assess the latent space smoothness of a diffusion model. Extensive quantitative and qualitative experiments demonstrate that Smooth Diffusion stands out as a more desirable solution not only in T2I generation but also across various downstream tasks. Smooth Diffusion is implemented as a plug-and-play Smooth-LoRA to work with various community models. Code is available at https://github.com/SHI-Labs/Smooth-Diffusion., Comment: GitHub: https://github.com/SHI-Labs/Smooth-Diffusion
Published: 2023

6. Study on the differences between Hoek–Brown parameters and equivalent Mohr–Coulomb parameters in the calculation slope critical acceleration and permanent displacement

Author: Li, Cheng, Zhao, Xi, Xu, Xingqian, and Qu, Xin
Published: 2024
Full Text: View/download PDF

7. Interactive Neural Painting

Author: Peruzzo, Elia, Menapace, Willi, Goel, Vidit, Arrigoni, Federica, Tang, Hao, Xu, Xingqian, Chopikyan, Arman, Orlov, Nikita, Hu, Yuxiao, Shi, Humphrey, Sebe, Nicu, and Ricci, Elisa
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: In the last few years, Neural Painting (NP) techniques became capable of producing extremely realistic artworks. This paper advances the state of the art in this emerging research domain by proposing the first approach for Interactive NP. Considering a setting where a user looks at a scene and tries to reproduce it on a painting, our objective is to develop a computational framework to assist the users creativity by suggesting the next strokes to paint, that can be possibly used to complete the artwork. To accomplish such a task, we propose I-Paint, a novel method based on a conditional transformer Variational AutoEncoder (VAE) architecture with a two-stage decoder. To evaluate the proposed approach and stimulate research in this area, we also introduce two novel datasets. Our experiments show that our approach provides good stroke suggestions and compares favorably to the state of the art. Additional details, code and examples are available at https://helia95.github.io/inp-website., Comment: This is a preprint version of the paper to appear at Computer Vision and Image Understanding (CVIU). The final journal version will be available at https://www.sciencedirect.com/science/article/pii/S1077314223001583
Published: 2023
Full Text: View/download PDF

8. Reference-based Painterly Inpainting via Diffusion: Crossing the Wild Reference Domain Gap

Author: Xu, Dejia, Xu, Xingqian, Cong, Wenyan, Shi, Humphrey, and Wang, Zhangyang
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Have you ever imagined how it would look if we placed new objects into paintings? For example, what would it look like if we placed a basketball into Claude Monet's ``Water Lilies, Evening Effect''? We propose Reference-based Painterly Inpainting, a novel task that crosses the wild reference domain gap and implants novel objects into artworks. Although previous works have examined reference-based inpainting, they are not designed for large domain discrepancies between the target and the reference, such as inpainting an artistic image using a photorealistic reference. This paper proposes a novel diffusion framework, dubbed RefPaint, to ``inpaint more wildly'' by taking such references with large domain gaps. Built with an image-conditioned diffusion model, we introduce a ladder-side branch and a masked fusion mechanism to work with the inpainting mask. By decomposing the CLIP image embeddings at inference time, one can manipulate the strength of semantic and style information with ease. Experiments demonstrate that our proposed RefPaint framework produces significantly better results than existing methods. Our method enables creative painterly image inpainting with reference objects that would otherwise be difficult to achieve. Project page: https://vita-group.github.io/RefPaint/
Published: 2023

9. Prompt-Free Diffusion: Taking 'Text' out of Text-to-Image Diffusion Models

Author: Xu, Xingqian, Guo, Jiayi, Wang, Zhangyang, Huang, Gao, Essa, Irfan, and Shi, Humphrey
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Text-to-image (T2I) research has grown explosively in the past year, owing to the large-scale pre-trained diffusion models and many emerging personalization and editing approaches. Yet, one pain point persists: the text prompt engineering, and searching high-quality text prompts for customized results is more art than science. Moreover, as commonly argued: "an image is worth a thousand words" - the attempt to describe a desired image with texts often ends up being ambiguous and cannot comprehensively cover delicate visual details, hence necessitating more additional controls from the visual domain. In this paper, we take a bold step forward: taking "Text" out of a pre-trained T2I diffusion model, to reduce the burdensome prompt engineering efforts for users. Our proposed framework, Prompt-Free Diffusion, relies on only visual inputs to generate new images: it takes a reference image as "context", an optional image structural conditioning, and an initial noise, with absolutely no text prompt. The core architecture behind the scene is Semantic Context Encoder (SeeCoder), substituting the commonly used CLIP-based or LLM-based text encoder. The reusability of SeeCoder also makes it a convenient drop-in component: one can also pre-train a SeeCoder in one T2I model and reuse it for another. Through extensive experiments, Prompt-Free Diffusion is experimentally found to (i) outperform prior exemplar-based image synthesis approaches; (ii) perform on par with state-of-the-art T2I models using prompts following the best practice; and (iii) be naturally extensible to other downstream applications such as anime figure generation and virtual try-on, with promising quality. Our code and models are open-sourced at https://github.com/SHI-Labs/Prompt-Free-Diffusion., Comment: Code, models and demos can be found through: https://github.com/SHI-Labs/Prompt-Free-Diffusion
Published: 2023

10. Zero-shot Generative Model Adaptation via Image-specific Prompt Learning

Author: Guo, Jiayi, Wang, Chaofei, Wu, You, Zhang, Eric, Wang, Kai, Xu, Xingqian, Song, Shiji, Shi, Humphrey, and Huang, Gao
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Recently, CLIP-guided image synthesis has shown appealing performance on adapting a pre-trained source-domain generator to an unseen target domain. It does not require any target-domain samples but only the textual domain labels. The training is highly efficient, e.g., a few minutes. However, existing methods still have some limitations in the quality of generated images and may suffer from the mode collapse issue. A key reason is that a fixed adaptation direction is applied for all cross-domain image pairs, which leads to identical supervision signals. To address this issue, we propose an Image-specific Prompt Learning (IPL) method, which learns specific prompt vectors for each source-domain image. This produces a more precise adaptation direction for every cross-domain image pair, endowing the target-domain generator with greatly enhanced flexibility. Qualitative and quantitative evaluations on various domains demonstrate that IPL effectively improves the quality and diversity of synthesized images and alleviates the mode collapse. Moreover, IPL is independent of the structure of the generative model, such as generative adversarial networks or diffusion models. Code is available at https://github.com/Picsart-AI-Research/IPL-Zero-Shot-Generative-Model-Adaptation., Comment: Accepted by CVPR 2023. GitHub: https://github.com/Picsart-AI-Research/IPL-Zero-Shot-Generative-Model-Adaptation
Published: 2023

11. Forget-Me-Not: Learning to Forget in Text-to-Image Diffusion Models

Author: Zhang, Eric, Wang, Kai, Xu, Xingqian, Wang, Zhangyang, and Shi, Humphrey
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: The unlearning problem of deep learning models, once primarily an academic concern, has become a prevalent issue in the industry. The significant advances in text-to-image generation techniques have prompted global discussions on privacy, copyright, and safety, as numerous unauthorized personal IDs, content, artistic creations, and potentially harmful materials have been learned by these models and later utilized to generate and distribute uncontrolled content. To address this challenge, we propose \textbf{Forget-Me-Not}, an efficient and low-cost solution designed to safely remove specified IDs, objects, or styles from a well-configured text-to-image model in as little as 30 seconds, without impairing its ability to generate other content. Alongside our method, we introduce the \textbf{Memorization Score (M-Score)} and \textbf{ConceptBench} to measure the models' capacity to generate general concepts, grouped into three primary categories: ID, object, and style. Using M-Score and ConceptBench, we demonstrate that Forget-Me-Not can effectively eliminate targeted concepts while maintaining the model's performance on other concepts. Furthermore, Forget-Me-Not offers two practical extensions: a) removal of potentially harmful or NSFW content, and b) enhancement of model accuracy, inclusion and diversity through \textbf{concept correction and disentanglement}. It can also be adapted as a lightweight model patch for Stable Diffusion, allowing for concept manipulation and convenient distribution. To encourage future research in this critical area and promote the development of safe and inclusive generative models, we will open-source our code and ConceptBench at \href{https://github.com/SHI-Labs/Forget-Me-Not}{https://github.com/SHI-Labs/Forget-Me-Not}.
Published: 2023

12. PAIR-Diffusion: A Comprehensive Multimodal Object-Level Image Editor

Author: Goel, Vidit, Peruzzo, Elia, Jiang, Yifan, Xu, Dejia, Xu, Xingqian, Sebe, Nicu, Darrell, Trevor, Wang, Zhangyang, and Shi, Humphrey
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Generative image editing has recently witnessed extremely fast-paced growth. Some works use high-level conditioning such as text, while others use low-level conditioning. Nevertheless, most of them lack fine-grained control over the properties of the different objects present in the image, i.e. object-level image editing. In this work, we tackle the task by perceiving the images as an amalgamation of various objects and aim to control the properties of each object in a fine-grained manner. Out of these properties, we identify structure and appearance as the most intuitive to understand and useful for editing purposes. We propose PAIR Diffusion, a generic framework that can enable a diffusion model to control the structure and appearance properties of each object in the image. We show that having control over the properties of each object in an image leads to comprehensive editing capabilities. Our framework allows for various object-level editing operations on real images such as reference image-based appearance editing, free-form shape editing, adding objects, and variations. Thanks to our design, we do not require any inversion step. Additionally, we propose multimodal classifier-free guidance which enables editing images using both reference images and text when using our approach with foundational diffusion models. We validate the above claims by extensively evaluating our framework on both unconditional and foundational diffusion models. Please refer to https://vidit98.github.io/publication/conference-paper/pair_diff.html for code and model release., Comment: Accepted in CVPR 2024, Project page https://vidit98.github.io/publication/conference-paper/pair_diff.html
Published: 2023

13. Versatile Diffusion: Text, Images and Variations All in One Diffusion Model

Author: Xu, Xingqian, Wang, Zhangyang, Zhang, Eric, Wang, Kai, and Shi, Humphrey
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Recent advances in diffusion models have set an impressive milestone in many generation tasks, and trending works such as DALL-E2, Imagen, and Stable Diffusion have attracted great interest. Despite the rapid landscape changes, recent new approaches focus on extensions and performance rather than capacity, thus requiring separate models for separate tasks. In this work, we expand the existing single-flow diffusion pipeline into a multi-task multimodal network, dubbed Versatile Diffusion (VD), that handles multiple flows of text-to-image, image-to-text, and variations in one unified model. The pipeline design of VD instantiates a unified multi-flow diffusion framework, consisting of sharable and swappable layer modules that enable the crossmodal generality beyond images and text. Through extensive experiments, we demonstrate that VD successfully achieves the following: a) VD outperforms the baseline approaches and handles all its base tasks with competitive quality; b) VD enables novel extensions such as disentanglement of style and semantics, dual- and multi-context blending, etc.; c) The success of our multi-flow multimodal framework over images and text may inspire further diffusion-based universal AI research. Our code and models are open-sourced at https://github.com/SHI-Labs/Versatile-Diffusion., Comment: ICCV 2023; Github link: https://github.com/SHI-Labs/Versatile-Diffusion
Published: 2022

14. StyleNAT: Giving Each Head a New Perspective

Author: Walton, Steven, Hassani, Ali, Xu, Xingqian, Wang, Zhangyang, and Shi, Humphrey
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Image generation has been a long sought-after but challenging task, and performing the generation task in an efficient manner is similarly difficult. Often researchers attempt to create a "one size fits all" generator, where there are few differences in the parameter space for drastically different datasets. Herein, we present a new transformer-based framework, dubbed StyleNAT, targeting high-quality image generation with superior efficiency and flexibility. At the core of our model, is a carefully designed framework that partitions attention heads to capture local and global information, which is achieved through using Neighborhood Attention (NA). With different heads able to pay attention to varying receptive fields, the model is able to better combine this information, and adapt, in a highly flexible manner, to the data at hand. StyleNAT attains a new SOTA FID score on FFHQ-256 with 2.046, beating prior arts with convolutional models such as StyleGAN-XL and transformers such as HIT and StyleSwin, and a new transformer SOTA on FFHQ-1024 with an FID score of 4.174. These results show a 6.4% improvement on FFHQ-256 scores when compared to StyleGAN-XL with a 28% reduction in the number of parameters and 56% improvement in sampling throughput. Code and models will be open-sourced at https://github.com/SHI-Labs/StyleNAT., Comment: Code at https://github.com/SHI-Labs/StyleNAT
Published: 2022

15. Image Completion with Heterogeneously Filtered Spectral Hints

Author: Xu, Xingqian, Navasardyan, Shant, Tadevosyan, Vahram, Sargsyan, Andranik, Mu, Yadong, and Shi, Humphrey
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Image completion with large-scale free-form missing regions is one of the most challenging tasks for the computer vision community. While researchers pursue better solutions, drawbacks such as pattern unawareness, blurry textures, and structure distortion remain noticeable, and thus leave space for improvement. To overcome these challenges, we propose a new StyleGAN-based image completion network, Spectral Hint GAN (SH-GAN), inside which a carefully designed spectral processing module, Spectral Hint Unit, is introduced. We also propose two novel 2D spectral processing strategies, Heterogeneous Filtering and Gaussian Split that well-fit modern deep learning models and may further be extended to other tasks. From our inclusive experiments, we demonstrate that our model can reach FID scores of 3.4134 and 7.0277 on the benchmark datasets FFHQ and Places2, and therefore outperforms prior works and reaches a new state-of-the-art. We also prove the effectiveness of our design via ablation studies, from which one may notice that the aforementioned challenges, i.e. pattern unawareness, blurry textures, and structure distortion, can be noticeably resolved. Our code will be open-sourced at: https://github.com/SHI-Labs/SH-GAN., Comment: wacv23
Published: 2022

16. Towards Layer-wise Image Vectorization

Author: Ma, Xu, Zhou, Yuqian, Xu, Xingqian, Sun, Bin, Filev, Valerii, Orlov, Nikita, Fu, Yun, and Shi, Humphrey
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Image rasterization is a mature technique in computer graphics, while image vectorization, the reverse path of rasterization, remains a major challenge. Recent advanced deep learning-based models achieve vectorization and semantic interpolation of vector graphs and demonstrate a better topology of generating new figures. However, deep models cannot be easily generalized to out-of-domain testing data. The generated SVGs also contain complex and redundant shapes that are not quite convenient for further editing. Specifically, the crucial layer-wise topology and fundamental semantics in images are still not well understood and thus not fully explored. In this work, we propose Layer-wise Image Vectorization, namely LIVE, to convert raster images to SVGs and simultaneously maintain its image topology. LIVE can generate compact SVG forms with layer-wise structures that are semantically consistent with human perspective. We progressively add new bezier paths and optimize these paths with the layer-wise framework, newly designed loss functions, and component-wise path initialization technique. Our experiments demonstrate that LIVE presents more plausible vectorized forms than prior works and can be generalized to new images. With the help of this newly learned topology, LIVE initiates human editable SVGs for both designers and other downstream applications. Codes are made available at https://github.com/Picsart-AI-Research/LIVE-Layerwise-Image-Vectorization., Comment: Accepted as Oral Presentation at CVPR 2022
Published: 2022

17. VideoINR: Learning Video Implicit Neural Representation for Continuous Space-Time Super-Resolution

Author: Chen, Zeyuan, Chen, Yinbo, Liu, Jingwen, Xu, Xingqian, Goel, Vidit, Wang, Zhangyang, Shi, Humphrey, and Wang, Xiaolong
Subjects: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: Videos typically record the streaming and continuous visual data as discrete consecutive frames. Since the storage cost is expensive for videos of high fidelity, most of them are stored in a relatively low resolution and frame rate. Recent works of Space-Time Video Super-Resolution (STVSR) are developed to incorporate temporal interpolation and spatial super-resolution in a unified framework. However, most of them only support a fixed up-sampling scale, which limits their flexibility and applications. In this work, instead of following the discrete representations, we propose Video Implicit Neural Representation (VideoINR), and we show its applications for STVSR. The learned implicit neural representation can be decoded to videos of arbitrary spatial resolution and frame rate. We show that VideoINR achieves competitive performances with state-of-the-art STVSR methods on common up-sampling scales and significantly outperforms prior works on continuous and out-of-training-distribution scales. Our project page is at http://zeyuan-chen.com/VideoINR/ ., Comment: Accepted to CVPR 2022. Project page: http://zeyuan-chen.com/VideoINR/
Published: 2022

18. Research Progress and Prospect of Mechanical Effects and Model Construction of Root-soil Complex

Author: XIE Xiangrong, CHEN Zhengfa, ZHU Zhenyan, XU Xingqian, YAN Kai, LI Bo, DUAN Qingsong, LI Shufang, and ZHANG Chuan
Subjects: root-soil complex, soil mechanics effect, hydraulic effect, mechanical model, soil consolidation and water conservation, ecological restoration, Environmental sciences, GE1-350, Agriculture
Abstract: [Objective] In order to investigate the mechanism of mechanical effect of root-soil complex and the application of modeling method. [Methods] The concept and connotation of root-soil complex, the principle of mechanical effect and mechanical model of root-soil complex, advantages and disadvantages as well as the scope of application were summarized and analyzed by using literature analysis method and comparative analysis method. [Results] (1) The root-soil complex was a composite whole of mechanical coupling effect between root system and soil body, and the root system was intertwined in soil playing a reinforced role. (2) The mechanical relationship between root system and soil body was essentially the result of soil mechanical, hydraulic and composite mechanical properties of root-soil complex. The soil mechanical and hydraulic properties focused on the influence of roots on the soil and the influence of water on the soil and root system respectively, and the composite mechanical properties focused on the direct influence of soil properties on the properties and structure of plant roots, so that the mechanical relationship between roots and soil was in a dynamic balance through the three. (3) The study of the composite mechanical model of the root-soil complex was slightly less than that of the soil mechanical and hydraulic model, which were based on quantitative parameters and measure the soil consolidation effect by comparing parameters. However, the composite mechanical properties involves both soil and hydraulic properties, which was a comprehensive consideration and should be the key research direction in the future. [Conclusion] In-depth research was needed in the future to investigate the effects of freeze-thaw cycle, dry-wet alternation and dry-hot cycle on root and soil interaction in different regions, the shear strength of multi-type plant mixtures, the influence mechanism of chemical and microbial effects on soil-water propertiess and the construction of composite models need to be further studied. This study can provide important theoretical value and engineering reference for vegetation restoration, soil and water conservation and sustainable development in ecologically fragile areas.
Published: 2024
Full Text: View/download PDF

19. Study on soil dielectric constant models: A review

Author: XU Xingqian, WANG Haijun, QU Xin, PENG Guangcan, and ZHAO Xi
Subjects: soil, water content, electromagnetic wave, dielectric constant, model, Agriculture (General), S1-972, Irrigation engineering. Reclamation of wasteland. Drainage, TC801-978
Abstract: 【Objective】 Soil dielectric constant measures the ability of a soil to transmit electric fields. It is the ratio of the capacitance of a soil sample to the capacitance of air-filled space of the same volume. In this paper, we systematically analyze the factors that influence the dielectric constant, offering an updated perspective on soil dielectric constant modeling. 【Method】 This study systematically identifies the primary factors influencing soil dielectric constant based on soil dielectric theory. Additionally, it categorizes and summarizes existing soil dielectric constant models, facilitating a comparative analysis of their strengths, weaknesses, applications, and future development. 【Result】 The primary influencing factors governing the dielectric constant varied with soil types, with notable dependence on testing frequency. However, the dielectric constant remains a valuable indicator of soil water content. The soil dielectric constant models were broadly categorized into four types: theoretical, semi-empirical, empirical, and boundary models. 【Conclusion】 Presently, there is a dearth of research focusing on the dielectric properties and model development specific to regional soils. Enhancing model accuracy necessitates incorporating the influence of soil-phase composition, mineral composition, and microstructure on the dielectric constant. This refinement will broaden the application studies for assessing and analyzing soil physical and chemical properties based on dielectric constant testing.
Published: 2024
Full Text: View/download PDF

20. UltraSR: Spatial Encoding is a Missing Key for Implicit Image Function-based Arbitrary-Scale Super-Resolution

Author: Xu, Xingqian, Wang, Zhangyang, and Shi, Humphrey
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: The recent success of NeRF and other related implicit neural representation methods has opened a new path for continuous image representation, where pixel values no longer need to be looked up from stored discrete 2D arrays but can be inferred from neural network models on a continuous spatial domain. Although the recent work LIIF has demonstrated that such novel approaches can achieve good performance on the arbitrary-scale super-resolution task, their upscaled images frequently show structural distortion due to the inaccurate prediction of high-frequency textures. In this work, we propose UltraSR, a simple yet effective new network design based on implicit image functions in which we deeply integrated spatial coordinates and periodic encoding with the implicit neural representation. Through extensive experiments and ablation studies, we show that spatial encoding is a missing key toward the next-stage high-performing implicit image function. Our UltraSR sets new state-of-the-art performance on the DIV2K benchmark under all super-resolution scales compared to previous state-of-the-art methods. UltraSR also achieves superior performance on other standard benchmark datasets in which it outperforms prior works in almost all experiments.
Published: 2021

21. Rethinking Text Segmentation: A Novel Dataset and A Text-Specific Refinement Approach

Author: Xu, Xingqian, Zhang, Zhifei, Wang, Zhaowen, Price, Brian, Wang, Zhonghao, and Shi, Humphrey
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Text segmentation is a prerequisite in many real-world text-related tasks, e.g., text style transfer, and scene text removal. However, facing the lack of high-quality datasets and dedicated investigations, this critical prerequisite has been left as an assumption in many works, and has been largely overlooked by current research. To bridge this gap, we proposed TextSeg, a large-scale fine-annotated text dataset with six types of annotations: word- and character-wise bounding polygons, masks and transcriptions. We also introduce Text Refinement Network (TexRNet), a novel text segmentation approach that adapts to the unique properties of text, e.g. non-convex boundary, diverse texture, etc., which often impose burdens on traditional segmentation models. In our TexRNet, we propose text specific network designs to address such challenges, including key features pooling and attention-based similarity checking. We also introduce trimap and discriminator losses that show significant improvement on text segmentation. Extensive experiments are carried out on both our TextSeg dataset and other existing datasets. We demonstrate that TexRNet consistently improves text segmentation performance by nearly 2% compared to other state-of-the-art segmentation methods. Our dataset and code will be made available at https://github.com/SHI-Labs/Rethinking-Text-Segmentation.
Published: 2020

22. The 1st Agriculture-Vision Challenge: Methods and Results

Author: Chiu, Mang Tik, Xu, Xingqian, Wang, Kai, Hobbs, Jennifer, Hovakimyan, Naira, Huang, Thomas S., Shi, Honghui, Wei, Yunchao, Huang, Zilong, Schwing, Alexander, Brunner, Robert, Dozier, Ivan, Dozier, Wyatt, Ghandilyan, Karen, Wilson, David, Park, Hyunseong, Kim, Junhee, Kim, Sungho, Liu, Qinghui, Kampffmeyer, Michael C., Jenssen, Robert, Salberg, Arnt B., Barbosa, Alexandre, Trevisan, Rodrigo, Zhao, Bingchen, Yu, Shaozuo, Yang, Siwei, Wang, Yin, Sheng, Hao, Chen, Xiao, Su, Jingyi, Rajagopal, Ram, Ng, Andrew, Huynh, Van Thong, Kim, Soo-Hyung, Na, In-Seop, Baid, Ujjwal, Innani, Shubham, Dutande, Prasad, Baheti, Bhakti, Talbar, Sanjay, and Tang, Jianyu
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning, Electrical Engineering and Systems Science - Image and Video Processing
Abstract: The first Agriculture-Vision Challenge aims to encourage research in developing novel and effective algorithms for agricultural pattern recognition from aerial images, especially for the semantic segmentation task associated with our challenge dataset. Around 57 participating teams from various countries compete to achieve state-of-the-art in aerial agriculture semantic segmentation. The Agriculture-Vision Challenge Dataset was employed, which comprises of 21,061 aerial and multi-spectral farmland images. This paper provides a summary of notable methods and results in the challenge. Our submission server and leaderboard will continue to open for researchers that are interested in this challenge dataset and task; the link can be found here., Comment: CVPR 2020 Workshop
Published: 2020

23. Deep Affinity Net: Instance Segmentation via Affinity

Author: Xu, Xingqian, Chiu, Mang Tik, Huang, Thomas S., and Shi, Honghui
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Most of the modern instance segmentation approaches fall into two categories: region-based approaches in which object bounding boxes are detected first and later used in cropping and segmenting instances; and keypoint-based approaches in which individual instances are represented by a set of keypoints followed by a dense pixel clustering around those keypoints. Despite the maturity of these two paradigms, we would like to report an alternative affinity-based paradigm where instances are segmented based on densely predicted affinities and graph partitioning algorithms. Such affinity-based approaches indicate that high-level graph features other than regions or keypoints can be directly applied in the instance segmentation task. In this work, we propose Deep Affinity Net, an effective affinity-based approach accompanied with a new graph partitioning algorithm Cascade-GAEC. Without bells and whistles, our end-to-end model results in 32.4% AP on Cityscapes val and 27.5% AP on test. It achieves the best single-shot result as well as the fastest running time among all affinity-based models. It also outperforms the region-based method Mask R-CNN.
Published: 2020

24. Agriculture-Vision: A Large Aerial Image Database for Agricultural Pattern Analysis

Author: Chiu, Mang Tik, Xu, Xingqian, Wei, Yunchao, Huang, Zilong, Schwing, Alexander, Brunner, Robert, Khachatrian, Hrant, Karapetyan, Hovnatan, Dozier, Ivan, Rose, Greg, Wilson, David, Tudor, Adrian, Hovakimyan, Naira, Huang, Thomas S., and Shi, Honghui
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Computers and Society, Computer Science - Machine Learning, Electrical Engineering and Systems Science - Image and Video Processing
Abstract: The success of deep learning in visual recognition tasks has driven advancements in multiple fields of research. Particularly, increasing attention has been drawn towards its application in agriculture. Nevertheless, while visual pattern recognition on farmlands carries enormous economic values, little progress has been made to merge computer vision and crop sciences due to the lack of suitable agricultural image datasets. Meanwhile, problems in agriculture also pose new challenges in computer vision. For example, semantic segmentation of aerial farmland images requires inference over extremely large-size images with extreme annotation sparsity. These challenges are not present in most of the common object datasets, and we show that they are more challenging than many other aerial image datasets. To encourage research in computer vision for agriculture, we present Agriculture-Vision: a large-scale aerial farmland image dataset for semantic segmentation of agricultural patterns. We collected 94,986 high-quality aerial images from 3,432 farmlands across the US, where each image consists of RGB and Near-infrared (NIR) channels with resolution as high as 10 cm per pixel. We annotate nine types of field anomaly patterns that are most important to farmers. As a pilot study of aerial agricultural semantic segmentation, we perform comprehensive experiments using popular semantic segmentation models; we also propose an effective model designed for aerial agricultural pattern recognition. Our experiments demonstrate several challenges Agriculture-Vision poses to both the computer vision and agriculture communities. Future versions of this dataset will include even more aerial images, anomaly patterns and image channels. More information at https://www.agriculture-vision.com., Comment: CVPR 2020
Published: 2020

25. The relationship between crust-lithosphere structures and seismicity on the southeastern edge of the Tibetan Plateau

Author: Xu, Xingqian, Su, Lijun, Liu, Junzhe, Zhou, Wanhuan, Gong, Aimin, and Qu, Xin
Published: 2020
Full Text: View/download PDF

26. Correction to: An optimal method for searching failure surfaces of hard thin-layered anaclinal rock slopes with cross joints

Author: Su, Lijun, Qu, Xin, Zhang, Chonglei, Iqbal, Javed, Wang, Shanyong, Xu, Xingqian, and Diao, Fangfang
Published: 2021
Full Text: View/download PDF

27. An optimal method for searching failure surfaces of hard thin-layered anaclinal rock slopes with cross joints

Author: Su, Lijun, Qu, Xin, Zhang, Chonglei, Iqbal, Javed, Wang, Shanyong, Xu, Xingqian, and Diao, Fangfang
Published: 2021
Full Text: View/download PDF

28. Interactive Neural Painting

Author: Peruzzo, Elia, primary, Menapace, Willi, additional, Goel, Vidit, additional, Arrigoni, Federica, additional, Tang, Hao, additional, Xu, Xingqian, additional, Chopikyan, Arman, additional, Orlov, Nikita, additional, Hu, Yuxiao, additional, Shi, Humphrey, additional, Sebe, Nicu, additional, and Ricci, Elisa, additional
Published: 2023
Full Text: View/download PDF

29. Effects of flow regimes on the interaction between granular flow and flexible barrier

Author: Xiao, Siyou, primary, Xu, Xingqian, additional, Wang, Haijun, additional, Li, Dianxin, additional, Wei, Zhongju, additional, and Zhang, Tengyuan, additional
Published: 2023
Full Text: View/download PDF

30. A Review on Anti-Dip Bedding Rock Slopes Subjected to Flexural Toppling

Author: Qu, Xin, primary, Diao, Fangfang, additional, Xu, Xingqian, additional, and Li, Cheng, additional
Published: 2023
Full Text: View/download PDF

31. Study on Resistivity Characteristics and Evaluation Model of Cadmium Contaminated Laterite.

Author: CHEN Xiaoshuang, XU Xingqian, ZHAO Xi, QU Xin, WANG Haijun, and PENG Guangcan
Subjects: LATERITE, STANDARD deviations, HEAVY metal toxicology, ANALYSIS of heavy metals, CADMIUM, HEAVY metals
Abstract: To investigate the influence characteristic of cadmium on the resistivity of laterite, the laterite samples with different water content, dry density and cadmium content were tested to analyze the relationships between the influence factors and the resistivity of laterite using the two-electrode method, and finally the resistivity evaluation model of the cadmium contaminated laterite was proposed. The results showed that the resistivity of cadmium-contaminated laterite decreased with the increasing of dry density, water content and temperature. The resistivity sharply decreased in the range of dry density less than 1.30 g⋅cm-3, then gradually decreased and further stabilized. On the condition of different cadmium content, the resistivity gradually decreased with the increasing water content, and the most significant decrease was observed when the dry density was 1.20 g⋅cm-3. Under the same dry density and water content, the effect of temperature above 0 °C on resistivity was not significant. The increasing cadmium gave rise to the decreasing resistivity gradually, and the change in resistivity was significant when the cadmium content was less than 100 mg⋅cm-1. Considering the characteristics of dry density, water content and temperature comprehensively, the resistivity evaluation model of cadmium-contaminated laterite was established by the introduction of the volume water content, which had high fitting accuracy (R²=0.939 9), and the measured resistivity values were in good agreement with the model calculated resistivity values. The average mean absolute percentage error and root mean square error were 4.77% and 0.07, respectively. It could provided a theoretical model for the rapid electrical detection of heavy metal cadmium pollution in laterite areas, and it was beneficial for the quality evaluation of regional laterite cultivated land as a convenient analytical method. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

32. Zero-Shot Generative Model Adaptation via Image-Specific Prompt Learning

Author: Guo, Jiayi, primary, Wang, Chaofei, additional, Wu, You, additional, Zhang, Eric, additional, Wang, Kai, additional, Xu, Xingqian, additional, Song, Shiji, additional, Shi, Humphrey, additional, and Huang, Gao, additional
Published: 2023
Full Text: View/download PDF

33. A resistivity-based experimental compaction model for Yunnan laterite

Author: PENG, Guangcan, primary, XU, Xingqian, additional, ZHAO, Xi, additional, QU, Xin, additional, and WANG, Haijun, additional
Published: 2023
Full Text: View/download PDF

34. Analysis of the Deformation Characteristics and Formation Mechanism of Reservoir Landslides under Rainfall Conditions

Author: DOU Sijun, XU Xingqian, LI Jiguo, LING Huikun, and ZHOU Shasha
Subjects: rainfall, landslide, deformation and failure, formation mechanism, River, lake, and water-supply engineering (General), TC401-506
Abstract: Hexi reservoir landslide is located in Tianyuan Town,Changning County,Baoshan City,Yunnan Province.There are obvious tensile cracks,fault,bank slope retaining wall and cut-off ditch crack deformation at its rear edge,which bring a serious threat to the safety of spillways and dam body.Based on the basic morphological features and geological environment conditions of the landslide,combined with the geological survey data,this paper analyzes the deformation characteristics and induced factors of the landslide and judges the landslide as a large-scale thick-layer traction-pushing bedding landslide,simulates the change of slope pore water pressure and safety coefficient under the conditions of different rainfall intensity and reservoir water level change by Geo-Studio numerical software,and analyzes & evaluates its stability.The results show that:Under continuous rainfall,the reservoir water level rises,the front edge of the slope body softens,the pressure of pore water in the slope body increases,the natural discharge effect produces,and the sliding force increases,which aggravate the deformation,failure and destabilization of slope body.In the process of the sudden drop of reservoir water level,the backpressure effect of the reservoir water is weakened,and the water penetration of the pre-slope body produces the dynamic water pressure,which is more likely to lead to the occurrence of such landslides.The steep slope terrain and bedding structure of downward rock formation in the landslide area create favorable conditions for the development of the landslide,and thecontinuous rainfall is the main factor of the landslide.The joint cutting slope and backpressure anti-slip pile is an effective means to prevent and control the landslide.The results can serve as theoretical reference for the analysis,judgment and timely prevention of similar landslides in daily operation and management of reservoir.
Published: 2020
Full Text: View/download PDF

35. Image Completion with Heterogeneously Filtered Spectral Hints

Author: Xu, Xingqian, primary, Navasardyan, Shant, additional, Tadevosyan, Vahram, additional, Sargsyan, Andranik, additional, Mu, Yadong, additional, and Shi, Humphrey, additional
Published: 2023
Full Text: View/download PDF

36. Study on the dielectric properties and dielectric constant model of laterite

Author: Xu, Xingqian, primary, Wang, Haijun, additional, Qu, Xin, additional, Li, Cheng, additional, Cai, Bo, additional, and Peng, Guangcan, additional
Published: 2022
Full Text: View/download PDF

37. Lithospheric structure and crust–mantle decoupling in the southeast edge of the Tibetan Plateau

Author: Hu, Jiafu, Yang, Haiyan, Xu, Xingqian, Wen, Limin, and Li, Guangquan
Published: 2012
Full Text: View/download PDF

38. Towards Layer-wise Image Vectorization

Author: Ma, Xu, primary, Zhou, Yuqian, additional, Xu, Xingqian, additional, Sun, Bin, additional, Filev, Valerii, additional, Orlov, Nikita, additional, Fu, Yun, additional, and Shi, Humphrey, additional
Published: 2022
Full Text: View/download PDF

39. VideoINR: Learning Video Implicit Neural Representation for Continuous Space-Time Super-Resolution

Author: Chen, Zeyuan, primary, Chen, Yinbo, additional, Liu, Jingwen, additional, Xu, Xingqian, additional, Goel, Vidit, additional, Wang, Zhangyang, additional, Shi, Humphrey, additional, and Wang, Xiaolong, additional
Published: 2022
Full Text: View/download PDF

40. Modelling of Critical Acceleration for Regional Seismic Landslide Hazard Assessments by Finite Element Limit Analysis

Author: Li, Cheng, primary, Wei, Shuhe, additional, Xu, Xingqian, additional, and Qu, Xin, additional
Published: 2022
Full Text: View/download PDF

41. Rethinking Text Segmentation: A Novel Dataset and A Text-Specific Refinement Approach

Author: Xu, Xingqian, primary, Zhang, Zhifei, additional, Wang, Zhaowen, additional, Price, Brian, additional, Wang, Zhonghao, additional, and Shi, Humphrey, additional
Published: 2021
Full Text: View/download PDF

42. Study on the Mechanical Effect and Constitutive Model of Montmorillonite under the Action of Acid Rain: A Case Study on Montmorillonite-Quartz Remolded Soil

Author: Li, Li, primary, Liu, Jian, additional, and Xu, Xingqian, additional
Published: 2021
Full Text: View/download PDF

43. Bending of Nonconforming Thin Plates Based on the Mixed-Order Manifold Method with Background Cells for Integration

Author: Qu, Xin, primary, Su, Lijun, additional, Liu, Zhijun, additional, Xu, Xingqian, additional, Diao, Fangfang, additional, and Li, Wei, additional
Published: 2020
Full Text: View/download PDF

44. Bending of nonconforming thin plates based on the first-order manifold method

Author: Qu, Xin, primary, Diao, Fangfang, additional, Xu, Xingqian, additional, and Li, Wei, additional
Published: 2020
Full Text: View/download PDF

45. The 1st Agriculture-Vision Challenge: Methods and Results

Author: Chiu, Mang Tik, primary, Xu, Xingqian, additional, Wang, Kai, additional, Hobbs, Jennifer, additional, Hovakimyan, Naira, additional, Huang, Thomas S., additional, Shi, Honghui, additional, Wei, Yunchao, additional, Huang, Zilong, additional, Schwing, Alexander, additional, Brunner, Robert, additional, Dozier, Ivan, additional, Dozier, Wyatt, additional, Ghandilyan, Karen, additional, Wilson, David, additional, Park, Hyunseong, additional, Kim, Junhee, additional, Kim, Sungho, additional, Liu, Qinghui, additional, Kampffmeyer, Michael C., additional, Jenssen, Robert, additional, Salberg, Arnt B., additional, Barbosa, Alexandre, additional, Trevisan, Rodrigo, additional, Zhao, Bingchen, additional, Yu, Shaozuo, additional, Yang, Siwei, additional, Wang, Yin, additional, Sheng, Hao, additional, Chen, Xiao, additional, Su, Jingyi, additional, Rajagopal, Ram, additional, Ng, Andrew, additional, Huynh, Van Thong, additional, Kim, Soo-Hyung, additional, Na, In-Seop, additional, Baid, Ujjwal, additional, Innani, Shubham, additional, Dutande, Prasad, additional, Baheti, Bhakti, additional, Talbar, Sanjay, additional, and Tang, Jianyu, additional
Published: 2020
Full Text: View/download PDF

46. Agriculture-Vision: A Large Aerial Image Database for Agricultural Pattern Analysis

Author: Chiu, Mang Tik, primary, Xu, Xingqian, additional, Wei, Yunchao, additional, Huang, Zilong, additional, Schwing, Alexander G., additional, Brunner, Robert, additional, Khachatrian, Hrant, additional, Karapetyan, Hovnatan, additional, Dozier, Ivan, additional, Rose, Greg, additional, Wilson, David, additional, Tudor, Adrian, additional, Hovakimyan, Naira, additional, Huang, Thomas S., additional, and Shi, Honghui, additional
Published: 2020
Full Text: View/download PDF

47. S receiver function analysis of the crustal and lithospheric structures beneath eastern Tibet

Author: Hu, Jiafu, Xu, Xingqian, Yang, Haiyan, Wen, Limin, and Li, Guangquan
Published: 2011
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

47 results on '"Xu, Xingqian"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources