Author: "Liu Xiaoming" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Liu Xiaoming"' showing total 8,052 results

Start Over Author "Liu Xiaoming"

8,052 results on '"Liu Xiaoming"'

1. DCF–VQA: Counterfactual Structure Based on Multi–Feature Enhancement

Author: Yang Guan, Ji Cheng, Liu Xiaoming, Zhang Ziming, and Wang Chen
Subjects: visual question answering, multi-feature enhancement, counterfactual, discrete cosine transform, Mathematics, QA1-939, Electronic computers. Computer science, QA75.5-76.95
Abstract: Visual question answering (VQA) is a pivotal topic at the intersection of computer vision and natural language processing. This paper addresses the challenges of linguistic bias and bias fusion within invalid regions encountered in existing VQA models due to insufficient representation of multi-modal features. To overcome those issues, we propose a multi-feature enhancement scheme. This scheme involves the fusion of one or more features with the original ones, incorporating discrete cosine transform (DCT) features into the counterfactual reasoning framework. This approach harnesses finegrained information and spatial relationships within images and questions, enabling a more refined understanding of the indirect relationship between images and questions. Consequently, it effectively mitigates linguistic bias and bias fusion within invalid regions in the model. Extensive experiments are conducted on multiple datasets, including VQA2 and VQA-CP2, employing various baseline models and fusion techniques, resulting in promising and robust performance.
Published: 2024
Full Text: View/download PDF

2. Molecular characterization, tissue expression, and antiviral activities of Bama minipig interferon-α subtypes

Author: Aziz Ullah Noor, Lu Huipeng, Zhanyu Du, Song Chengyi, Zhou Xiaohui, Liu Xiaoming, Suliman khan, Huaichang Sun, and Abdelouahab Bellou
Subjects: Porcine interferon-α subtypes, Tissue expression analysis, Gene cloning, Antiviral activity, Chinese Bama minipigs, Science (General), Q1-390, Social sciences (General), H1-99
Abstract: Interferons play a major role in innate immunity and disease resistance. Porcine interferon alpha has 17 subtypes, and their gene sequences, tissue expression profiles, and antiviral activities have been primarily studied in domestic pigs but not in minipigs. Bama minipigs are genetically stable disease-resistant and making them as laboratory animal models for bioscience studies. To define the potential mechanism for disease resistance, in this study, we cloned 17 subtypes of Porcine interferon alpha genes in Bama minipigs using high fidelity polymerase chain reaction and subsequent sequencing. Sequence alignment showed that the 17 porcine interferon alpha subtypes were 98%–100 % homologous in those of domestic pigs. However, significantly different tissue expression profiles of PoIFN-α subtypes were found in the two pig species using real-time quantitative RT-PCR. Among the 10 different Bama minipig tissues tested, significant expression of multi-subtype porcine interferon alpha was detected in the lymph nodes and spleen, whereas no or low expression of fewer subtypes was detected in the heart, lung, brain, and small intestine. Sequence analysis revealed that the porcine interferon alpha promoters were almost similar between the two pig species. A cytopathic effect inhibition assay showed that the recombinant 17 porcine interferon alpha subtypes purified from mammalian cells had significantly different antiviral profile against vesicular stomatitis virus, porcine pseudorabies virus and porcine reproductive and respiratory syndrome virus compared with those in domestic pigs. Our findings provide evidence that porcine interferon alpha subtypes are highly conserved between Bama minipigs and domestic pigs but show varied tissue expression pattern and antiviral capabilities, which may contribute to their differences in disease resistance.
Published: 2024
Full Text: View/download PDF

3. Translocator protein activates autophagy in diabetic neuropathic pain rats via regulation of the Keap1/Nrf2/HO-1 signaling

Author: GAO Nan, HAO Gem, MA Bingjie, JIN Tian, MA Ke, and LIU Xiaoming
Subjects: translocator protein (tspo), neuropathic pain (np), diabetes, autophagy, kelch-like ech-associated protein 1 (keap1)/nuclear factor erythroid-derived-2-like 2 (nrf2)/heme oxygenase-1 (ho-1) signaling (keap1/nrf2/ho-1 signaling), Medicine
Abstract: Objective·To study the effects of translocator protein (TSPO) agonist Ro5-4864 on autophagy and Kelch-like ECH-associated protein 1 (Keap1)/nuclear factor erythroid-derived-2-like 2 (Nrf2)/heme oxygenase-1 (HO-1) signaling in diabetic neuropathic pain (DNP) rats.Methods·Type 2 diabetic rats were established by high-fat diet and streptozotocin (STZ), and DNP rats were filtered by behavioral assessment. Twenty-four rats were randomly assigned to the Sham group, DNP group, TSPO agonist Ro5-4864 group (Ro group), and TSPO agonist Ro5-4864 combined with Nrf2 inhibitor ML385 group (Ro+ML385 group). Up-Down method was used to measure paw 50% mechanical withdrawal threshold (50% PMWT) of the rats before high-fat diet (baseline), and on Day 3, 7, 14, 21 and 28 after STZ. Sciatic nerves were collected on the last day to analyze the effects of Ro5-4864 on autophagy related proteins and Keap1/Nrf2/HO-1 signaling related proteins of DNP rats by using immunofluorescence and Western blotting.Results·The 50% PMWT in the DNP group decreased from D3 to D28 (P=0.000 at all timing), and the expression of Bcl-2 interacting coiled-coil protein 1 (Beclin-1), microtubule-associated protein light chain 3-Ⅱ (LC3-Ⅱ), HO-1, and nuclear Nrf2 (P=0.000) were significantly reduced in the sciatic nerves of DNP rats (all P=0.000), compared with those in the sham group, but p62 was significantly increased (P=0.000). Administration of Ro5-4864 attenuated these changes in the rats of the Ro group. There was a gradual increase in the 50% PMWT, compared with that of the rats in the DNP group (D14 P=0.039, both D21 and D28 P=0.000), and the impairment of autophagy and the Keap1/Nrf2/HO-1 signaling was repaired, which was demonstrated by increases of Beclin-1, LC3-Ⅱ, HO-1, and nuclear Nrf2 protein contents (all P=0.000) and a decrease in p62 content (P=0.001). However, the beneficial effects of Ro5-4864 were totally abrogated by ML385 in rats of the Ro+ML385 group.Conclusion·TSPO alleviates DNP in rats, of which the mechanism involves activation of autophagy via upregulation of the Keap1/Nrf2/HO-1 signaling in sciatic nerves. This study provides a new strategy for the treatment of DNP.
Published: 2023
Full Text: View/download PDF

4. Lncrna FEZf1-as1 negatively regulates ETNK1 to promote malignant progression of renal cell carcinoma

Author: Lou Jiangyong, Liu Xiaoming, Fan Xiaodong, Xu Xiaoming, Wang Zhichao, and Wang Liqun
Subjects: lncfezf1-as1, etnk1, renal cell carcinoma, malignant progression, Biochemistry, QD415-436
Abstract: Background: To explore the role of LncFEZF1-AS1 in renal cell carcinoma (RCC) tissues and cells, and the possible molecular mechanism. Methods: Expressions of LncFEZF1-AS1 in RCC tissues and adjacent ones were detected. The association of LncFEZF1-AS1 level with clinical data of RCC patients was also analyzed. Besides, the differential expressions of LncFEZF1-AS1 in a variety of RCC cell lines were also determined. Then the LncFEZF1-AS1 knockdown model was constructed in RCC cell line to further determine the influences of LncFEZF1-AS1 on the proliferative ability and migration of RCC cells through CCK8 and Transwell experiments. Furthermore, luciferase reporter gene experiment were used to validate the combination of LncFEZF1-AS1 to ETNK1. Results: Results suggested that expression of LncFEZF1-AS1 was noticeably higher in RCC tumor tissues and the RCC cells. Clinical pathological data analysis also suggested that high LncFEZF1-AS1 expression was in correlation with the pathological stage and the incidence of distant metastasis in RCC patients, and the poor overall survival rate. In vitro experiments demonstrated that knocking down of LncFEZF1-AS1 markedly repressed the proliferation and migration of RCC cell lines. Bioinformatics suggested that LncFEZF1-AS1 can interact with the downstream target gene ETNK1, which was confirmed by the luciferase reporter gene experiments. Western Blot results revealed that knocking down of LncFEZF1-AS1 markedly enhanced ETNK1. qRT-PCR analysis indicated that ETNK1 level was under-expressed in RCC tissues and in negative correlation with LncFEZF1-AS1. Further experiments suggested that knockdown of ETNK1 partially reversed the inhibitory effects of LncFEZF1-AS1 silencing on the proliferative and migrative abilities of RCC cells. Conclusions: LncFEZF1-AS1 could facilitation the proliferative and migration of RCC cells by regulating the expression of ETNK1. Therefore, FEZF1-AS1 might function as a cancer-promoting factor and possible new therapeutic target for RCC.
Published: 2023

5. Comprehensive Optimization of Access Point Selection for Offshore Wind Farm Integrated With Voltage Source Converter High Voltage Direct Current

Author: LIU Xiaoming, TAN Zukuang, YUAN Zhenhua, and LIU Yutian
Subjects: offshore wind power, voltage source converter high voltage direct current (vsc-hvdc), access point selection, fuzzy analytic hierarchy process, security accommodation, Applications of electric power, TK4001-4102, Production of electric energy or power. Powerplants. Central stations, TK1001-1841, Science
Abstract: With the large-scale development of offshore wind power, the suitable access point to power grid for offshore wind farm is more beneficial for voltage source converter high voltage direct current (VSC-HVDC) power transmission. Aiming at large scale offshore wind farm integrated by the VSC-HVDC, the evaluation indexes of wind power accommodation capability, grid voltage stability, grid-connected point vulnerability and construction cost were proposed by analyzing the main influencing factors of grid-connected. Based on the theory of information entropy and fuzzy analytic hierarchy process, a combination weighting model was established to determine the index weights, , and a comprehensive optimization method of access point selection was proposed. Simulation results of eastern Shandong power grid of China integrated with offshore wind farm demonstrate that the proposed method enhances the capability of wind power accommodation and improves the security and stability of power grid.
Published: 2022
Full Text: View/download PDF

6. Insights into the efficient degradation of metformin - An emerging antidiabetic medicine by UV/Sulfite process

Author: Gu Yurong, Liu Xiaoming, Han Qi, and Wang Feng
Subjects: Environmental sciences, GE1-350
Abstract: The growing utilization of metformin (MET) in diabetes treatment has resulted in its occurrence in wastewater treatment plants, where conventional techniques were proved inadequate in eliminating it. The present study evaluated the potential of UV/sulfite system in degrading MET and analyzed the influence of common factors (i.e UV intensity, dosage of reagent, the pH value of reaction solution) on the target contaminant removal. In comparison with both direct UV photolysis and merely sulfite reduction, the UV/sulfite process had a remarkable enhancement in MET removal, with 96.54% of MET (initial concentration of 15 mg/L) being degraded within 30 minutes. A strong linear relationship (R2 > 0.99) was observed between MET degradation kinetics and UV intensity. The increase of sulfite dosage and solution pH could promote MET degradation to a certain extent in the studied system. Additionally, the hydrated electron (eaq-) was found played the principle role in MET removal through scavenging reactions.
Published: 2024
Full Text: View/download PDF

7. On Learning Multi-Modal Forgery Representation for Diffusion Generated Video Detection

Author: Song, Xiufeng, Guo, Xiao, Zhang, Jiache, Li, Qirui, Bai, Lei, Liu, Xiaoming, Zhai, Guangtao, and Liu, Xiaohong
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Large numbers of synthesized videos from diffusion models pose threats to information security and authenticity, leading to an increasing demand for generated content detection. However, existing video-level detection algorithms primarily focus on detecting facial forgeries and often fail to identify diffusion-generated content with a diverse range of semantics. To advance the field of video forensics, we propose an innovative algorithm named Multi-Modal Detection(MM-Det) for detecting diffusion-generated videos. MM-Det utilizes the profound perceptual and comprehensive abilities of Large Multi-modal Models (LMMs) by generating a Multi-Modal Forgery Representation (MMFR) from LMM's multi-modal space, enhancing its ability to detect unseen forgery content. Besides, MM-Det leverages an In-and-Across Frame Attention (IAFA) mechanism for feature augmentation in the spatio-temporal domain. A dynamic fusion strategy helps refine forgery representations for the fusion. Moreover, we construct a comprehensive diffusion video dataset, called Diffusion Video Forensics (DVF), across a wide range of forgery videos. MM-Det achieves state-of-the-art performance in DVF, demonstrating the effectiveness of our algorithm. Both source code and DVF are available at https://github.com/SparkleXFantasy/MM-Det., Comment: 10 pages, 9 figures
Published: 2024

8. Language-guided Hierarchical Fine-grained Image Forgery Detection and Localization

Author: Guo, Xiao, Liu, Xiaohong, Masi, Iacopo, and Liu, Xiaoming
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Differences in forgery attributes of images generated in CNN-synthesized and image-editing domains are large, and such differences make a unified image forgery detection and localization (IFDL) challenging. To this end, we present a hierarchical fine-grained formulation for IFDL representation learning. Specifically, we first represent forgery attributes of a manipulated image with multiple labels at different levels. Then, we perform fine-grained classification at these levels using the hierarchical dependency between them. As a result, the algorithm is encouraged to learn both comprehensive features and the inherent hierarchical nature of different forgery attributes. In this work, we propose a Language-guided Hierarchical Fine-grained IFDL, denoted as HiFi-Net++. Specifically, HiFi-Net++ contains four components: a multi-branch feature extractor, a language-guided forgery localization enhancer, as well as classification and localization modules. Each branch of the multi-branch feature extractor learns to classify forgery attributes at one level, while localization and classification modules segment pixel-level forgery regions and detect image-level forgery, respectively. Also, the language-guided forgery localization enhancer (LFLE), containing image and text encoders learned by contrastive language-image pre-training (CLIP), is used to further enrich the IFDL representation. LFLE takes specifically designed texts and the given image as multi-modal inputs and then generates the visual embedding and manipulation score maps, which are used to further improve HiFi-Net++ manipulation localization performance. Lastly, we construct a hierarchical fine-grained dataset to facilitate our study. We demonstrate the effectiveness of our method on $8$ by using different benchmarks for both tasks of IFDL and forgery attribute classification. Our source code and dataset are available., Comment: Accepted by IJCV2024. arXiv admin note: substantial text overlap with arXiv:2303.17111
Published: 2024

9. Proactive Schemes: A Survey of Adversarial Attacks for Social Good

Author: Asnani, Vishal, Yin, Xi, and Liu, Xiaoming
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Adversarial attacks in computer vision exploit the vulnerabilities of machine learning models by introducing subtle perturbations to input data, often leading to incorrect predictions or classifications. These attacks have evolved in sophistication with the advent of deep learning, presenting significant challenges in critical applications, which can be harmful for society. However, there is also a rich line of research from a transformative perspective that leverages adversarial techniques for social good. Specifically, we examine the rise of proactive schemes-methods that encrypt input data using additional signals termed templates, to enhance the performance of deep learning models. By embedding these imperceptible templates into digital media, proactive schemes are applied across various applications, from simple image enhancements to complicated deep learning frameworks to aid performance, as compared to the passive schemes, which don't change the input data distribution for their framework. The survey delves into the methodologies behind these proactive schemes, the encryption and learning processes, and their application to modern computer vision and natural language processing applications. Additionally, it discusses the challenges, potential vulnerabilities, and future directions for proactive schemes, ultimately highlighting their potential to foster the responsible and secure advancement of deep learning technologies., Comment: Submitted for review
Published: 2024

10. Towards Effective User Attribution for Latent Diffusion Models via Watermark-Informed Blending

Author: Pan, Yongyang, Liu, Xiaohong, Luo, Siqi, Xin, Yi, Guo, Xiao, Liu, Xiaoming, Min, Xiongkuo, and Zhai, Guangtao
Subjects: Computer Science - Multimedia, Computer Science - Cryptography and Security, Computer Science - Computer Vision and Pattern Recognition, Electrical Engineering and Systems Science - Image and Video Processing
Abstract: Rapid advancements in multimodal large language models have enabled the creation of hyper-realistic images from textual descriptions. However, these advancements also raise significant concerns about unauthorized use, which hinders their broader distribution. Traditional watermarking methods often require complex integration or degrade image quality. To address these challenges, we introduce a novel framework Towards Effective user Attribution for latent diffusion models via Watermark-Informed Blending (TEAWIB). TEAWIB incorporates a unique ready-to-use configuration approach that allows seamless integration of user-specific watermarks into generative models. This approach ensures that each user can directly apply a pre-configured set of parameters to the model without altering the original model parameters or compromising image quality. Additionally, noise and augmentation operations are embedded at the pixel level to further secure and stabilize watermarked images. Extensive experiments validate the effectiveness of TEAWIB, showcasing the state-of-the-art performance in perceptual quality and attribution accuracy., Comment: 9 pages, 7 figures
Published: 2024

11. Reliable Deep Diffusion Tensor Estimation: Rethinking the Power of Data-Driven Optimization Routine

Author: Li, Jialong, Zhang, Zhicheng, Chen, Yunwei, Lu, Qiqi, Wu, Ye, Liu, Xiaoming, Feng, QianJin, Feng, Yanqiu, and Zhang, Xinyuan
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning, Electrical Engineering and Systems Science - Image and Video Processing
Abstract: Diffusion tensor imaging (DTI) holds significant importance in clinical diagnosis and neuroscience research. However, conventional model-based fitting methods often suffer from sensitivity to noise, leading to decreased accuracy in estimating DTI parameters. While traditional data-driven deep learning methods have shown potential in terms of accuracy and efficiency, their limited generalization to out-of-training-distribution data impedes their broader application due to the diverse scan protocols used across centers, scanners, and studies. This work aims to tackle these challenges and promote the use of DTI by introducing a data-driven optimization-based method termed DoDTI. DoDTI combines the weighted linear least squares fitting algorithm and regularization by denoising technique. The former fits DW images from diverse acquisition settings into diffusion tensor field, while the latter applies a deep learning-based denoiser to regularize the diffusion tensor field instead of the DW images, which is free from the limitation of fixed-channel assignment of the network. The optimization object is solved using the alternating direction method of multipliers and then unrolled to construct a deep neural network, leveraging a data-driven strategy to learn network parameters. Extensive validation experiments are conducted utilizing both internally simulated datasets and externally obtained in-vivo datasets. The results, encompassing both qualitative and quantitative analyses, showcase that the proposed method attains state-of-the-art performance in DTI parameter estimation. Notably, it demonstrates superior generalization, accuracy, and efficiency, rendering it highly reliable for widespread application in the field.
Published: 2024

12. COMPOSE: Comprehensive Portrait Shadow Editing

Author: Hou, Andrew, Shu, Zhixin, Zhang, Xuaner, Zhang, He, Hold-Geoffroy, Yannick, Yoon, Jae Shin, and Liu, Xiaoming
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Existing portrait relighting methods struggle with precise control over facial shadows, particularly when faced with challenges such as handling hard shadows from directional light sources or adjusting shadows while remaining in harmony with existing lighting conditions. In many situations, completely altering input lighting is undesirable for portrait retouching applications: one may want to preserve some authenticity in the captured environment. Existing shadow editing methods typically restrict their application to just the facial region and often offer limited lighting control options, such as shadow softening or rotation. In this paper, we introduce COMPOSE: a novel shadow editing pipeline for human portraits, offering precise control over shadow attributes such as shape, intensity, and position, all while preserving the original environmental illumination of the portrait. This level of disentanglement and controllability is obtained thanks to a novel decomposition of the environment map representation into ambient light and an editable gaussian dominant light source. COMPOSE is a four-stage pipeline that consists of light estimation and editing, light diffusion, shadow synthesis, and finally shadow editing. We define facial shadows as the result of a dominant light source, encoded using our novel gaussian environment map representation. Utilizing an OLAT dataset, we have trained models to: (1) predict this light source representation from images, and (2) generate realistic shadows using this representation. We also demonstrate comprehensive and intuitive shadow editing with our pipeline. Through extensive quantitative and qualitative evaluations, we have demonstrated the robust capability of our system in shadow editing., Comment: Accepted at ECCV 2024
Published: 2024

13. Revisit Self-supervised Depth Estimation with Local Structure-from-Motion

Author: Zhu, Shengjie and Liu, Xiaoming
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Both self-supervised depth estimation and Structure-from-Motion (SfM) recover scene depth from RGB videos. Despite sharing a similar objective, the two approaches are disconnected. Prior works of self-supervision backpropagate losses defined within immediate neighboring frames. Instead of learning-through-loss, this work proposes an alternative scheme by performing local SfM. First, with calibrated RGB or RGB-D images, we employ a depth and correspondence estimator to infer depthmaps and pair-wise correspondence maps. Then, a novel bundle-RANSAC-adjustment algorithm jointly optimizes camera poses and one depth adjustment for each depthmap. Finally, we fix camera poses and employ a NeRF, however, without a neural network, for dense triangulation and geometric verification. Poses, depth adjustments, and triangulated sparse depths are our outputs. For the first time, we show self-supervision within $5$ frames already benefits SoTA supervised depth and correspondence models. The project page is held in the link (https://shngjz.github.io/SSfM.github.io/).
Published: 2024

14. RePLAy: Remove Projective LiDAR Depthmap Artifacts via Exploiting Epipolar Geometry

Author: Zhu, Shengjie, Ganesan, Girish Chandar, Kumar, Abhinav, and Liu, Xiaoming
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: 3D sensing is a fundamental task for Autonomous Vehicles. Its deployment often relies on aligned RGB cameras and LiDAR. Despite meticulous synchronization and calibration, systematic misalignment persists in LiDAR projected depthmap. This is due to the physical baseline distance between the two sensors. The artifact is often reflected as background LiDAR incorrectly projected onto the foreground, such as cars and pedestrians. The KITTI dataset uses stereo cameras as a heuristic solution to remove artifacts. However most AV datasets, including nuScenes, Waymo, and DDAD, lack stereo images, making the KITTI solution inapplicable. We propose RePLAy, a parameter-free analytical solution to remove the projective artifacts. We construct a binocular vision system between a hypothesized virtual LiDAR camera and the RGB camera. We then remove the projective artifacts by determining the epipolar occlusion with the proposed analytical solution. We show unanimous improvement in the State-of-The-Art (SoTA) monocular depth estimators and 3D object detectors with the artifacts-free depthmaps.
Published: 2024

15. Open-Set Biometrics: Beyond Good Closed-Set Models

Author: Su, Yiyang, Kim, Minchul, Liu, Feng, Jain, Anil, and Liu, Xiaoming
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Biometric recognition has primarily addressed closed-set identification, assuming all probe subjects are in the gallery. However, most practical applications involve open-set biometrics, where probe subjects may or may not be present in the gallery. This poses distinct challenges in effectively distinguishing individuals in the gallery while minimizing false detections. While it is commonly believed that powerful biometric models can excel in both closed- and open-set scenarios, existing loss functions are inconsistent with open-set evaluation. They treat genuine (mated) and imposter (non-mated) similarity scores symmetrically and neglect the relative magnitudes of imposter scores. To address these issues, we simulate open-set evaluation using minibatches during training and introduce novel loss functions: (1) the identification-detection loss optimized for open-set performance under selective thresholds and (2) relative threshold minimization to reduce the maximum negative score for each probe. Across diverse biometric tasks, including face recognition, gait recognition, and person re-identification, our experiments demonstrate the effectiveness of the proposed loss functions, significantly enhancing open-set performance while positively impacting closed-set performance. Our code and models are available at https://github.com/prevso1088/open-set-biometrics., Comment: Published at ECCV 2024
Published: 2024

16. Concentrate Attention: Towards Domain-Generalizable Prompt Optimization for Language Models

Author: Li, Chengzhengxu, Liu, Xiaoming, Zhang, Zhaohan, Wang, Yichen, Liu, Chen, Lan, Yu, and Shen, Chao
Subjects: Computer Science - Computation and Language
Abstract: Recent advances in prompt optimization have notably enhanced the performance of pre-trained language models (PLMs) on downstream tasks. However, the potential of optimized prompts on domain generalization has been under-explored. To explore the nature of prompt generalization on unknown domains, we conduct pilot experiments and find that (i) Prompts gaining more attention weight from PLMs' deep layers are more generalizable and (ii) Prompts with more stable attention distributions in PLMs' deep layers are more generalizable. Thus, we offer a fresh objective towards domain-generalizable prompts optimization named "Concentration", which represents the "lookback" attention from the current decoding token to the prompt tokens, to increase the attention strength on prompts and reduce the fluctuation of attention distribution. We adapt this new objective to popular soft prompt and hard prompt optimization methods, respectively. Extensive experiments demonstrate that our idea improves comparison prompt optimization methods by 1.42% for soft prompt generalization and 2.16% for hard prompt generalization in accuracy on the multi-source domain generalization setting, while maintaining satisfying in-domain performance. The promising results validate the effectiveness of our proposed prompt optimization objective and provide key insights into domain-generalizable prompts., Comment: NeurIPS 2024 Main Track
Published: 2024

17. Insight into the cadmium and zinc binding potential of humic acids derived from composts by EEM spectra combined with PARAFAC analysis

Author: Liu Minru, Tang Zhihua, Lin Zhenrong, Guo Huafang, Yu Zhen, Liu Xiaoming, and Fang Kejing
Subjects: heavy metal, humic-like substance, protein-like substance, binding parameter, eem-parafac analysis, Chemistry, QD1-999
Published: 2020
Full Text: View/download PDF

18. Underground wireless charging device deployment algorithm based on grid divisio

Author: GUO Yu, LIU Xiaoming, FENG Kai, DING Enjie, and ZHAO Duan
Subjects: underground wireless charging, underground wireless charging device deployment, wireless rechargeable sensor networks, wireless charging device, grid division, charging coverage, Mining engineering. Metallurgy, TN1-997
Abstract: Aiming at scenario of using wireless charging device to carry out energy transmission for underground wireless rechargeable sensor networks, deployment of charging device was transformed into charging coverage problem, and the optimal charging coverage model was established based on charging model. In order to obtain the approximate optimal solution of the optimal charging coverage model, a wireless charging device deployment algorithm based on grid division was proposed. The optimal position of charging device is determined through grid division and scanning, and the number of charging devices is minimized while charging coverage of all sensor nodes is satisfied. The simulation results show that when the number of sensor nodes is small, grid size has little influence on charging coverage, while charging radius has a great influence on the number of charging devices. When the number of sensor nodes is large, grid size has a certain influence on charging coverage, while charging radius has little influence on the number of charging devices. When charging radius exceeds a certain range, the number of charging devices required is almost constant.
Published: 2018
Full Text: View/download PDF

19. StablePT: Towards Stable Prompting for Few-shot Learning via Input Separation

Author: Liu, Xiaoming, Liu, Chen, Zhang, Zhaohan, Li, Chengzhengxu, Wang, Longtian, Lan, Yu, and Shen, Chao
Subjects: Computer Science - Computation and Language
Abstract: Large language models have shown their ability to become effective few-shot learners with prompting, revolutionizing the paradigm of learning with data scarcity. However, this approach largely depends on the quality of prompt initialization, and always exhibits large variability among different runs. Such property makes prompt tuning highly unreliable and vulnerable to poorly constructed prompts, which limits its extension to more real-world applications. To tackle this issue, we propose to treat the hard prompt and soft prompt as separate inputs to mitigate noise brought by the prompt initialization. Furthermore, we optimize soft prompts with contrastive learning for utilizing class-aware information in the training process to maintain model performance. Experimental results demonstrate that \sysname outperforms state-of-the-art methods by 6.97% in accuracy and reduces the standard deviation by 1.92 on average. Furthermore, extensive experiments underscore its robustness and stability across 8 datasets covering various tasks. Codes are available at https://github.com/lccc0528/Stable/tree/main., Comment: EMNLP 2024 Findings
Published: 2024

20. Second Edition FRCSyn Challenge at CVPR 2024: Face Recognition Challenge in the Era of Synthetic Data

Author: DeAndres-Tame, Ivan, Tolosana, Ruben, Melzi, Pietro, Vera-Rodriguez, Ruben, Kim, Minchul, Rathgeb, Christian, Liu, Xiaoming, Morales, Aythami, Fierrez, Julian, Ortega-Garcia, Javier, Zhong, Zhizhou, Huang, Yuge, Mi, Yuxi, Ding, Shouhong, Zhou, Shuigeng, He, Shuai, Fu, Lingzhi, Cong, Heng, Zhang, Rongyu, Xiao, Zhihong, Smirnov, Evgeny, Pimenov, Anton, Grigorev, Aleksei, Timoshenko, Denis, Asfaw, Kaleb Mesfin, Low, Cheng Yaw, Liu, Hao, Wang, Chuyi, Zuo, Qing, He, Zhixiang, Shahreza, Hatef Otroshi, George, Anjith, Unnervik, Alexander, Rahimi, Parsa, Marcel, Sébastien, Neto, Pedro C., Huber, Marco, Kolf, Jan Niklas, Damer, Naser, Boutros, Fadi, Cardoso, Jaime S., Sequeira, Ana F., Atzori, Andrea, Fenu, Gianni, Marras, Mirko, Štruc, Vitomir, Yu, Jiang, Li, Zhangjie, Li, Jichun, Zhao, Weisong, Lei, Zhen, Zhu, Xiangyu, Zhang, Xiao-Yu, Biesseck, Bernardo, Vidal, Pedro, Coelho, Luiz, Granada, Roger, and Menotti, David
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence, Computer Science - Computers and Society, Computer Science - Machine Learning
Abstract: Synthetic data is gaining increasing relevance for training machine learning models. This is mainly motivated due to several factors such as the lack of real data and intra-class variability, time and errors produced in manual labeling, and in some cases privacy concerns, among others. This paper presents an overview of the 2nd edition of the Face Recognition Challenge in the Era of Synthetic Data (FRCSyn) organized at CVPR 2024. FRCSyn aims to investigate the use of synthetic data in face recognition to address current technological limitations, including data privacy concerns, demographic biases, generalization to novel scenarios, and performance constraints in challenging situations such as aging, pose variations, and occlusions. Unlike the 1st edition, in which synthetic data from DCFace and GANDiffFace methods was only allowed to train face recognition systems, in this 2nd edition we propose new sub-tasks that allow participants to explore novel face generative methods. The outcomes of the 2nd FRCSyn Challenge, along with the proposed experimental protocol and benchmarking contribute significantly to the application of synthetic data to face recognition., Comment: arXiv admin note: text overlap with arXiv:2311.10476
Published: 2024

21. SeaBird: Segmentation in Bird's View with Dice Loss Improves Monocular 3D Detection of Large Objects

Author: Kumar, Abhinav, Guo, Yuliang, Huang, Xinyu, Ren, Liu, and Liu, Xiaoming
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence
Abstract: Monocular 3D detectors achieve remarkable performance on cars and smaller objects. However, their performance drops on larger objects, leading to fatal accidents. Some attribute the failures to training data scarcity or their receptive field requirements of large objects. In this paper, we highlight this understudied problem of generalization to large objects. We find that modern frontal detectors struggle to generalize to large objects even on nearly balanced datasets. We argue that the cause of failure is the sensitivity of depth regression losses to noise of larger objects. To bridge this gap, we comprehensively investigate regression and dice losses, examining their robustness under varying error levels and object sizes. We mathematically prove that the dice loss leads to superior noise-robustness and model convergence for large objects compared to regression losses for a simplified case. Leveraging our theoretical insights, we propose SeaBird (Segmentation in Bird's View) as the first step towards generalizing to large objects. SeaBird effectively integrates BEV segmentation on foreground objects for 3D detection, with the segmentation head trained with the dice loss. SeaBird achieves SoTA results on the KITTI-360 leaderboard and improves existing detectors on the nuScenes leaderboard, particularly for large objects. Code and models at https://github.com/abhi1kumar/SeaBird, Comment: CVPR 2024
Published: 2024

22. KeyPoint Relative Position Encoding for Face Recognition

Author: Kim, Minchul, Su, Yiyang, Liu, Feng, Jain, Anil, and Liu, Xiaoming
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: In this paper, we address the challenge of making ViT models more robust to unseen affine transformations. Such robustness becomes useful in various recognition tasks such as face recognition when image alignment failures occur. We propose a novel method called KP-RPE, which leverages key points (e.g.~facial landmarks) to make ViT more resilient to scale, translation, and pose variations. We begin with the observation that Relative Position Encoding (RPE) is a good way to bring affine transform generalization to ViTs. RPE, however, can only inject the model with prior knowledge that nearby pixels are more important than far pixels. Keypoint RPE (KP-RPE) is an extension of this principle, where the significance of pixels is not solely dictated by their proximity but also by their relative positions to specific keypoints within the image. By anchoring the significance of pixels around keypoints, the model can more effectively retain spatial relationships, even when those relationships are disrupted by affine transformations. We show the merit of KP-RPE in face and gait recognition. The experimental results demonstrate the effectiveness in improving face recognition performance from low-quality images, particularly where alignment is prone to failure. Code and pre-trained models are available., Comment: To appear in CVPR2024
Published: 2024

23. ProMark: Proactive Diffusion Watermarking for Causal Attribution

Author: Asnani, Vishal, Collomosse, John, Bui, Tu, Liu, Xiaoming, and Agarwal, Shruti
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Generative AI (GenAI) is transforming creative workflows through the capability to synthesize and manipulate images via high-level prompts. Yet creatives are not well supported to receive recognition or reward for the use of their content in GenAI training. To this end, we propose ProMark, a causal attribution technique to attribute a synthetically generated image to its training data concepts like objects, motifs, templates, artists, or styles. The concept information is proactively embedded into the input training images using imperceptible watermarks, and the diffusion models (unconditional or conditional) are trained to retain the corresponding watermarks in generated images. We show that we can embed as many as $2^{16}$ unique watermarks into the training data, and each training image can contain more than one watermark. ProMark can maintain image quality whilst outperforming correlation-based attribution. Finally, several qualitative examples are presented, providing the confidence that the presence of the watermark conveys a causative relationship between training data and synthetic images., Comment: Accepted to CVPR 2024
Published: 2024

24. Comparison of Hot Corrosion Behavior of Ni36Fe34Al17Cr10Mo1Ti2 and Ni34Co25Fe12Al15Cr12W2 Alloys in NaCl–KCl–Na2SO4 Salt

Author: Liu, Xiaoming, Quan, Fengyang, Gao, Yuan, Zhang, Shaodong, Wang, Jianbin, Wang, Zhijun, Li, Junjie, He, Feng, and Wang, Jincheng
Published: 2024
Full Text: View/download PDF

25. Wavelet-guided network with fine-grained feature extraction for vessel segmentation

Author: Zhong, Yuanhong, Chen, Ting, Zhong, Daidi, and Liu, Xiaoming
Published: 2024
Full Text: View/download PDF

26. Enhancing the High Temperature Tensile Strength of Fe36Ni36Cr10Mo1Al17 Alloy by Substituting Al with Si

Author: Liu, Xiaoming, Wang, Jianbin, Jia, Yuhao, He, Xindang, Wang, Zhijun, He, Feng, Li, Junjie, and Wang, Jincheng
Published: 2024
Full Text: View/download PDF

27. Effect of slake durability on time dependent swelling behavior of red-bed siltstone in Sichuan Basin

Author: Jia, Qinji, Galindo Aires, Ruben Angel, Liu, Xiaoming, and Xu, Haifeng
Published: 2024
Full Text: View/download PDF

28. Enhanced electrochemical performance of the LiMn0.6Fe0.4PO4/C modified by secondary particle morphology control combined with primary particle size control

Author: Liu, Xiaoming, Wen, Lizhi, and Guan, Zhiwei
Published: 2024
Full Text: View/download PDF

29. Review on irradiation effects on quality of frozen meat food

Author: DONG Juancong, CHENG Jiao, DANG Xuhong, WANG Chao, and LIU Xiaoming
Subjects: ionizing radiation, frozen food, quality, dose, Nuclear engineering. Atomic power, TK9001-9401
Abstract: Irradiation technology has been widely used in the field of food processing. It is urgent to figure out whether the quality of frozen meat food would change after irradiation, when the SARS-CoV-2 was detected in the imported cold-chain meat. The effects of irradiation on the quality of frozen meat are summarized from the aspects of food sensory, protein decomposition, fat oxidation, vitamin content and so on, providing reference for the formulation of irradiation for the elimination of SARS-CoV-2 and other viruses on frozen food, as well as the study of irradiated frozen meat and the industrial development of irradiated frozen food.
Published: 2022
Full Text: View/download PDF

30. A two-stage inexact programming with value-at-risk for water resources management

Author: Kong Xiangming, Wang Donglin, Wang Chunxiao, Wang Yu, Yao Xiufeng, and Liu Xiaoming
Subjects: Environmental sciences, GE1-350
Abstract: In this study, the application of a two-stage inexact programming with value-at-risk (TIPV) model in water resources system planning has been developed. The TIPV method is intended to tackle the inexact parameters and the risks of economic loss. The application of case study shows that more alternatives under multiple levels of risks could be generated. The amount of water shortages and the width of system benefit intervals would decrease as the risk increases. TIPV could provide more effective information for stakeholders to recognize social policies with maximized system benefits under various risk levels.
Published: 2022
Full Text: View/download PDF

31. The Oligocene Reifnitz tonalite (Austria) and its host rocks: implications for Cretaceous and Oligocene–Neogene tectonics of the south-eastern Eastern Alps

Author: Neubauer Franz, Heberer Bianca, Dunkl István, Liu Xiaoming, Bernroider Manfred, and Dong Yunpeng
Subjects: Periadriatic magmatism, peripheral bulge, exhumation, cooling history, shortening, Geology, QE1-996.5
Abstract: In the south-eastern Eastern Alps, the Reifnitz tonalite intruded into the Austroalpine metamorphic basement of the Wörthersee half-window exposed north of the Sarmatian–Pliocene flexural Klagenfurt basin. The Reifnitz tonalite is dated for the first time, and yields a laser ICP-MS U–Pb zircon age of 30.72±0.30 Ma. The (U–Th–Sm)/He apatite age of the tonalite is 27.6 ± 1.8 Ma implying rapid Late Oligocene cooling of the tonalite to ca. 60 °C. The Reifnitz tonalite intruded into a retrogressed amphibolite-grade metamorphic basement with a metamorphic overprint of Cretaceous age (40Ar/39Ar white mica plateau age of 90.7 ± 1.6 Ma). This fact indicates that pervasive Alpine metamorphism of Cretaceous age extends southwards almost up to the Periadriatic fault. Based on the exhumation and erosion history of the Reifnitz tonalite and the hosting Wörthersee half window formed by the Wörthersee anticline, the age of gentle folding of Austroalpine units in the south-eastern part of the Eastern Alps is likely of Oligocene age. North of the Wörthersee antiform, Upper Cretaceous–Eocene, Oligocene and Miocene sedimentary rocks of the Krappfeld basin are preserved in a gentle synform, suggesting that the top of the Krappfeld basin has always been near the Earth’s surface since the Late Cretaceous. The new data imply, therefore, that the Reifnitz tonalite is part of a post-30 Ma antiform, which was likely exhumed, uplifted and eroded in two steps. In the first step, which is dated to ca. 31–27 Ma, rapid cooling to ca. 60 °C and exhumation occurred in an E–W trending antiform, which formed as a result of a regional N–S compression. In the second step of the Sarmatian–Pliocene age a final exhumation occurred in the peripheral bulge in response to the lithospheric flexure in front of the overriding North Karawanken thrust sheet. The Klagenfurt basin developed as a flexural basin at the northern front of the North Karawanken, which represent a transpressive thrust sheet of a positive flower structure related to the final activity along the Periadriatic fault. In the Eastern Alps, on a large scale, the distribution of Periadriatic plutons and volcanics seems to monitor a northward or eastward shift of magmatic activity, with the main phase of intrusions ca. 30 Ma at the fault itself.
Published: 2018
Full Text: View/download PDF

32. A Broadband Quasi-optical System for Measuring the Dielectric Properties in the Terahertz Band

Author: Liu Xiaoming, Yu Junsheng, Chen Xiaodong, Zhou Jun, Gan Lu, and Zhang Chijian
Subjects: Terahertz, Dielectric property, Broadband, Quasi-optical system, Electricity and magnetism, QC501-766
Abstract: To fulfill the requirements of the dielectric property measurement in the terahertz band, herein, a broadband quasi-optical system was designed and verified utilizing a planar scanning system. Additionally, the method of retrieving the dielectric parameters was discussed. Our experimental findings indicated that the measurement results were in good agreement with the theoretical results. Boron silicon, and deionized water were used for verifying the measurement, and the permittivity was obtained using a numerical method. We found that the dielectric properties were in good agreement with the typical values. This indicated that the proposed quasi-optical method effectively characterized the permittivity.
Published: 2018
Full Text: View/download PDF

33. Developing a Data-Driven Emerging Skill Network Analytics Framework for Automated Employment Advert Evaluation

Author: Liu, Xiaoming and Schwieger, Dana
Abstract: Rapid advancements and emergent technologies add an additional layer of complexity to preparing computer science and information technology higher education students for entering the post pandemic job market. Knowing and predicting employers' technical skill needs is essential for shaping curriculum development to address the emergent skill gap. Examining online advertisements to determine the skills sought by employers of new hires for these emerging areas and ensuring that program course content addresses these skills can be a daunting task. In this paper, the authors describe the development of a data-driven analytics framework that can be used for evaluating emerging skill clusters in online job adverts and the application of the framework to a mobile computing course at the authors' institution.
Published: 2023

34. BigGait: Learning Gait Representation You Want by Large Vision Models

Author: Ye, Dingqiang, Fan, Chao, Ma, Jingzhe, Liu, Xiaoming, and Yu, Shiqi
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Gait recognition stands as one of the most pivotal remote identification technologies and progressively expands across research and industry communities. However, existing gait recognition methods heavily rely on task-specific upstream driven by supervised learning to provide explicit gait representations like silhouette sequences, which inevitably introduce expensive annotation costs and potential error accumulation. Escaping from this trend, this work explores effective gait representations based on the all-purpose knowledge produced by task-agnostic Large Vision Models (LVMs) and proposes a simple yet efficient gait framework, termed BigGait. Specifically, the Gait Representation Extractor (GRE) within BigGait draws upon design principles from established gait representations, effectively transforming all-purpose knowledge into implicit gait representations without requiring third-party supervision signals. Experiments on CCPG, CAISA-B* and SUSTech1K indicate that BigGait significantly outperforms the previous methods in both within-domain and cross-domain tasks in most cases, and provides a more practical paradigm for learning the next-generation gait representation. Finally, we delve into prospective challenges and promising directions in LVMs-based gait recognition, aiming to inspire future work in this emerging topic. The source code is available at https://github.com/ShiqiYu/OpenGait.
Published: 2024

35. UnlearnCanvas: Stylized Image Dataset for Enhanced Machine Unlearning Evaluation in Diffusion Models

Author: Zhang, Yihua, Fan, Chongyu, Zhang, Yimeng, Yao, Yuguang, Jia, Jinghan, Liu, Jiancheng, Zhang, Gaoyuan, Liu, Gaowen, Kompella, Ramana Rao, Liu, Xiaoming, and Liu, Sijia
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: The technological advancements in diffusion models (DMs) have demonstrated unprecedented capabilities in text-to-image generation and are widely used in diverse applications. However, they have also raised significant societal concerns, such as the generation of harmful content and copyright disputes. Machine unlearning (MU) has emerged as a promising solution, capable of removing undesired generative capabilities from DMs. However, existing MU evaluation systems present several key challenges that can result in incomplete and inaccurate assessments. To address these issues, we propose UnlearnCanvas, a comprehensive high-resolution stylized image dataset that facilitates the evaluation of the unlearning of artistic styles and associated objects. This dataset enables the establishment of a standardized, automated evaluation framework with 7 quantitative metrics assessing various aspects of the unlearning performance for DMs. Through extensive experiments, we benchmark 9 state-of-the-art MU methods for DMs, revealing novel insights into their strengths, weaknesses, and underlying mechanisms. Additionally, we explore challenging unlearning scenarios for DMs to evaluate worst-case performance against adversarial prompts, the unlearning of finer-scale concepts, and sequential unlearning. We hope that this study can pave the way for developing more effective, accurate, and robust DM unlearning methods, ensuring safer and more ethical applications of DMs in the future. The dataset, benchmark, and codes are publicly available at https://unlearn-canvas.netlify.app/., Comment: NeurIPS 2024 Dataset & Benchmark Track
Published: 2024

36. Stumbling Blocks: Stress Testing the Robustness of Machine-Generated Text Detectors Under Attacks

Author: Wang, Yichen, Feng, Shangbin, Hou, Abe Bohan, Pu, Xiao, Shen, Chao, Liu, Xiaoming, Tsvetkov, Yulia, and He, Tianxing
Subjects: Computer Science - Computation and Language
Abstract: The widespread use of large language models (LLMs) is increasing the demand for methods that detect machine-generated text to prevent misuse. The goal of our study is to stress test the detectors' robustness to malicious attacks under realistic scenarios. We comprehensively study the robustness of popular machine-generated text detectors under attacks from diverse categories: editing, paraphrasing, prompting, and co-generating. Our attacks assume limited access to the generator LLMs, and we compare the performance of detectors on different attacks under different budget levels. Our experiments reveal that almost none of the existing detectors remain robust under all the attacks, and all detectors exhibit different loopholes. Averaging all detectors, the performance drops by 35% across all attacks. Further, we investigate the reasons behind these defects and propose initial out-of-the-box patches to improve robustness.
Published: 2024

37. Does DetectGPT Fully Utilize Perturbation? Bridging Selective Perturbation to Fine-tuned Contrastive Learning Detector would be Better

Author: Liu, Shengchao, Liu, Xiaoming, Wang, Yichen, Cheng, Zehua, Li, Chengzhengxu, Zhang, Zhaohan, Lan, Yu, and Shen, Chao
Subjects: Computer Science - Computation and Language
Abstract: The burgeoning generative capabilities of large language models (LLMs) have raised growing concerns about abuse, demanding automatic machine-generated text detectors. DetectGPT, a zero-shot metric-based detector, first introduces perturbation and shows great performance improvement. However, in DetectGPT, the random perturbation strategy could introduce noise, and logit regression depends on the threshold, harming the generalizability and applicability of individual or small-batch inputs. Hence, we propose a novel fine-tuned detector, Pecola, bridging metric-based and fine-tuned methods by contrastive learning on selective perturbation. Selective strategy retains important tokens during perturbation and weights for multi-pair contrastive learning. The experiments show that Pecola outperforms the state-of-the-art (SOTA) by 1.20% in accuracy on average on four public datasets. And we further analyze the effectiveness, robustness, and generalization of the method.
Published: 2024

38. Unified Physical-Digital Face Attack Detection

Author: Fang, Hao, Liu, Ajian, Yuan, Haocheng, Zheng, Junze, Zeng, Dingheng, Liu, Yanhong, Deng, Jiankang, Escalera, Sergio, Liu, Xiaoming, Wan, Jun, and Lei, Zhen
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Face Recognition (FR) systems can suffer from physical (i.e., print photo) and digital (i.e., DeepFake) attacks. However, previous related work rarely considers both situations at the same time. This implies the deployment of multiple models and thus more computational burden. The main reasons for this lack of an integrated model are caused by two factors: (1) The lack of a dataset including both physical and digital attacks with ID consistency which means the same ID covers the real face and all attack types; (2) Given the large intra-class variance between these two attacks, it is difficult to learn a compact feature space to detect both attacks simultaneously. To address these issues, we collect a Unified physical-digital Attack dataset, called UniAttackData. The dataset consists of $1,800$ participations of 2 and 12 physical and digital attacks, respectively, resulting in a total of 29,706 videos. Then, we propose a Unified Attack Detection framework based on Vision-Language Models (VLMs), namely UniAttackDetection, which includes three main modules: the Teacher-Student Prompts (TSP) module, focused on acquiring unified and specific knowledge respectively; the Unified Knowledge Mining (UKM) module, designed to capture a comprehensive feature space; and the Sample-Level Prompt Interaction (SLPI) module, aimed at grasping sample-level semantics. These three modules seamlessly form a robust unified attack detection framework. Extensive experiments on UniAttackData and three other datasets demonstrate the superiority of our approach for unified face attack detection., Comment: 12 pages, 8 figures
Published: 2024

39. A Generalist FaceX via Learning Unified Facial Representation

Author: Han, Yue, Zhang, Jiangning, Zhu, Junwei, Li, Xiangtai, Ge, Yanhao, Li, Wei, Wang, Chengjie, Liu, Yong, Liu, Xiaoming, and Tai, Ying
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: This work presents FaceX framework, a novel facial generalist model capable of handling diverse facial tasks simultaneously. To achieve this goal, we initially formulate a unified facial representation for a broad spectrum of facial editing tasks, which macroscopically decomposes a face into fundamental identity, intra-personal variation, and environmental factors. Based on this, we introduce Facial Omni-Representation Decomposing (FORD) for seamless manipulation of various facial components, microscopically decomposing the core aspects of most facial editing tasks. Furthermore, by leveraging the prior of a pretrained StableDiffusion (SD) to enhance generation quality and accelerate training, we design Facial Omni-Representation Steering (FORS) to first assemble unified facial representations and then effectively steer the SD-aware generation process by the efficient Facial Representation Controller (FRC). %Without any additional features, Our versatile FaceX achieves competitive performance compared to elaborate task-specific models on popular facial editing tasks. Full codes and models will be available at https://github.com/diffusion-facex/FaceX., Comment: Project page: https://diffusion-facex.github.io/
Published: 2023

40. INFAMOUS-NeRF: ImproviNg FAce MOdeling Using Semantically-Aligned Hypernetworks with Neural Radiance Fields

Author: Hou, Andrew, Liu, Feng, Ren, Zhiyuan, Sarkis, Michel, Bi, Ning, Tong, Yiying, and Liu, Xiaoming
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: We propose INFAMOUS-NeRF, an implicit morphable face model that introduces hypernetworks to NeRF to improve the representation power in the presence of many training subjects. At the same time, INFAMOUS-NeRF resolves the classic hypernetwork tradeoff of representation power and editability by learning semantically-aligned latent spaces despite the subject-specific models, all without requiring a large pretrained model. INFAMOUS-NeRF further introduces a novel constraint to improve NeRF rendering along the face boundary. Our constraint can leverage photometric surface rendering and multi-view supervision to guide surface color prediction and improve rendering near the surface. Finally, we introduce a novel, loss-guided adaptive sampling method for more effective NeRF training by reducing the sampling redundancy. We show quantitatively and qualitatively that our method achieves higher representation power than prior face modeling methods in both controlled and in-the-wild settings. Code and models will be released upon publication.
Published: 2023

41. Tracing Hyperparameter Dependencies for Model Parsing via Learnable Graph Pooling Network

Author: Guo, Xiao, Asnani, Vishal, Liu, Sijia, and Liu, Xiaoming
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Model Parsing defines the research task of predicting hyperparameters of the generative model (GM), given a generated image as input. Since a diverse set of hyperparameters is jointly employed by the generative model, and dependencies often exist among them, it is crucial to learn these hyperparameter dependencies for the improved model parsing performance. To explore such important dependencies, we propose a novel model parsing method called Learnable Graph Pooling Network (LGPN). Specifically, we transform model parsing into a graph node classification task, using graph nodes and edges to represent hyperparameters and their dependencies, respectively. Furthermore, LGPN incorporates a learnable pooling-unpooling mechanism tailored to model parsing, which adaptively learns hyperparameter dependencies of GMs used to generate the input image. We also extend our proposed method to CNN-generated image detection and coordinate attacks detection. Empirically, we achieve state-of-the-art results in model parsing and its extended applications, showing the effectiveness of our method. Our source code are available., Comment: 24 pages, 15 figures, 17 tables
Published: 2023

42. Geochemical and isotopic evidence for Carboniferous rifting: mafic dykes in the central Sanandaj-Sirjan zone (Dorud-Azna, West Iran)

Author: Shakerardakani Farzaneh, Neubauer Franz, Bernroider Manfred, Von Quadt Albrecht, Peytcheva Irena, Liu Xiaoming, Genser Johann, Monfaredi Behzad, and Masoudi Fariborz
Subjects: mafic dyke, Sanandaj-Sirjan Zone, 40Ar/39Ar dating, whole-rock Sr–Nd isotopes, Carboniferous rift, Palaeotethys, Geology, QE1-996.5
Abstract: In this paper, we present detailed field observations, chronological, geochemical and Sr–Nd isotopic data and discuss the petrogenetic aspects of two types of mafic dykes, of alkaline to subalkaline nature. The alkaline mafic dykes exhibit a cumulate to foliated texture and strike NW–SE, parallel to the main trend of the region. The 40Ar/39Ar amphibole age of 321.32 ± 0.55 Ma from an alkaline mafic dyke is interpreted as an indication of Carboniferous cooling through ca. 550 °C after intrusion of the dyke into the granitic Galeh-Doz orthogneiss and Amphibolite-Metagabbro units, the latter with Early Carboniferous amphibolite facies grade metamorphism and containing the Dare-Hedavand metagabbro with a similar Carboniferous age. The alkaline and subalkaline mafic dykes can be geochemically categorized into those with light REE-enriched patterns [(La/Yb)N = 8.32–9.28] and others with a rather flat REE pattern [(La/Yb)N = 1.16] and with a negative Nb anomaly. Together, the mafic dykes show oceanic island basalt to MORB geochemical signature, respectively. This is consistent, as well, with the (Tb/Yb)PM ratios. The alkaline mafic dykes were formed within an enriched mantle source at depths of ˃ 90 km, generating a suite of alkaline basalts. In comparison, the subalkaline mafic dykes were formed within more depleted mantle source at depths of ˂ 90 km. The subalkaline mafic dyke is characterized by 87Sr/86Sr ratio of 0.706 and positive ɛNd(t) value of + 0.77, whereas 87Sr/86Sr ratio of 0.708 and ɛNd(t) value of + 1.65 of the alkaline mafic dyke, consistent with the derivation from an enriched mantle source. There is no evidence that the mafic dykes were affected by significant crustal contamination during emplacement. Because of the similar age, the generation of magmas of alkaline mafic dykes and of the Dare-Hedavand metagabbro are assumed to reflect the same process of lithospheric or asthenospheric melting. Carboniferous back-arc rifting is the likely geodynamic setting of mafic dyke generation and emplacement. In contrast, the subalkaline mafic sill is likely related to the emplacement of the Jurassic Darijune gabbro.
Published: 2017
Full Text: View/download PDF

43. Revisit Self-supervised Depth Estimation with Local Structure-from-Motion

Author: Zhu, Shengjie, Liu, Xiaoming, Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Leonardis, Aleš, editor, Ricci, Elisa, editor, Roth, Stefan, editor, Russakovsky, Olga, editor, Sattler, Torsten, editor, and Varol, Gül, editor
Published: 2025
Full Text: View/download PDF

44. Research on transient insulation numerical analysis method of circuit breaker in GIS under lightning impulse voltage

Author: Wu Qi, Liu Xiaoming, Yang Tian, and Li Longnv
Subjects: numerical analysis, electric fields, electric breakdown, circuit breakers, transient analysis, gas insulated switchgear, probability, transient insulation numerical analysis method, GIS, lightning impulse voltage, complex electromagnetic transient wave process phenomenon, insulation destruction, electromagnetic transient analysis method, transient insulation characteristics, circuit breaker insulation breakdown, circuit breaker insulation structure optimisation design, voltage input parameters, optimised charge simulation method, severe electric field distortion, electric field strength amplitude change rate, insulation breakdown probability reduction, maximum electric field strength, electrostatic field, voltage 550 kV, time 0 mus to 15 mus, Engineering (General). Civil engineering (General), TA1-2040
Abstract: In order to reveal the process of insulation destruction of circuit breaker in GIS under lightning impulse voltage, a new method of transient insulation numerical analysis is described in this paper. A combination of the electromagnetic transient analysis method and optimised charge simulation method (CSM) was put forward to calculate and analyse the transient insulation characteristics of 550 kV circuit breaker in GIS under lightning impulse voltage. The results show that, within 0–15 μ, though the voltage of circuit breaker did not reach its peak, there is severe electric field distortion caused by sharp change of voltage with a high probability of circuit breaker insulation breakdown. In addition, as for the circuit breaker insulation structure optimisation design, it is advisable to restrain the change rate of electric field strength amplitude so as to reduce the probability of insulation breakdown, rather than simply verify insulation structure by using maxi-mum electric field strength as the voltage input parameters of the electrostatic field under the power frequency in accordance with the traditional insulation analysis method, with the purpose of ensuring the circuit breaker can operate safely and reliably for a long time.
Published: 2019
Full Text: View/download PDF

45. Research on Athlete Posture Monitoring and Correction Technology Based on Wireless Sensing and Computer Vision Algorithms

Author: Guo, Haiying, Liu, Xiaoming, and Liu, Hui
Published: 2024
Full Text: View/download PDF

46. Heat transfer mechanism of asphalt pavement based on entransy dissipation analysis

Author: Zhao, Yu, Liu, Xiaoming, and Zhang, Xihe
Published: 2024
Full Text: View/download PDF

47. Formation of polycrystalline-Co particle-chains in Cu–Co alloy during liquid-phase sintering in a high magnetic field

Author: Zhang, Siyu, Liu, Tie, Miao, Ling, Wang, Kai, Liu, Xiaoming, and Wang, Qiang
Published: 2024
Full Text: View/download PDF

48. Enhancing oxidation resistance with Si in Fe36Ni36Al15Cr10Si2Mo1 multi-principal element alloy at 700 °C

Author: Liu, Xiaoming, Shi, Xinbo, Wang, Jianbin, Jia, Yuhao, Wang, Zhijun, He, Feng, Li, Junjie, and Wang, Jincheng
Published: 2024
Full Text: View/download PDF

49. Zero-shot learning via categorization-relevant disentanglement and discriminative samples synthesis

Author: Fang, Juan, Yang, Guan, Han, Ayou, Liu, Xiaoming, Chen, Bo, and Wang, Chen
Published: 2024
Full Text: View/download PDF

50. FRCSyn Challenge at WACV 2024:Face Recognition Challenge in the Era of Synthetic Data

Author: Melzi, Pietro, Tolosana, Ruben, Vera-Rodriguez, Ruben, Kim, Minchul, Rathgeb, Christian, Liu, Xiaoming, DeAndres-Tame, Ivan, Morales, Aythami, Fierrez, Julian, Ortega-Garcia, Javier, Zhao, Weisong, Zhu, Xiangyu, Yan, Zheyu, Zhang, Xiao-Yu, Wu, Jinlin, Lei, Zhen, Tripathi, Suvidha, Kothari, Mahak, Zama, Md Haider, Deb, Debayan, Biesseck, Bernardo, Vidal, Pedro, Granada, Roger, Fickel, Guilherme, Führ, Gustavo, Menotti, David, Unnervik, Alexander, George, Anjith, Ecabert, Christophe, Shahreza, Hatef Otroshi, Rahimi, Parsa, Marcel, Sébastien, Sarridis, Ioannis, Koutlis, Christos, Baltsou, Georgia, Papadopoulos, Symeon, Diou, Christos, Di Domenico, Nicolò, Borghi, Guido, Pellegrini, Lorenzo, Mas-Candela, Enrique, Sánchez-Pérez, Ángela, Atzori, Andrea, Boutros, Fadi, Damer, Naser, Fenu, Gianni, and Marras, Mirko
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Despite the widespread adoption of face recognition technology around the world, and its remarkable performance on current benchmarks, there are still several challenges that must be covered in more detail. This paper offers an overview of the Face Recognition Challenge in the Era of Synthetic Data (FRCSyn) organized at WACV 2024. This is the first international challenge aiming to explore the use of synthetic data in face recognition to address existing limitations in the technology. Specifically, the FRCSyn Challenge targets concerns related to data privacy issues, demographic biases, generalization to unseen scenarios, and performance limitations in challenging scenarios, including significant age disparities between enrollment and testing, pose variations, and occlusions. The results achieved in the FRCSyn Challenge, together with the proposed benchmark, contribute significantly to the application of synthetic data to improve face recognition technology., Comment: 10 pages, 1 figure, WACV 2024 Workshops
Published: 2023

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

8,052 results on '"Liu Xiaoming"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources