2,585 results on '"Xu Zhiqiang"'
Search Results
2. Elderly onset of MELAS carried an M.3243A >G mutation in a female with deafness and visual deficits: A case report
- Author
-
Lin Zijun, Yi Xu, Yang Yujia, and Xu Zhiqiang
- Subjects
case report ,epilepsy ,MELAS syndrome ,mitochondria ,stroke‐like episodes ,visual deficit ,Medicine ,Medicine (General) ,R5-920 - Abstract
Key Clinical Message MELAS is a disorder with clinical variability that also responsible for a significant portion of unexplained hereditary or childhood‐onset hearing loss. Although patients typically present in childhood, the first stroke‐like episode can occur later in life in some patients, potentially related to a lower heteroplasmy level. It is crucial to consider MELAS as a potential cause of stroke‐like events if age at presentation and symptoms are atypical, especially among middle‐aged patients without vascular risk factors. Abstract MELAS syndrome (mitochondrial encephalopathy with lactic acidosis and stroke‐like episodes) is a rare genetic condition that most patients develop stroke‐like episodes before the age of 40. We report a 52‐year‐old female with a documented 40‐year history of progressive sensorineural hearing loss, developed a visual field deficit and stroke‐like events in her middle age who finally diagnosed was MELAS. The patient was started on vitamin E, l‐carnitine, l‐arginine, and coenzyme Q10 that gradually improved before dismissal from the hospital. This case highlights the importance of considering MELAS as a potential cause of stroke‐like events if imaging findings are atypical for cerebral infarction, especially among middle‐aged patients without vascular risk factors and an unusual cause of progressive sensorineural hearing loss.
- Published
- 2024
- Full Text
- View/download PDF
3. Seismic behavior of reactive powder concrete (RPC) interior beam–to-column joints under reversed cyclic loading
- Author
-
Yu Jianbing, Xu Zhiqiang, Xia Yufeng, and Guo Zhengxing
- Subjects
Seismic response ,Nonlinear finite element analysis ,Reactive powder concrete ,Precast concrete structure ,Parametric analysis ,Materials of engineering and construction. Mechanics of materials ,TA401-492 - Abstract
Not all precast concrete structures show performance that can be obtained by experiment. Finite element analysis also plays a vital role in the study of structural performance. In this paper, a steel hysteretic is employed to reproduce the experimental response of a steel strand anchored precast concrete frame joints exposed to reversed cyclic load tests. The ultimate load and ultimate displacement obtained by finite element calculation are not more than 1% different from the experimental values. By employing the proposed model, the hysteretic curve, energy dissipation capacity, stiffness degradation, and displacement ductility of the reactive powder concrete (RPC) interior beam–column joints were analyzed. According to the analysis, in the case of the failure morphology of RPC beam-to-column joints with friction dampers, it is evident that the plastic hinge at the end of beam had shifted outward. In addition, both the energy dissipation capacity and the ultimate bearing capacity of the RPC interior beam–column joints have been improved, the ultimate load of PC2 is 28.8% higher than that of PC1. According to the aforementioned results, the parametric analysis of the RPC interior beam–column joint was performed. Subsequently, by varying the friction coefficient of the friction damper, the stirrup spacing in the panel zone and the different axial compression ratios, the influence of a number of different parameters on the mechanical properties of the joint was obtained.
- Published
- 2023
- Full Text
- View/download PDF
4. Capacity Allocation of Wind Farm Energy Storage System Considering Economic Function
- Author
-
CHEN Xiaoguang, YANG Xiuyuan, BU Siqi, and XU Zhiqiang
- Subjects
wind power ,energy storage system ,capacity configuration ,economic evaluation ,Applications of electric power ,TK4001-4102 ,Production of electric energy or power. Powerplants. Central stations ,TK1001-1841 ,Science - Abstract
The energy storage system with the ability of bidirectional absorption and release of power, realizes the temporal and spatial transfer of power and provides the key technical support for the safe and friendly integration of wind power into the grid. In practical engineering, the high cost restricts the large-scale application of energy storage. Therefore, when selecting the capacity configuration scheme, the introduction of economic evaluation under the premise of meeting the functional applicability can accelerate the industrialization of energy storage. Combined with the capacity configuration of economic and functional energy storage, this paper integrated three typical evaluation criteria for the economic evaluation of energy storage system, summarized three methods of capacity configuration, and analyzed the advantages and disadvantages of the three methods by comparison. Moreover, the basic process of capacity allocation of energy storage system was given. Finally, the existing problems and future research directions of energy storage configuration were pointed out.
- Published
- 2022
- Full Text
- View/download PDF
5. Exploring the Generalization Capabilities of AID-based Bi-level Optimization
- Author
-
Chen, Congliang, Shen, Li, Xu, Zhiqiang, Liu, Wei, Luo, Zhi-Quan, and Zhao, Peilin
- Subjects
Computer Science - Machine Learning ,Statistics - Machine Learning - Abstract
Bi-level optimization has achieved considerable success in contemporary machine learning applications, especially for given proper hyperparameters. However, due to the two-level optimization structure, commonly, researchers focus on two types of bi-level optimization methods: approximate implicit differentiation (AID)-based and iterative differentiation (ITD)-based approaches. ITD-based methods can be readily transformed into single-level optimization problems, facilitating the study of their generalization capabilities. In contrast, AID-based methods cannot be easily transformed similarly but must stay in the two-level structure, leaving their generalization properties enigmatic. In this paper, although the outer-level function is nonconvex, we ascertain the uniform stability of AID-based methods, which achieves similar results to a single-level nonconvex problem. We conduct a convergence analysis for a carefully chosen step size to maintain stability. Combining the convergence and stability results, we give the generalization ability of AID-based bi-level optimization methods. Furthermore, we carry out an ablation study of the parameters and assess the performance of these methods on real-world tasks. Our experimental results corroborate the theoretical findings, demonstrating the effectiveness and potential applications of these methods.
- Published
- 2024
6. Bag of Design Choices for Inference of High-Resolution Masked Generative Transformer
- Author
-
Shao, Shitong, Zhou, Zikai, Ye, Tian, Bai, Lichen, Xu, Zhiqiang, and Xie, Zeke
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Machine Learning - Abstract
Text-to-image diffusion models (DMs) develop at an unprecedented pace, supported by thorough theoretical exploration and empirical analysis. Unfortunately, the discrepancy between DMs and autoregressive models (ARMs) complicates the path toward achieving the goal of unified vision and language generation. Recently, the masked generative Transformer (MGT) serves as a promising intermediary between DM and ARM by predicting randomly masked image tokens (i.e., masked image modeling), combining the efficiency of DM with the discrete token nature of ARM. However, we find that the comprehensive analyses regarding the inference for MGT are virtually non-existent, and thus we aim to present positive design choices to fill this gap. We modify and re-design a set of DM-based inference techniques for MGT and further elucidate their performance on MGT. We also discuss the approach to correcting token's distribution to enhance inference. Extensive experiments and empirical analyses lead to concrete and effective design choices, and these design choices can be merged to achieve further performance gains. For instance, in terms of enhanced inference, we achieve winning rates of approximately 70% compared to vanilla sampling on HPS v2 with the recent SOTA MGT Meissonic. Our contributions have the potential to further enhance the capabilities and future development of MGTs.
- Published
- 2024
7. Golden Noise for Diffusion Models: A Learning Framework
- Author
-
Zhou, Zikai, Shao, Shitong, Bai, Lichen, Xu, Zhiqiang, Han, Bo, and Xie, Zeke
- Subjects
Computer Science - Machine Learning ,Computer Science - Computer Vision and Pattern Recognition - Abstract
Text-to-image diffusion model is a popular paradigm that synthesizes personalized images by providing a text prompt and a random Gaussian noise. While people observe that some noises are ``golden noises'' that can achieve better text-image alignment and higher human preference than others, we still lack a machine learning framework to obtain those golden noises. To learn golden noises for diffusion sampling, we mainly make three contributions in this paper. First, we identify a new concept termed the \textit{noise prompt}, which aims at turning a random Gaussian noise into a golden noise by adding a small desirable perturbation derived from the text prompt. Following the concept, we first formulate the \textit{noise prompt learning} framework that systematically learns ``prompted'' golden noise associated with a text prompt for diffusion models. Second, we design a noise prompt data collection pipeline and collect a large-scale \textit{noise prompt dataset}~(NPD) that contains 100k pairs of random noises and golden noises with the associated text prompts. With the prepared NPD as the training dataset, we trained a small \textit{noise prompt network}~(NPNet) that can directly learn to transform a random noise into a golden noise. The learned golden noise perturbation can be considered as a kind of prompt for noise, as it is rich in semantic information and tailored to the given text prompt. Third, our extensive experiments demonstrate the impressive effectiveness and generalization of NPNet on improving the quality of synthesized images across various diffusion models, including SDXL, DreamShaper-xl-v2-turbo, and Hunyuan-DiT. Moreover, NPNet is a small and efficient controller that acts as a plug-and-play module with very limited additional inference and computational costs, as it just provides a golden noise instead of a random noise without accessing the original pipeline.
- Published
- 2024
8. Online Parallel Multi-Task Relationship Learning via Alternating Direction Method of Multipliers
- Author
-
Li, Ruiyu, Zhao, Peilin, Li, Guangxia, Xu, Zhiqiang, and Li, Xuewei
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence ,Computer Science - Distributed, Parallel, and Cluster Computing - Abstract
Online multi-task learning (OMTL) enhances streaming data processing by leveraging the inherent relations among multiple tasks. It can be described as an optimization problem in which a single loss function is defined for multiple tasks. Existing gradient-descent-based methods for this problem might suffer from gradient vanishing and poor conditioning issues. Furthermore, the centralized setting hinders their application to online parallel optimization, which is vital to big data analytics. Therefore, this study proposes a novel OMTL framework based on the alternating direction multiplier method (ADMM), a recent breakthrough in optimization suitable for the distributed computing environment because of its decomposable and easy-to-implement nature. The relations among multiple tasks are modeled dynamically to fit the constant changes in an online scenario. In a classical distributed computing architecture with a central server, the proposed OMTL algorithm with the ADMM optimizer outperforms SGD-based approaches in terms of accuracy and efficiency. Because the central server might become a bottleneck when the data scale grows, we further tailor the algorithm to a decentralized setting, so that each node can work by only exchanging information with local neighbors. Experimental results on a synthetic and several real-world datasets demonstrate the efficiency of our methods., Comment: Accpeted by Neurocomputing
- Published
- 2024
9. On the Comparison between Multi-modal and Single-modal Contrastive Learning
- Author
-
Huang, Wei, Han, Andi, Chen, Yongqiang, Cao, Yuan, Xu, Zhiqiang, and Suzuki, Taiji
- Subjects
Computer Science - Machine Learning - Abstract
Multi-modal contrastive learning with language supervision has presented a paradigm shift in modern machine learning. By pre-training on a web-scale dataset, multi-modal contrastive learning can learn high-quality representations that exhibit impressive robustness and transferability. Despite its empirical success, the theoretical understanding is still in its infancy, especially regarding its comparison with single-modal contrastive learning. In this work, we introduce a feature learning theory framework that provides a theoretical foundation for understanding the differences between multi-modal and single-modal contrastive learning. Based on a data generation model consisting of signal and noise, our analysis is performed on a ReLU network trained with the InfoMax objective function. Through a trajectory-based optimization analysis and generalization characterization on downstream tasks, we identify the critical factor, which is the signal-to-noise ratio (SNR), that impacts the generalizability in downstream tasks of both multi-modal and single-modal contrastive learning. Through the cooperation between the two modalities, multi-modal learning can achieve better feature learning, leading to improvements in performance in downstream tasks compared to single-modal learning. Our analysis provides a unified framework that can characterize the optimization and generalization of both single-modal and multi-modal contrastive learning. Empirical experiments on both synthetic and real-world datasets further consolidate our theoretical findings., Comment: 51pages, 1 figure, 1 table
- Published
- 2024
10. Corrected Soft Actor Critic for Continuous Control
- Author
-
Chen, Yanjun, Zhang, Xinming, Wang, Xianghui, Xu, Zhiqiang, Shen, Xiaoyu, and Zhang, Wei
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence - Abstract
The Soft Actor-Critic (SAC) algorithm is known for its stability and high sample efficiency in deep reinforcement learning. However, the tanh transformation applied to sampled actions in SAC distorts the action distribution, hindering the selection of the most probable actions. This paper presents a novel action sampling method that directly identifies and selects the most probable actions within the transformed distribution, thereby addressing this issue. Extensive experiments on standard continuous control benchmarks demonstrate that the proposed method significantly enhances SAC's performance, resulting in faster convergence and higher cumulative rewards compared to the original algorithm.
- Published
- 2024
11. TextLap: Customizing Language Models for Text-to-Layout Planning
- Author
-
Chen, Jian, Zhang, Ruiyi, Zhou, Yufan, Healey, Jennifer, Gu, Jiuxiang, Xu, Zhiqiang, and Chen, Changyou
- Subjects
Computer Science - Computation and Language ,Computer Science - Machine Learning - Abstract
Automatic generation of graphical layouts is crucial for many real-world applications, including designing posters, flyers, advertisements, and graphical user interfaces. Given the incredible ability of Large language models (LLMs) in both natural language understanding and generation, we believe that we could customize an LLM to help people create compelling graphical layouts starting with only text instructions from the user. We call our method TextLap (text-based layout planning). It uses a curated instruction-based layout planning dataset (InsLap) to customize LLMs as a graphic designer. We demonstrate the effectiveness of TextLap and show that it outperforms strong baselines, including GPT-4 based methods, for image generation and graphical design benchmarks., Comment: Accepted to the EMNLP Findings
- Published
- 2024
12. Experimental study on shear performance of basalt fiber concrete beams without web reinforcement
- Author
-
Yu Jianbing, Xia Yufeng, Liu Saijie, and Xu Zhiqiang
- Subjects
Basalt fiber concrete beams ,Failure morphology ,Bearing capacity ,Finite element analysis ,Materials of engineering and construction. Mechanics of materials ,TA401-492 - Abstract
In this study, we report the results of an experimental investigation conducted under a four-point bending loading in order to evaluate the shear performance of concrete beams reinforced with basalt fiber without web reinforcement. According to the experimental results, it is evident that the final failure morphology of concrete-basalt fiber concrete composite beams and conventional concrete beams was shear failure, whereas the failure morphology of concrete beams reinforced with full basalt fiber was bending failure. Moreover, the results also indicate that the cracking load and ultimate bearing capacity of concrete composite beams reinforced with concrete-basalt fiber are lower than those of conventional concrete beam. This is mainly due to the interface between ordinary post-cast concrete and concrete reinforced with hardened basalt fiber. Due to the superior tensile properties of fiber, composite beams and all basalt fiber concrete beams exhibit a higher degree of ductility than conventional concrete beams. Utilizing the existing calculation theory for concrete beams reinforced with fiber, it was calculated that the cracking load and ultimate load of basalt fiber beams were calculated, and the calculation results were consistent with the experimental results. Based on the results of the evaluation and the theoretical analysis, this study proposes a finite element modeling method for basalt fiber composite beams. The results of the analysis were consistent with the experimental results of all investigated beams. The established model was employed in order to investigate the influence of shear span ratio on the shear performance of basalt fiber beams.
- Published
- 2022
- Full Text
- View/download PDF
13. Exogenetic over-expression of pBDNF affects learning and memory in APPswePS1dE9 transgenic mice
- Author
-
XU Manyu, ZAHNG Yuan, YI Xu, and XU Zhiqiang
- Subjects
brain-derived neurotrophic factor propeptide ,alzheimer's disease ,learning and memory ,postsynaptic density protein 95 ,Medicine (General) ,R5-920 - Abstract
Objective To explore the effect and possible machanism of recombinant adeno-associated virus (rAAV) vector containing brain-derived neurotrophic factor propeptide (AAV-pBDNF) on learning and memory in APPswePS1dE9 transgenic mice (Alzheimer's disease, AD). Methods A total of 18 APPswePS1dE9 transgenic mice were randomly and equally divided into control group, AAV-pBDNF group and AAV-pBDNF+p75NTR antibody group. Recombinant AAV-pBDNF vector (2 μL, 1×1012 vg/mL) were stereotaxically injected into the right side of the hippocampus in AAV-pBDNF group, AAV-GFP (2 μL, 1×1012 vg/mL) into that of the control group, and same amount of AAV-pBDNF and p75NTR antibody (2 μL, 10 μg/mL) into that of the AAV-pBDNF+p75NTR antibody group. There were 6 mice in each group. Morris water maze test was performed in 4 weeks after hippocampal injection to detect the changes of learning and memory abilities in AD rats. Immunofluorescence assay was used to observe the transfection ofAAV-pBDNF in the brain. Western blotting was employed to measure the expression of postsynaptic density protein 95 (PSD-95) in the hippocampus. Results The results of Morris water maze showed that no significant difference was seen in the movement speed of each group (P > 0.05); the escape incubation period on day 5 in the control group and AAV-pBDNF+p75NTR antibody group was significantly shorter than that of day 1 (P < 0.05); similar result was seen in the pBDNF group though no statistical difference (P > 0.05); and less platforms were crossed in the AD mice of the pBDNF group than those in the other 2 groups on day 6 in space exploration experiment (P < 0.05). Immunofluorescence assay showed that after transfection, pBDNF was expressed in the neurons, but not glial cells in the hippocampus. Western blotting results indicated that the expression of PSD-95 was obviously lower in the AAV-pBDNF group than the other 2 groups (P < 0.05). Conclusion pBDNF inhibits the learning and memory abilities of AD mice by down-regulating hippocampal expression of PSD-95.
- Published
- 2021
- Full Text
- View/download PDF
14. Visual Question Decomposition on Multimodal Large Language Models
- Author
-
Zhang, Haowei, Liu, Jianzhe, Han, Zhen, Chen, Shuo, He, Bailan, Tresp, Volker, Xu, Zhiqiang, and Gu, Jindong
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence ,Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Machine Learning - Abstract
Question decomposition has emerged as an effective strategy for prompting Large Language Models (LLMs) to answer complex questions. However, while existing methods primarily focus on unimodal language models, the question decomposition capability of Multimodal Large Language Models (MLLMs) has yet to be explored. To this end, this paper explores visual question decomposition on MLLMs. Specifically, we introduce a systematic evaluation framework including a dataset and several evaluation criteria to assess the quality of the decomposed sub-questions, revealing that existing MLLMs struggle to produce high-quality sub-questions. To address this limitation, we propose a specific finetuning dataset, DecoVQA+, for enhancing the model's question decomposition capability. Aiming at enabling models to perform appropriate selective decomposition, we propose an efficient finetuning pipeline. The finetuning pipeline consists of our proposed dataset and a training objective for selective decomposition. Finetuned MLLMs demonstrate significant improvements in the quality of sub-questions and the policy of selective question decomposition. Additionally, the models also achieve higher accuracy with selective decomposition on VQA benchmark datasets., Comment: Accepted to EMNLP2024 Findings
- Published
- 2024
15. Intelligent Fish Detection System with Similarity-Aware Transformer
- Author
-
Li, Shengchen, Zuo, Haobo, Fu, Changhong, Wang, Zhiyong, and Xu, Zhiqiang
- Subjects
Computer Science - Robotics - Abstract
Fish detection in water-land transfer has significantly contributed to the fishery. However, manual fish detection in crowd-collaboration performs inefficiently and expensively, involving insufficient accuracy. To further enhance the water-land transfer efficiency, improve detection accuracy, and reduce labor costs, this work designs a new type of lightweight and plug-and-play edge intelligent vision system to automatically conduct fast fish detection with high-speed camera. Moreover, a novel similarity-aware vision Transformer for fast fish detection (FishViT) is proposed to onboard identify every single fish in a dense and similar group. Specifically, a novel similarity-aware multi-level encoder is developed to enhance multi-scale features in parallel, thereby yielding discriminative representations for varying-size fish. Additionally, a new soft-threshold attention mechanism is introduced, which not only effectively eliminates background noise from images but also accurately recognizes both the edge details and overall features of different similar fish. 85 challenging video sequences with high framerate and high-resolution are collected to establish a benchmark from real fish water-land transfer scenarios. Exhaustive evaluation conducted with this challenging benchmark has proved the robustness and effectiveness of FishViT with over 80 FPS. Real work scenario tests validate the practicality of the proposed method. The code and demo video are available at https://github.com/vision4robotics/FishViT.
- Published
- 2024
16. Alignment of Diffusion Models: Fundamentals, Challenges, and Future
- Author
-
Liu, Buhua, Shao, Shitong, Li, Bao, Bai, Lichen, Xu, Zhiqiang, Xiong, Haoyi, Kwok, James, Helal, Sumi, and Xie, Zeke
- Subjects
Computer Science - Machine Learning ,Computer Science - Computer Vision and Pattern Recognition - Abstract
Diffusion models have emerged as the leading paradigm in generative modeling, excelling in various applications. Despite their success, these models often misalign with human intentions, generating outputs that may not match text prompts or possess desired properties. Inspired by the success of alignment in tuning large language models, recent studies have investigated aligning diffusion models with human expectations and preferences. This work mainly reviews alignment of diffusion models, covering advancements in fundamentals of alignment, alignment techniques of diffusion models, preference benchmarks, and evaluation for diffusion models. Moreover, we discuss key perspectives on current challenges and promising future directions on solving the remaining challenges in alignment of diffusion models. To the best of our knowledge, our work is the first comprehensive review paper for researchers and engineers to comprehend, practice, and research alignment of diffusion models., Comment: 35 pages, 5 figures, 3 tables
- Published
- 2024
17. LAMP: Learnable Meta-Path Guided Adversarial Contrastive Learning for Heterogeneous Graphs
- Author
-
Li, Siqing, Park, Jin-Duk, Huang, Wei, Cao, Xin, Shin, Won-Yong, and Xu, Zhiqiang
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence ,Computer Science - Social and Information Networks - Abstract
Heterogeneous graph neural networks (HGNNs) have significantly propelled the information retrieval (IR) field. Still, the effectiveness of HGNNs heavily relies on high-quality labels, which are often expensive to acquire. This challenge has shifted attention towards Heterogeneous Graph Contrastive Learning (HGCL), which usually requires pre-defined meta-paths. However, our findings reveal that meta-path combinations significantly affect performance in unsupervised settings, an aspect often overlooked in current literature. Existing HGCL methods have considerable variability in outcomes across different meta-path combinations, thereby challenging the optimization process to achieve consistent and high performance. In response, we introduce \textsf{LAMP} (\underline{\textbf{L}}earn\underline{\textbf{A}}ble \underline{\textbf{M}}eta-\underline{\textbf{P}}ath), a novel adversarial contrastive learning approach that integrates various meta-path sub-graphs into a unified and stable structure, leveraging the overlap among these sub-graphs. To address the denseness of this integrated sub-graph, we propose an adversarial training strategy for edge pruning, maintaining sparsity to enhance model performance and robustness. \textsf{LAMP} aims to maximize the difference between meta-path and network schema views for guiding contrastive learning to capture the most meaningful information. Our extensive experimental study conducted on four diverse datasets from the Heterogeneous Graph Benchmark (HGB) demonstrates that \textsf{LAMP} significantly outperforms existing state-of-the-art unsupervised models in terms of accuracy and robustness., Comment: 19 pages, 7 figures
- Published
- 2024
18. Stability of Least Square Approximation under Random Sampling
- Author
-
Xu, Zhiqiang and Zhang, Xinyue
- Subjects
Mathematics - Numerical Analysis - Abstract
This paper investigates the stability of the least squares approximation $P_m^n$ within the univariate polynomial space of degree $m$, denoted by ${\mathbb P}_m$. The approximation $P_m^n$ entails identifying a polynomial in ${\mathbb P}_m$ that approximates a function $f$ over a domain $X$ based on samples of $f$ taken at $n$ randomly selected points, according to a specified measure $\rho_X$. The primary goal is to determine the sampling rate necessary to ensure the stability of $P_m^n$. Assuming the sampling points are i.i.d. with respect to a Jacobi weight function, we present the sampling rates that guarantee the stability of $P_m^n$. Specifically, for uniform random sampling, we demonstrate that a sampling rate of $n \asymp m^2$ is required to maintain stability. By integrating these findings with those of Cohen-Davenport-Leviatan, we conclude that, for uniform random sampling, the optimal sampling rate for guaranteeing the stability of $P_m^n$ is $n \asymp m^2$, up to a $\log n$ factor. Motivated by this result, we extend the impossibility theorem, previously applicable to equally spaced samples, to the case of random samples, illustrating the balance between accuracy and stability in recovering analytic functions., Comment: 26 pages
- Published
- 2024
19. Research on the potential of low temperature waste-heat heating resources based on multi-factor constraints
- Author
-
Feng Chao, Xu Zhiqiang, Jiang Ximei, and Wang Jianfu
- Subjects
low-temperature waste heat ,waste-heat heating ,resource potential ,heating area ,scenario analysis ,Engineering geology. Rock mechanics. Soil mechanics. Underground construction ,TA703-712 ,Mining engineering. Metallurgy ,TN1-997 - Abstract
Low temperature waste-heat heating is an important technology for clean heating.But the resource potential of low temperature waste-heat used for heating will be affected by time, space, temperature, industrial development and other factors.Therefore, the resource potential of low temperature waste-heat for long-term, effective & stable heating should be studied.In this study, the resource potential of low temperature waste-heat for heating was studied by the scenario analysis method and the results show that, the low temperature waste-heat resources available for heating in the northern region are about 62.262 million tce, and the heating area is about 8.12 billion square meters in 2017.Among them, the low temperature waste-heat resources above 80℃ account for 25.2%.In the future, the low temperature waste-heat resources available for heating will be 68.98 million tce, and the heating area is about 12.14 billion square meters in 2020.The low temperature waste-heat resources available for heating will be 66.487 million tce, and the heating area is about 11.7 billion square meters in 2030.
- Published
- 2020
- Full Text
- View/download PDF
20. Effects of AAV-proBDNF on hippocampal DCX positive cells and PSD-95 expression in Alzheimer's disease mice
- Author
-
XU Manyu, CHEN Jia, and XU Zhiqiang
- Subjects
brain-derived neurotrophic factor precursor ,alzheimer's disease ,postsynaptic density-95 ,Medicine (General) ,R5-920 - Abstract
Objective To explore the effects of lateral ventricle injection of recombinant adeno-associated virus-brain-derived neurotrophic factor precursor (AAV-proBDNF) on the hippocampal neurogenesis in APPswePS1dE9 transgenic mice (Alzheimer's disease, AD). Methods Fifteen APP/PS1 dE9 transgenic mice (5 months old) were randomly divided into normal control, empty vector control and AAV-proBDNF groups (n=5 in each group). Recombinant vectors of AAV-proBDNF (8×1010 v.g.) were stereotaxically delivered into the right lateral ventricle in the AAV-proBDNF group, and empty AAV vector and normal saline (4 μL) were given respectively to the mice from the empty vector control and normal control groups. Enzyme-linked immunosorbent assay was conducted to test the proBDNF level in the brain in 4 weeks after injection. The count of doublecortin (DCX) positive cells and the expression of postsynaptic density-95 (PSD-95) in the hippocampus were detected by immunohistochemical assay and Western blotting respectively. Results In 4 weeks after injection, the proBDNF level was significantly higher in the AAV-proBDNF group than the normal control group and the empty vector control group (P < 0.05). The number of DCX-positive cells in the hippocampus of AAV-proBDNF group (202.46±24.27) was significantly lower than that in the normal control group (382.26±37.58, P < 0.05) and empty vector control group (365.48±32.68, P < 0.05). In AAV-proBDNF group, the relative content of PSD-95 (0.62±0.05) was obviously lower than the normal control group (0.82±0.02, P < 0.05) and vector control group (0.86±0.07, P < 0.05). Conclusion Lateral ventricle injection of AAV-proBDNF can induce the expression of proBDNF in the brain of AD mice, and reduce the number of new neurons and synaptic plasticity in the hippocampus.
- Published
- 2019
- Full Text
- View/download PDF
21. NTIRE 2024 Challenge on Night Photography Rendering
- Author
-
Ershov, Egor, Panshin, Artyom, Karasev, Oleg, Korchagin, Sergey, Lev, Shepelev, Startsev, Alexandr, Vladimirov, Daniil, Zaychenkova, Ekaterina, Banić, Nikola, Iarchuk, Dmitrii, Efimova, Maria, Timofte, Radu, Terekhin, Arseniy, Yue, Shuwei, Liu, Yuyang, Wei, Minchen, Xu, Lu, Zhang, Chao, Wang, Yasi, Kınlı, Furkan, Yılmaz, Doğa, Özcan, Barış, Kıraç, Furkan, Liu, Shuai, Xiao, Jingyuan, Feng, Chaoyu, Wang, Hao, Shao, Guangqi, Zhang, Yuqian, Huang, Yibin, Luo, Wei, Wang, Liming, Wang, Xiaotao, Lei, Lei, Zini, Simone, Rota, Claudio, Buzzelli, Marco, Bianco, Simone, Schettini, Raimondo, Guo, Jin, Liu, Tianli, Wu, Mohao, Shao, Ben, Yang, Qirui, Li, Xianghui, Cheng, Qihua, Zhang, Fangpu, Xu, Zhiqiang, Yang, Jingyu, and Yue, Huanjing
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
This paper presents a review of the NTIRE 2024 challenge on night photography rendering. The goal of the challenge was to find solutions that process raw camera images taken in nighttime conditions, and thereby produce a photo-quality output images in the standard RGB (sRGB) space. Unlike the previous year's competition, the challenge images were collected with a mobile phone and the speed of algorithms was also measured alongside the quality of their output. To evaluate the results, a sufficient number of viewers were asked to assess the visual quality of the proposed solutions, considering the subjective nature of the task. There were 2 nominations: quality and efficiency. Top 5 solutions in terms of output quality were sorted by evaluation time (see Fig. 1). The top ranking participants' solutions effectively represent the state-of-the-art in nighttime photography rendering. More results can be found at https://nightimaging.org., Comment: 10 pages, 10 figures
- Published
- 2024
22. DALD: Improving Logits-based Detector without Logits from Black-box LLMs
- Author
-
Zeng, Cong, Tang, Shengkun, Yang, Xianjun, Chen, Yuanzhou, Sun, Yiyou, xu, zhiqiang, Li, Yao, Chen, Haifeng, Cheng, Wei, and Xu, Dongkuan
- Subjects
Computer Science - Computation and Language ,Computer Science - Machine Learning - Abstract
The advent of Large Language Models (LLMs) has revolutionized text generation, producing outputs that closely mimic human writing. This blurring of lines between machine- and human-written text presents new challenges in distinguishing one from the other a task further complicated by the frequent updates and closed nature of leading proprietary LLMs. Traditional logits-based detection methods leverage surrogate models for identifying LLM-generated content when the exact logits are unavailable from black-box LLMs. However, these methods grapple with the misalignment between the distributions of the surrogate and the often undisclosed target models, leading to performance degradation, particularly with the introduction of new, closed-source models. Furthermore, while current methodologies are generally effective when the source model is identified, they falter in scenarios where the model version remains unknown, or the test set comprises outputs from various source models. To address these limitations, we present Distribution-Aligned LLMs Detection (DALD), an innovative framework that redefines the state-of-the-art performance in black-box text detection even without logits from source LLMs. DALD is designed to align the surrogate model's distribution with that of unknown target LLMs, ensuring enhanced detection capability and resilience against rapid model iterations with minimal training investment. By leveraging corpus samples from publicly accessible outputs of advanced models such as ChatGPT, GPT-4 and Claude-3, DALD fine-tunes surrogate models to synchronize with unknown source model distributions effectively.
- Published
- 2024
23. Provably Neural Active Learning Succeeds via Prioritizing Perplexing Samples
- Author
-
Bu, Dake, Huang, Wei, Suzuki, Taiji, Cheng, Ji, Zhang, Qingfu, Xu, Zhiqiang, and Wong, Hau-San
- Subjects
Computer Science - Machine Learning - Abstract
Neural Network-based active learning (NAL) is a cost-effective data selection technique that utilizes neural networks to select and train on a small subset of samples. While existing work successfully develops various effective or theory-justified NAL algorithms, the understanding of the two commonly used query criteria of NAL: uncertainty-based and diversity-based, remains in its infancy. In this work, we try to move one step forward by offering a unified explanation for the success of both query criteria-based NAL from a feature learning view. Specifically, we consider a feature-noise data model comprising easy-to-learn or hard-to-learn features disrupted by noise, and conduct analysis over 2-layer NN-based NALs in the pool-based scenario. We provably show that both uncertainty-based and diversity-based NAL are inherently amenable to one and the same principle, i.e., striving to prioritize samples that contain yet-to-be-learned features. We further prove that this shared principle is the key to their success-achieve small test error within a small labeled set. Contrastingly, the strategy-free passive learning exhibits a large test error due to the inadequate learning of yet-to-be-learned features, necessitating resort to a significantly larger label complexity for a sufficient test error reduction. Experimental results validate our findings., Comment: Accepted by the 41th Intemational Conference on Machine Learning (lCML 2024)
- Published
- 2024
24. MGCP: A Multi-Grained Correlation based Prediction Network for Multivariate Time Series
- Author
-
Chen, Zhicheng, Xiao, Xi, Xu, Ke, Zhang, Zhong, Rong, Yu, Li, Qing, Gan, Guojun, Xu, Zhiqiang, and Zhao, Peilin
- Subjects
Computer Science - Machine Learning - Abstract
Multivariate time series prediction is widely used in daily life, which poses significant challenges due to the complex correlations that exist at multi-grained levels. Unfortunately, the majority of current time series prediction models fail to simultaneously learn the correlations of multivariate time series at multi-grained levels, resulting in suboptimal performance. To address this, we propose a Multi-Grained Correlations-based Prediction (MGCP) Network, which simultaneously considers the correlations at three granularity levels to enhance prediction performance. Specifically, MGCP utilizes Adaptive Fourier Neural Operators and Graph Convolutional Networks to learn the global spatiotemporal correlations and inter-series correlations, enabling the extraction of potential features from multivariate time series at fine-grained and medium-grained levels. Additionally, MGCP employs adversarial training with an attention mechanism-based predictor and conditional discriminator to optimize prediction results at coarse-grained level, ensuring high fidelity between the generated forecast results and the actual data distribution. Finally, we compare MGCP with several state-of-the-art time series prediction algorithms on real-world benchmark datasets, and our results demonstrate the generality and effectiveness of the proposed model.
- Published
- 2024
25. Variational Bayes for Federated Continual Learning
- Author
-
Yao, Dezhong, Li, Sanmu, Dai, Yutong, Xu, Zhiqiang, Hu, Shengshan, Zhao, Peilin, and Sun, Lichao
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence ,Computer Science - Distributed, Parallel, and Cluster Computing - Abstract
Federated continual learning (FCL) has received increasing attention due to its potential in handling real-world streaming data, characterized by evolving data distributions and varying client classes over time. The constraints of storage limitations and privacy concerns confine local models to exclusively access the present data within each learning cycle. Consequently, this restriction induces performance degradation in model training on previous data, termed "catastrophic forgetting". However, existing FCL approaches need to identify or know changes in data distribution, which is difficult in the real world. To release these limitations, this paper directs attention to a broader continuous framework. Within this framework, we introduce Federated Bayesian Neural Network (FedBNN), a versatile and efficacious framework employing a variational Bayesian neural network across all clients. Our method continually integrates knowledge from local and historical data distributions into a single model, adeptly learning from new data distributions while retaining performance on historical distributions. We rigorously evaluate FedBNN's performance against prevalent methods in federated learning and continual learning using various metrics. Experimental analyses across diverse datasets demonstrate that FedBNN achieves state-of-the-art results in mitigating forgetting.
- Published
- 2024
26. Hard-Thresholding Meets Evolution Strategies in Reinforcement Learning
- Author
-
Gao, Chengqian, de Vazelhes, William, Zhang, Hualin, Gu, Bin, and Xu, Zhiqiang
- Subjects
Computer Science - Neural and Evolutionary Computing ,Computer Science - Machine Learning - Abstract
Evolution Strategies (ES) have emerged as a competitive alternative for model-free reinforcement learning, showcasing exemplary performance in tasks like Mujoco and Atari. Notably, they shine in scenarios with imperfect reward functions, making them invaluable for real-world applications where dense reward signals may be elusive. Yet, an inherent assumption in ES, that all input features are task-relevant, poses challenges, especially when confronted with irrelevant features common in real-world problems. This work scrutinizes this limitation, particularly focusing on the Natural Evolution Strategies (NES) variant. We propose NESHT, a novel approach that integrates Hard-Thresholding (HT) with NES to champion sparsity, ensuring only pertinent features are employed. Backed by rigorous analysis and empirical tests, NESHT demonstrates its promise in mitigating the pitfalls of irrelevant features and shines in complex decision-making problems like noisy Mujoco and Atari tasks., Comment: 16 pages, including proofs in the appendix
- Published
- 2024
27. MIPI 2024 Challenge on Nighttime Flare Removal: Methods and Results
- Author
-
Dai, Yuekun, Zhang, Dafeng, Li, Xiaoming, Yue, Zongsheng, Li, Chongyi, Zhou, Shangchen, Feng, Ruicheng, Yang, Peiqing, Jin, Zhezhu, Liu, Guanqun, Loy, Chen Change, Zhang, Lize, Liu, Shuai, Feng, Chaoyu, Wang, Luyang, Chen, Shuan, Shao, Guangqi, Wang, Xiaotao, Lei, Lei, Yang, Qirui, Cheng, Qihua, Xu, Zhiqiang, Liu, Yihao, Yue, Huanjing, Yang, Jingyu, Vasluianu, Florin-Alexandru, Wu, Zongwei, Ciubotariu, George, Timofte, Radu, Zhang, Zhao, Zhao, Suiyi, Wang, Bo, Zuo, Zhichao, Wei, Yanyan, Teja, Kuppa Sai Sri, A, Jayakar Reddy, Rongali, Girish, Mitra, Kaushik, Ma, Zhihao, Liu, Yongxu, Zhang, Wanying, Shang, Wei, He, Yuhong, Peng, Long, Yu, Zhongxin, Luo, Shaofei, Wang, Jian, Miao, Yuqi, Li, Baiang, Wei, Gang, Verma, Rakshank, Maheshwari, Ritik, Tekchandani, Rahul, Hambarde, Praful, Tazi, Satya Narayan, Vipparthi, Santosh Kumar, Murala, Subrahmanyam, Zhang, Haopeng, Hou, Yingli, Yao, Mingde, S, Levin M, Sundararajan, Aniruth, and A, Hari Kumar
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
The increasing demand for computational photography and imaging on mobile platforms has led to the widespread development and integration of advanced image sensors with novel algorithms in camera systems. However, the scarcity of high-quality data for research and the rare opportunity for in-depth exchange of views from industry and academia constrain the development of mobile intelligent photography and imaging (MIPI). Building on the achievements of the previous MIPI Workshops held at ECCV 2022 and CVPR 2023, we introduce our third MIPI challenge including three tracks focusing on novel image sensors and imaging algorithms. In this paper, we summarize and review the Nighttime Flare Removal track on MIPI 2024. In total, 170 participants were successfully registered, and 14 teams submitted results in the final testing phase. The developed solutions in this challenge achieved state-of-the-art performance on Nighttime Flare Removal. More details of this challenge and the link to the dataset can be found at https://mipi-challenge.org/MIPI2024/., Comment: CVPR 2024 Mobile Intelligent Photography and Imaging (MIPI) Workshop--Nighttime Flare Removal Challenge Report. Website: https://mipi-challenge.org/MIPI2024/
- Published
- 2024
28. Stability in Phase Retrieval: Characterizing Condition Numbers and the Optimal Vector Set
- Author
-
Xia, Yu, Xu, Zhiqiang, and Xu, Zili
- Subjects
Computer Science - Information Theory ,Mathematics - Functional Analysis ,Mathematics - Numerical Analysis - Abstract
In this paper, we primarily focus on analyzing the stability property of phase retrieval by examining the bi-Lipschitz property of the map $\Phi_{\boldsymbol{A}}(\boldsymbol{x})=|\boldsymbol{A}\boldsymbol{x}|\in \mathbb{R}_+^m$, where $\boldsymbol{x}\in \mathbb{H}^d$ and $\boldsymbol{A}\in \mathbb{H}^{m\times d}$ is the measurement matrix for $\mathbb{H}\in\{\mathbb{R},\mathbb{C}\}$. We define the condition number $\beta_{\boldsymbol{A}}=\frac{U_{\boldsymbol{A}}}{L_{\boldsymbol{A}}}$, where $L_{\boldsymbol{A}}$ and $U_{\boldsymbol{A}}$ represent the optimal lower and upper Lipschitz constants, respectively. We establish the first universal lower bound on $\beta_{\boldsymbol{A}}$ by demonstrating that for any ${\boldsymbol{A}}\in\mathbb{H}^{m\times d}$, \begin{equation*} \beta_{\boldsymbol{A}}\geq \beta_0^{\mathbb{H}}=\begin{cases} \sqrt{\frac{\pi}{\pi-2}}\,\,\approx\,\, 1.659 & \text{if $\mathbb{H}=\mathbb{R}$,}\\ \sqrt{\frac{4}{4-\pi}}\,\,\approx\,\, 2.159 & \text{if $\mathbb{H}=\mathbb{C}$.} \end{cases} \end{equation*} We prove that the condition number of a standard Gaussian matrix in $\mathbb{H}^{m\times d}$ asymptotically matches the lower bound $\beta_0^{\mathbb{H}}$ for both real and complex cases. This result indicates that the constant lower bound $\beta_0^{\mathbb{H}}$ is asymptotically tight, holding true for both the real and complex scenarios. As an application of this result, we utilize it to investigate the performance of quadratic models for phase retrieval. Lastly, we establish that for any odd integer $m\geq 3$, the harmonic frame $\boldsymbol{A}\in \mathbb{R}^{m\times 2}$ possesses the minimum condition number among all $\boldsymbol{A}\in \mathbb{R}^{m\times 2}$. We are confident that these findings carry substantial implications for enhancing our understanding of phase retrieval.
- Published
- 2024
29. Bone morphogenetic protein-2 and pulsed electrical stimulation synergistically promoted osteogenic differentiation on MC-3T3-E1 cells
- Author
-
Xie, Shaodong, Zeng, Deming, Luo, Hanwen, Zhong, Ping, Wang, Yu, Xu, Zhiqiang, and Zhang, Peibiao
- Published
- 2024
- Full Text
- View/download PDF
30. Research of Comprehensive Parameter Measure Method of RV Reducer
- Author
-
Zhang Xiyang, Chen Zhixin, and Xu Zhiqiang
- Subjects
RV reducer ,Comprehensive parameter ,Measuring method ,Mechanical engineering and machinery ,TJ1-1570 - Abstract
RV reducer is a key part of industry robot,its comprehensive performance is closely related to the positioning accuracy of robot. By analyzing the principle of RV reducer measurement,the measurement method of comprehensive parameters of RV reducer is studied,and the measurement and data processing method of each parameters are designed,which can accomplish automated measurement of comprehensive parameters in different RV reducer model. The mechanical structure and hardware system of the measuring instrument are cooperated reliably,and the complicated manual measurement is avoided. Through the error measurement of five items( Include transmission error,rotation error,torsional stiffness,static friction torque and dynamic friction torque),proved the practicability and validity of the measurement method,that provide an effective means for the RV reducer production and processing technology.
- Published
- 2018
- Full Text
- View/download PDF
31. Towards Aligned Layout Generation via Diffusion Model with Aesthetic Constraints
- Author
-
Chen, Jian, Zhang, Ruiyi, Zhou, Yufan, Jain, Rajiv, Xu, Zhiqiang, Rossi, Ryan, and Chen, Changyou
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Machine Learning - Abstract
Controllable layout generation refers to the process of creating a plausible visual arrangement of elements within a graphic design (e.g., document and web designs) with constraints representing design intentions. Although recent diffusion-based models have achieved state-of-the-art FID scores, they tend to exhibit more pronounced misalignment compared to earlier transformer-based models. In this work, we propose the $\textbf{LA}$yout $\textbf{C}$onstraint diffusion mod$\textbf{E}$l (LACE), a unified model to handle a broad range of layout generation tasks, such as arranging elements with specified attributes and refining or completing a coarse layout design. The model is based on continuous diffusion models. Compared with existing methods that use discrete diffusion models, continuous state-space design can enable the incorporation of differentiable aesthetic constraint functions in training. For conditional generation, we introduce conditions via masked input. Extensive experiment results show that LACE produces high-quality layouts and outperforms existing state-of-the-art baselines., Comment: Accepted by ICLR 2024
- Published
- 2024
32. Learning Time-aware Graph Structures for Spatially Correlated Time Series Forecasting
- Author
-
Ma, Minbo, Hu, Jilin, Jensen, Christian S., Teng, Fei, Han, Peng, Xu, Zhiqiang, and Li, Tianrui
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence - Abstract
Spatio-temporal forecasting of future values of spatially correlated time series is important across many cyber-physical systems (CPS). Recent studies offer evidence that the use of graph neural networks to capture latent correlations between time series holds a potential for enhanced forecasting. However, most existing methods rely on pre-defined or self-learning graphs, which are either static or unintentionally dynamic, and thus cannot model the time-varying correlations that exhibit trends and periodicities caused by the regularity of the underlying processes in CPS. To tackle such limitation, we propose Time-aware Graph Structure Learning (TagSL), which extracts time-aware correlations among time series by measuring the interaction of node and time representations in high-dimensional spaces. Notably, we introduce time discrepancy learning that utilizes contrastive learning with distance-based regularization terms to constrain learned spatial correlations to a trend sequence. Additionally, we propose a periodic discriminant function to enable the capture of periodic changes from the state of nodes. Next, we present a Graph Convolution-based Gated Recurrent Unit (GCGRU) that jointly captures spatial and temporal dependencies while learning time-aware and node-specific patterns. Finally, we introduce a unified framework named Time-aware Graph Convolutional Recurrent Network (TGCRN), combining TagSL, and GCGRU in an encoder-decoder architecture for multi-step spatio-temporal forecasting. We report on experiments with TGCRN and popular existing approaches on five real-world datasets, thus providing evidence that TGCRN is capable of advancing the state-of-the-art. We also cover a detailed ablation study and visualization analysis, offering detailed insight into the effectiveness of time-aware structure learning., Comment: published in ICDE 2024
- Published
- 2023
33. Interlacing Polynomial Method for Matrix Approximation via Generalized Column and Row Selection
- Author
-
Cai, Jian-Feng, Xu, Zhiqiang, and Xu, Zili
- Subjects
Mathematics - Functional Analysis ,Mathematics - Combinatorics ,Mathematics - Operator Algebras - Abstract
This paper delves into the spectral norm aspect of the Generalized Column and Row Subset Selection (GCRSS) problem. Given a target matrix $\mathbf{A}$, the objective of GCRSS is to select a column submatrix $\mathbf{B}_{:,S}$ from the source matrix $\mathbf{B}$ and a row submatrix $\mathbf{C}_{R,:}$ from the source maitrx $\mathbf{C}$, with the aim of minimizing the spectral norm of the residual matrix $(\mathbf{I}_n-\mathbf{B}_{:,S}\mathbf{B}_{:,S}^{\dagger})\mathbf{A}(\mathbf{I}_d-\mathbf{C}_{R,:}^{\dagger} \mathbf{C}_{R,:})$. By employing the interlacing polynomials method, we show that the largest root of the expected characteristic polynomial of the residual matrix serves as an upper bound on the smallest spectral norm of the residual matrix. We estimate this root for two specific GCRSS scenarios, one where $r=0$, simplifying the problem to the Generalized Column Subset Selection (GCSS) problem, and the other where $\mathbf{B}=\mathbf{C}=\mathbf{I}_d$, reducing the problem to the submatrix selection problem. In the GCSS scenario, we connect the expected characteristic polynomials to the convolution of multi-affine polynomials, leading to the derivation of the first provable reconstruction bound on the spectral norm of the residual matrix for the GCSS problem. In the submatrix selection scenario, we show that for any sufficiently small $\varepsilon>0$ and any square matrix $\mathbf{A}\in\mathbb{R}^{d\times d}$, there exist two subsets $S\subset [d]$ and $R\subset [d]$ of sizes $O(d\cdot \varepsilon^2)$ such that $\Vert\mathbf{A}_{S,R}\Vert_2\leq \varepsilon\cdot \Vert\mathbf{A}\Vert_2$. Unlike previous studies that have produced comparable results for very special cases where the matrix is either a zero-diagonal or a positive semidefinite matrix, our results apply universally to any matrix $\mathbf{A}$.
- Published
- 2023
34. Behavior Optimized Image Generation
- Author
-
Khurana, Varun, Singla, Yaman K, Subramanian, Jayakumar, Shah, Rajiv Ratn, Chen, Changyou, Xu, Zhiqiang, and Krishnamurthy, Balaji
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Computation and Language - Abstract
The last few years have witnessed great success on image generation, which has crossed the acceptance thresholds of aesthetics, making it directly applicable to personal and commercial applications. However, images, especially in marketing and advertising applications, are often created as a means to an end as opposed to just aesthetic concerns. The goal can be increasing sales, getting more clicks, likes, or image sales (in the case of stock businesses). Therefore, the generated images need to perform well on these key performance indicators (KPIs), in addition to being aesthetically good. In this paper, we make the first endeavor to answer the question of "How can one infuse the knowledge of the end-goal within the image generation process itself to create not just better-looking images but also "better-performing'' images?''. We propose BoigLLM, an LLM that understands both image content and user behavior. BoigLLM knows how an image should look to get a certain required KPI. We show that BoigLLM outperforms 13x larger models such as GPT-3.5 and GPT-4 in this task, demonstrating that while these state-of-the-art models can understand images, they lack information on how these images perform in the real world. To generate actual pixels of behavior-conditioned images, we train a diffusion-based model (BoigSD) to align with a proposed BoigLLM-defined reward. We show the performance of the overall pipeline on two datasets covering two different behaviors: a stock dataset with the number of forward actions as the KPI and a dataset containing tweets with the total likes as the KPI, denoted as BoigBench. To advance research in the direction of utility-driven image generation and understanding, we release BoigBench, a benchmark dataset containing 168 million enterprise tweets with their media, brand account names, time of post, and total likes.
- Published
- 2023
35. Transparent ultrasonic transducers based on relaxor ferroelectric crystals for advanced photoacoustic imaging
- Author
-
Qiu, Chaorui, Zhang, Zhiqiang, Xu, Zhiqiang, Qiao, Liao, Ning, Li, Zhang, Shujun, Su, Min, Wu, Weichang, Song, Kexin, Xu, Zhuo, Chen, Long-Qing, Zheng, Hairong, Liu, Chengbo, Qiu, Weibao, and Li, Fei
- Published
- 2024
- Full Text
- View/download PDF
36. Procyanidin C1 ameliorates acidic pH stress induced nucleus pulposus degeneration through SIRT3/FOXO3-mediated mitochondrial dynamics
- Author
-
Hua, Wenbin, Xie, Lin, Dong, Chenpeng, Yang, Guoyu, Chi, Shouyuan, Xu, Zhiqiang, Yang, Cao, Wang, Huiwen, and Wu, Xinghuo
- Published
- 2024
- Full Text
- View/download PDF
37. Clinical, pathological and genetic characteristics of 17 unrelated children with Alagille Syndrome
- Author
-
Yan, Jianguo, Huang, Yuanzhi, Cao, Lili, Dong, Yi, Xu, Zhiqiang, Wang, Fuchuan, Gao, Yinjie, Feng, Danni, and Zhang, Min
- Published
- 2024
- Full Text
- View/download PDF
38. Comparison of the repeatability and reproducibility of corneal thickness mapping using optical coherence tomography according to tear film break-up time
- Author
-
Lin, Kan, Xu, Zhiqiang, Wang, Hui, Wang, Yuzhou, Wei, Linzhi, Ma, Hongqing, Zhao, Jian, Lu, Fan, and Hu, Liang
- Published
- 2024
- Full Text
- View/download PDF
39. Annotations of four high-quality indigenous chicken genomes identify more than one thousand missing genes in subtelomeric regions and micro-chromosomes with high G/C contents
- Author
-
Wu, Siwen, Dou, Tengfei, Yuan, Sisi, Yan, Shixiong, Xu, Zhiqiang, Liu, Yong, Jian, Zonghui, Zhao, Jingying, Zhao, Rouhan, Zi, Xiannian, Gu, Dahai, Liu, Lixian, Li, Qihua, Wu, Dong-Dong, Jia, Junjing, Ge, Changrong, Su, Zhengchang, and Wang, Kun
- Published
- 2024
- Full Text
- View/download PDF
40. Artificial selection footprints in indigenous and commercial chicken genomes
- Author
-
Wu, Siwen, Dou, Tengfei, Wang, Kun, Yuan, Sisi, Yan, Shixiong, Xu, Zhiqiang, Liu, Yong, Jian, Zonghui, Zhao, Jingying, Zhao, Rouhan, Wu, Hao, Gu, Dahai, Liu, Lixian, Li, Qihua, Wu, Dong-Dong, Ge, Changrong, Su, Zhengchang, and Jia, Junjing
- Published
- 2024
- Full Text
- View/download PDF
41. Clinical and genetic study of ABCB4 gene-related cholestatic liver disease in China: children and adults
- Author
-
Cao, Lili, Ling, Xiuxin, Yan, Jianguo, Feng, Danni, Dong, Yi, Xu, Zhiqiang, Wang, Fuchuan, Zhu, Shishu, Gao, Yinjie, Cao, Zhenhua, and Zhang, Min
- Published
- 2024
- Full Text
- View/download PDF
42. Author Correction: High quality assemblies of four indigenous chicken genomes and related functional data resources
- Author
-
Wu, Siwen, Wang, Kun, Dou, Tengfei, Yuan, Sisi, Yan, Shixiong, Xu, Zhiqiang, Liu, Yong, Jian, Zonghui, Zhao, Jingying, Zhao, Rouhan, Zi, Xiannian, Gu, Dahai, Liu, Lixian, Li, Qihua, Wu, Dong-Dong, Jia, Junjing, Su, Zhengchang, and Ge, Changrong
- Published
- 2024
- Full Text
- View/download PDF
43. High quality assemblies of four indigenous chicken genomes and related functional data resources
- Author
-
Wu, Siwen, Wang, Kun, Dou, Tengfei, Yuan, Sisi, Yan, Shixiong, Xu, Zhiqiang, Liu, Yong, Jian, Zonghui, Zhao, Jingying, Zhao, Rouhan, Zi, Xiannian, Gu, Dahai, Liu, Lixian, Li, Qihua, Wu, Dong-Dong, Jia, Junjing, Su, Zhengchang, and Ge, Changrong
- Published
- 2024
- Full Text
- View/download PDF
44. Study on the high precision frequency measurement algorithm in the mine intelligent power supply system
- Author
-
Yuan Shiliang, Xu Zhiqiang, Dong Jie, and Zhu Qichen
- Subjects
power supply system ,frequency ,coal mine ,intelligent mine ,zero crossing ,rate of frequency variation ,Engineering geology. Rock mechanics. Soil mechanics. Underground construction ,TA703-712 ,Mining engineering. Metallurgy ,TN1-997 - Abstract
To solve the problem of the error of frequency measurement algorithm with time-varying frequency in mine power supply system, an improved software zero-crossing algorithm is proposed.The relationship among the phase angle, the time and the frequency of the voltage of the system is established.The relation formula between the real-time frequency and the period of zero-crossing point is derived when rate of frequency variation is not zero according to the relationship between two adjacent with the direction of the zero-crossing phase difference of 2π.The exact calculation formula of the period is obtained by using the Newton three interpolation polynomial, the Newton iteration method and the Qin Jiushao algorithm.Hence, the frequency and rate of frequency variation can be obtained.A variety of situations , including signals without harmonics, signals with harmonics and random noise, as well as the frequency rapid decline process in system failure, are simulated using MATLAB, The simulation results show that the algorithm of this paper can accurately measure the real-time frequency of the system and rate of frequency variation in above situations, and this method is superior to conventional zero-crossing algorithm in measurement accuracy, instantaneity and computation.This algorithm is suitable for application of protection, control and monitoring devices and automation equipment in mine intelligent power supply system.
- Published
- 2016
45. SR-R$^2$KAC: Improving Single Image Defocus Deblurring
- Author
-
Tang, Peng, Xu, Zhiqiang, Wei, Pengfei, Hu, Xiaobin, Zhao, Peilin, Cao, Xin, Zhou, Chunlai, and Lasser, Tobias
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
We propose an efficient deep learning method for single image defocus deblurring (SIDD) by further exploring inverse kernel properties. Although the current inverse kernel method, i.e., kernel-sharing parallel atrous convolution (KPAC), can address spatially varying defocus blurs, it has difficulty in handling large blurs of this kind. To tackle this issue, we propose a Residual and Recursive Kernel-sharing Atrous Convolution (R$^2$KAC). R$^2$KAC builds on a significant observation of inverse kernels, that is, successive use of inverse-kernel-based deconvolutions with fixed size helps remove unexpected large blurs but produces ringing artifacts. Specifically, on top of kernel-sharing atrous convolutions used to simulate multi-scale inverse kernels, R$^2$KAC applies atrous convolutions recursively to simulate a large inverse kernel. Specifically, on top of kernel-sharing atrous convolutions, R$^2$KAC stacks atrous convolutions recursively to simulate a large inverse kernel. To further alleviate the contingent effect of recursive stacking, i.e., ringing artifacts, we add identity shortcuts between atrous convolutions to simulate residual deconvolutions. Lastly, a scale recurrent module is embedded in the R$^2$KAC network, leading to SR-R$^2$KAC, so that multi-scale information from coarse to fine is exploited to progressively remove the spatially varying defocus blurs. Extensive experimental results show that our method achieves the state-of-the-art performance., Comment: Submitted to IEEE Transactions on Cybernetics on 2023-July-24
- Published
- 2023
46. Functional cure is associated with younger age in children undergoing antiviral treatment for active chronic hepatitis B
- Author
-
Zhang, Min, Li, Jing, Xu, Zhiqiang, Fan, Peiyao, Dong, Yi, Wang, Fuchuan, Gao, Yinjie, Yan, Jianguo, Cao, Lili, Ji, Dong, Feng, Danni, Zhong, Yanwei, Zhang, Yang, Hong, Weiguo, Zhang, Chao, and Wang, Fu-Sheng
- Published
- 2024
- Full Text
- View/download PDF
47. A Cover Time Study of a non-Markovian Algorithm
- Author
-
Fang, Guanhua, Samorodnitsky, Gennady, and Xu, Zhiqiang
- Subjects
Computer Science - Data Structures and Algorithms ,Computer Science - Machine Learning ,Mathematics - Statistics Theory - Abstract
Given a traversal algorithm, cover time is the expected number of steps needed to visit all nodes in a given graph. A smaller cover time means a higher exploration efficiency of traversal algorithm. Although random walk algorithms have been studied extensively in the existing literature, there has been no cover time result for any non-Markovian method. In this work, we stand on a theoretical perspective and show that the negative feedback strategy (a count-based exploration method) is better than the naive random walk search. In particular, the former strategy can locally improve the search efficiency for an arbitrary graph. It also achieves smaller cover times for special but important graphs, including clique graphs, tree graphs, etc. Moreover, we make connections between our results and reinforcement learning literature to give new insights on why classical UCB and MCTS algorithms are so useful. Various numerical results corroborate our theoretical findings., Comment: 25 pages
- Published
- 2023
48. Label-Retrieval-Augmented Diffusion Models for Learning from Noisy Labels
- Author
-
Chen, Jian, Zhang, Ruiyi, Yu, Tong, Sharma, Rohan, Xu, Zhiqiang, Sun, Tong, and Chen, Changyou
- Subjects
Computer Science - Machine Learning ,Computer Science - Computer Vision and Pattern Recognition - Abstract
Learning from noisy labels is an important and long-standing problem in machine learning for real applications. One of the main research lines focuses on learning a label corrector to purify potential noisy labels. However, these methods typically rely on strict assumptions and are limited to certain types of label noise. In this paper, we reformulate the label-noise problem from a generative-model perspective, $\textit{i.e.}$, labels are generated by gradually refining an initial random guess. This new perspective immediately enables existing powerful diffusion models to seamlessly learn the stochastic generative process. Once the generative uncertainty is modeled, we can perform classification inference using maximum likelihood estimation of labels. To mitigate the impact of noisy labels, we propose the $\textbf{L}$abel-$\textbf{R}$etrieval-$\textbf{A}$ugmented (LRA) diffusion model, which leverages neighbor consistency to effectively construct pseudo-clean labels for diffusion training. Our model is flexible and general, allowing easy incorporation of different types of conditional information, $\textit{e.g.}$, use of pre-trained models, to further boost model performance. Extensive experiments are conducted for evaluation. Our model achieves new state-of-the-art (SOTA) results on all the standard real-world benchmark datasets. Remarkably, by incorporating conditional information from the powerful CLIP model, our method can boost the current SOTA accuracy by 10-20 absolute points in many cases., Comment: Accepted by NeurIPS 2023
- Published
- 2023
49. Cell-Free Supernatant of Bacillus subtilis G2B9-Q Improves Intestinal Health and Modulates Immune Response to Promote Mouse Recovery in Clostridium perfringens Infection
- Author
-
Xu, Zhiqiang, Feng, Xin, Song, Zhanyun, Li, Xiang, Li, Ke, Li, Mengjiao, Wang, Xianghui, Liu, Bo, and Sun, Changjiang
- Published
- 2024
- Full Text
- View/download PDF
50. Odor Diffusion Control by Nanoparticles Filled Long-Chain Branched Polylactic Acid Film: Mechanism and Application Potential Research
- Author
-
Feng, Meiyun, Xu, Zhiqiang, Lin, Kuangfei, and Zhang, Meng
- Published
- 2024
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.