16,412 results on '"Wu, Qi"'
Search Results
202. Algorithm for Human Abnormal Behavior Recognition Based on Improved Spatial Temporal Graph Convolutional Networks
- Author
-
Wu, Qi, Zhao, Xiaoyan, Zhang, Zhaohui, Zhang, Tianyao, Peng, Zexuan, Filipe, Joaquim, Editorial Board Member, Ghosh, Ashish, Editorial Board Member, Prates, Raquel Oliveira, Editorial Board Member, Zhou, Lizhu, Editorial Board Member, Xin, Bin, editor, Kubota, Naoyuki, editor, Chen, Kewei, editor, and Dong, Fangyan, editor
- Published
- 2024
- Full Text
- View/download PDF
203. Structural Health Monitoring of Similar Gantry Crane Based on Federated Learning Algorithm
- Author
-
Peng, Zexuan, Zhang, Zhaohui, Zhao, Xiaoyan, Zhang, Tianyao, Wu, Qi, Filipe, Joaquim, Editorial Board Member, Ghosh, Ashish, Editorial Board Member, Prates, Raquel Oliveira, Editorial Board Member, Zhou, Lizhu, Editorial Board Member, Xin, Bin, editor, Kubota, Naoyuki, editor, Chen, Kewei, editor, and Dong, Fangyan, editor
- Published
- 2024
- Full Text
- View/download PDF
204. BHSD: A 3D Multi-class Brain Hemorrhage Segmentation Dataset
- Author
-
Wu, Biao, Xie, Yutong, Zhang, Zeyu, Ge, Jinchao, Yaxley, Kaspar, Bahadir, Suzan, Wu, Qi, Liu, Yifan, To, Minh-Son, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Cao, Xiaohuan, editor, Xu, Xuanang, editor, Rekik, Islem, editor, Cui, Zhiming, editor, and Ouyang, Xi, editor
- Published
- 2024
- Full Text
- View/download PDF
205. Multi-modal Adapter for Medical Vision-and-Language Learning
- Author
-
Yu, Zheng, Qiao, Yanyuan, Xie, Yutong, Wu, Qi, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Cao, Xiaohuan, editor, Xu, Xuanang, editor, Rekik, Islem, editor, Cui, Zhiming, editor, and Ouyang, Xi, editor
- Published
- 2024
- Full Text
- View/download PDF
206. Self-prompting Large Vision Models for Few-Shot Medical Image Segmentation
- Author
-
Wu, Qi, Zhang, Yuyao, Elbatel, Marawan, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Koch, Lisa, editor, Cardoso, M. Jorge, editor, Ferrante, Enzo, editor, Kamnitsas, Konstantinos, editor, Islam, Mobarakol, editor, Jiang, Meirui, editor, Rieke, Nicola, editor, Tsaftaris, Sotirios A., editor, and Yang, Dong, editor
- Published
- 2024
- Full Text
- View/download PDF
207. New Trends and Technologies in Facilities Management in China: The Dual Impact of Environmental Protection and Digitalisation
- Author
-
Wu, Qi, Appolloni, Andrea, Series Editor, Caracciolo, Francesco, Series Editor, Ding, Zhuoqi, Series Editor, Gogas, Periklis, Series Editor, Huang, Gordon, Series Editor, Nartea, Gilbert, Series Editor, Ngo, Thanh, Series Editor, Striełkowski, Wadim, Series Editor, Zailani, Suhaiza Hanim Binti Dato Mohamad, editor, Yagapparaj, Kosga, editor, and Zakuan, Norhayati, editor
- Published
- 2024
- Full Text
- View/download PDF
208. Decorr: Environment Partitioning for Invariant Learning and OOD Generalization
- Author
-
Liao, Yufan, Wu, Qi, Wu, Zhaodi, and Yan, Xing
- Subjects
Computer Science - Machine Learning - Abstract
Invariant learning methods, aimed at identifying a consistent predictor across multiple environments, are gaining prominence in out-of-distribution (OOD) generalization. Yet, when environments aren't inherent in the data, practitioners must define them manually. This environment partitioning--algorithmically segmenting the training dataset into environments--crucially affects invariant learning's efficacy but remains underdiscussed. Proper environment partitioning could broaden the applicability of invariant learning and enhance its performance. In this paper, we suggest partitioning the dataset into several environments by isolating low-correlation data subsets. Through experiments with synthetic and real data, our Decorr method demonstrates superior performance in combination with invariant learning. Decorr mitigates the issue of spurious correlations, aids in identifying stable predictors, and broadens the applicability of invariant learning methods.
- Published
- 2022
209. Beyond ExaBricks: GPU Volume Path Tracing of AMR Data
- Author
-
Zellmann, Stefan, Wu, Qi, Sahistan, Alper, Ma, Kwan-Liu, and Wald, Ingo
- Subjects
Computer Science - Graphics - Abstract
Adaptive Mesh Refinement (AMR) is becoming a prevalent data representation for scientific visualization. Resulting from large fluid mechanics simulations, the data is usually cell centric, imposing a number of challenges for high quality reconstruction at sample positions. While recent work has concentrated on real-time volume and isosurface rendering on GPUs, the rendering methods used still focus on simple lighting models without scattering events and global illumination. As in other areas of rendering, key to real-time performance are acceleration data structures; in this work we analyze the major bottlenecks of data structures that were originally optimized for camera/primary ray traversal when used with the incoherent ray tracing workload of a volumetric path tracer, and propose strategies to overcome the challenges coming with this.
- Published
- 2022
210. Efficient and Accurate Quantized Image Super-Resolution on Mobile NPUs, Mobile AI & AIM 2022 challenge: Report
- Author
-
Ignatov, Andrey, Timofte, Radu, Denna, Maurizio, Younes, Abdel, Gankhuyag, Ganzorig, Huh, Jingang, Kim, Myeong Kyun, Yoon, Kihwan, Moon, Hyeon-Cheol, Lee, Seungho, Choe, Yoonsik, Jeong, Jinwoo, Kim, Sungjei, Smyl, Maciej, Latkowski, Tomasz, Kubik, Pawel, Sokolski, Michal, Ma, Yujie, Chao, Jiahao, Zhou, Zhou, Gao, Hongfan, Yang, Zhengfeng, Zeng, Zhenbing, Zhuge, Zhengyang, Li, Chenghua, Zhu, Dan, Sun, Mengdi, Duan, Ran, Gao, Yan, Kong, Lingshun, Sun, Long, Li, Xiang, Zhang, Xingdong, Zhang, Jiawei, Wu, Yaqi, Pan, Jinshan, Yu, Gaocheng, Zhang, Jin, Zhang, Feng, Ma, Zhe, Wang, Hongbin, Cho, Hojin, Kim, Steve, Li, Huaen, Ma, Yanbo, Luo, Ziwei, Li, Youwei, Yu, Lei, Wen, Zhihong, Wu, Qi, Fan, Haoqiang, Liu, Shuaicheng, Zhang, Lize, Zong, Zhikai, Kwon, Jeremy, Zhang, Junxi, Li, Mengyuan, Fu, Nianxiang, Ding, Guanchen, Zhu, Han, Chen, Zhenzhong, Li, Gen, Zhang, Yuanfan, Sun, Lei, Zhang, Dafeng, Yang, Neo, Liu, Fitz, Zhao, Jerry, Ayazoglu, Mustafa, Bilecen, Bahri Batuhan, Hirose, Shota, Arunruangsirilert, Kasidis, Ao, Luo, Leung, Ho Chun, Wei, Andrew, Liu, Jie, Liu, Qiang, Yu, Dahai, Li, Ao, Luo, Lei, Zhu, Ce, Hong, Seongmin, Park, Dongwon, Lee, Joonhee, Lee, Byeong Hyun, Lee, Seunggyu, Chun, Se Young, He, Ruiyuan, Jiang, Xuhao, Ruan, Haihang, Zhang, Xinjian, Liu, Jing, Gendy, Garas, Sabor, Nabil, Hou, Jingchao, and He, Guanghui
- Subjects
Electrical Engineering and Systems Science - Image and Video Processing ,Computer Science - Computer Vision and Pattern Recognition - Abstract
Image super-resolution is a common task on mobile and IoT devices, where one often needs to upscale and enhance low-resolution images and video frames. While numerous solutions have been proposed for this problem in the past, they are usually not compatible with low-power mobile NPUs having many computational and memory constraints. In this Mobile AI challenge, we address this problem and propose the participants to design an efficient quantized image super-resolution solution that can demonstrate a real-time performance on mobile NPUs. The participants were provided with the DIV2K dataset and trained INT8 models to do a high-quality 3X image upscaling. The runtime of all models was evaluated on the Synaptics VS680 Smart Home board with a dedicated edge NPU capable of accelerating quantized neural networks. All proposed solutions are fully compatible with the above NPU, demonstrating an up to 60 FPS rate when reconstructing Full HD resolution images. A detailed description of all models developed in the challenge is provided in this paper., Comment: arXiv admin note: text overlap with arXiv:2105.07825, arXiv:2105.08826, arXiv:2211.04470, arXiv:2211.03885, arXiv:2211.05256
- Published
- 2022
211. Investigations on the light hadron decays of $Z_b(10610)$ and $Z_b(10650)$
- Author
-
Wu, Qi, Zheng, Yuanxin, Liu, Shidong, and Li, Gang
- Subjects
High Energy Physics - Phenomenology - Abstract
The light hadron decay processes of $Z_b(10610)/Z_b(10650)$ provide us a way to study their nature and decay mechanism. In this work, we evaluate the branching ratios of $Z_b(10610)/Z_b(10650) \to VP$ ($V$ and $P$ stand for light vector and pseudoscalar mesons, respectively) using an effective Lagrangian approach, in which the contributions of intermediate bottomed meson triangle loops are considered. In our calculations, the $Z_b(10610)$ and $Z_b(10650)$ are regarded as $B\bar{B}^*+c.c.$ and $B^*\bar{B}^*$ molecular states, respectively. The predicted branching ratios of $Z_b(10610)\rightarrow VP$ are about in the order of $10^{-2}$, while the branching ratios of $Z_b(10650)\rightarrow VP$ are in the order of $10^{-3}$. Furthermore, the dependence of these ratios between different decay modes of $Z_b(10610)/Z_b(10650)$ on the mixing $\eta-\eta^\prime$ angle $\theta_P$ is investigated, which may be a good quantity for the experiments. It is hoped that the calculations here could be tested by future experiments., Comment: 7 pages, 8 figures and 1 table
- Published
- 2022
- Full Text
- View/download PDF
212. Investigations on the charmless decays of $X(3872)$ in intermediate meson loops model
- Author
-
Wang, Yan, Wu, Qi, Li, Gang, Qin, Wen-Hua, Liu, Xiao-Hai, An, Chun-Sheng, and Xie, Ju-Jun
- Subjects
High Energy Physics - Phenomenology ,High Energy Physics - Experiment - Abstract
The charmless decay processes of $X(3872)$ provide us a good platform to study the nature and the decay mechanism of $X(3872)$. Based on a molecular nature of $X(3872)$ as a $\bar{D}D^*$ bound state, we have investigated the charmless decays $X(3872) \to VV$ and $VP$ via intermediate $D^*{\bar D} +c.c.$ meson loops, where $V$ and $P$ stand for light vector and pseudoscalar mesons, respectively. We discuss three cases, i.e., pure neutral components ($\theta=0$), isospin singlet ($\theta=\pi/4$) and neutral components dominant ($\theta = \pi/6$), where $\theta$ is a phase angle describing the proportion of neutral and charged constituents. The proportion of neutral and charged constituent have an influence on the decay widths of $X(3872) \to VV$ and $VP$. With the coupling constant of $X(3872)$ to the $\bar{D}D^*$ channel obtained under the molecule ansatz of $X(3872)$ resonance, the predicted decay widths of $X(3872)\rightarrow VV$ are about tens of keVs, while the decay width can reach a few hundreds of keVs for $X(3872)\to VP$. The dependence of these ratios between different decay modes of $X(3872)\to VV$ and $X(3872)\to VP$ to the mixing angle $\theta$ is also investigated. It is expected that the theoretical calculations here can be tested by future experiments., Comment: 9 pages, 8 figures, submitted to Phys. Rev. D
- Published
- 2022
213. Toward 3D Spatial Reasoning for Human-like Text-based Visual Question Answering
- Author
-
Li, Hao, Huang, Jinfa, Jin, Peng, Song, Guoli, Wu, Qi, and Chen, Jie
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Multimedia - Abstract
Text-based Visual Question Answering~(TextVQA) aims to produce correct answers for given questions about the images with multiple scene texts. In most cases, the texts naturally attach to the surface of the objects. Therefore, spatial reasoning between texts and objects is crucial in TextVQA. However, existing approaches are constrained within 2D spatial information learned from the input images and rely on transformer-based architectures to reason implicitly during the fusion process. Under this setting, these 2D spatial reasoning approaches cannot distinguish the fine-grain spatial relations between visual objects and scene texts on the same image plane, thereby impairing the interpretability and performance of TextVQA models. In this paper, we introduce 3D geometric information into a human-like spatial reasoning process to capture the contextual knowledge of key objects step-by-step. %we formulate a human-like spatial reasoning process by introducing 3D geometric information for capturing key objects' contextual knowledge. To enhance the model's understanding of 3D spatial relationships, Specifically, (i)~we propose a relation prediction module for accurately locating the region of interest of critical objects; (ii)~we design a depth-aware attention calibration module for calibrating the OCR tokens' attention according to critical objects. Extensive experiments show that our method achieves state-of-the-art performance on TextVQA and ST-VQA datasets. More encouragingly, our model surpasses others by clear margins of 5.7\% and 12.1\% on questions that involve spatial reasoning in TextVQA and ST-VQA valid split. Besides, we also verify the generalizability of our model on the text-based image captioning task., Comment: Accepted by TIP2023, The Arxiv version of "Weakly-Supervised 3D Spatial Reasoning for Text-based Visual Question Answering"
- Published
- 2022
- Full Text
- View/download PDF
214. Coupling of optical phonons with Kondo effect and magnetic orders in antiferromagnetic Kondo lattice CeAuSb$_2$
- Author
-
Zhao, Yin-Zou, Wu, Qi-Yi, Zhang, Chen, Chen, Bo, Xia, Wei, Song, Jiao-Jiao, Yuan, Ya-Hua, Liu, Hao, Wu, Fan-Ying, Ye, Xue-Qing, Zhang, Hong-Yi, Huang, Han, Liu, Hai-Yun, Duan, Yu-Xia, Guo, Yan-Feng, He, Jun, and Meng, Jian-Qiao
- Subjects
Condensed Matter - Strongly Correlated Electrons - Abstract
Ultrafast optical spectroscopy was used to investigate the ultrafast quasiparticle dynamics of antiferromagnetic Kondo lattice CeAuSb$_2$ as a function of temperature and fluence. Our results reveal (i) the opening of a narrow hybridization gap ($\Delta$ $\sim$ 4.5 meV) near the Fermi level below the coherence temperature $T^*$ $\approx$ 100 K, (ii) the coupling of coherent phonon modes with Kondo effect and magnetic orders, leading to the frequencies anomaly at the characteristic temperatures, and (iii) a possible photoinduced nonthermal phase transition. Our observations thus shed light on the hybridization dynamics and magnetic orders in heavy fermion systems., Comment: 6 pages with 4 figures
- Published
- 2022
- Full Text
- View/download PDF
215. FoVolNet: Fast Volume Rendering using Foveated Deep Neural Networks
- Author
-
Bauer, David, Wu, Qi, and Ma, Kwan-Liu
- Subjects
Computer Science - Graphics ,Computer Science - Machine Learning - Abstract
Volume data is found in many important scientific and engineering applications. Rendering this data for visualization at high quality and interactive rates for demanding applications such as virtual reality is still not easily achievable even using professional-grade hardware. We introduce FoVolNet -- a method to significantly increase the performance of volume data visualization. We develop a cost-effective foveated rendering pipeline that sparsely samples a volume around a focal point and reconstructs the full-frame using a deep neural network. Foveated rendering is a technique that prioritizes rendering computations around the user's focal point. This approach leverages properties of the human visual system, thereby saving computational resources when rendering data in the periphery of the user's field of vision. Our reconstruction network combines direct and kernel prediction methods to produce fast, stable, and perceptually convincing output. With a slim design and the use of quantization, our method outperforms state-of-the-art neural reconstruction techniques in both end-to-end frame times and visual quality. We conduct extensive evaluations of the system's rendering performance, inference speed, and perceptual properties, and we provide comparisons to competing neural image reconstruction techniques. Our test results show that FoVolNet consistently achieves significant time saving over conventional rendering while preserving perceptual quality., Comment: To appear at IEEE VIS 2022 and later TVCG
- Published
- 2022
216. Learning Distinct and Representative Styles for Image Captioning
- Author
-
Chen, Qi, Deng, Chaorui, and Wu, Qi
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Over the years, state-of-the-art (SoTA) image captioning methods have achieved promising results on some evaluation metrics (e.g., CIDEr). However, recent findings show that the captions generated by these methods tend to be biased toward the "average" caption that only captures the most general mode (a.k.a, language pattern) in the training corpus, i.e., the so-called mode collapse problem. Affected by it, the generated captions are limited in diversity and usually less informative than natural image descriptions made by humans. In this paper, we seek to avoid this problem by proposing a Discrete Mode Learning (DML) paradigm for image captioning. Our innovative idea is to explore the rich modes in the training caption corpus to learn a set of "mode embeddings", and further use them to control the mode of the generated captions for existing image captioning models. Specifically, the proposed DML optimizes a dual architecture that consists of an image-conditioned discrete variational autoencoder (CdVAE) branch and a mode-conditioned image captioning (MIC) branch. The CdVAE branch maps each image caption to one of the mode embeddings stored in a learned codebook, and is trained with a pure non-autoregressive generation objective to make the modes distinct and representative. The MIC branch can be simply modified from an existing image captioning model, where the mode embedding is added to the original word embeddings as the control signal. In the experiments, we apply the proposed DML to two widely used image captioning models, Transformer and AoANet. The results show that the learned mode embedding successfully facilitates these models to generate high-quality image captions with different modes, further leading to better performance for both diversity and quality on the MSCOCO dataset., Comment: NeurIPS 2022
- Published
- 2022
217. A PeVatron Candidate: Modelling the Boomerang Nebula in X-ray Band
- Author
-
Liang, Xuan-Han, Li, Chao-Ming, Wu, Qi-Zuo, Pan, Jia-Shu, and Liu, Ruo-Yu
- Subjects
Astrophysics - High Energy Astrophysical Phenomena - Abstract
Pulsar wind nebula (PWN) Boomerang and the associated supernova remnant (SNR) G106.3+2.7 are among candidates for the ultra-high-energy (UHE) gamma-ray counterparts published by LHAASO. Although the centroid of the extended source, LHAASO J2226+6057, deviates from the pulsar's position by about $0.3^\circ$, the source partially covers the PWN. Therefore, we cannot totally exclude the possibility that a part of the UHE emission comes from the PWN. Previous studies mainly focus on whether the SNR is a PeVatron, while neglecting the energetic PWN. Here, we explore the possibility of the Boomerang Nebula being a PeVatron candidate by studying its X-ray radiation. By modelling the diffusion of relativistic electrons injected in the PWN, we fit the radial profiles of the X-ray surface brightness and the photon index. The solution with a magnetic field $B=140\mu$G can well reproduce the observed profiles and implies a severe suppression of IC scattering of electrons. Therefore, a proton component need be introduced to account for the UHE emission, in light of recent LHAASO's measurement on Crab Nebula, if future observations reveal part of the UHE emission originating from the PWN. In this sense, Boomerang Nebula would be a hadronic PeVatron., Comment: 15 pages, 5 figures, 5 tables; Invited contribution to Special Issue of Universe "Advances in Astrophysics and Cosmology in China - in Memory of Prof. Tan Lu"
- Published
- 2022
218. Quantum state discrimination in a PT-symmetric system
- Author
-
Chen, Dong-Xu, Zhang, Yu, Zhao, Jun-Long, Wu, Qi-Cheng, Fang, Yu-Liang, Yang, Chui-Ping, and Nori, Franco
- Subjects
Quantum Physics - Abstract
Nonorthogonal quantum state discrimination (QSD) plays an important role in quantum information and quantum communication. In addition, compared to Hermitian quantum systems, parity-time-($\mathcal{PT}$-)symmetric non-Hermitian quantum systems exhibit novel phenomena and have attracted considerable attention. Here, we experimentally demonstrate QSD in a $\mathcal{PT}$-symmetric system (i.e., $\mathcal{PT}$-symmetric QSD), by having quantum states evolve under a $\mathcal{PT}$-symmetric Hamiltonian in a lossy linear optical setup. We observe that two initially nonorthogonal states can rapidly evolve into orthogonal states, and the required evolution time can even be vanishing provided the matrix elements of the Hamiltonian become sufficiently large. We also observe that the cost of such a discrimination is a dissipation of quantum states into the environment. Furthermore, by comparing $\mathcal{PT}$-symmetric QSD with optimal strategies in Hermitian systems, we find that at the critical value, $\mathcal{PT}$-symmetric QSD is equivalent to the optimal unambiguous state discrimination in Hermitian systems. We also extend the $\mathcal{PT}$-symmetric QSD to the case of discriminating three nonorthogonal states. The QSD in a $\mathcal{PT}$-symmetric system opens a new door for quantum state discrimination, which has important applications in quantum computing, quantum cryptography, and quantum communication., Comment: 12 pages, 9 figures, comments are welcome
- Published
- 2022
- Full Text
- View/download PDF
219. Experimental investigation of wave-particle duality relations in asymmetric beam interference
- Author
-
Chen, Dong-Xu, Zhang, Yu, Zhao, Jun-Long, Wu, Qi-Cheng, Fang, Yu-Liang, Yang, Chui-Ping, and Nori, Franco
- Subjects
Quantum Physics - Abstract
Wave-particle duality relations are fundamental for quantum physics. Previous experimental studies of duality relations mainly focus on the quadratic relation $D^2+V^2\leq1$, based on symmetric beam interference, while a linear form of the duality relation, predicated earlier theoretically, has never been experimentally tested. In addition, the difference between the quadratic form and the linear form has not been explored yet. In this work, with a designed asymmetric beam interference and by utilizing the polarization degree of freedom of the photon as a which-way detector, we experimentally confirm both forms of the duality relations. The results show that more path information is obtained in the quadratic case. Our findings reveal the difference between the two duality relations and have fundamental implications in better understanding these important duality relations., Comment: 9 pages, 7 figures, comments are welcome
- Published
- 2022
- Full Text
- View/download PDF
220. Moderately-Balanced Representation Learning for Treatment Effects with Orthogonality Information
- Author
-
Huang, Yiyan, Leung, Cheuk Hang, Ma, Shumin, Wu, Qi, Wang, Dongdong, and Huang, Zhixiang
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence ,Statistics - Methodology - Abstract
Estimating the average treatment effect (ATE) from observational data is challenging due to selection bias. Existing works mainly tackle this challenge in two ways. Some researchers propose constructing a score function that satisfies the orthogonal condition, which guarantees that the established ATE estimator is "orthogonal" to be more robust. The others explore representation learning models to achieve a balanced representation between the treated and the controlled groups. However, existing studies fail to 1) discriminate treated units from controlled ones in the representation space to avoid the over-balanced issue; 2) fully utilize the "orthogonality information". In this paper, we propose a moderately-balanced representation learning (MBRL) framework based on recent covariates balanced representation learning methods and orthogonal machine learning theory. This framework protects the representation from being over-balanced via multi-task learning. Simultaneously, MBRL incorporates the noise orthogonality information in the training and validation stages to achieve a better ATE estimation. The comprehensive experiments on benchmark and simulated datasets show the superiority and robustness of our method on treatment effect estimations compared with existing state-of-the-art methods., Comment: This paper was accepted and will be published at the 19th Pacific Rim International Conference on Artificial Intelligence (PRICAI2022)
- Published
- 2022
221. Robust Causal Learning for the Estimation of Average Treatment Effects
- Author
-
Huang, Yiyan, Leung, Cheuk Hang, Yan, Xing, Wu, Qi, Ma, Shumin, Yuan, Zhiri, Wang, Dongdong, and Huang, Zhixiang
- Subjects
Economics - Econometrics ,Quantitative Finance - Risk Management ,Statistics - Methodology ,Statistics - Machine Learning - Abstract
Many practical decision-making problems in economics and healthcare seek to estimate the average treatment effect (ATE) from observational data. The Double/Debiased Machine Learning (DML) is one of the prevalent methods to estimate ATE in the observational study. However, the DML estimators can suffer an error-compounding issue and even give an extreme estimate when the propensity scores are misspecified or very close to 0 or 1. Previous studies have overcome this issue through some empirical tricks such as propensity score trimming, yet none of the existing literature solves this problem from a theoretical standpoint. In this paper, we propose a Robust Causal Learning (RCL) method to offset the deficiencies of the DML estimators. Theoretically, the RCL estimators i) are as consistent and doubly robust as the DML estimators, and ii) can get rid of the error-compounding issue. Empirically, the comprehensive experiments show that i) the RCL estimators give more stable estimations of the causal parameters than the DML estimators, and ii) the RCL estimators outperform the traditional estimators and their variants when applying different machine learning models on both simulation and benchmark datasets., Comment: This paper was accepted and will be published at The 2022 International Joint Conference on Neural Networks (IJCNN2022). arXiv admin note: substantial text overlap with arXiv:2103.11869
- Published
- 2022
222. 7-Dehydrocholesterol dictates ferroptosis sensitivity
- Author
-
Li, Yaxu, Ran, Qiao, Duan, Qiuhui, Jin, Jiali, Wang, Yanjin, Yu, Lei, Wang, Chaojie, Zhu, Zhenyun, Chen, Xin, Weng, Linjun, Li, Zan, Wang, Jia, Wu, Qi, Wang, Hui, Tian, Hongling, Song, Sihui, Shan, Zezhi, Zhai, Qiwei, Qin, Huanlong, Chen, Shili, Fang, Lan, Yin, Huiyong, Zhou, Hu, Jiang, Xuejun, and Wang, Ping
- Published
- 2024
- Full Text
- View/download PDF
223. Effect of SiC concentration on the mechanical properties and fracture modes of SiC/Al composites via cluster analysis of acoustic emission signals
- Author
-
Wu, Qi and Pei, Ning
- Published
- 2024
- Full Text
- View/download PDF
224. Ultrasound evaluation of gastric emptying of high-energy semifluid solid beverage in parturients during labor at term: a randomized controlled trial
- Author
-
Ni, Xiu, Li, Jiang, Wu, Qi-Wei, Zhou, Shuang-qiong, Xu, Zhen-Dong, and Liu, Zhi-Qiang
- Published
- 2024
- Full Text
- View/download PDF
225. Liquefaction susceptibility and deformation characteristics of saturated coral sandy soils subjected to cyclic loadings – a critical review
- Author
-
Chen, Guoxing, Qin, You, Ma, Weijia, Liang, Ke, Wu, Qi, and Juang, C. Hsein
- Published
- 2024
- Full Text
- View/download PDF
226. Soil pH differently affects N2O emissions from soils amended with chemical fertilizer and manure by modifying nitrification and denitrification in wheat-maize rotation system
- Author
-
Wu, Gong, Liang, Fei, Wu, Qi, Feng, Xiao-Gang, Shang, Wen-ding, Li, Hua-wei, Li, Xiao-xiao, Che, Zhao, Dong, Zhao-rong, and Song, He
- Published
- 2024
- Full Text
- View/download PDF
227. You see what you eat: effects of spicy food on emotion perception
- Author
-
Chen, Dongfang, Zhang, Siwei, Wu, Qi, and Ren, Menghao
- Published
- 2024
- Full Text
- View/download PDF
228. ClusTR: Exploring Efficient Self-attention via Clustering for Vision Transformers
- Author
-
Xie, Yutong, Zhang, Jianpeng, Xia, Yong, Hengel, Anton van den, and Wu, Qi
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Although Transformers have successfully transitioned from their language modelling origins to image-based applications, their quadratic computational complexity remains a challenge, particularly for dense prediction. In this paper we propose a content-based sparse attention method, as an alternative to dense self-attention, aiming to reduce the computation complexity while retaining the ability to model long-range dependencies. Specifically, we cluster and then aggregate key and value tokens, as a content-based method of reducing the total token count. The resulting clustered-token sequence retains the semantic diversity of the original signal, but can be processed at a lower computational cost. Besides, we further extend the clustering-guided attention from single-scale to multi-scale, which is conducive to dense prediction tasks. We label the proposed Transformer architecture ClusTR, and demonstrate that it achieves state-of-the-art performance on various vision tasks but at lower computational cost and with fewer parameters. For instance, our ClusTR small model with 22.7M parameters achieves 83.2\% Top-1 accuracy on ImageNet. Source code and ImageNet models will be made publicly available., Comment: 14 pages
- Published
- 2022
229. Fast Nearest Convolution for Real-Time Efficient Image Super-Resolution
- Author
-
Luo, Ziwei, Li, Youwei, Yu, Lei, Wu, Qi, Wen, Zhihong, Fan, Haoqiang, and Liu, Shuaicheng
- Subjects
Electrical Engineering and Systems Science - Image and Video Processing ,Computer Science - Computer Vision and Pattern Recognition - Abstract
Deep learning-based single image super-resolution (SISR) approaches have drawn much attention and achieved remarkable success on modern advanced GPUs. However, most state-of-the-art methods require a huge number of parameters, memories, and computational resources, which usually show inferior inference times when applying them to current mobile device CPUs/NPUs. In this paper, we propose a simple plain convolution network with a fast nearest convolution module (NCNet), which is NPU-friendly and can perform a reliable super-resolution in real-time. The proposed nearest convolution has the same performance as the nearest upsampling but is much faster and more suitable for Android NNAPI. Our model can be easily deployed on mobile devices with 8-bit quantization and is fully compatible with all major mobile AI accelerators. Moreover, we conduct comprehensive experiments on different tensor operations on a mobile device to illustrate the efficiency of our network architecture. Our NCNet is trained and validated on the DIV2K 3x dataset, and the comparison with other efficient SR methods demonstrated that the NCNet can achieve high fidelity SR results while using fewer inference times. Our codes and pretrained models are publicly available at \url{https://github.com/Algolzw/NCNet}., Comment: AIM & Mobile AI 2022
- Published
- 2022
230. Flipping of antiferromagnetic to superconducting states in pressurized quasi-one-dimensional manganese-based compounds
- Author
-
Long, Sijin, Chen, Long, Wang, Yuxin, Zhou, Ying, Cai, Shu, Guo, Jing, Zhou, Yazhou, Yang, Ke, Jiang, Sheng, Wu, Qi, Wang, Gang, Hu, Jiangping, and Sun, Liling
- Subjects
Condensed Matter - Superconductivity - Abstract
One of the universal features of unconventional superconductors is that the superconducting (SC) state is developed in the proximity of an antiferromagnetic (AFM) state. Understanding the interplay between these two states is one of the key issues to uncover the underlying physics of unconventional SC mechanism. Here, we report a pressure-induced flipping of the AFM state to SC state in the quasi-one-dimensional AMn6Bi5 (A = K, Rb, and Cs) compounds. We find that at a critical pressure the AFM state suddenly disappears at a finite temperature and a SC state simultaneously emerges at a lower temperature without detectable structural changes. Intriguingly, all members of the family present the AFM-SC transition at almost the same critical pressures (Pc), though their ambient-pressure unit-cell volumes vary substantially. Our theoretical calculations indicate that the increasing weight of dxz orbital electrons near Fermi energy under the pressure may be the origin of the flipping. These results reveal a diversity of competing nature between the AFM and SC states among the 3d-transition-metal compounds., Comment: 18 pages and 4 figures
- Published
- 2022
- Full Text
- View/download PDF
231. The breakdown of both strange metal and superconducting states at a pressure-induced quantum critical point in iron-pnictide superconductors
- Author
-
Cai, Shu, Zhao, Jinyu, Ni, Ni, Guo, Jing, Yang, Run, Wang, Pengyu, Han, Jinyu, Long, Sijin, Zhou, Yazhou, Wu, Qi, Qiu, Xianggang, Xiang, Tao, Cava, Robert J, and Sun, Liling
- Subjects
Condensed Matter - Superconductivity - Abstract
The strange metal (SM) state, characterized by a linear-in-temperature resistivity, is often seen in the normal state of high temperature superconductors. It is believed that the SM state is one of the keys to understand the underlying mechanism of high-Tc superconductivity. Here we report the first observation of the concurrent breakdown of the SM normal state and superconductivity at a pressure-induced quantum critical point in an iron-pnictide superconductor, Ca10(Pt4As8)((Fe0.97Pt0.03)2As2)5. We find that, upon suppressing the superconducting state by applying pressure, the power exponent changes from 1 to 2, and the corresponding coefficient A, the slope of the temperature-linear resistivity per FeAs layer, gradually diminishes. At a critical pressure (12.5 GPa), A and Tc go to zero concurrently,where a quantum phase transition (QPT) from a superconducting state with a SM normal state to a non-superconducting Fermi liquid state takes place. Scaling analysis on the results obtained from the pressurized 1048 superconductor reveals that A and Tc have a positive relation, which exhibits a similarity with that is seen in other chemically-doped unconventional superconductors, regardless of the type of the tuning method (doping or pressurizing), the crystal structure, the bulk or film superconductors and the nature of dopant. These results suggest that there is a simple but powerful organizational principle of connecting the SM normal state with the high-Tc superconductivity., Comment: 16 pages, 4 figures
- Published
- 2022
- Full Text
- View/download PDF
232. Interactive Volume Visualization via Multi-Resolution Hash Encoding based Neural Representation
- Author
-
Wu, Qi, Bauer, David, Doyle, Michael J., and Ma, Kwan-Liu
- Subjects
Computer Science - Graphics ,Computer Science - Machine Learning - Abstract
Neural networks have shown great potential in compressing volume data for visualization. However, due to the high cost of training and inference, such volumetric neural representations have thus far only been applied to offline data processing and non-interactive rendering. In this paper, we demonstrate that by simultaneously leveraging modern GPU tensor cores, a native CUDA neural network framework, and a well-designed rendering algorithm with macro-cell acceleration, we can interactively ray trace volumetric neural representations (10-60fps). Our neural representations are also high-fidelity (PSNR > 30dB) and compact (10-1000x smaller). Additionally, we show that it is possible to fit the entire training step inside a rendering loop and skip the pre-training process completely. To support extreme-scale volume data, we also develop an efficient out-of-core training strategy, which allows our volumetric neural representation training to potentially scale up to terascale using only an NVIDIA RTX 3090 workstation., Comment: There is a supplementary video for this manuscript, which can be accessed via this link: https://drive.google.com/file/d/17wSgIm_VsoeGhfyZwMpOnCYy2Mj3ydGv/view?usp=sharing
- Published
- 2022
233. Entanglement Dynamics in Anti-$\mathcal{PT}$-Symmetric Systems
- Author
-
Fang, Yu-Liang, Zhao, Jun-Long, Chen, Dong-Xu, Zhou, Yan-Hui, Zhang, Yu, Wu, Qi-Cheng, Yang, Chui-Ping, and Nori, Franco
- Subjects
Quantum Physics - Abstract
In the past years, many efforts have been made to study various noteworthy phenomena in both parity-time ($\mathcal{PT}$) and anti-parity-time ($\mathcal{APT}$) symmetric systems. However, entanglement dynamics in $\mathcal{APT}$-symmetric systems has not previously been investigated in both theory and experiments. Here, we investigate the entanglement evolution of two qubits in an $\mathcal{APT}$-symmetric system. In the $\mathcal{APT}$-symmetric unbroken regime, our theoretical simulations demonstrate the periodic oscillations of entanglement when each qubit evolves identically, while the nonperiodic oscillations of entanglement when each qubit evolves differently. In particular, when each qubit evolves near the exceptional point in the $\mathcal{APT}$-symmetric unbroken regime, there exist entanglement sudden vanishing and revival. Moreover, our simulations demonstrate rapid decay and delayed death of entanglement provided one qubit evolves in the $\mathcal{APT}$-symmetric broken regime. In this work, we also perform an experiment with a linear optical setup. The experimental results agree well with our theoretical simulation results. Our findings reveal novel phenomena of entanglement evolution in the $\mathcal{APT}$-symmetric system and opens a new direction for future studies on the dynamics of quantum entanglement in multiqubit $\mathcal{APT}$-symmetric systems or other non-Hermitian quantum systems., Comment: 13 pages, 8 figures
- Published
- 2022
- Full Text
- View/download PDF
234. Optical Field Recovery in Jones Space
- Author
-
Wu, Qi, Zhu, Yixiao, Jiang, Hexun, Zhuge, Qunbi, and Hu, Weisheng
- Subjects
Electrical Engineering and Systems Science - Signal Processing ,Physics - Optics - Abstract
Optical full-field recovery makes it possible to compensate for fiber impairments such as chromatic dispersion and polarization mode dispersion (PMD) in the digital signal processing. For cost-sensitive short-reach optical networks, some advanced single-polarization (SP) optical field recovery schemes are recently proposed to avoid chromatic dispersion-induced power fading effect, and improve the spectral efficiency for larger potential capacity. Polarization division multiplexing (PDM) can further double both the spectral efficiency and the system capacity of these SP carrier-assisted direct detection (DD) schemes. However, the so-called polarization fading phenomenon induced by random polarization rotation is a fundamental obstacle which prevents SP carrier-assisted DD systems from polarization diversity. In this paper, we propose a receiver of Jones-space field recovery (JSFR) to realize polarization diversity with SP carrier-assisted DD schemes in Jones space. Different receiver structures and simplified recovery procedures for JSFR are explored theoretically. The proposed JSFR pushes the SP DD schemes towards PDM without extra optical signal-to-noise ratio (OSNR) penalty. In addition, the JSFR shows good tolerance to PMD since the optical field recovery is conducted before polarization recovery. In the concept-of-proof experiment, we demonstrate 448-Gb/s reception over 80-km single-mode fiber using the proposed JSFR based on 22 couplers. Furthermore, we qualitatively compare the optical field recovery in Jones space and Stokes space from the perspective of the modulation dimension. Qualitatively, we compare the optical field recovery in the Jones space and Stokes space from the perspective of the modulation dimension., Comment: 8 pages and 9 figures
- Published
- 2022
- Full Text
- View/download PDF
235. FoVolNet: Fast Volume Rendering using Foveated Deep Neural Networks
- Author
-
Bauer, David, Wu, Qi, and Ma, Kwan-Liu
- Subjects
Information and Computing Sciences ,Graphics ,Augmented Reality and Games ,Bioengineering ,Networking and Information Technology R&D (NITRD) ,Eye Disease and Disorders of Vision ,Volume data ,volume visualization ,deep learning ,foveated rendering ,neural reconstruction ,Artificial Intelligence and Image Processing ,Computation Theory and Mathematics ,Software Engineering ,Information and computing sciences - Abstract
Volume data is found in many important scientific and engineering applications. Rendering this data for visualization at high quality and interactive rates for demanding applications such as virtual reality is still not easily achievable even using professional-grade hardware. We introduce FoVolNet-a method to significantly increase the performance of volume data visualization. We develop a cost-effective foveated rendering pipeline that sparsely samples a volume around a focal point and reconstructs the full-frame using a deep neural network. Foveated rendering is a technique that prioritizes rendering computations around the user's focal point. This approach leverages properties of the human visual system, thereby saving computational resources when rendering data in the periphery of the user's field of vision. Our reconstruction network combines direct and kernel prediction methods to produce fast, stable, and perceptually convincing output. With a slim design and the use of quantization, our method outperforms state-of-the-art neural reconstruction techniques in both end-to-end frame times and visual quality. We conduct extensive evaluations of the system's rendering performance, inference speed, and perceptual properties, and we provide comparisons to competing neural image reconstruction techniques. Our test results show that FoVolNet consistently achieves significant time saving over conventional rendering while preserving perceptual quality.
- Published
- 2023
236. Phenotypic and Immunological Characterization of Patients with Activated PI3Kδ Syndrome 1 Presenting with Autoimmunity
- Author
-
Li, Qifan, Wang, Wenjie, Wu, Qi, Zhou, Qinhua, Ying, Wenjing, Hui, Xiaoying, Sun, Bijun, Hou, Jia, Qian, Feng, Wang, Xiaochuan, and Sun, Jinqiao
- Published
- 2024
- Full Text
- View/download PDF
237. Experimental observation of the significant difference between surface and bulk Kondo processes in Kondo lattice YbCu2Si2
- Author
-
Zhao, Yin-Zou, Song, Jiao-Jiao, Wu, Qi-Yi, Liu, Hao, Zhang, Chen, Chen, Bo, Zhang, Hong-Yi, Chen, Zhen-Hua, Huang, Yao-Bo, Ye, Xue-Qing, Yuan, Ya-Hua, Duan, Yu-Xia, He, Jun, and Meng, Jian-Qiao
- Published
- 2024
- Full Text
- View/download PDF
238. Investigation of ultrafast laser surface patterning and nanocrystalline formation on CrTiN alloy films for enhanced cell proliferation
- Author
-
Wu, Qi-Xuan, Chang, Tien-Li, Chen, Zhao-Chi, Hsiao, Wen-Tse, and Huang, Song-Pu
- Published
- 2024
- Full Text
- View/download PDF
239. Exacerbation of symptoms, nocturnal acid reflux, and impaired autonomic function are associated with sleep disturbance in gastroesophageal reflux disease patients
- Author
-
Yizhou Huang, Jie Liu, Linsheng Xu, Wu Qi, Jie Dai, Bo Wang, Jiashuang Tian, Xin Fu, and Yue Yu
- Subjects
gastroesophageal reflux disease ,sleep disturbance ,autonomic dysfunction ,anxiety ,depression ,Medicine (General) ,R5-920 - Abstract
Background and aimGastroesophageal reflux disease (GERD) patients often report sleep disturbance (SD); however, the relationship between GERD and SD is unknown. This study investigated whether SD affects symptoms, acid reflux, and autonomic function in GERD patients.MethodsA total of 257 subjects (126 patients with SD and 99 patients without SD) participated in this survey from January 2020 to August 2022. Participants were required to complete questionnaires including the GERD impact scale (GIS), Hamilton Anxiety Scale (HAMA), and Hamilton Depression Scale (HAMD). Esophageal mucosal injury, acid exposure, peristaltic function, and autonomic function were assessed by upper endoscopy, high-resolution esophageal manometry (HRAM), 24-h multichannel intraluminal impedance with pH (24 h-MII-pH), and electrocardiography (ECG).ResultsGastroesophageal reflux disease patients with SD experienced a higher frequency of prolonged reflux (p
- Published
- 2024
- Full Text
- View/download PDF
240. Attract me to Buy: Advertisement Copywriting Generation with Multimodal Multi-structured Information
- Author
-
Zhang, Zhipeng, Hou, Xinglin, Niu, Kai, Huang, Zhongzhen, Ge, Tiezheng, Jiang, Yuning, Wu, Qi, and Wang, Peng
- Subjects
Computer Science - Computation and Language ,Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Multimedia - Abstract
Recently, online shopping has gradually become a common way of shopping for people all over the world. Wonderful merchandise advertisements often attract more people to buy. These advertisements properly integrate multimodal multi-structured information of commodities, such as visual spatial information and fine-grained structure information. However, traditional multimodal text generation focuses on the conventional description of what existed and happened, which does not match the requirement of advertisement copywriting in the real world. Because advertisement copywriting has a vivid language style and higher requirements of faithfulness. Unfortunately, there is a lack of reusable evaluation frameworks and a scarcity of datasets. Therefore, we present a dataset, E-MMAD (e-commercial multimodal multi-structured advertisement copywriting), which requires, and supports much more detailed information in text generation. Noticeably, it is one of the largest video captioning datasets in this field. Accordingly, we propose a baseline method and faithfulness evaluation metric on the strength of structured information reasoning to solve the demand in reality on this dataset. It surpasses the previous methods by a large margin on all metrics. The dataset and method are coming soon on \url{https://e-mmad.github.io/e-mmad.net/index.html}.
- Published
- 2022
241. BSRT: Improving Burst Super-Resolution with Swin Transformer and Flow-Guided Deformable Alignment
- Author
-
Luo, Ziwei, Li, Youwei, Cheng, Shen, Yu, Lei, Wu, Qi, Wen, Zhihong, Fan, Haoqiang, Sun, Jian, and Liu, Shuaicheng
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
This work addresses the Burst Super-Resolution (BurstSR) task using a new architecture, which requires restoring a high-quality image from a sequence of noisy, misaligned, and low-resolution RAW bursts. To overcome the challenges in BurstSR, we propose a Burst Super-Resolution Transformer (BSRT), which can significantly improve the capability of extracting inter-frame information and reconstruction. To achieve this goal, we propose a Pyramid Flow-Guided Deformable Convolution Network (Pyramid FG-DCN) and incorporate Swin Transformer Blocks and Groups as our main backbone. More specifically, we combine optical flows and deformable convolutions, hence our BSRT can handle misalignment and aggregate the potential texture information in multi-frames more efficiently. In addition, our Transformer-based structure can capture long-range dependency to further improve the performance. The evaluation on both synthetic and real-world tracks demonstrates that our approach achieves a new state-of-the-art in BurstSR task. Further, our BSRT wins the championship in the NTIRE2022 Burst Super-Resolution Challenge., Comment: CVPRW, Winner method in NTIRE 2022 Burst Super-Resolution Challenge Real-World Track
- Published
- 2022
242. Custom Sine Waves Are Enough for Imitation Learning of Bipedal Gaits with Different Styles
- Author
-
Wu, Qi, Zhang, Chong, and Liu, Yanchen
- Subjects
Computer Science - Robotics - Abstract
Not until recently, robust bipedal locomotion has been achieved through reinforcement learning. However, existing implementations rely heavily on insights and efforts from human experts, which is costly for the iterative design of robot systems. Also, styles of the learned motion are strictly limited to that of the reference. In this paper, we propose a new way to learn bipedal locomotion from a simple sine wave as the reference for foot heights. With the naive human insight that the two feet should be lifted up alternatively and periodically, we experimentally demonstrate on the Cassie robot that, a simple reward function is able to make the robot learn to walk end-to-end and efficiently without any explicit knowledge of the model. With custom sine waves, the learned gait pattern can also have customized styles. Codes are released at github.com/WooQi57/sin-cassie-rl., Comment: \c{opyright} 2022 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works
- Published
- 2022
243. Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions
- Author
-
Gu, Jing, Stefani, Eliana, Wu, Qi, Thomason, Jesse, and Wang, Xin Eric
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Artificial Intelligence ,Computer Science - Computation and Language ,Computer Science - Machine Learning - Abstract
A long-term goal of AI research is to build intelligent agents that can communicate with humans in natural language, perceive the environment, and perform real-world tasks. Vision-and-Language Navigation (VLN) is a fundamental and interdisciplinary research topic towards this goal, and receives increasing attention from natural language processing, computer vision, robotics, and machine learning communities. In this paper, we review contemporary studies in the emerging field of VLN, covering tasks, evaluation metrics, methods, etc. Through structured analysis of current progress and challenges, we highlight the limitations of current VLN and opportunities for future work. This paper serves as a thorough reference for the VLN research community., Comment: 19 pages. Accepted to ACL 2022
- Published
- 2022
- Full Text
- View/download PDF
244. HOP: History-and-Order Aware Pre-training for Vision-and-Language Navigation
- Author
-
Qiao, Yanyuan, Qi, Yuankai, Hong, Yicong, Yu, Zheng, Wang, Peng, and Wu, Qi
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Computation and Language - Abstract
Pre-training has been adopted in a few of recent works for Vision-and-Language Navigation (VLN). However, previous pre-training methods for VLN either lack the ability to predict future actions or ignore the trajectory contexts, which are essential for a greedy navigation process. In this work, to promote the learning of spatio-temporal visual-textual correspondence as well as the agent's capability of decision making, we propose a novel history-and-order aware pre-training paradigm (HOP) with VLN-specific objectives that exploit the past observations and support future action prediction. Specifically, in addition to the commonly used Masked Language Modeling (MLM) and Trajectory-Instruction Matching (TIM), we design two proxy tasks to model temporal order information: Trajectory Order Modeling (TOM) and Group Order Modeling (GOM). Moreover, our navigation action prediction is also enhanced by introducing the task of Action Prediction with History (APH), which takes into account the history visual perceptions. Extensive experimental results on four downstream VLN tasks (R2R, REVERIE, NDH, RxR) demonstrate the effectiveness of our proposed method compared against several state-of-the-art agents.
- Published
- 2022
245. Angle-resolved photoemission spectroscopy study of the charge density wave order in layered semiconductor EuTe4
- Author
-
Zhang, Chen, Wu, Qi-Yi, Yuan, Ya-Hua, Xia, Wei, Liu, Hao, Liu, Zi-Teng, Zhang, Hong-Yi, Song, Jiao-Jiao, Zhao, Yin-Zou, Wu, Fan-Ying, Liu, Shu-Yu, Chen, Bo, Ye, Xue-Qing, Cui, Sheng-Tao, Sun, Zhe, Tang, Xiao-Fang, He, Jun, Liu, Hai-Yun, Duan, Yu-Xia, Guo, Yan-Feng, and Meng, Jian-Qiao
- Subjects
Condensed Matter - Materials Science ,Condensed Matter - Strongly Correlated Electrons - Abstract
Layered tellurides have been extensively studied as a platform for investigating the Fermi surface (FS) nesting-driven charge density wave (CDW) states. EuTe4, one of quasi-two-dimensional (quasi-2D) binary rare-earth tetratellurides CDW compounds, with unconventional hysteretic transition, is currently receiving much attention. Here, the CDW modulation vector, momentum and temperature dependence of CDW gaps in EuTe4 are investigated using angle-resolved photoemission spectroscopy. Our results reveal that (i) a FS nesting vector q ~ 0.67 b* drives the formation of CDW state, (ii) a large anisotropic CDW gap is fully open in the whole FS, and maintains a considerable size even at 300 K, leading to appearance of semiconductor properties, (iii) an abnormal non-monotonic increase of CDW gap in magnitude as a function of temperature, (iv) an extra, larger gap opens at a higher binding energy due to the interaction between the different orbits of the main bands., Comment: 5 pages, 4 figures
- Published
- 2022
- Full Text
- View/download PDF
246. MuKEA: Multimodal Knowledge Extraction and Accumulation for Knowledge-based Visual Question Answering
- Author
-
Ding, Yang, Yu, Jing, Liu, Bang, Hu, Yue, Cui, Mingxin, and Wu, Qi
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Multimedia - Abstract
Knowledge-based visual question answering requires the ability of associating external knowledge for open-ended cross-modal scene understanding. One limitation of existing solutions is that they capture relevant knowledge from text-only knowledge bases, which merely contain facts expressed by first-order predicates or language descriptions while lacking complex but indispensable multimodal knowledge for visual understanding. How to construct vision-relevant and explainable multimodal knowledge for the VQA scenario has been less studied. In this paper, we propose MuKEA to represent multimodal knowledge by an explicit triplet to correlate visual objects and fact answers with implicit relations. To bridge the heterogeneous gap, we propose three objective losses to learn the triplet representations from complementary views: embedding structure, topological relation and semantic space. By adopting a pre-training and fine-tuning learning strategy, both basic and domain-specific multimodal knowledge are progressively accumulated for answer prediction. We outperform the state-of-the-art by 3.35% and 6.08% respectively on two challenging knowledge-required datasets: OK-VQA and KRVQA. Experimental results prove the complementary benefits of the multimodal knowledge with existing knowledge bases and the advantages of our end-to-end framework over the existing pipeline methods. The code is available at https://github.com/AndersonStra/MuKEA., Comment: Accepted by CVPR2022
- Published
- 2022
247. Entanglement-interference complementarity and experimental demonstration in a superconducting circuit
- Author
-
Huang, Xin-Jie, Han, Pei-Rong, Ning, Wen, Yang, Shou-Bang, Zhu, Xin, Lü, Jia-Hao, Zheng, Ri-Hua, Li, Hekang, Yang, Zhen-Biao, Xu, Kai, Yang, Chui-Ping, Wu, Qi-Cheng, Zheng, Dongning, Fan, Heng, and Zheng, Shi-Biao
- Subjects
Quantum Physics - Abstract
Quantum entanglement between an interfering particle and a detector for acquiring the which-path information plays a central role for enforcing Bohr's complementarity principle. However, the quantitative relation between this entanglement and the fringe visibility remains untouched upon for an initial mixed state. Here we find an equality for quantifying this relation. Our equality characterizes how well the interference pattern can be preserved when an interfering particle, initially carrying a definite amount of coherence, is entangled, to a certain degree, with a which-path detector. This equality provides a connection between entanglement and interference in the unified framework of coherence, revealing the quantitative entanglement-interference complementarity. We experimentally demonstrate this relation with a superconducting circuit, where a resonator serves as a which-path detector for an interfering qubit. The measured fringe visibility of the qubit's Ramsey signal and the qubit-resonator entanglement exhibit a complementary relation, in well agreement with the theoretical prediction., Comment: 19 pages, 9 figures, 1 table
- Published
- 2022
- Full Text
- View/download PDF
248. SQUAB I: The first release of Strange QUasar candidates with ABnormal astrometric characteristics from Gaia EDR3 and SDSS
- Author
-
Wu, Qi-Qi, Liao, Shi-Long, Ji, Xiang, Qi, Zhao-Xiang, Lin, Zhen-Ya Zheng Ru-Qiu, Zhang, Ying-Kang, and An, Tao
- Subjects
Astrophysics - Astrophysics of Galaxies ,Astrophysics - Cosmology and Nongalactic Astrophysics - Abstract
Given their extremely large distances and small apparent sizes, quasars are generally considered as objects with near-zero parallax and proper motion. However, some special quasars may have abnormal astrometric characteristics, such as quasar pairs, lensed quasars, AGNs with bright parsec-scale optical jets, which are scientifically interesting objects, such as binary black holes. These quasars may come with astrometric jitter detectable with Gaia data, or significant changes in the position at different wavelengths. In this work, we aim to find these quasar candidates from Gaia EDR3 astrometric data combining with Sloan Digital Sky Survey (SDSS) spectroscopic data to provide a candidate catalog to the science community. We propose a series of criteria for selecting abnormal quasars based on Gaia astrometric data. We obtain two catalogs containing 155 sources and 44 sources, respectively. They are potential candidates of quasar pairs., Comment: 17 pages, 13 figures Accepted on 28 January 2022 Front. Astron. Space Sci
- Published
- 2022
- Full Text
- View/download PDF
249. Charge Density Wave order and electron-phonon coupling in ternary superconductor Bi$_2$Rh$_3$Se$_2$
- Author
-
Liu, Zi-Teng, Zhang, Chen, Wu, Qi-Yi, Liu, Hao, Chen, Bo, Yin, Zhi-Bo, Cui, Sheng-Tao, Sun, Zhe, Zhu, Shuang-Xing, Song, Jiao-Jiao, Zhao, Yin-Zou, Zhang, Hong-Yi, Ye, Xue-Qing, Fan-YingWu, Liu, Shu-Yu, Tang, Xiao-Fang, Yuan, Ya-Hua, Wang, Yun-Peng, He, Jun, Liu, Hai-Yun, Duan, Yu-Xia, and Meng, Jian-Qiao
- Subjects
Condensed Matter - Superconductivity ,Condensed Matter - Strongly Correlated Electrons - Abstract
The newly discovered ternary chalcogenide superconductor Bi$_2$Rh$_3$Se$_2$ has attracted growing attention, which provides an opportunity to explore the interplay between charge density wave (CDW) order and superconductivity. However, whether the phase transition at 240 K can be attributed to CDW formation remains controversial. To help resolve the debate, we study the electronic structure study of Bi2Rh3Se2 by angle-resolved photoemission spectroscopy experiments, with emphasis on the nature of its high-temperature phase transition at 240 K. Our measurements demonstrate that the phase transition at 240 K is a second-order CDW phase transition. Our results reveal (i) a 2 x 2 CDW order in Bi$_2$Rh$_3$Se$_2$, accompanied by the reconstruction of electronic structure, such as band folding, band splitting, and opening of CDW gaps at and away from Fermi level; (ii) the existence of electron-phonon coupling, which is manifested as an obvious kink and peak-dip-hump structure in dispersion; and (iii) the appearance of a flat band. Our observations thus enable us to shed light on the nature of the CDW order and its interplay with superconductivity in Bi$_2$Rh$_3$Se$_2$., Comment: 5 pages, 5 figures
- Published
- 2022
- Full Text
- View/download PDF
250. Bridging the Gap Between Learning in Discrete and Continuous Environments for Vision-and-Language Navigation
- Author
-
Hong, Yicong, Wang, Zun, Wu, Qi, and Gould, Stephen
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Computation and Language ,Computer Science - Robotics - Abstract
Most existing works in vision-and-language navigation (VLN) focus on either discrete or continuous environments, training agents that cannot generalize across the two. The fundamental difference between the two setups is that discrete navigation assumes prior knowledge of the connectivity graph of the environment, so that the agent can effectively transfer the problem of navigation with low-level controls to jumping from node to node with high-level actions by grounding to an image of a navigable direction. To bridge the discrete-to-continuous gap, we propose a predictor to generate a set of candidate waypoints during navigation, so that agents designed with high-level actions can be transferred to and trained in continuous environments. We refine the connectivity graph of Matterport3D to fit the continuous Habitat-Matterport3D, and train the waypoints predictor with the refined graphs to produce accessible waypoints at each time step. Moreover, we demonstrate that the predicted waypoints can be augmented during training to diversify the views and paths, and therefore enhance agent's generalization ability. Through extensive experiments we show that agents navigating in continuous environments with predicted waypoints perform significantly better than agents using low-level actions, which reduces the absolute discrete-to-continuous gap by 11.76% Success Weighted by Path Length (SPL) for the Cross-Modal Matching Agent and 18.24% SPL for the Recurrent VLN-BERT. Our agents, trained with a simple imitation learning objective, outperform previous methods by a large margin, achieving new state-of-the-art results on the testing environments of the R2R-CE and the RxR-CE datasets.
- Published
- 2022
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.