831,240 results on '"Ning, An"'
Search Results
2. A Confucian Construction of Joseph Conrad’s Sincerity
- Author
-
Ning, An
- Published
- 2022
- Full Text
- View/download PDF
3. A neutrino flare associated with X-ray emission from TDE ATLAS17jrp
- Author
-
Li, Rong-Lan, Yuan, Chengchao, He, Hao-Ning, Wang, Yun, Zhu, Ben-Yang, Liang, Yun-Feng, Jiang, Ning, and Wei, Da-Ming
- Subjects
Astrophysics - High Energy Astrophysical Phenomena - Abstract
Tidal disruption events (TDEs), where stars are captured or tidally disrupted by supermassive black holes, are potential sources of high-energy neutrinos. We report the discovery of a potential neutrino flare that is spatially and temporally associated with X-ray emission from TDE ATLAS17jrp. The best-fit spectrum of the neutrino flare follows a power-law with an index of $\rm{\gamma=2.7\pm0.4}$ and a flux normalization of $\rm{\Phi_0 =1.7^{+6.3}_{-1.5}\times 10^{-18}\;GeV^{-1} cm^{-2} s^{-1}}$ at 100 TeV based on an analysis of 10-year track data from IceCube, and the flare duration is 61 days. We calculate that the probability of this association occurring by chance is $0.17\%$. Therefore, ATLAS17jrp is the second TDE (not including candidates) associated with high-energy neutrinos, following TDE AT2019dsg associated with an IceCube neutrino alert. This association can be attributed to the interaction of X-ray photons produced by the hot corona with high-energy particles accelerated by disk winds or outflows, resulting in the production of neutrinos., Comment: 8 pages, 2 figures, submitted to ApJL
- Published
- 2024
4. Integrating Window-Based Correlated Decoding with Constant-Time Logical Gates for Large-Scale Quantum Computation
- Author
-
Zhang, Jiaxuan, Chen, Zhao-Yun, Li, Jia-Ning, Wei, Tian-Hao, Liu, Huan-Yu, Zhuang, Xi-Ning, Li, Qing-Song, Wu, Yu-Chun, and Guo, Guo-Ping
- Subjects
Quantum Physics - Abstract
Large-scale quantum computation requires to be performed in the fault-tolerant manner. One crucial issue of fault-tolerant quantum computing (FTQC) is reducing the overhead of implementing logical gates. Recently proposed correlated decoding and ``algorithmic fault tolerance" achieve fast logical gates that enables universal quantum computation. However, for circuits involving mid-circuit measurements and feedback, this approach is incompatible with window-based decoding, which is a natural requirement for handling large-scale circuits. In this letter, we propose an alternative architecture that employs delayed fixup circuits, integrating window-based correlated decoding with fast transversal gates. This design significantly reduce both the frequency and duration of correlated decoding, while maintaining support for constant-time logical gates and universality across a broad class of quantum codes. More importantly, by spatial parallelism of windows, this architecture well adapts to time-optimal FTQC, making it particularly useful for large-scale computation. Using Shor's algorithm as an example, we explore the application of our architecture and reveals the promising potential of using fast transversal gates to perform large-scale quantum computing tasks with acceptable overhead on physical systems like ion traps., Comment: 11 pages, 9 figures
- Published
- 2024
5. ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery
- Author
-
Chen, Ziru, Chen, Shijie, Ning, Yuting, Zhang, Qianheng, Wang, Boshi, Yu, Botao, Li, Yifei, Liao, Zeyi, Wei, Chen, Lu, Zitong, Dey, Vishal, Xue, Mingyi, Baker, Frazier N., Burns, Benjamin, Adu-Ampratwum, Daniel, Huang, Xuhui, Ning, Xia, Gao, Song, Su, Yu, and Sun, Huan
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence ,Computer Science - Machine Learning - Abstract
The advancements of language language models (LLMs) have piqued growing interest in developing LLM-based language agents to automate scientific discovery end-to-end, which has sparked both excitement and skepticism about their true capabilities. In this work, we call for rigorous assessment of agents on individual tasks in a scientific workflow before making bold claims on end-to-end automation. To ensure the scientific authenticity and real-world relevance of our benchmark, we extract 102 tasks from 44 peer-reviewed publications in four disciplines and engage nine subject matter experts to validate them. We unify the target output for every task to a self-contained Python program file and employ an array of evaluation metrics to examine the generated programs, execution results, and costs. Each task goes through multiple rounds of manual validation by annotators and subject matter experts to ensure its annotation quality and scientific plausibility. We also propose two effective strategies to mitigate data contamination concerns. Using our benchmark, we evaluate five open-weight and proprietary LLMs, each with three frameworks: direct prompting, OpenHands CodeAct, and self-debug. Given three attempts for each task, the best-performing agent can only solve 32.4% of the tasks independently and 34.3% with expert-provided knowledge. In addition, we evaluate OpenAI o1 with direct prompting and self-debug, which demonstrates the effectiveness of increasing inference-time compute. Still, our results underscore the limitations of current language agents in generating code for data-driven discovery, let alone end-to-end automation for scientific research., Comment: 57 pages
- Published
- 2024
6. Searching for MeV-scale Axion-like Particles and Dark Photons with PandaX-4T
- Author
-
PandaX Collaboration, Li, Tao, Bo, Zihao, Chen, Wei, Chen, Xun, Chen, Yunhua, Cheng, Zhaokan, Cui, Xiangyi, Fan, Yingjie, Fang, Deqing, Gao, Zhixing, Geng, Lisheng, Giboni, Karl, Guo, Xunan, Guo, Xuyuan, Guo, Zichao, Han, Chencheng, He, Ke HanChangda, He, Jinrong, Huang, Di, Huang, Houqi, Huang, Junting, Hou, Ruquan, Hou, Yu, Ji, Xiangdong, Ji, Xiangpan, Ju, Yonglin, Li, Chenxiang, Li, Jiafu, Li, Mingchuan, Li, Shuaijie, Li, Zhiyuan, Lin, Qing, Liu, Jianglai, Lu, Congcong, Lu, Xiaoying, Luo, Lingyin, Luo, Yunyang, Ma, Wenbo, Ma, Yugang, Mao, Yajun, Meng, Yue, Ning, Xuyang, Pang, Binyu, Qi, Ningchun, Qian, Zhicheng, Ren, Xiangxiang, Shan, Dong, Shang, Xiaofeng, Shao, Xiyuan, Shen, Guofang, Shen, Manbin, Sun, Wenliang, Tao, Yi, Wang, Anqing, Wang, Guanbo, Wang, Hao, Wang, Jiamin, Wang, Lei, Wang, Meng, Wang, Qiuhong, Wang, Shaobo, Wang, Siguang, Wang, Wei, Wang, Xiuli, Wang, Xu, Wang, Zhou, Wei, Yuehuan, Wu, Weihao, Wu, Yuan, Xiao, Mengjiao, Xiao, Xiang, Xiong, Kaizhi, Xu, Yifan, Yao, Shunyu, Yan, Binbin, Yan, Xiyu, Yang, Yong, Ye, Peihua, Yu, Chunxu, Yuan, Ying, Yuan, Zhe, Yun, Youhui, Zeng, Xinning, Zhang, Minzhen, Zhang, Peng, Zhang, Shibo, Zhang, Shu, Zhang, Tao, Zhang, Wei, Zhang, Yang, Zhang, Yingxin, Zhang, Yuanyuan, Zhao, Li, Zhou, Jifang, Zhou, Jiaxu, Zhou, Jiayi, Zhou, Ning, Zhou, Xiaopeng, Zhou, Yubo, and Zhou, Zhizhen
- Subjects
High Energy Physics - Experiment - Abstract
Axion-like particles (ALPs) and dark photons (DPs) are viable dark matter particle candidates. We have searched for possible ALP/DP signals in the PandaX-4T liquid xenon detector using 94.8 days of data. A binned likelihood fit is constructed to search for possible mono-energetic peaks induced by the absorption processes between ALPs/DPs and atomic electrons of xenon. A detailed temporal model of decays associated with xenon isotopes is introduced to constrain the number of background events. No signal excess over background expectations is observed, and we have established the most stringent exclusion limits for most ALP/DP masses ranging from 150 keV/$c^2$ to 1 MeV/$c^2$.
- Published
- 2024
7. Conrad and the Chinese Reader: Confucius and “Amy Foster”
- Author
-
Ning, An
- Published
- 2021
- Full Text
- View/download PDF
8. Reinforced Disentanglers on Random Unitary Circuits
- Author
-
Bao, Ning, Furuya, Keiichiro, and Suer, Gun
- Subjects
Quantum Physics ,Condensed Matter - Disordered Systems and Neural Networks ,Condensed Matter - Statistical Mechanics ,Computer Science - Machine Learning - Abstract
We search for efficient disentanglers on random Clifford circuits of two-qubit gates arranged in a brick-wall pattern, using the proximal policy optimization (PPO) algorithm \cite{schulman2017proximalpolicyoptimizationalgorithms}. Disentanglers are defined as a set of projective measurements inserted between consecutive entangling layers. An efficient disentangler is a set of projective measurements that minimize the averaged von Neumann entropy of the final state with the least number of total projections possible. The problem is naturally amenable to reinforcement learning techniques by taking the binary matrix representing the projective measurements along the circuit as our state, and actions as bit flipping operations on this binary matrix that add or delete measurements at specified locations. We give rewards to our agent dependent on the averaged von Neumann entropy of the final state and the configuration of measurements, such that the agent learns the optimal policy that will take him from the initial state of no measurements to the optimal measurement state that minimizes the entanglement entropy. Our results indicate that the number of measurements required to disentangle a random quantum circuit is drastically less than the numerical results of measurement-induced phase transition papers. Additionally, the reinforcement learning procedure enables us to characterize the pattern of optimal disentanglers, which is not possible in the works of measurement-induced phase transitions., Comment: 9 pages, 7 figures, 1 table. Submitted to QIP 2025
- Published
- 2024
9. NEP-MB-pol: A unified machine-learned framework for fast and accurate prediction of water's thermodynamic and transport properties
- Author
-
Xu, Ke, Liang, Ting, Xu, Nan, Ying, Penghua, Chen, Shunda, Wei, Ning, Xu, Jianbin, and Fan, Zheyong
- Subjects
Physics - Chemical Physics ,Condensed Matter - Materials Science ,Condensed Matter - Soft Condensed Matter ,Condensed Matter - Statistical Mechanics - Abstract
Water's unique hydrogen-bonding network and anomalous properties present significant challenges for accurately modeling its structural, thermodynamic, and transport behavior across varied conditions. Although machine-learned potentials have advanced the prediction of individual properties, a unified computational framework capable of simultaneously capturing water's complex and subtle properties with high accuracy has remained elusive. Here, we address this challenge by introducing NEP-MB-pol, a highly accurate and efficient neuroevolution potential trained on extensive MB-pol reference data with coupled-cluster-level accuracy, combined with path-integral molecular dynamics and quantum-correction techniques to incorporate nuclear quantum effects. This NEP-MB-pol framework reproduces experimentally measured structural, thermodynamic, and transport properties of water across a broad temperature range, achieving simultaneous, fast, and accurate prediction of self-diffusion coefficient, viscosity, and thermal conductivity. Our approach provides a unified and robust tool for exploring thermodynamic and transport properties of water under diverse conditions, with significant potential for broader applications across research fields., Comment: 12 pages, 4 figures in the main text; 8 figures in the SI
- Published
- 2024
10. All-optical magnetic imaging with spin defects in van der Waals materials at Angstrom-scale
- Author
-
Wang, Ning, Cai, Jianming, and Lei, Chao
- Subjects
Quantum Physics - Abstract
Magnetic imaging with ultra-high spatial resolution is crucial to exploring the magnetic textures of emerging quantum materials. We propose a novel magnetic imaging protocol that achieves Angstrom-scale resolution by combining spin defects in van der Waals materials and terahertz scattering scanning near-field optical microscopy (THz s-SNOM). Spin defects in the atomic monolayer enable the probe-to-sample distance diving into the Angstrom range where the exchange interactions between the probe and sample spins become predominant. This exchange interaction leads to energy splitting of the probe spin in the order of millielectronvolts, corresponding to THz frequencies. With THz optics and the spin-dependent fluorescence of the probe spin, the interaction energy can be resolved entirely through optical methods. Our proposed all-optical magnetic imaging protocol holds significant promise for investigating magnetic textures in condensed matter physics due to its excellent compatibility and high spatial resolution.
- Published
- 2024
11. DarkSHINE Baseline Design Report: Physics Prospects and Detector Technologies
- Author
-
Chen, Jing, Chen, Ji-Yuan, Chen, Jun-Feng, Chen, Xiang, Fu, Chang-Bo, Guo, Jun, Guo, Yi-Han, Khaw, Kim Siang, Li, Jia-Lin, Li, Liang, Li, Shu, Lin, Yu-ming, Liu, Dan-Ning, Liu, Kang, Liu, Kun, Liu, Qi-Bin, Liu, Zhi, Lu, Ze-Jia, Lv, Meng, Song, Si-Yuan, Sun, Tong, Tang, Jian-Nan, Wan, Wei-Shi, Wang, Dong, Wang, Xiao-Long, Wang, Yu-Feng, Wang, Zhen, Wang, Zi-Rui, Wu, Wei-Hao, Yang, Hai-Jun, Yang, Lin, Yang, Yong, Yu, Dian, Yuan, Rui, Zhang, Jun-Hua, Zhang, Yu-Lei, Zhang, Yun-Long, Zhao, Zhi-Yu, Zhou, Bai-Hong, Zhu, Chun-Xiang, Zhu, Xu-Liang, and Zhu, Yi-Fan
- Subjects
Physics - Instrumentation and Detectors ,High Energy Physics - Experiment - Abstract
DarkSHINE is a newly proposed fixed-target experiment initiative to search for the invisible decay of Dark Photon via missing energy/momentum signatures, based on the high repetition rate electron beam to be deployed/delivered by the Shanghai High repetition rate XFEL and Extreme light facility (SHINE). This report elaborates the baseline design of DarkSHINE experiment by introducing the physics goals, experimental setups, details of each sub-detector system technical designs, signal and backgground modelings, expected search sensitivities and future prospects, which mark an important step towards the further prototyping and technical demonstrations.
- Published
- 2024
12. Cross-Modal Consistency in Multimodal Large Language Models
- Author
-
Zhang, Xiang, Li, Senyu, Shi, Ning, Hauer, Bradley, Wu, Zijun, Kondrak, Grzegorz, Abdul-Mageed, Muhammad, and Lakshmanan, Laks V. S.
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence - Abstract
Recent developments in multimodal methodologies have marked the beginning of an exciting era for models adept at processing diverse data types, encompassing text, audio, and visual content. Models like GPT-4V, which merge computer vision with advanced language processing, exhibit extraordinary proficiency in handling intricate tasks that require a simultaneous understanding of both textual and visual information. Prior research efforts have meticulously evaluated the efficacy of these Vision Large Language Models (VLLMs) in various domains, including object detection, image captioning, and other related fields. However, existing analyses have often suffered from limitations, primarily centering on the isolated evaluation of each modality's performance while neglecting to explore their intricate cross-modal interactions. Specifically, the question of whether these models achieve the same level of accuracy when confronted with identical task instances across different modalities remains unanswered. In this study, we take the initiative to delve into the interaction and comparison among these modalities of interest by introducing a novel concept termed cross-modal consistency. Furthermore, we propose a quantitative evaluation framework founded on this concept. Our experimental findings, drawn from a curated collection of parallel vision-language datasets developed by us, unveil a pronounced inconsistency between the vision and language modalities within GPT-4V, despite its portrayal as a unified multimodal model. Our research yields insights into the appropriate utilization of such models and hints at potential avenues for enhancing their design.
- Published
- 2024
13. Stability of the catenoid for the hyperbolic vanishing mean curvature equation in 4 spatial dimensions
- Author
-
Tang, Ning
- Subjects
Mathematics - Analysis of PDEs ,Mathematics - Differential Geometry - Abstract
We establish the asymptotic stability of the catenoid, as a nonflat stationary solution to the hyperbolic vanishing mean curvature (HVMC) equation in Minkowski space $\mathbb{R}^{1 + (n + 1)}$ for $n = 4$. Our main result is under a ``codimension-$1$'' assumption on initial perturbation, modulo suitable translation and boost (i.e. modulation), without any symmetry assumptions. In comparison to the $n \geq 5$ case addressed by L\"{u}hrmann-Oh-Shahshahani arxiv:2212.05620, proving catenoid stability in $4$ dimensions shares additional difficulties with its $3$ dimensional analog, namely the slower spatial decay of the catenoid and slower temporal decay of waves. To overcome these difficulties in the $n = 3$ case, the strong Huygens principle, as well as a miraculous cancellation in the source term, plays an important role in arxiv:2409.05968 to obtain strong late time tails. In $n = 4$ dimensions, without these special structural advantages, our novelty is to introduce an appropriate commutator vector field to derive a new hierarchy of estimates with higher $r^p$-weights so that an improved pointwise decay can be established. We expect this to be applicable for proving improved late time tails of other quasilinear wave equations in even dimensions or wave equations with inverse square potential., Comment: 105 pages, 1 figure
- Published
- 2024
14. Multimodal Instruction Tuning with Hybrid State Space Models
- Author
-
Zhou, Jianing, Li, Han, Zhang, Shuai, Xie, Ning, Wang, Ruijie, Nie, Xiaohan, Liu, Sheng, and Wang, Lingyun
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Handling lengthy context is crucial for enhancing the recognition and understanding capabilities of multimodal large language models (MLLMs) in applications such as processing high-resolution images or high frame rate videos. The rise in image resolution and frame rate substantially increases computational demands due to the increased number of input tokens. This challenge is further exacerbated by the quadratic complexity with respect to sequence length of the self-attention mechanism. Most prior works either pre-train models with long contexts, overlooking the efficiency problem, or attempt to reduce the context length via downsampling (e.g., identify the key image patches or frames) to decrease the context length, which may result in information loss. To circumvent this issue while keeping the remarkable effectiveness of MLLMs, we propose a novel approach using a hybrid transformer-MAMBA model to efficiently handle long contexts in multimodal applications. Our multimodal model can effectively process long context input exceeding 100k tokens, outperforming existing models across various benchmarks. Remarkably, our model enhances inference efficiency for high-resolution images and high-frame-rate videos by about 4 times compared to current models, with efficiency gains increasing as image resolution or video frames rise. Furthermore, our model is the first to be trained on low-resolution images or low-frame-rate videos while being capable of inference on high-resolution images and high-frame-rate videos, offering flexibility for inference in diverse scenarios.
- Published
- 2024
15. Study of the light scalar $a_{0}(980)$ through the decay $D^{0} \to a_{0}(980)^-e^{+} \nu_{e}$ with $a_{0}(980)^- \to \eta \pi^-$
- Author
-
BESIII Collaboration, Ablikim, M., Achasov, M. N., Adlarson, P., Afedulidis, O., Ai, X. C., Aliberti, R., Amoroso, A., An, Q., Bai, Y., Bakina, O., Balossino, I., Ban, Y., Bao, H. -R., Batozskaya, V., Begzsuren, K., Berger, N., Berlowski, M., Bertani, M., Bettoni, D., Bianchi, F., Bianco, E., Bortone, A., Boyko, I., Briere, R. A., Brueggemann, A., Cai, H., Cai, X., Calcaterra, A., Cao, G. F., Cao, N., Cetin, S. A., Chai, X. Y., Chang, J. F., Che, G. R., Che, Y. Z., Chelkov, G., Chen, C., Chen, C. H., Chen, Chao, Chen, G., Chen, H. S., Chen, H. Y., Chen, M. L., Chen, S. J., Chen, S. L., Chen, S. M., Chen, T., Chen, X. R., Chen, X. T., Chen, Y. B., Chen, Y. Q., Chen, Z. J., Chen, Z. Y., Choi, S. K., Cibinetto, G., Cossio, F., Cui, J. J., Dai, H. L., Dai, J. P., Dbeyssi, A., de Boer, R. E., Dedovich, D., Deng, C. Q., Deng, Z. Y., Denig, A., Denysenko, I., Destefanis, M., De Mori, F., Ding, B., Ding, X. X., Ding, Y., Dong, J., Dong, L. Y., Dong, M. Y., Dong, X., Du, M. C., Du, S. X., Duan, Y. Y., Duan, Z. H., Egorov, P., Fan, Y. H., Fang, J., Fang, S. S., Fang, W. X., Fang, Y., Fang, Y. Q., Farinelli, R., Fava, L., Feldbauer, F., Felici, G., Feng, C. Q., Feng, J. H., Feng, Y. T., Fritsch, M., Fu, C. D., Fu, J. L., Fu, Y. W., Gao, H., Gao, X. B., Gao, Y. N., Gao, Yang, Garbolino, S., Garzia, I., Ge, L., Ge, P. T., Ge, Z. W., Geng, C., Gersabeck, E. M., Gilman, A., Goetzen, K., Gong, L., Gong, W. X., Gradl, W., Gramigna, S., Greco, M., Gu, M. H., Gu, Y. T., Guan, C. Y., Guo, A. Q., Guo, L. B., Guo, M. J., Guo, R. P., Guo, Y. P., Guskov, A., Gutierrez, J., Han, K. L., Han, T. T., Hanisch, F., Hao, X. Q., Harris, F. A., He, K. K., He, K. L., Heinsius, F. H., Heinz, C. H., Heng, Y. K., Herold, C., Holtmann, T., Hong, P. C., Hou, G. Y., Hou, X. T., Hou, Y. R., Hou, Z. L., Hu, B. Y., Hu, H. M., Hu, J. F., Hu, Q. P., Hu, S. L., Hu, T., Hu, Y., Huang, G. S., Huang, K. X., Huang, L. Q., Huang, X. T., Huang, Y. P., Huang, Y. S., Hussain, T., Hölzken, F., Hüsken, N., der Wiesche, N. in, Jackson, J., Janchiv, S., Jeong, J. H., Ji, Q., Ji, Q. P., Ji, W., Ji, X. B., Ji, X. L., Ji, Y. Y., Jia, X. Q., Jia, Z. K., Jiang, D., Jiang, H. B., Jiang, P. C., Jiang, S. S., Jiang, T. J., Jiang, X. S., Jiang, Y., Jiao, J. B., Jiao, J. K., Jiao, Z., Jin, S., Jin, Y., Jing, M. Q., Jing, X. M., Johansson, T., Kabana, S., Kalantar-Nayestanaki, N., Kang, X. L., Kang, X. S., Kavatsyuk, M., Ke, B. C., Khachatryan, V., Khoukaz, A., Kiuchi, R., Kolcu, O. B., Kopf, B., Kuessner, M., Kui, X., Kumar, N., Kupsc, A., Kühn, W., Lavezzi, L., Lei, T. T., Lei, Z. H., Lellmann, M., Lenz, T., Li, C., Li, C. H., Li, Cheng, Li, D. M., Li, F., Li, G., Li, H. B., Li, H. J., Li, H. N., Li, Hui, Li, J. R., Li, J. S., Li, K., Li, K. L., Li, L. J., Li, L. K., Li, Lei, Li, M. H., Li, P. R., Li, Q. M., Li, Q. X., Li, R., Li, S. X., Li, T., Li, W. D., Li, W. G., Li, X., Li, X. H., Li, X. L., Li, X. Y., Li, X. Z., Li, Y. G., Li, Z. J., Li, Z. Y., Liang, C., Liang, H., Liang, Y. F., Liang, Y. T., Liao, G. R., Liao, Y. P., Libby, J., Limphirat, A., Lin, C. C., Lin, C. X., Lin, D. X., Lin, T., Liu, B. J., Liu, B. X., Liu, C., Liu, C. X., Liu, F., Liu, F. H., Liu, Feng, Liu, G. M., Liu, H., Liu, H. B., Liu, H. H., Liu, H. M., Liu, Huihui, Liu, J. B., Liu, J. Y., Liu, K., Liu, K. Y., Liu, Ke, Liu, L., Liu, L. C., Liu, Lu, Liu, M. H., Liu, P. L., Liu, Q., Liu, S. B., Liu, T., Liu, W. K., Liu, W. M., Liu, X., Liu, Y., Liu, Y. B., Liu, Z. A., Liu, Z. D., Liu, Z. Q., Lou, X. C., Lu, F. X., Lu, H. J., Lu, J. G., Lu, X. L., Lu, Y., Lu, Y. P., Lu, Z. H., Luo, C. L., Luo, J. R., Luo, M. X., Luo, T., Luo, X. L., Lyu, X. R., Lyu, Y. F., Ma, F. C., Ma, H., Ma, H. L., Ma, J. L., Ma, L. L., Ma, L. R., Ma, M. M., Ma, Q. M., Ma, R. Q., Ma, T., Ma, X. T., Ma, X. Y., Ma, Y. M., Maas, F. E., MacKay, I., Maggiora, M., Malde, S., Mao, Y. J., Mao, Z. P., Marcello, S., Meng, Z. X., Messchendorp, J. G., Mezzadri, G., Miao, H., Min, T. J., Mitchell, R. E., Mo, X. H., Moses, B., Muchnoi, N. Yu., Muskalla, J., Nefedov, Y., Nerling, F., Nie, L. S., Nikolaev, I. B., Ning, Z., Nisar, S., Niu, Q. L., Niu, W. D., Niu, Y., Olsen, S. L., Ouyang, Q., Pacetti, S., Pan, X., Pan, Y., Pathak, A., Pei, Y. P., Pelizaeus, M., Peng, H. P., Peng, Y. Y., Peters, K., Ping, J. L., Ping, R. G., Plura, S., Prasad, V., Qi, F. Z., Qi, H., Qi, H. R., Qi, M., Qi, T. Y., Qian, S., Qian, W. B., Qiao, C. F., Qiao, X. K., Qin, J. J., Qin, L. Q., Qin, L. Y., Qin, X. P., Qin, X. S., Qin, Z. H., Qiu, J. F., Qu, Z. H., Redmer, C. F., Ren, K. J., Rivetti, A., Rolo, M., Rong, G., Rosner, Ch., Ruan, M. Q., Ruan, S. N., Salone, N., Sarantsev, A., Schelhaas, Y., Schoenning, K., Scodeggio, M., Shan, K. Y., Shan, W., Shan, X. Y., Shang, Z. J., Shangguan, J. F., Shao, L. G., Shao, M., Shen, C. P., Shen, H. F., Shen, W. H., Shen, X. Y., Shi, B. A., Shi, H., Shi, H. C., Shi, J. L., Shi, J. Y., Shi, Q. Q., Shi, S. Y., Shi, X., Song, J. J., Song, T. Z., Song, W. M., Song, Y. J., Song, Y. X., Sosio, S., Spataro, S., Stieler, F., Su, S. S, Su, Y. J., Sun, G. B., Sun, G. X., Sun, H., Sun, H. K., Sun, J. F., Sun, K., Sun, L., Sun, S. S., Sun, T., Sun, W. Y., Sun, Y., Sun, Y. J., Sun, Y. Z., Sun, Z. Q., Sun, Z. T., Tang, C. J., Tang, G. Y., Tang, J., Tang, M., Tang, Y. A., Tao, L. Y., Tao, Q. T., Tat, M., Teng, J. X., Thoren, V., Tian, W. H., Tian, Y., Tian, Z. F., Uman, I., Wan, Y., Wang, S. J., Wang, B., Wang, B. L., Wang, Bo, Wang, D. Y., Wang, F., Wang, H. J., Wang, J. J., Wang, J. P., Wang, K., Wang, L. L., Wang, M., Wang, N. Y., Wang, S., Wang, T., Wang, T. J., Wang, W., Wang, W. P., Wang, X., Wang, X. F., Wang, X. J., Wang, X. L., Wang, X. N., Wang, Y., Wang, Y. D., Wang, Y. F., Wang, Y. H., Wang, Y. L., Wang, Y. N., Wang, Y. Q., Wang, Yaqian, Wang, Yi, Wang, Z., Wang, Z. L., Wang, Z. Y., Wang, Ziyi, Wei, D. H., Weidner, F., Wen, S. P., Wen, Y. R., Wiedner, U., Wilkinson, G., Wolke, M., Wollenberg, L., Wu, C., Wu, J. F., Wu, L. H., Wu, L. J., Wu, X., Wu, X. H., Wu, Y., Wu, Y. H., Wu, Y. J., Wu, Z., Xia, L., Xian, X. M., Xiang, B. H., Xiang, T., Xiao, D., Xiao, G. Y., Xiao, S. Y., Xiao, Y. L., Xiao, Z. J., Xie, C., Xie, X. H., Xie, Y., Xie, Y. G., Xie, Y. H., Xie, Z. P., Xing, T. Y., Xu, C. F., Xu, C. J., Xu, G. F., Xu, H. Y., Xu, M., Xu, Q. J., Xu, Q. N., Xu, W., Xu, W. L., Xu, X. P., Xu, Y., Xu, Y. C., Xu, Z. S., Yan, F., Yan, L., Yan, W. B., Yan, W. C., Yan, X. Q., Yang, H. J., Yang, H. L., Yang, H. X., Yang, J. H., Yang, T., Yang, Y., Yang, Y. F., Yang, Y. X., Yang, Z. W., Yao, Z. P., Ye, M., Ye, M. H., Yin, J. H., Yin, Junhao, You, Z. Y., Yu, B. X., Yu, C. X., Yu, G., Yu, J. S., Yu, M. C., Yu, T., Yu, X. D., Yu, Y. C., Yuan, C. Z., Yuan, J., Yuan, L., Yuan, S. C., Yuan, Y., Yuan, Z. Y., Yue, C. X., Zafar, A. A., Zeng, F. R., Zeng, S. H., Zeng, X., Zeng, Y., Zeng, Y. J., Zhai, X. Y., Zhai, Y. C., Zhan, Y. H., Zhang, A. Q., Zhang, B. L., Zhang, B. X., Zhang, D. H., Zhang, G. Y., Zhang, H., Zhang, H. C., Zhang, H. H., Zhang, H. Q., Zhang, H. R., Zhang, H. Y., Zhang, J., Zhang, J. J., Zhang, J. L., Zhang, J. Q., Zhang, J. S., Zhang, J. W., Zhang, J. X., Zhang, J. Y., Zhang, J. Z., Zhang, Jianyu, Zhang, L. M., Zhang, Lei, Zhang, P., Zhang, Q. Y., Zhang, R. Y., Zhang, S. H., Zhang, Shulei, Zhang, X. M., Zhang, X. Y, Zhang, X. Y., Zhang, Y., Zhang, Y. T., Zhang, Y. H., Zhang, Y. M., Zhang, Yan, Zhang, Z. D., Zhang, Z. H., Zhang, Z. L., Zhang, Z. Y., Zhang, Z. Z., Zhao, G., Zhao, J. Y., Zhao, J. Z., Zhao, L., Zhao, Lei, Zhao, M. G., Zhao, N., Zhao, R. P., Zhao, S. J., Zhao, Y. B., Zhao, Y. X., Zhao, Z. G., Zhemchugov, A., Zheng, B., Zheng, B. M., Zheng, J. P., Zheng, W. J., Zheng, Y. H., Zhong, B., Zhong, X., Zhou, H., Zhou, J. Y., Zhou, L. P., Zhou, S., Zhou, X., Zhou, X. K., Zhou, X. R., Zhou, X. Y., Zhou, Y. Z., Zhou, Z. C., Zhu, A. N., Zhu, J., Zhu, K., Zhu, K. J., Zhu, K. S., Zhu, L., Zhu, L. X., Zhu, S. H., Zhu, T. J., Zhu, W. D., Zhu, Y. C., Zhu, Z. A., Zou, J. H., and Zu, J.
- Subjects
High Energy Physics - Experiment - Abstract
Using 7.93 ${\rm fb^{-1}}$ of $e^+e^-$ collision data collected at a center-of-mass energy of 3.773 ${\rm GeV}$ with the BESIII detector, we present an analysis of the decay $D^{0} \to \eta \pi^- e^+ \nu_{e}$. The branching fraction of the decay $D^{0} \to a_{0}(980)^{-} e^+ \nu_{e}$ with $a_{0}(980)^{-} \to \eta \pi^{-}$ is measured to be $(0.86\pm0.17_{\text{stat}}\pm0.05_{\text{syst}})\times 10^{-4}$. The decay dynamics of this process is studied with a single-pole parameterization of the hadronic form factor and the Flatt\'e formula describing the $a_0(980)$ line shape in the differential decay rate. The product of the form factor $f^{ a_0}_{+}(0)$ and the Cabibbo-Kobayashi-Maskawa matrix element $|V_{cd}|$ is determined for the first time with the result $f^{ a_0}_+(0)|V_{cd}|=0.126\pm0.013_{\rm stat}\pm0.003_{\rm syst}$.
- Published
- 2024
16. Emergence of steady quantum transport in a superconducting processor
- Author
-
Zhang, Pengfei, Gao, Yu, Xu, Xiansong, Wang, Ning, Dong, Hang, Guo, Chu, Deng, Jinfeng, Zhang, Xu, Chen, Jiachen, Xu, Shibo, Wang, Ke, Wu, Yaozu, Zhang, Chuanyu, Jin, Feitong, Zhu, Xuhao, Zhang, Aosai, Zou, Yiren, Tan, Ziqi, Cui, Zhengyi, Zhu, Zitian, Shen, Fanhao, Li, Tingting, Zhong, Jiarun, Bao, Zehang, Zhao, Liangtian, Hao, Jie, Li, Hekang, Wang, Zhen, Song, Chao, Guo, Qiujiang, Wang, H., and Poletti, Dario
- Subjects
Quantum Physics ,Condensed Matter - Mesoscale and Nanoscale Physics ,Condensed Matter - Statistical Mechanics - Abstract
Non-equilibrium quantum transport is crucial to technological advances ranging from nanoelectronics to thermal management. In essence, it deals with the coherent transfer of energy and (quasi-)particles through quantum channels between thermodynamic baths. A complete understanding of quantum transport thus requires the ability to simulate and probe macroscopic and microscopic physics on equal footing. Using a superconducting quantum processor, we demonstrate the emergence of non-equilibrium steady quantum transport by emulating the baths with qubit ladders and realising steady particle currents between the baths. We experimentally show that the currents are independent of the microscopic details of bath initialisation, and their temporal fluctuations decrease rapidly with the size of the baths, emulating those predicted by thermodynamic baths. The above characteristics are experimental evidence of pure-state statistical mechanics and prethermalisation in non-equilibrium many-body quantum systems. Furthermore, by utilising precise controls and measurements with single-site resolution, we demonstrate the capability to tune steady currents by manipulating the macroscopic properties of the baths, including filling and spectral properties. Our investigation paves the way for a new generation of experimental exploration of non-equilibrium quantum transport in strongly correlated quantum matter., Comment: 7 pages, 4 figures
- Published
- 2024
17. Quantum Homotopy Analysis Method with Secondary Linearization for Nonlinear Partial Differential Equations
- Author
-
Xue, Cheng, Xu, Xiao-Fan, Zhuang, Xi-Ning, Sun, Tai-Ping, Wang, Yun-Jie, Tan, Ming-Yang, Ye, Chuang-Chao, Liu, Huan-Yu, Wu, Yu-Chun, Chen, Zhao-Yun, and Guo, Guo-Ping
- Subjects
Quantum Physics - Abstract
Nonlinear partial differential equations (PDEs) are crucial for modeling complex fluid dynamics and are foundational to many computational fluid dynamics (CFD) applications. However, solving these nonlinear PDEs is challenging due to the vast computational resources they demand, highlighting the pressing need for more efficient computational methods. Quantum computing offers a promising but technically challenging approach to solving nonlinear PDEs. Recently, Liao proposed a framework that leverages quantum computing to accelerate the solution of nonlinear PDEs based on the homotopy analysis method (HAM), a semi-analytical technique that transforms nonlinear PDEs into a series of linear PDEs. However, the no-cloning theorem in quantum computing poses a major limitation, where directly applying quantum simulation to each HAM step results in exponential complexity growth with the HAM truncation order. This study introduces a "secondary linearization" approach that maps the whole HAM process into a system of linear PDEs, allowing for a one-time solution using established quantum PDE solvers. Our method preserves the exponential speedup of quantum linear PDE solvers while ensuring that computational complexity increases only polynomially with the HAM truncation order. We demonstrate the efficacy of our approach by applying it to the Burgers' equation and the Korteweg-de Vries (KdV) equation. Our approach provides a novel pathway for transforming nonlinear PDEs into linear PDEs, with potential applications to fluid dynamics. This work thus lays the foundation for developing quantum algorithms capable of solving the Navier-Stokes equations, ultimately offering a promising route to accelerate their solutions using quantum computing., Comment: 22 pages, 4 figures
- Published
- 2024
18. Movable Antenna-Aided Federated Learning with Over-the-Air Aggregation: Joint Optimization of Positioning, Beamforming, and User Selection
- Author
-
Zhao, Yang, Xiu, Yue, Xu, Minrui, Wang, Ping, and Wei, Ning
- Subjects
Electrical Engineering and Systems Science - Signal Processing - Abstract
Federated learning (FL) in wireless computing effectively utilizes communication bandwidth, yet it is vulnerable to errors during the analog aggregation process. While removing users with unfavorable channel conditions can mitigate these errors, it also reduces the available local training data for FL, which in turn hinders the convergence rate of the training process. To tackle this issue, we propose the use of movable antenna (MA) techniques to enhance the degrees of freedom within the channel space, ultimately boosting the convergence speed of FL training. Moreover, we develop a coordinated approach for uplink receiver beamforming, user selection, and MA positioning to optimize the convergence rate of wireless FL training in dynamic wireless environments. This stochastic optimization challenge is reformulated into a mixed-integer programming problem by utilizing the training loss upper bound. We then introduce a penalty dual decomposition (PDD) method to solve the mixed-integer mixed programming problem. Experimental results indicate that incorporating MA techniques significantly accelerates the training convergence of FL and greatly surpasses conventional methods.
- Published
- 2024
19. Non-isometry, State-Dependence and Holography
- Author
-
Antonini, Stefano, Balasubramanian, Vijay, Bao, Ning, Cao, ChunJun, and Chemissany, Wissam
- Subjects
High Energy Physics - Theory ,Quantum Physics - Abstract
We establish an equivalence between non-isometry of quantum codes and state-dependence of operator reconstruction, and discuss implications of this equivalence for holographic duality. Specifically, we define quantitative measures of non-isometry and state-dependence and describe bounds relating these quantities. In the context of holography we show that, assuming known gravitational path integral results for overlaps between semiclassical states, non-isometric bulk-to-boundary maps with a trivial kernel are approximately isometric and bulk reconstruction approximately state-independent. In contrast, non-isometric maps with a non-empty kernel always lead to state-dependent reconstruction. We also show that if a global bulk-to-boundary map is non-isometric, then there exists a region in the bulk which is causally disconnected from the boundary. Finally, we conjecture that, under certain physical assumptions for the definition of the Hilbert space of effective field theory in AdS space, the presence of a global horizon implies a non-isometric global bulk-to-boundary map., Comment: 35 pages, 1 figure + Appendices
- Published
- 2024
20. Tooling or Not Tooling? The Impact of Tools on Language Agents for Chemistry Problem Solving
- Author
-
Yu, Botao, Baker, Frazier N., Chen, Ziru, Herb, Garrett, Gou, Boyu, Adu-Ampratwum, Daniel, Ning, Xia, and Sun, Huan
- Subjects
Computer Science - Artificial Intelligence ,Computer Science - Computational Engineering, Finance, and Science - Abstract
To enhance large language models (LLMs) for chemistry problem solving, several LLM-based agents augmented with tools have been proposed, such as ChemCrow and Coscientist. However, their evaluations are narrow in scope, leaving a large gap in understanding the benefits of tools across diverse chemistry tasks. To bridge this gap, we develop ChemAgent, an enhanced chemistry agent over ChemCrow, and conduct a comprehensive evaluation of its performance on both specialized chemistry tasks and general chemistry questions. Surprisingly, ChemAgent does not consistently outperform its base LLMs without tools. Our error analysis with a chemistry expert suggests that: For specialized chemistry tasks, such as synthesis prediction, we should augment agents with specialized tools; however, for general chemistry questions like those in exams, agents' ability to reason correctly with chemistry knowledge matters more, and tool augmentation does not always help.
- Published
- 2024
21. A Systematic Search for Candidate Supermassive Black Hole Binaries Using Periodic Mid-Infrared Light Curves of Active Galactic Nuclei
- Author
-
Luo, Di, Jiang, Ning, and Liu, Xin
- Subjects
Astrophysics - High Energy Astrophysical Phenomena - Abstract
Periodic variability in active galactic nuclei (AGNs) is a promising method for studying sub-parsec supermassive black hole binaries (SMBHBs), which are a challenging detection target. While extensive searches have been made in the optical, X-ray and gamma-ray bands, systematic infrared (IR) studies remain limited. Using data from the Wide-field Infrared Survey Explorer (WISE), which provides unique decade-long mid-IR light curves with a six-month cadence, we have conducted the first systematic search for SMBHB candidates based on IR periodicity. Analyzing a parent sample of 48,932 objects selected from about half a million AGNs, we have identified 28 candidate periodic AGNs with periods ranging from 1,268 to 2,437 days (in the observer frame) by fitting their WISE light curves with sinusoidal functions. However, our mock simulation of the parent sample indicates that stochastic variability can actually produce a similar number of periodic sources, underscoring the difficulty in robustly identifying real periodic signals with WISE light curves, given their current sampling. Notably, we found no overlap between our sample and optical periodic sources, which can be explained by a distinct preference for certain periods due to selection bias. By combining archived data from different surveys, we have identified SDSS J140336.43+174136.1 as a candidate exhibiting periodic behavior in both optical and IR bands, a phenomenon that warrants further validation through observational tests. Our results highlight the potential of IR time-domain surveys, including future missions such as the Nancy Grace-Roman Space Telescope, for identifying periodic AGNs, but complementary tests are still needed to determine their physical origins such as SMBHBs., Comment: Accepted for publication in ApJ. 24 pages
- Published
- 2024
22. Thermal Broadening of Phonon Spectral Function in Classical Lattice Models: Projective Truncation Approximation
- Author
-
Jia, Hu-Wei, Liu, Wen-Jun, Wu, Yue-Hong, Ma, Kou-Han, Wang, Lei, and Tong, Ning-Hua
- Subjects
Condensed Matter - Strongly Correlated Electrons - Abstract
Thermal broadening of the quasi-particle peak in the spectral function is an important physical feature in many statistical systems, but difficult to calculate. Within the projective truncation approximation (PTA) of Green's function equation of motion for classical systems, we produce the spectral function with thermal broadened quasi-particles peak using an $H$-expanded basis. We demonstrate this method on two model systems, the one-variable anharmonic oscillator model and the one-dimensional classical $\phi^4$ lattice model. Comparison with exact spectral function and the molecular dynamics simulation results shows that the method is semi-quantitatively accurate. Extension of the $H$-expanded basis to PTA for quantum system is possible., Comment: 20 pages, 14 figures
- Published
- 2024
23. Zero-Shot NAS via the Suppression of Local Entropy Decrease
- Author
-
Wu, Ning, Huang, Han, Xu, Yueting, and Hao, Zhifeng
- Subjects
Computer Science - Machine Learning ,Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Neural and Evolutionary Computing - Abstract
Architecture performance evaluation is the most time-consuming part of neural architecture search (NAS). Zero-Shot NAS accelerates the evaluation by utilizing zero-cost proxies instead of training. Though effective, existing zero-cost proxies require invoking backpropagations or running networks on input data, making it difficult to further accelerate the computation of proxies. To alleviate this issue, architecture topologies are used to evaluate the performance of networks in this study. We prove that particular architectural topologies decrease the local entropy of feature maps, which degrades specific features to a bias, thereby reducing network performance. Based on this proof, architectural topologies are utilized to quantify the suppression of local entropy decrease (SED) as a data-free and running-free proxy. Experimental results show that SED outperforms most state-of-the-art proxies in terms of architecture selection on five benchmarks, with computation time reduced by three orders of magnitude. We further compare the SED-based NAS with state-of-the-art proxies. SED-based NAS selects the architecture with higher accuracy and fewer parameters in only one second. The theoretical analyses of local entropy and experimental results demonstrate that the suppression of local entropy decrease facilitates selecting optimal architectures in Zero-Shot NAS., Comment: 8 pages, 2 figures. Corrected typos and latex template
- Published
- 2024
24. Pattern Integration and Enhancement Vision Transformer for Self-Supervised Learning in Remote Sensing
- Author
-
Lu, Kaixuan, Zhang, Ruiqian, Huang, Xiao, Xie, Yuxing, Ning, Xiaogang, Zhang, Hanchao, Yuan, Mengke, Zhang, Pan, Wang, Tao, and Liao, Tongkui
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Recent self-supervised learning (SSL) methods have demonstrated impressive results in learning visual representations from unlabeled remote sensing images. However, most remote sensing images predominantly consist of scenographic scenes containing multiple ground objects without explicit foreground targets, which limits the performance of existing SSL methods that focus on foreground targets. This raises the question: Is there a method that can automatically aggregate similar objects within scenographic remote sensing images, thereby enabling models to differentiate knowledge embedded in various geospatial patterns for improved feature representation? In this work, we present the Pattern Integration and Enhancement Vision Transformer (PIEViT), a novel self-supervised learning framework designed specifically for remote sensing imagery. PIEViT utilizes a teacher-student architecture to address both image-level and patch-level tasks. It employs the Geospatial Pattern Cohesion (GPC) module to explore the natural clustering of patches, enhancing the differentiation of individual features. The Feature Integration Projection (FIP) module further refines masked token reconstruction using geospatially clustered patches. We validated PIEViT across multiple downstream tasks, including object detection, semantic segmentation, and change detection. Experiments demonstrated that PIEViT enhances the representation of internal patch features, providing significant improvements over existing self-supervised baselines. It achieves excellent results in object detection, land cover classification, and change detection, underscoring its robustness, generalization, and transferability for remote sensing image interpretation tasks.
- Published
- 2024
25. Strong-to-weak Symmetry Breaking and Entanglement Transitions
- Author
-
Chen, Langxuan, Sun, Ning, and Zhang, Pengfei
- Subjects
Quantum Physics ,Condensed Matter - Statistical Mechanics ,Condensed Matter - Strongly Correlated Electrons - Abstract
When interacting with an environment, the entanglement within quantum many-body systems is rapidly transferred to the entanglement between the system and the bath. For systems with a large local Hilbert space dimension, this leads to a first-order entanglement transition for the reduced density matrix of the system. On the other hand, recent studies have introduced a new paradigm for classifying density matrices, with particular focus on scenarios where a strongly symmetric density matrix undergoes spontaneous symmetry breaking to a weak symmetry phase. This is typically characterized by a finite R\'enyi-2 correlator or a finite Wightman correlator. In this work, we study the entanglement transition from the perspective of strong-to-weak symmetry breaking, using solvable complex Brownian SYK models. We perform analytical calculations for both the early-time and late-time saddles. The results show that while the R\'enyi-2 correlator indicates a transition from symmetric to symmetry-broken phase, the Wightman correlator becomes finite even in the early-time saddle due to the single-replica limit, demonstrating that the first-order transition occurs between a near-symmetric phase and a deeply symmetry-broken phase in the sense of Wightman correlator. Our results provide a novel viewpoint on the entanglement transition under symmetry constraints and can be readily generalized to systems with repeated measurements., Comment: 7 pages, 1 figure
- Published
- 2024
26. Content-Adaptive Rate-Quality Curve Prediction Model in Media Processing System
- Author
-
Yin, Shibo, Zhang, Zhiyu, Ning, Peirong, Chen, Qiubo, Chen, Jing, Zhou, Quan, and Song, Li
- Subjects
Computer Science - Multimedia - Abstract
In streaming media services, video transcoding is a common practice to alleviate bandwidth demands. Unfortunately, traditional methods employing a uniform rate factor (RF) across all videos often result in significant inefficiencies. Content-adaptive encoding (CAE) techniques address this by dynamically adjusting encoding parameters based on video content characteristics. However, existing CAE methods are often tightly coupled with specific encoding strategies, leading to inflexibility. In this paper, we propose a model that predicts both RF-quality and RF-bitrate curves, which can be utilized to derive a comprehensive bitrate-quality curve. This approach facilitates flexible adjustments to the encoding strategy without necessitating model retraining. The model leverages codec features, content features, and anchor features to predict the bitrate-quality curve accurately. Additionally, we introduce an anchor suspension method to enhance prediction accuracy. Experiments confirm that the actual quality metric (VMAF) of the compressed video stays within 1 of the target, achieving an accuracy of 99.14%. By incorporating our quality improvement strategy with the rate-quality curve prediction model, we conducted online A/B tests, obtaining both +0.107% improvements in video views and video completions and +0.064% app duration time. Our model has been deployed on the Xiaohongshu App., Comment: Accepted by IEEE VCIP 2024 (Oral)
- Published
- 2024
27. Integrated Location Sensing and Communication for Ultra-Massive MIMO With Hybrid-Field Beam-Squint Effect
- Author
-
Gao, Zhen, Zhou, Xingyu, Ning, Boyu, Su, Yu, Qin, Tong, and Niyato, Dusit
- Subjects
Electrical Engineering and Systems Science - Signal Processing ,Computer Science - Information Theory - Abstract
The advent of ultra-massive multiple-input-multiple output systems holds great promise for next-generation communications, yet their channels exhibit hybrid far- and near- field beam-squint (HFBS) effect. In this paper, we not only overcome but also harness the HFBS effect to propose an integrated location sensing and communication (ILSC) framework. During the uplink training stage, user terminals (UTs) transmit reference signals for simultaneous channel estimation and location sensing. This stage leverages an elaborately designed hybrid-field projection matrix to overcome the HFBS effect and estimate the channel in compressive manner. Subsequently, the scatterers' locations can be sensed from the spherical wavefront based on the channel estimation results. By treating the sensed scatterers as virtual anchors, we employ a weighted least-squares approach to derive UT' s location. Moreover, we propose an iterative refinement mechanism, which utilizes the accurately estimated time difference of arrival of multipath components to enhance location sensing precision. In the following downlink data transmission stage, we leverage the acquired location information to further optimize the hybrid beamformer, which combines the beam broadening and focusing to mitigate the spectral efficiency degradation resulted from the HFBS effect. Extensive simulation experiments demonstrate that the proposed ILSC scheme has superior location sensing and communication performance than conventional methods., Comment: This paper has been accepted by IEEE JSAC
- Published
- 2024
28. Decoding Report Generators: A Cyclic Vision-Language Adapter for Counterfactual Explanations
- Author
-
Fang, Yingying, Jin, Zihao, Guo, Shaojie, Liu, Jinda, Gao, Yijian, Ning, Junzhi, Yue, Zhiling, Li, Zhi, Walsh, Simon LF, and Yang, Guang
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Artificial Intelligence ,Computer Science - Computation and Language ,Computer Science - Machine Learning - Abstract
Despite significant advancements in report generation methods, a critical limitation remains: the lack of interpretability in the generated text. This paper introduces an innovative approach to enhance the explainability of text generated by report generation models. Our method employs cyclic text manipulation and visual comparison to identify and elucidate the features in the original content that influence the generated text. By manipulating the generated reports and producing corresponding images, we create a comparative framework that highlights key attributes and their impact on the text generation process. This approach not only identifies the image features aligned to the generated text but also improves transparency but also provides deeper insights into the decision-making mechanisms of the report generation models. Our findings demonstrate the potential of this method to significantly enhance the interpretability and transparency of AI-generated reports.
- Published
- 2024
29. Audiobox TTA-RAG: Improving Zero-Shot and Few-Shot Text-To-Audio with Retrieval-Augmented Generation
- Author
-
Yang, Mu, Shi, Bowen, Le, Matthew, Hsu, Wei-Ning, and Tjandra, Andros
- Subjects
Electrical Engineering and Systems Science - Audio and Speech Processing ,Computer Science - Sound - Abstract
Current leading Text-To-Audio (TTA) generation models suffer from degraded performance on zero-shot and few-shot settings. It is often challenging to generate high-quality audio for audio events that are unseen or uncommon in the training set. Inspired by the success of Retrieval-Augmented Generation (RAG) in Large Language Model (LLM)-based knowledge-intensive tasks, we extend the TTA process with additional conditioning contexts. We propose Audiobox TTA-RAG, a novel retrieval-augmented TTA approach based on Audiobox, a conditional flow-matching audio generation model. Unlike the vanilla Audiobox TTA solution which generates audio conditioned on text, we augmented the conditioning input with retrieved audio samples that provide additional acoustic information to generate the target audio. Our retrieval method does not require the external database to have labeled audio, offering more practical use cases. To evaluate our proposed method, we curated test sets in zero-shot and few-shot settings. Our empirical results show that the proposed model can effectively leverage the retrieved audio samples and significantly improve zero-shot and few-shot TTA performance, with large margins on multiple evaluation metrics, while maintaining the ability to generate semantically aligned audio for the in-domain setting. In addition, we investigate the effect of different retrieval methods and data sources.
- Published
- 2024
30. Innovative Weight Simulation in Virtual Reality Cube Games: A Pseudo-Haptic Approach
- Author
-
Lim, Woan Ning, Leong, Edric Yi Junn, Lee, Yun Li, and Yap, Kian Meng
- Subjects
Computer Science - Human-Computer Interaction - Abstract
This paper presents an innovative pseudo-haptic model for weight simulation in virtual reality (VR) environments. By integrating visual feedback with voluntary exerted force through a passive haptic glove, the model creates haptic illusions of weight perception. Two VR cube games were developed to evaluate the model's effectiveness. The first game assesses participants' ability to discriminate relative weights, while the second evaluates their capability to estimate absolute weights. Twelve participants, aged 18 to 59, tested the games. Results suggest that the pseudo-haptic model is effective for relative weight discrimination tasks and holds potential for various VR applications. Further research with a larger participant group and more complex scenarios is recommended to refine and validate the model., Comment: Part of proceedings of 6th International Conference AsiaHaptics 2024
- Published
- 2024
31. Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models
- Author
-
Liang, Weixin, Yu, Lili, Luo, Liang, Iyer, Srinivasan, Dong, Ning, Zhou, Chunting, Ghosh, Gargi, Lewis, Mike, Yih, Wen-tau, Zettlemoyer, Luke, and Lin, Xi Victoria
- Subjects
Computer Science - Computation and Language - Abstract
The development of large language models (LLMs) has expanded to multi-modal systems capable of processing text, images, and speech within a unified framework. Training these models demands significantly larger datasets and computational resources compared to text-only LLMs. To address the scaling challenges, we introduce Mixture-of-Transformers (MoT), a sparse multi-modal transformer architecture that significantly reduces pretraining computational costs. MoT decouples non-embedding parameters of the model by modality -- including feed-forward networks, attention matrices, and layer normalization -- enabling modality-specific processing with global self-attention over the full input sequence. We evaluate MoT across multiple settings and model scales. In the Chameleon 7B setting (autoregressive text-and-image generation), MoT matches the dense baseline's performance using only 55.8\% of the FLOPs. When extended to include speech, MoT reaches speech performance comparable to the dense baseline with only 37.2\% of the FLOPs. In the Transfusion setting, where text and image are trained with different objectives, a 7B MoT model matches the image modality performance of the dense baseline with one third of the FLOPs, and a 760M MoT model outperforms a 1.4B dense baseline across key image generation metrics. System profiling further highlights MoT's practical benefits, achieving dense baseline image quality in 47.2\% of the wall-clock time and text quality in 75.6\% of the wall-clock time (measured on AWS p4de.24xlarge instances with NVIDIA A100 GPUs).
- Published
- 2024
32. Search for Axions from Magnetic White Dwarfs with Chandra
- Author
-
Ning, Orion, Dessert, Christopher, Hong, Vi, and Safdi, Benjamin R.
- Subjects
Astrophysics - High Energy Astrophysical Phenomena ,Astrophysics - Solar and Stellar Astrophysics ,High Energy Physics - Phenomenology - Abstract
Low mass axion-like particles could be produced in abundance within the cores of hot, compact magnetic white dwarf (MWD) stars from electron bremsstrahlung and converted to detectable X-rays in the strong magnetic fields surrounding these systems. In this work, we constrain the existence of such axions from two dedicated Chandra X-ray observations of $\sim$40 ks each in the energy range $\sim$1 - 10 keV towards the magnetic white dwarfs (MWDs) WD 1859+148 and PG 0945+246. We find no evidence for axions, which constrains the axion-electron times axion-photon coupling to $|g_{a\gamma \gamma} g_{aee}| \lesssim 1.54 \times 10^{-25}$ ($3.54 \times 10^{-25}$) GeV$^{-1}$ for PG 0945+246 (WD 1859+148) at 95% confidence for axion masses $m_a \lesssim 10^{-6}$ eV. We find an excess of low-energy X-rays between 1 - 3 keV for WD 1859+148 but determine that the spectral morphology is too soft to arise from axions; instead, the soft X-rays may arise from non-thermal emission in the MWD magnetosphere., Comment: 10 pages, 14 figures, video abstract at https://youtu.be/yfj8JoC1jKQ
- Published
- 2024
33. Automating Exploratory Proteomics Research via Language Models
- Author
-
Ding, Ning, Qu, Shang, Xie, Linhai, Li, Yifei, Liu, Zaoqu, Zhang, Kaiyan, Xiong, Yibai, Zuo, Yuxin, Chen, Zhangren, Hua, Ermo, Lv, Xingtai, Sun, Youbang, Li, Yang, Li, Dong, He, Fuchu, and Zhou, Bowen
- Subjects
Computer Science - Artificial Intelligence ,Quantitative Biology - Quantitative Methods - Abstract
With the development of artificial intelligence, its contribution to science is evolving from simulating a complex problem to automating entire research processes and producing novel discoveries. Achieving this advancement requires both specialized general models grounded in real-world scientific data and iterative, exploratory frameworks that mirror human scientific methodologies. In this paper, we present PROTEUS, a fully automated system for scientific discovery from raw proteomics data. PROTEUS uses large language models (LLMs) to perform hierarchical planning, execute specialized bioinformatics tools, and iteratively refine analysis workflows to generate high-quality scientific hypotheses. The system takes proteomics datasets as input and produces a comprehensive set of research objectives, analysis results, and novel biological hypotheses without human intervention. We evaluated PROTEUS on 12 proteomics datasets collected from various biological samples (e.g. immune cells, tumors) and different sample types (single-cell and bulk), generating 191 scientific hypotheses. These were assessed using both automatic LLM-based scoring on 5 metrics and detailed reviews from human experts. Results demonstrate that PROTEUS consistently produces reliable, logically coherent results that align well with existing literature while also proposing novel, evaluable hypotheses. The system's flexible architecture facilitates seamless integration of diverse analysis tools and adaptation to different proteomics data types. By automating complex proteomics analysis workflows and hypothesis generation, PROTEUS has the potential to considerably accelerate the pace of scientific discovery in proteomics research, enabling researchers to efficiently explore large-scale datasets and uncover biological insights.
- Published
- 2024
34. PACE: Pacing Operator Learning to Accurate Optical Field Simulation for Complicated Photonic Devices
- Author
-
Zhu, Hanqing, Cong, Wenyan, Chen, Guojin, Ning, Shupeng, Chen, Ray T., Gu, Jiaqi, and Pan, David Z.
- Subjects
Computer Science - Machine Learning ,Physics - Optics - Abstract
Electromagnetic field simulation is central to designing, optimizing, and validating photonic devices and circuits. However, costly computation associated with numerical simulation poses a significant bottleneck, hindering scalability and turnaround time in the photonic circuit design process. Neural operators offer a promising alternative, but existing SOTA approaches, NeurOLight, struggle with predicting high-fidelity fields for real-world complicated photonic devices, with the best reported 0.38 normalized mean absolute error in NeurOLight. The inter-plays of highly complex light-matter interaction, e.g., scattering and resonance, sensitivity to local structure details, non-uniform learning complexity for full-domain simulation, and rich frequency information, contribute to the failure of existing neural PDE solvers. In this work, we boost the prediction fidelity to an unprecedented level for simulating complex photonic devices with a novel operator design driven by the above challenges. We propose a novel cross-axis factorized PACE operator with a strong long-distance modeling capacity to connect the full-domain complex field pattern with local device structures. Inspired by human learning, we further divide and conquer the simulation task for extremely hard cases into two progressively easy tasks, with a first-stage model learning an initial solution refined by a second model. On various complicated photonic device benchmarks, we demonstrate one sole PACE model is capable of achieving 73% lower error with 50% fewer parameters compared with various recent ML for PDE solvers. The two-stage setup further advances high-fidelity simulation for even more intricate cases. In terms of runtime, PACE demonstrates 154-577x and 11.8-12x simulation speedup over numerical solver using scipy or highly-optimized pardiso solver, respectively. We open sourced the code and dataset., Comment: Accepeted by Neurips 2024, 21 pages
- Published
- 2024
35. GIS Copilot: Towards an Autonomous GIS Agent for Spatial Analysis
- Author
-
Akinboyewa, Temitope, Li, Zhenlong, Ning, Huan, and Lessani, M. Naser
- Subjects
Computer Science - Artificial Intelligence ,Computer Science - Emerging Technologies ,Computer Science - Human-Computer Interaction ,Computer Science - Software Engineering - Abstract
Recent advancements in Generative AI offer promising capabilities for spatial analysis. Despite their potential, the integration of generative AI with established GIS platforms remains underexplored. In this study, we propose a framework for integrating LLMs directly into existing GIS platforms, using QGIS as an example. Our approach leverages the reasoning and programming capabilities of LLMs to autonomously generate spatial analysis workflows and code through an informed agent that has comprehensive documentation of key GIS tools and parameters. The implementation of this framework resulted in the development of a "GIS Copilot" that allows GIS users to interact with QGIS using natural language commands for spatial analysis. The GIS Copilot was evaluated with over 100 spatial analysis tasks with three complexity levels: basic tasks that require one GIS tool and typically involve one data layer to perform simple operations; intermediate tasks involving multi-step processes with multiple tools, guided by user instructions; and advanced tasks which involve multi-step processes that require multiple tools but not guided by user instructions, necessitating the agent to independently decide on and executes the necessary steps. The evaluation reveals that the GIS Copilot demonstrates strong potential in automating foundational GIS operations, with a high success rate in tool selection and code generation for basic and intermediate tasks, while challenges remain in achieving full autonomy for more complex tasks. This study contributes to the emerging vision of Autonomous GIS, providing a pathway for non-experts to engage with geospatial analysis with minimal prior expertise. While full autonomy is yet to be achieved, the GIS Copilot demonstrates significant potential for simplifying GIS workflows and enhancing decision-making processes.
- Published
- 2024
36. Graph-DPEP: Decomposed Plug and Ensemble Play for Few-Shot Document Relation Extraction with Graph-of-Thoughts Reasoning
- Author
-
Zhang, Tao, Yan, Ning, Mortazavi, Masood, Nguyen, Hoang H., Deng, Zhongfen, and Yu, Philip S.
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence ,Computer Science - Information Retrieval - Abstract
Large language models (LLMs) pre-trained on massive corpora have demonstrated impressive few-shot learning capability on many NLP tasks. Recasting an NLP task into a text-to-text generation task is a common practice so that generative LLMs can be prompted to resolve it. However, performing document-level relation extraction (DocRE) tasks with generative LLM models is still challenging due to the structured output format of DocRE, which complicates the conversion to plain text. Limited information available in few-shot samples and prompt instructions induce further difficulties and challenges in relation extraction for mentioned entities in a document. In this paper, we represent the structured output as a graph-style triplet rather than natural language expressions and leverage generative LLMs for the DocRE task. Our approach, the Graph-DPEP framework is grounded in the reasoning behind triplet explanation thoughts presented in natural language. In this framework, we first introduce a ``decomposed-plug" method for performing the generation from LLMs over prompts with type-space decomposition to alleviate the burden of distinguishing all relation types. Second, we employ a verifier for calibrating the generation and identifying overlooked query entity pairs. Third, we develop "ensemble-play", reapplying generation on the entire type list by leveraging the reasoning thoughts embedded in a sub-graph associated with the missing query pair to address the missingness issue. Through extensive comparisons with existing prompt techniques and alternative Language Models (LLMs), our framework demonstrates superior performance on publicly available benchmarks in experiments.
- Published
- 2024
37. Sparse Orthogonal Parameters Tuning for Continual Learning
- Author
-
Ning, Kun-Peng, Ke, Hai-Jian, Liu, Yu-Yang, Yao, Jia-Yu, Tian, Yong-Hong, and Yuan, Li
- Subjects
Computer Science - Machine Learning - Abstract
Continual learning methods based on pre-trained models (PTM) have recently gained attention which adapt to successive downstream tasks without catastrophic forgetting. These methods typically refrain from updating the pre-trained parameters and instead employ additional adapters, prompts, and classifiers. In this paper, we from a novel perspective investigate the benefit of sparse orthogonal parameters for continual learning. We found that merging sparse orthogonality of models learned from multiple streaming tasks has great potential in addressing catastrophic forgetting. Leveraging this insight, we propose a novel yet effective method called SoTU (Sparse Orthogonal Parameters TUning). We hypothesize that the effectiveness of SoTU lies in the transformation of knowledge learned from multiple domains into the fusion of orthogonal delta parameters. Experimental evaluations on diverse CL benchmarks demonstrate the effectiveness of the proposed approach. Notably, SoTU achieves optimal feature representation for streaming data without necessitating complex classifier designs, making it a Plug-and-Play solution.
- Published
- 2024
38. QCG-Rerank: Chunks Graph Rerank with Query Expansion in Retrieval-Augmented LLMs for Tourism Domain
- Author
-
Wei, Qikai, Yang, Mingzhi, Han, Chunlong, Wei, Jingfu, Zhang, Minghao, Shi, Feifei, and Ning, Huansheng
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence - Abstract
Retrieval-Augmented Generation (RAG) mitigates the issue of hallucination in Large Language Models (LLMs) by integrating information retrieval techniques. However, in the tourism domain, since the query is usually brief and the content in the database is diverse, existing RAG may contain a significant amount of irrelevant or contradictory information contents after retrieval. To address this challenge, we propose the QCG-Rerank model. This model first performs an initial retrieval to obtain candidate chunks and then enhances semantics by extracting critical information to expand the original query. Next, we utilize the expanded query and candidate chunks to calculate similarity scores as the initial transition probability and construct the chunks graph. Subsequently, We iteratively compute the transition probabilities based on an initial estimate until convergence. The chunks with the highest score are selected and input into the LLMs to generate responses. We evaluate the model on Cultour, IIRC, StrategyQA, HotpotQA, SQuAD, and MuSiQue datasets. The experimental results demonstrate the effectiveness and superiority of the QCG-Rerank method.
- Published
- 2024
39. Enhancing Indoor Mobility with Connected Sensor Nodes: A Real-Time, Delay-Aware Cooperative Perception Approach
- Author
-
Ning, Minghao, Cui, Yaodong, Yang, Yufeng, Huang, Shucheng, Liu, Zhenan, Alghooneh, Ahmad Reza, Hashemi, Ehsan, and Khajepour, Amir
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Artificial Intelligence ,Computer Science - Robotics - Abstract
This paper presents a novel real-time, delay-aware cooperative perception system designed for intelligent mobility platforms operating in dynamic indoor environments. The system contains a network of multi-modal sensor nodes and a central node that collectively provide perception services to mobility platforms. The proposed Hierarchical Clustering Considering the Scanning Pattern and Ground Contacting Feature based Lidar Camera Fusion improve intra-node perception for crowded environment. The system also features delay-aware global perception to synchronize and aggregate data across nodes. To validate our approach, we introduced the Indoor Pedestrian Tracking dataset, compiled from data captured by two indoor sensor nodes. Our experiments, compared to baselines, demonstrate significant improvements in detection accuracy and robustness against delays. The dataset is available in the repository: https://github.com/NingMingHao/MVSLab-IndoorCooperativePerception
- Published
- 2024
40. Scalable Efficient Training of Large Language Models with Low-dimensional Projected Attention
- Author
-
Lv, Xingtai, Ding, Ning, Zhang, Kaiyan, Hua, Ermo, Cui, Ganqu, and Zhou, Bowen
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence ,Computer Science - Machine Learning - Abstract
Improving the effectiveness and efficiency of large language models (LLMs) simultaneously is a critical yet challenging research goal. In this paper, we find that low-rank pre-training, normally considered as efficient methods that will compromise performance, can be scalably effective when reduced parameters are precisely targeted. Specifically, applying the low-dimensional module only to the attention layer -- resolves this issue and enhances both effectiveness and efficiency. We refer to this structure as Low-dimensional Projected Attention (LPA) and provide an explanatory analysis. Through extensive experimentation at parameter scales of 130M, 370M, and scaling up to 3B, we have validated the effectiveness and scalability of LPA. Our results show that LPA model can save up to 12.4% in time while achieving an approximate 5% improvement in test perplexity (ppl) and on downstream tasks compared with the vanilla Transformer., Comment: Accepted to EMNLP 2024 (Main Conference)
- Published
- 2024
41. Exploiting Unlabeled Data with Multiple Expert Teachers for Open Vocabulary Aerial Object Detection and Its Orientation Adaptation
- Author
-
Li, Yan, Guo, Weiwei, Yang, Xue, Liao, Ning, Zhang, Shaofeng, Yu, Yi, Yu, Wenxian, and Yan, Junchi
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
In recent years, aerial object detection has been increasingly pivotal in various earth observation applications. However, current algorithms are limited to detecting a set of pre-defined object categories, demanding sufficient annotated training samples, and fail to detect novel object categories. In this paper, we put forth a novel formulation of the aerial object detection problem, namely open-vocabulary aerial object detection (OVAD), which can detect objects beyond training categories without costly collecting new labeled data. We propose CastDet, a CLIP-activated student-teacher detection framework that serves as the first OVAD detector specifically designed for the challenging aerial scenario, where objects often exhibit weak appearance features and arbitrary orientations. Our framework integrates a robust localization teacher along with several box selection strategies to generate high-quality proposals for novel objects. Additionally, the RemoteCLIP model is adopted as an omniscient teacher, which provides rich knowledge to enhance classification capabilities for novel categories. A dynamic label queue is devised to maintain high-quality pseudo-labels during training. By doing so, the proposed CastDet boosts not only novel object proposals but also classification. Furthermore, we extend our approach from horizontal OVAD to oriented OVAD with tailored algorithm designs to effectively manage bounding box representation and pseudo-label generation. Extensive experiments for both tasks on multiple existing aerial object detection datasets demonstrate the effectiveness of our approach. The code is available at https://github.com/lizzy8587/CastDet.
- Published
- 2024
42. Mixing angle of $K_1(1270/1400)$ and the $K\bar K_1(1400)$ molecular interpretation of $\eta_1(1855)$
- Author
-
Liu, Zheng-Shu, Chen, Xu-Liang, Lian, Ding-Kun, Li, Ning, and Chen, Wei
- Subjects
High Energy Physics - Phenomenology ,High Energy Physics - Experiment - Abstract
Due to the SU(3) symmetry breaking effect, the axial-vector kaons $K_1(1270)$ and $K_1(1400)$ are established to be mixtures of two P-wave $K_{1A}\left( {^3{P_1}} \right)$ and $K_{1B}\left( {^1{P_1}} \right)$ states. In QCD sum rules, we propose a new construction of the $K_1$ current operators and calculate the two-point correlation functions by including the next-to-leading order four-quark condensates. The mixing angle is determined as $\theta = \left( {46.95_{ - 0.23}^{ + 0.25}} \right)^\circ$ by reproducing the masses of $K_1(1270)$ and $K_1(1400)$. We further compose the $K\bar K_1\left( {1270} \right)$ and $K\bar K_1\left( {1400} \right)$ interpolating currents with exotic quantum numbers $J^{PC}=1^{-+}$ to investigate the possible molecular interpretation of the recently observed ${\eta _1}(1855)$ state. We calculate the correlation functions and perform the QCD sum rule analyses for these two molecular systems. However, the spectral functions are found to be negative in physical regions so that they are not able to provide reliable investigations of the $K\bar K_1$ molecular states., Comment: 10 pages, 9 figures. More references added, some typos are corrected
- Published
- 2024
43. DeMod: A Holistic Tool with Explainable Detection and Personalized Modification for Toxicity Censorship
- Author
-
Li, Yaqiong, Zhang, Peng, Gu, Hansu, Lu, Tun, Qiao, Siyuan, Shu, Yubo, Shao, Yiyang, and Gu, Ning
- Subjects
Computer Science - Human-Computer Interaction ,Computer Science - Artificial Intelligence ,Computer Science - Social and Information Networks - Abstract
Although there have been automated approaches and tools supporting toxicity censorship for social posts, most of them focus on detection. Toxicity censorship is a complex process, wherein detection is just an initial task and a user can have further needs such as rationale understanding and content modification. For this problem, we conduct a needfinding study to investigate people's diverse needs in toxicity censorship and then build a ChatGPT-based censorship tool named DeMod accordingly. DeMod is equipped with the features of explainable Detection and personalized Modification, providing fine-grained detection results, detailed explanations, and personalized modification suggestions. We also implemented the tool and recruited 35 Weibo users for evaluation. The results suggest DeMod's multiple strengths like the richness of functionality, the accuracy of censorship, and ease of use. Based on the findings, we further propose several insights into the design of content censorship systems.
- Published
- 2024
44. Dynamic Supervised Principal Component Analysis for Classification
- Author
-
Ouyang, Wenbo, Wu, Ruiyang, Hao, Ning, and Zhang, Hao Helen
- Subjects
Statistics - Methodology - Abstract
This paper introduces a novel framework for dynamic classification in high dimensional spaces, addressing the evolving nature of class distributions over time or other index variables. Traditional discriminant analysis techniques are adapted to learn dynamic decision rules with respect to the index variable. In particular, we propose and study a new supervised dimension reduction method employing kernel smoothing to identify the optimal subspace, and provide a comprehensive examination of this approach for both linear discriminant analysis and quadratic discriminant analysis. We illustrate the effectiveness of the proposed methods through numerical simulations and real data examples. The results show considerable improvements in classification accuracy and computational efficiency. This work contributes to the field by offering a robust and adaptive solution to the challenges of scalability and non-staticity in high-dimensional data classification.
- Published
- 2024
45. LLM-based Framework for Bearing Fault Diagnosis
- Author
-
Tao, Laifa, Liu, Haifei, Ning, Guoao, Cao, Wenyan, Huang, Bohao, and Lu, Chen
- Subjects
Electrical Engineering and Systems Science - Signal Processing - Abstract
Accurately diagnosing bearing faults is crucial for maintaining the efficient operation of rotating machinery. However, traditional diagnosis methods face challenges due to the diversification of application environments, including cross-condition adaptability, small-sample learning difficulties, and cross-dataset generalization. These challenges have hindered the effectiveness and limited the application of existing approaches. Large language models (LLMs) offer new possibilities for improving the generalization of diagnosis models. However, the integration of LLMs with traditional diagnosis techniques for optimal generalization remains underexplored. This paper proposed an LLM-based bearing fault diagnosis framework to tackle these challenges. First, a signal feature quantification method was put forward to address the issue of extracting semantic information from vibration data, which integrated time and frequency domain feature extraction based on a statistical analysis framework. This method textualized time-series data, aiming to efficiently learn cross-condition and small-sample common features through concise feature selection. Fine-tuning methods based on LoRA and QLoRA were employed to enhance the generalization capability of LLMs in analyzing vibration data features. In addition, the two innovations (textualizing vibration features and fine-tuning pre-trained models) were validated by single-dataset cross-condition and cross-dataset transfer experiment with complete and limited data. The results demonstrated the ability of the proposed framework to perform three types of generalization tasks simultaneously. Trained cross-dataset models got approximately a 10% improvement in accuracy, proving the adaptability of LLMs to input patterns. Ultimately, the results effectively enhance the generalization capability and fill the research gap in using LLMs for bearing fault diagnosis., Comment: 25 pages, 11 figures
- Published
- 2024
46. Holographic homogeneous superfluid on the sphere
- Author
-
Gao, Meng, Ning, Zhuan, Tian, Yu, and Zhang, Hongbao
- Subjects
High Energy Physics - Theory - Abstract
In this paper, we extend the study of holographic superfluids from planar topology to spherical topology, inspired by recent studies on Bose-Einstein condensation (BEC) superfluid on shell-shaped geometry. We mainly focus on superfluid phase transition and Quasi Normal Modes (QNMs). It turns out that the critical temperature for the superfluid phase transition on the sphere is higher than that in the planar case. We investigated four different solutions in the backgrounds of large black hole and small black hole. The calculation of free energy selects the most stable solution. Finally, after calculating the quasi-normal modes and their dynamic behavior, we obtained three different channels similar to the planar superfluid case, along with the "first" hydrodynamic excitation, which may be associated with Berezinskii-Kosterlitz-Thoules (BKT) transition., Comment: 18 pages, 14 figures
- Published
- 2024
47. Pre-trained Molecular Language Models with Random Functional Group Masking
- Author
-
Peng, Tianhao, Li, Yuchen, Li, Xuhong, Bian, Jiang, Xie, Zeke, Sui, Ning, Mumtaz, Shahid, Xu, Yanwu, Kong, Linghe, and Xiong, Haoyi
- Subjects
Quantitative Biology - Biomolecules ,Computer Science - Artificial Intelligence ,Computer Science - Machine Learning ,Physics - Chemical Physics - Abstract
Recent advancements in computational chemistry have leveraged the power of trans-former-based language models, such as MoLFormer, pre-trained using a vast amount of simplified molecular-input line-entry system (SMILES) sequences, to understand and predict molecular properties and activities, a critical step in fields like drug discovery and materials science. To further improve performance, researchers have introduced graph neural networks with graph-based molecular representations, such as GEM, incorporating the topology, geometry, 2D or even 3D structures of molecules into pre-training. While most of molecular graphs in existing studies were automatically converted from SMILES sequences, it is to assume that transformer-based language models might be able to implicitly learn structure-aware representations from SMILES sequences. In this paper, we propose \ours{} -- a SMILES-based \underline{\em M}olecular \underline{\em L}anguage \underline{\em M}odel, which randomly masking SMILES subsequences corresponding to specific molecular \underline{\em F}unctional \underline{\em G}roups to incorporate structure information of atoms during the pre-training phase. This technique aims to compel the model to better infer molecular structures and properties, thus enhancing its predictive capabilities. Extensive experimental evaluations across 11 benchmark classification and regression tasks in the chemical domain demonstrate the robustness and superiority of \ours{}. Our findings reveal that \ours{} outperforms existing pre-training models, either based on SMILES or graphs, in 9 out of the 11 downstream tasks, ranking as a close second in the remaining ones., Comment: Under review
- Published
- 2024
48. Infinite-Resolution Integral Noise Warping for Diffusion Models
- Author
-
Deng, Yitong, Lin, Winnie, Li, Lingxiao, Smirnov, Dmitriy, Burgert, Ryan, Yu, Ning, Dedun, Vincent, and Taghavi, Mohammad H.
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Artificial Intelligence ,Computer Science - Graphics ,Computer Science - Machine Learning - Abstract
Adapting pretrained image-based diffusion models to generate temporally consistent videos has become an impactful generative modeling research direction. Training-free noise-space manipulation has proven to be an effective technique, where the challenge is to preserve the Gaussian white noise distribution while adding in temporal consistency. Recently, Chang et al. (2024) formulated this problem using an integral noise representation with distribution-preserving guarantees, and proposed an upsampling-based algorithm to compute it. However, while their mathematical formulation is advantageous, the algorithm incurs a high computational cost. Through analyzing the limiting-case behavior of their algorithm as the upsampling resolution goes to infinity, we develop an alternative algorithm that, by gathering increments of multiple Brownian bridges, achieves their infinite-resolution accuracy while simultaneously reducing the computational cost by orders of magnitude. We prove and experimentally validate our theoretical claims, and demonstrate our method's effectiveness in real-world applications. We further show that our method readily extends to the 3-dimensional space.
- Published
- 2024
49. GarmentLab: A Unified Simulation and Benchmark for Garment Manipulation
- Author
-
Lu, Haoran, Wu, Ruihai, Li, Yitong, Li, Sijie, Zhu, Ziyu, Ning, Chuanruo, Shen, Yan, Luo, Longzan, Chen, Yuanpei, and Dong, Hao
- Subjects
Computer Science - Robotics ,Computer Science - Artificial Intelligence ,Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Human-Computer Interaction - Abstract
Manipulating garments and fabrics has long been a critical endeavor in the development of home-assistant robots. However, due to complex dynamics and topological structures, garment manipulations pose significant challenges. Recent successes in reinforcement learning and vision-based methods offer promising avenues for learning garment manipulation. Nevertheless, these approaches are severely constrained by current benchmarks, which offer limited diversity of tasks and unrealistic simulation behavior. Therefore, we present GarmentLab, a content-rich benchmark and realistic simulation designed for deformable object and garment manipulation. Our benchmark encompasses a diverse range of garment types, robotic systems and manipulators. The abundant tasks in the benchmark further explores of the interactions between garments, deformable objects, rigid bodies, fluids, and human body. Moreover, by incorporating multiple simulation methods such as FEM and PBD, along with our proposed sim-to-real algorithms and real-world benchmark, we aim to significantly narrow the sim-to-real gap. We evaluate state-of-the-art vision methods, reinforcement learning, and imitation learning approaches on these tasks, highlighting the challenges faced by current algorithms, notably their limited generalization capabilities. Our proposed open-source environments and comprehensive analysis show promising boost to future research in garment manipulation by unlocking the full potential of these methods. We guarantee that we will open-source our code as soon as possible. You can watch the videos in supplementary files to learn more about the details of our work. Our project page is available at: https://garmentlab.github.io/, Comment: NeurIPS 2024
- Published
- 2024
50. Bidirectional Optimization onto Thermoelectric Performance via Hydrostatic-Pressure in Chalcopyrite AgXTe2 (X=In, Ga)
- Author
-
Guo, Siqi, Yue, Jincheng, Zheng, Jiongzhi, Zhang, Hui, Wang, Ning, Li, Junda, Liu, Yanhui, and Cui, Tian
- Subjects
Condensed Matter - Materials Science - Abstract
Pressure tuning has emerged as a powerful strategy for manipulating the thermoelectric properties of materials by inducing structural and electronic modifications. Herein, we systematically investigate the transport properties and thermoelectric performance concerning lattice distortions induced by hydrostatic pressure in Ag-based chalcopyrite AgXTe2 (X=In, Ga). The findings reveal that the lattice distortion in AgXTe2 exhibits distinct behaviors under lattice compression, diverging from trends observed at ambient pressure. Importantly, the hydrostatic pressure breaks the phenomenally negative correlation between thermal conductivity and lattice distortion. Pressure-induced softening of low-frequency acoustic phonons broadens the low-energy phonon spectrum, enhancing interactions between acoustic and optical phonons. Such broadening substantially increases the number of available three-phonon scattering channels, resulting in a marked reduction in thermal conductivity. Meanwhile, we establish a macroscopic connection between metavalent bonding and anharmonicity, providing an indirect explanation for lattice anharmonicity through pressure-driven transferred charge. Additionally, the applied pressure achieves a notable net increase in the power factor despite the strong coupling of electrical transport parameters, which underscores the potential for bidirectional optimization of transport properties in AgXTe2. As a result, the maximum ZT value of AgInTe2 is nearly doubled, demonstrating that pressure modulation is a powerful strategy for enhancing thermoelectric performance. Our work not only establishes the link between pressure, lattice dynamics, and thermoelectric properties within chalcopyrite AgXTe2, but also inspires the exploration of pressure-related optimization strategies for conventional thermoelectric materials.
- Published
- 2024
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.