36,466 results on '"Zhang, Liang"'
Search Results
2. HierPromptLM: A Pure PLM-based Framework for Representation Learning on Heterogeneous Text-rich Networks
- Author
-
Zhu, Qiuyu, Zhang, Liang, Xu, Qianxiong, and Long, Cheng
- Subjects
Computer Science - Machine Learning - Abstract
Representation learning on heterogeneous text-rich networks (HTRNs), which consist of multiple types of nodes and edges with each node associated with textual information, is essential for various real-world applications. Given the success of pretrained language models (PLMs) in processing text data, recent efforts have focused on integrating PLMs into HTRN representation learning. These methods typically handle textual and structural information separately, using both PLMs and heterogeneous graph neural networks (HGNNs). However, this separation fails to capture the critical interactions between these two types of information within HTRNs. Additionally, it necessitates an extra alignment step, which is challenging due to the fundamental differences between distinct embedding spaces generated by PLMs and HGNNs. To deal with it, we propose HierPromptLM, a novel pure PLM-based framework that seamlessly models both text data and graph structures without the need for separate processing. Firstly, we develop a Hierarchical Prompt module that employs prompt learning to integrate text data and heterogeneous graph structures at both the node and edge levels, within a unified textual space. Building upon this foundation, we further introduce two innovative HTRN-tailored pretraining tasks to fine-tune PLMs for representation learning by emphasizing the inherent heterogeneity and interactions between textual and structural information within HTRNs. Extensive experiments on two real-world HTRN datasets demonstrate HierPromptLM outperforms state-of-the-art methods, achieving significant improvements of up to 6.08% for node classification and 10.84% for link prediction.
- Published
- 2025
3. Science objectives of the Einstein Probe mission
- Author
-
Yuan, Weimin, Dai, Lixin, Feng, Hua, Jin, Chichuan, Jonker, Peter, Kuulkers, Erik, Liu, Yuan, Nandra, Kirpal, O'Brien, Paul, Piro, Luigi, Rau, Arne, Rea, Nanda, Sanders, Jeremy, Tao, Lian, Wang, Junfeng, Wu, Xuefeng, Zhang, Bing, Zhang, Shuangnan, Ai, Shunke, Buchner, Johannes, Bulbul, Esra, Chen, Hechao, Chen, Minghua, Chen, Yong, Chen, Yu-Peng, Coleiro, Alexis, Zelati, Francesco Coti, Dai, Zigao, Fan, Xilong, Fan, Zhou, Friedrich, Susanne, Gao, He, Ge, Chong, Ge, Mingyu, Geng, Jinjun, Ghirlanda, Giancarlo, Gianfagna, Giulia, Gou, Lijun, Guillot, Sébastien, Hou, Xian, Hu, Jingwei, Huang, Yongfeng, Ji, Long, Jia, Shumei, Komossa, S., Kong, Albert K. H., Lan, Lin, Li, An, Li, Ang, Li, Chengkui, Li, Dongyue, Li, Jian, Li, Zhaosheng, Ling, Zhixing, Liu, Ang, Liu, Jinzhong, Liu, Liangduan, Liu, Zhu, Luo, Jiawei, Ma, Ruican, Maggi, Pierre, Maitra, Chandreyee, Marino, Alessio, Ng, Stephen Chi-Yung, Pan, Haiwu, Rukdee, Surangkhana, Soria, Roberto, Sun, Hui, Tam, Pak-Hin Thomas, Thakur, Aishwarya Linesh, Tian, Hui, Troja, Eleonora, Wang, Wei, Wang, Xiangyu, Wang, Yanan, Wei, Junjie, Wen, Sixiang, Wu, Jianfeng, Wu, Ting, Xiao, Di, Xu, Dong, Xu, Renxin, Xu, Yanjun, Xu, Yu, Yang, Haonan, You, Bei, Yu, Heng, Yu, Yunwei, Zhang, Binbin, Zhang, Chen, Zhang, Guobao, Zhang, Liang, Zhang, Wenda, Zhang, Yu, Zhou, Ping, and Zou, Zecheng
- Subjects
Astrophysics - High Energy Astrophysical Phenomena - Abstract
The Einstein Probe (EP) is an interdisciplinary mission of time-domain and X-ray astronomy. Equipped with a wide-field lobster-eye X-ray focusing imager, EP will discover cosmic X-ray transients and monitor the X-ray variability of known sources in 0.5-4 keV, at a combination of detecting sensitivity and cadence that is not accessible to the previous and current wide-field monitoring missions. EP can perform quick characterisation of transients or outbursts with a Wolter-I X-ray telescope onboard. In this paper, the science objectives of the Einstein Probe mission are presented. EP is expected to enlarge the sample of previously known or predicted but rare types of transients with a wide range of timescales. Among them, fast extragalactic transients will be surveyed systematically in soft X-rays, which include {\gamma}-ray bursts and their variants, supernova shock breakouts, and the predicted X-ray transients associated with binary neutron star mergers. EP will detect X-ray tidal disruption events and outbursts from active galactic nuclei, possibly at an early phase of the flares for some. EP will monitor the variability and outbursts of X-rays from white dwarfs, neutron stars and black holes in our and neighbouring galaxies at flux levels fainter than those detectable by the current instruments, and is expected to discover new objects. A large sample of stellar X-ray flares will also be detected and characterised. In the era of multi-messenger astronomy, EP has the potential of detecting the possible X-ray counterparts of gravitational wave events, neutrino sources, and ultra-high energy {\gamma}-ray and cosmic ray sources. EP is expected to help advance the studies of extreme objects/phenomena and their underlying physical processes revealed in the dynamic X-ray universe, as well as studies in other areas of X-ray astronomy., Comment: 67 pages, 24 figures, accepted for publication in SCIENCE CHINA Physics, Mechanics & Astronomy
- Published
- 2025
- Full Text
- View/download PDF
4. First Token Probability Guided RAG for Telecom Question Answering
- Author
-
Chen, Tingwei, Chen, Jiayi, Zhao, Zijian, Chen, Haolong, Zhang, Liang, and Zhu, Guangxu
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence - Abstract
Large Language Models (LLMs) have garnered significant attention for their impressive general-purpose capabilities. For applications requiring intricate domain knowledge, Retrieval-Augmented Generation (RAG) has shown a distinct advantage in incorporating domain-specific information into LLMs. However, existing RAG research has not fully addressed the challenges of Multiple Choice Question Answering (MCQA) in telecommunications, particularly in terms of retrieval quality and mitigating hallucinations. To tackle these challenges, we propose a novel first token probability guided RAG framework. This framework leverages confidence scores to optimize key hyperparameters, such as chunk number and chunk window size, while dynamically adjusting the context. Our method starts by retrieving the most relevant chunks and generates a single token as the potential answer. The probabilities of all options are then normalized to serve as confidence scores, which guide the dynamic adjustment of the context. By iteratively optimizing the hyperparameters based on these confidence scores, we can continuously improve RAG performance. We conducted experiments to validate the effectiveness of our framework, demonstrating its potential to enhance accuracy in domain-specific MCQA tasks.
- Published
- 2025
5. Generation and Acceleration of Isolated-Attosecond Electron Bunch in a Hollow-Channel Plasma Wakefield
- Author
-
Zhang, Liang-Qi, Si, Mei-Yu, Yu, Tong-Pu, Bi, Yuan-Jie, and Huang, Yong-Sheng
- Subjects
Physics - Plasma Physics ,Physics - Accelerator Physics - Abstract
We propose a novel scheme for generating and accelerating simultaneously a dozen-GeV isolated attosecond electron bunch from an electron beam-driven hollow-channel plasma target. During the beam-target interaction, transverse oscillations of plasma electrons are induced, and subsequently, a radiative wakefield is generated. Meanwhile, a large number of plasma electrons of close to the speed of light are injected transversely from the position of the weaker radiative wakefield (e.g., the half-periodic node of the radiative wakefield) and converge towards the center of the hollow channel, forming an isolated attosecond electron bunch. Then, the attosecond electron bunch is significantly accelerated to high energies by the radiative wakefield. It is demonstrated theoretically and numerically that this scheme can efficiently generate an isolated attosecond electron bunch with a charge of more than 2 nC, a peak energy up to 13 GeV of more than 2 times that of the driving electron beam, a peak divergence angle of less than 5 mmrad, a duration of 276 as, and an energy conversion efficiency of 36.7% as well as a high stability as compared with the laser-beam drive case. Such an isolated attosecond electron bunch in the range of GeV would provide critical applications in ultrafast physics and high energy physics, etc.
- Published
- 2024
6. Data sharing in the metaverse with key abuse resistance based on decentralized CP-ABE
- Author
-
Zhang, Liang, Ou, Zhanrong, Hu, Changhui, Kan, Haibin, and Zhang, Jiheng
- Subjects
Computer Science - Cryptography and Security - Abstract
Data sharing is ubiquitous in the metaverse, which adopts blockchain as its foundation. Blockchain is employed because it enables data transparency, achieves tamper resistance, and supports smart contracts. However, securely sharing data based on blockchain necessitates further consideration. Ciphertext-policy attribute-based encryption (CP-ABE) is a promising primitive to provide confidentiality and fine-grained access control. Nonetheless, authority accountability and key abuse are critical issues that practical applications must address. Few studies have considered CP-ABE key confidentiality and authority accountability simultaneously. To our knowledge, we are the first to fill this gap by integrating non-interactive zero-knowledge (NIZK) proofs into CP-ABE keys and outsourcing the verification process to a smart contract. To meet the decentralization requirement, we incorporate a decentralized CP-ABE scheme into the proposed data sharing system. Additionally, we provide an implementation based on smart contract to determine whether an access control policy is satisfied by a set of CP-ABE keys. We also introduce an open incentive mechanism to encourage honest participation in data sharing. Hence, the key abuse issue is resolved through the NIZK proof and the incentive mechanism. We provide a theoretical analysis and conduct comprehensive experiments to demonstrate the feasibility and efficiency of the data sharing system. Based on the proposed accountable approach, we further illustrate an application in GameFi, where players can play to earn or contribute to an accountable DAO, fostering a thriving metaverse ecosystem.
- Published
- 2024
- Full Text
- View/download PDF
7. BA-ORABE: Blockchain-Based Auditable Registered Attribute-Based Encryption With Reliable Outsourced Decryption
- Author
-
Cai, Dongliang, Chen, Borui, Zhang, Liang, and Kan, Haibin
- Subjects
Computer Science - Cryptography and Security - Abstract
Attribute-based encryption (ABE) is a generalization of public-key encryption that enables fine-grained access control in cloud services. Recently, Hohenberger et al. (Eurocrypt 2023) introduced the notion of registered ABE, which is an ABE scheme without a trusted central authority. Instead, users generate their own public/secret keys and then register their keys and attributes with a key curator. The key curator is a transparent and untrusted entity and its behavior needs to be audited for malicious registration. In addition, pairing-based registered ABE still suffers the heavy decryption overhead like ABE. A general approach to address this issue is to outsource decryption to a decryption cloud service (DCS).In this work, we propose BA-ORABE, the first fully auditable registered ABE with reliable outsourced decryption scheme based on blockchain. First, we utilize a verifiable tag mechanism to achieve verifiability of ciphertext transformation, and the exemptibility which enables the honest DCS to escape from wrong claims is guaranteed by zero knowledge fraud proof under optimistic assumption. Additionally, our system achieves fairness and decentralized outsourcing to protect the interests of all parties and the registration and outsourcing process are transparent and fully auditable through blockchain. Finally, we give security analysis, implement and evaluate our scheme on Ethereum to demonstrate its feasibility and efficiency, and show its advantages in real application of decentralized finance., Comment: 15pages,add application
- Published
- 2024
8. LV-CadeNet: Long View Feature Convolution-Attention Fusion Encoder-Decoder Network for Clinical MEG Spike Detection
- Author
-
Xiao, Kuntao, Wang, Xiongfei, Teng, Pengfei, Sun, Yi, Yang, Wanli, Zhang, Liang, Dong, Hanyang, Luan, Guoming, and Sheng, Shurong
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,I.4.6 ,I.5.1 ,J.3 - Abstract
It is widely acknowledged that the epileptic foci can be pinpointed by source localizing interictal epileptic discharges (IEDs) via Magnetoencephalography (MEG). However, manual detection of IEDs, which appear as spikes in MEG data, is extremely labor intensive and requires considerable professional expertise, limiting the broader adoption of MEG technology. Numerous studies have focused on automatic detection of MEG spikes to overcome this challenge, but these efforts often validate their models on synthetic datasets with balanced positive and negative samples. In contrast, clinical MEG data is highly imbalanced, raising doubts on the real-world efficacy of these models. To address this issue, we introduce LV-CadeNet, a Long View feature Convolution-Attention fusion Encoder-Decoder Network, designed for automatic MEG spike detection in real-world clinical scenarios. Beyond addressing the disparity between training data distribution and clinical test data through semi-supervised learning, our approach also mimics human specialists by constructing long view morphological input data. Moreover, we propose an advanced convolution-attention module to extract temporal and spatial features from the input data. LV-CadeNet significantly improves the accuracy of MEG spike detection, boosting it from 42.31\% to 54.88\% on a novel clinical dataset sourced from Sanbo Brain Hospital Capital Medical University. This dataset, characterized by a highly imbalanced distribution of positive and negative samples, accurately represents real-world clinical scenarios.
- Published
- 2024
9. The classification and formation rate of $\mathbf{Swift/BAT}$ gamma-ray bursts
- Author
-
Luo, Juan-Juan, Zhang, Liang, Zhang, Li-Yun, Huang, Yong-Feng, Lin, Jia-Quan, Lu, Jun-Wang, and Dong, Xiao-Fei
- Subjects
Astrophysics - High Energy Astrophysical Phenomena ,High Energy Physics - Phenomenology - Abstract
Gamma-ray bursts (GRBs) are usually classified into long/short categories according to their durations, but controversy still exists in this aspect. Here we re-examine the long/short classification of GRBs and further compare the cosmological distribution and evolution of each potential subclass. A large number of $Swift/BAT$ GRBs are analyzed in this study. The Gaussian mixture model is used to fit the duration distribution as well as the joint distribution of duration and hardness ratio, and the Akaike and Bayesian information criteria are adopted to assess the goodness of fit. It is found that three Gaussian components can better fit both the univariate and bivariate distributions, indicating that there are three subclasses in the $Swift/BAT$ GRBs, namely short, intermediate, and long subclasses. The non-parametric Efron-Petrosian and Lynden-Bell's $c^{-}$ methods are used to derive the luminosity function and formation rate from the truncated data of bursts with known redshift in each subclass. It is found that the luminosity distributions and birth rates of the three subclasses are different, further supporting the existence of the intermediate subclass in the $Swift/BAT$ GRBs., Comment: 11 pages, 3 figures, 2 tables
- Published
- 2024
10. Adaptive extended Kalman filter and point ahead angle prediction in the detection of gravitational waves in space
- Author
-
Yang, Jinke, Xie, Yong, Tang, Wenlin, Liang, Xindong, Zhang, Liang, Cui, Zhao, Wang, Xue, Li, Haojie, Jia, Jianjun, and Lau, Yun Kau
- Subjects
Astrophysics - Instrumentation and Methods for Astrophysics - Abstract
In the detection of gravitational waves in space, during the science phase of the mission, the point ahead angle mechanism (PAAM) serves to steer a laser beam to compensate for the angle generated by the relative motion of the two spacecrafts (SCs) during the approximately 10 seconds of flight time a laser beam will take from one SC to reach a distant SC of three million kilometers away. The common practice for pointing stability control of a laser beam is to first do a coarse tracking by the PAAM to steer a laser beam to compensate for the relative motion between two SCs, to be followed by a fine pointing stability control. In the present work, by exploiting the near-circular orbit structure of individual SC in the triangular constellation, the feasibility of inserting an adaptive Kalman filter (AEKF) into the PAAM control loop is investigated. By adopting a colored measurement noise model that closely resembles the prospective on orbit situation, numerical simulation suggests that the dynamic range of the PAAM may be reduced to the level of nano-radians using the prediction of the pointing head angle (PAA) by the AEKF. This will cut down on the TTL coupling noise and the position noise budget allocated to the PAAM. This in turn reduces the dynamic range of the fine pointing control and leaves room to improve its accuracy, thereby offers the prospect of reduction of the position noise budget allocated to the laser pointing instability as a whole., Comment: 29 pages, 23 figures, published in PRD
- Published
- 2024
- Full Text
- View/download PDF
11. Channel-Adaptive Wireless Image Semantic Transmission with Learnable Prompts
- Author
-
Zhang, Liang, Huang, Danlan, Zhou, Xinyi, Ding, Feng, Wu, Sheng, and Wei, Zhiqing
- Subjects
Electrical Engineering and Systems Science - Image and Video Processing - Abstract
Recent developments in Deep learning based Joint Source-Channel Coding (DeepJSCC) have demonstrated impressive capabilities within wireless semantic communications system. However, existing DeepJSCC methodologies exhibit limited generalization ability across varying channel conditions, necessitating the preparation of multiple models. Optimal performance is only attained when the channel status during testing aligns precisely with the training channel status, which is very inconvenient for real-life applications. In this paper, we introduce a novel DeepJSCC framework, termed Prompt JSCC (PJSCC), which incorporates a learnable prompt to implicitly integrate the physical channel state into the transmission system. Specifically, the Channel State Prompt (CSP) module is devised to generate prompts based on diverse SNR and channel distribution models. Through the interaction of latent image features with channel features derived from the CSP module, the DeepJSCC process dynamically adapts to varying channel conditions without necessitating retraining. Comparative analyses against leading DeepJSCC methodologies and traditional separate coding approaches reveal that the proposed PJSCC achieves optimal image reconstruction performance across different SNR settings and various channel models, as assessed by Peak Signal-to-Noise Ratio (PSNR) and Learning-based Perceptual Image Patch Similarity (LPIPS) metrics. Furthermore, in real-world scenarios, PJSCC shows excellent memory efficiency and scalability, rendering it readily deployable on resource-constrained platforms to facilitate semantic communications.
- Published
- 2024
12. A Multi-Scale Spatial-Temporal Network for Wireless Video Transmission
- Author
-
Zhou, Xinyi, Huang, Danlan, Qi, Zhixin, Zhang, Liang, and Jiang, Ting
- Subjects
Electrical Engineering and Systems Science - Image and Video Processing - Abstract
Deep joint source-channel coding (DeepJSCC) has shown promise in wireless transmission of text, speech, and images within the realm of semantic communication. However, wireless video transmission presents greater challenges due to the difficulty of extracting and compactly representing both spatial and temporal features, as well as its significant bandwidth and computational resource requirements. In response, we propose a novel video DeepJSCC (VDJSCC) approach to enable end-to-end video transmission over a wireless channel. Our approach involves the design of a multi-scale vision Transformer encoder and decoder to effectively capture spatial-temporal representations over long-term frames. Additionally, we propose a dynamic token selection module to mask less semantically important tokens from spatial or temporal dimensions, allowing for content-adaptive variable-length video coding by adjusting the token keep ratio. Experimental results demonstrate the effectiveness of our VDJSCC approach compared to digital schemes that use separate source and channel codes, as well as other DeepJSCC schemes, in terms of reconstruction quality and bandwidth reduction., Comment: 2024 IEEE Global Communications Conference (GLOBECOM)
- Published
- 2024
13. Invar-RAG: Invariant LLM-aligned Retrieval for Better Generation
- Author
-
Liu, Ziwei, Zhang, Liang, Li, Qian, Wu, Jianghua, and Zhu, Guangxu
- Subjects
Computer Science - Information Retrieval - Abstract
Retrieval-augmented generation (RAG) has shown impressive capability in providing reliable answer predictions and addressing hallucination problems. A typical RAG implementation uses powerful retrieval models to extract external information and large language models (LLMs) to generate answers. In contrast, recent LLM-based retrieval has gained attention for its substantial improvements in information retrieval (IR) due to the LLMs' semantic understanding capability. However, directly applying LLM to RAG systems presents challenges. This may cause feature locality problems as massive parametric knowledge can hinder effective usage of global information across the corpus; for example, an LLM-based retriever often inputs document summaries instead of full documents. Moreover, various pre-trained tasks in LLMs introduce variance, further weakening performance as a retriever. To address these issues, we propose a novel two-stage fine-tuning architecture called Invar-RAG. In the retrieval stage, an LLM-based retriever is constructed by integrating LoRA-based representation learning to tackle feature locality issues. To enhance retrieval performance, we develop two patterns (invariant and variant patterns) and an invariance loss to reduce LLM variance. In the generation stage, a refined fine-tuning method is employed to improve LLM accuracy in generating answers based on retrieved information. Experimental results show that Invar-RAG significantly outperforms existing baselines across three open-domain question answering (ODQA) datasets. Code is available in the Supplementary Material for reproducibility.
- Published
- 2024
14. Attribute-Based Encryption With Payable Outsourced Decryption Using Blockchain and Responsive Zero Knowledge Proof
- Author
-
Cai, Dongliang, Chen, Borui, Zhang, Liang, Li, Kexin, and Kan, Haibin
- Subjects
Computer Science - Cryptography and Security - Abstract
Attribute-Based Encryption (ABE) is a promising solution for access control in cloud services. However, the heavy decryption overhead hinders its widespread adoption. A general approach to address this issue is to outsource decryption to decryption cloud service(DCS). Existing schemes have utilized various methods to enable users to verify outsourced results; however, they lack an effective mechanism to achieve exemptibility which enables the honest DCS to escape from wrong claims. And it is impractical to assume that the DCS will provide free services. In this paper, we propose a blockchain-based payable outsourced decryption ABE scheme that achieves both verifiability and exemptibility without adding redundant information to ABE ciphertext. We use zero-knowledge proof to verify outsourced results on blockchain and introduce an optional single-round challenge game under optimistic assumption to address the high cost of proof generation. Moreover, our system achieves fairness and decentralized outsourcing to protect the interests of all parties. Finally, we implement and evaluate our scheme on Ethereum to demonstrate its feasibility and efficiency, the gas usage in attribute numbers from 5 to 60 is 11$\times$ to 140$\times$ in the happy case and 4$\times$ to 55$\times$ in the challenge case lower than the scheme of Ge et al. (TDSC'23)., Comment: 12 pages, 5 figures
- Published
- 2024
15. MuCol Milestone Report No. 5: Preliminary Parameters
- Author
-
Accettura, Carlotta, Adrian, Simon, Agarwal, Rohit, Ahdida, Claudia, Aimé, Chiara, Aksoy, Avni, Alberghi, Gian Luigi, Alden, Siobhan, Alfonso, Luca, Amapane, Nicola, Amorim, David, Andreetto, Paolo, Anulli, Fabio, Appleby, Rob, Apresyan, Artur, Asadi, Pouya, Mahmoud, Mohammed Attia, Auchmann, Bernhard, Back, John, Badea, Anthony, Bae, Kyu Jung, Bahng, E. J., Balconi, Lorenzo, Balli, Fabrice, Bandiera, Laura, Barbagallo, Carmelo, Barlow, Roger, Bartoli, Camilla, Bartosik, Nazar, Barzi, Emanuela, Batsch, Fabian, Bauce, Matteo, Begel, Michael, Berg, J. Scott, Bersani, Andrea, Bertarelli, Alessandro, Bertinelli, Francesco, Bertolin, Alessandro, Bhat, Pushpalatha, Bianchi, Clarissa, Bianco, Michele, Bishop, William, Black, Kevin, Boattini, Fulvio, Bogacz, Alex, Bonesini, Maurizio, Bordini, Bernardo, de Sousa, Patricia Borges, Bottaro, Salvatore, Bottura, Luca, Boyd, Steven, Breschi, Marco, Broggi, Francesco, Brunoldi, Matteo, Buffat, Xavier, Buonincontri, Laura, Burrows, Philip Nicholas, Burt, Graeme Campbell, Buttazzo, Dario, Caiffi, Barbara, Calatroni, Sergio, Calviani, Marco, Calzaferri, Simone, Calzolari, Daniele, Cantone, Claudio, Capdevilla, Rodolfo, Carli, Christian, Carrelli, Carlo, Casaburo, Fausto, Casarsa, Massimo, Castelli, Luca, Catanesi, Maria Gabriella, Cavallucci, Lorenzo, Cavoto, Gianluca, Celiberto, Francesco Giovanni, Celona, Luigi, Cemmi, Alessia, Ceravolo, Sergio, Cerri, Alessandro, Cerutti, Francesco, Cesarini, Gianmario, Cesarotti, Cari, Chancé, Antoine, Charitonidis, Nikolaos, Chiesa, Mauro, Chiggiato, Paolo, Ciccarella, Vittoria Ludovica, Puviani, Pietro Cioli, Colaleo, Anna, Colao, Francesco, Collamati, Francesco, Costa, Marco, Craig, Nathaniel, Curtin, David, Damerau, Heiko, Da Molin, Giacomo, D'Angelo, Laura, Dasu, Sridhara, de Blas, Jorge, De Curtis, Stefania, De Gersem, Herbert, Delahaye, Jean-Pierre, Del Moro, Tommaso, Denisov, Dmitri, Denizli, Haluk, Dermisek, Radovan, Valdor, Paula Desiré, Desponds, Charlotte, Di Luzio, Luca, Di Meco, Elisa, Diociaiuti, Eleonora, Di Petrillo, Karri Folan, Di Sarcina, Ilaria, Dorigo, Tommaso, Dreimanis, Karlis, Pree, Tristan du, Yildiz, Hatice Duran, Edgecock, Thomas, Fabbri, Siara, Fabbrichesi, Marco, Farinon, Stefania, Ferrand, Guillaume, Somoza, Jose Antonio Ferreira, Fieg, Max, Filthaut, Frank, Fox, Patrick, Franceschini, Roberto, Ximenes, Rui Franqueira, Gallinaro, Michele, Garcia-Sciveres, Maurice, Garcia-Tabares, Luis, Gargiulo, Ruben, Garion, Cedric, Garzelli, Maria Vittoria, Gast, Marco, Generoso, Lisa, Gerber, Cecilia E., Giambastiani, Luca, Gianelle, Alessio, Gianfelice-Wendt, Eliana, Gibson, Stephen, Gilardoni, Simone, Giove, Dario Augusto, Giovinco, Valentina, Giraldin, Carlo, Glioti, Alfredo, Gorzawski, Arkadiusz, Greco, Mario, Grojean, Christophe, Grudiev, Alexej, Gschwendtner, Edda, Gueli, Emanuele, Guilhaudin, Nicolas, Han, Chengcheng, Han, Tao, Hauptman, John Michael, Herndon, Matthew, Hillier, Adrian D, Hillman, Micah, Holmes, Tova Ray, Homiller, Samuel, Jana, Sudip, Jindariani, Sergo, Johannesson, Sofia, Johnson, Benjamin, Jones, Owain Rhodri, Jurj, Paul-Bogdan, Kahn, Yonatan, Kamath, Rohan, Kario, Anna, Karpov, Ivan, Kelliher, David, Kilian, Wolfgang, Kitano, Ryuichiro, Kling, Felix, Kolehmainen, Antti, Kong, K. C., Kosse, Jaap, Krintiras, Georgios, Krizka, Karol, Kumar, Nilanjana, Kvikne, Erik, Kyle, Robert, Laface, Emanuele, Lane, Kenneth, Latina, Andrea, Lechner, Anton, Lee, Junghyun, Lee, Lawrence, Lee, Seh Wook, Lefevre, Thibaut, Leonardi, Emanuele, Lerner, Giuseppe, Li, Peiran, Li, Qiang, Li, Tong, Li, Wei, Lindroos, Mats, Lipton, Ronald, Liu, Da, Liu, Miaoyuan, Liu, Zhen, Voti, Roberto Li, Lombardi, Alessandra, Lomte, Shivani, Long, Kenneth, Longo, Luigi, Lorenzo, José, Losito, Roberto, Low, Ian, Lu, Xianguo, Lucchesi, Donatella, Luo, Tianhuan, Lupato, Anna, Ma, Yang, Machida, Shinji, Madlener, Thomas, Magaletti, Lorenzo, Maggi, Marcello, Durand, Helene Mainaud, Maltoni, Fabio, Manczak, Jerzy Mikolaj, Mandurrino, Marco, Marchand, Claude, Mariani, Francesco, Marin, Stefano, Mariotto, Samuele, Martin-Haugh, Stewart, Masullo, Maria Rosaria, Mauro, Giorgio Sebastiano, Mazzolari, Andrea, Mękała, Krzysztof, Mele, Barbara, Meloni, Federico, Meng, Xiangwei, Mentink, Matthias, Métral, Elias, Miceli, Rebecca, Milas, Natalia, Mohammadi, Abdollah, Moll, Dominik, Montella, Alessandro, Morandin, Mauro, Morrone, Marco, Mulder, Tim, Musenich, Riccardo, Nardecchia, Marco, Nardi, Federico, Nenna, Felice, Neuffer, David, Newbold, David, Novelli, Daniel, Olvegård, Maja, Onel, Yasar, Orestano, Domizia, Osborne, John, Otten, Simon, Torres, Yohan Mauricio Oviedo, Paesani, Daniele, Griso, Simone Pagan, Pagani, Davide, Pal, Kincso, Palmer, Mark, Pampaloni, Alessandra, Panci, Paolo, Pani, Priscilla, Papaphilippou, Yannis, Paparella, Rocco, Paradisi, Paride, Passeri, Antonio, Pasternak, Jaroslaw, Pastrone, Nadia, Pellecchia, Antonello, Piccinini, Fulvio, Piekarz, Henryk, Pieloni, Tatiana, Plouin, Juliette, Portone, Alfredo, Potamianos, Karolos, Potdevin, Joséphine, Prestemon, Soren, Puig, Teresa, Qiang, Ji, Quettier, Lionel, Rabemananjara, Tanjona Radonirina, Radicioni, Emilio, Radogna, Raffaella, Rago, Ilaria Carmela, Ratkus, Andris, Resseguie, Elodie, Reuter, Juergen, Ribani, Pier Luigi, Riccardi, Cristina, Ricciardi, Stefania, Robens, Tania, Robert, Youri, Rogers, Chris, Rojo, Juan, Romagnoni, Marco, Ronald, Kevin, Rosser, Benjamin, Rossi, Carlo, Rossi, Lucio, Rozanov, Leo, Ruhdorfer, Maximilian, Ruiz, Richard, Saini, Saurabh, Sala, Filippo, Salierno, Claudia, Salmi, Tiina, Salvini, Paola, Salvioni, Ennio, Sammut, Nicholas, Santini, Carlo, Saputi, Alessandro, Sarra, Ivano, Scarantino, Giuseppe, Schneider-Muntau, Hans, Schulte, Daniel, Scifo, Jessica, Sen, Tanaji, Senatore, Carmine, Senol, Abdulkadir, Sertore, Daniele, Sestini, Lorenzo, Rêgo, Ricardo César Silva, Simone, Federica Maria, Skoufaris, Kyriacos, Sorbello, Gino, Sorbi, Massimo, Sorti, Stefano, Soubirou, Lisa, Spataro, David, Queiroz, Farinaldo S., Stamerra, Anna, Stapnes, Steinar, Stark, Giordon, Statera, Marco, Stechauner, Bernd Michael, Su, Shufang, Su, Wei, Sun, Xiaohu, Sytov, Alexei, Tang, Jian, Tang, Jingyu, Taylor, Rebecca, Kate, Herman Ten, Testoni, Pietro, Thiele, Leonard Sebastian, Garcia, Rogelio Tomas, Topp-Mugglestone, Max, Torims, Toms, Torre, Riccardo, Tortora, Luca, Tortora, Ludovico, Trifinopoulos, Sokratis, Udongwo, Sosoho-Abasi, Vai, Ilaria, Valente, Riccardo Umberto, van Rienen, Ursula, Van Weelderen, Rob, Vanwelde, Marion, Velev, Gueorgui, Venditti, Rosamaria, Vendrasco, Adam, Verna, Adriano, Vernassa, Gianluca, Verweij, Arjan, Verwilligen, Piet, Villamizar, Yoxara, Vittorio, Ludovico, Vitulo, Paolo, Vojskovic, Isabella, Wang, Dayong, Wang, Lian-Tao, Wang, Xing, Wendt, Manfred, Widorski, Markus, Wozniak, Mariusz, Wu, Yongcheng, Wulzer, Andrea, Xie, Keping, Yang, Yifeng, Yap, Yee Chinn, Yonehara, Katsuya, Yoo, Hwi Dong, You, Zhengyun, Zanetti, Marco, Zaza, Angela, Zhang, Liang, Zhu, Ruihu, Zlobin, Alexander, Zuliani, Davide, and Zurita, José Francisco
- Subjects
Physics - Accelerator Physics - Abstract
This document is comprised of a collection of updated preliminary parameters for the key parts of the muon collider. The updated preliminary parameters follow on from the October 2023 Tentative Parameters Report. Particular attention has been given to regions of the facility that are believed to hold greater technical uncertainty in their design and that have a strong impact on the cost and power consumption of the facility. The data is collected from a collaborative spreadsheet and transferred to overleaf.
- Published
- 2024
- Full Text
- View/download PDF
16. Exploring Optimal Transport-Based Multi-Grained Alignments for Text-Molecule Retrieval
- Author
-
Min, Zijun, Liu, Bingshuai, Zhang, Liang, Song, Jia, Su, Jinsong, He, Song, and Bo, Xiaochen
- Subjects
Computer Science - Information Retrieval ,Computer Science - Artificial Intelligence ,Computer Science - Computation and Language ,Quantitative Biology - Biomolecules - Abstract
The field of bioinformatics has seen significant progress, making the cross-modal text-molecule retrieval task increasingly vital. This task focuses on accurately retrieving molecule structures based on textual descriptions, by effectively aligning textual descriptions and molecules to assist researchers in identifying suitable molecular candidates. However, many existing approaches overlook the details inherent in molecule sub-structures. In this work, we introduce the Optimal TRansport-based Multi-grained Alignments model (ORMA), a novel approach that facilitates multi-grained alignments between textual descriptions and molecules. Our model features a text encoder and a molecule encoder. The text encoder processes textual descriptions to generate both token-level and sentence-level representations, while molecules are modeled as hierarchical heterogeneous graphs, encompassing atom, motif, and molecule nodes to extract representations at these three levels. A key innovation in ORMA is the application of Optimal Transport (OT) to align tokens with motifs, creating multi-token representations that integrate multiple token alignments with their corresponding motifs. Additionally, we employ contrastive learning to refine cross-modal alignments at three distinct scales: token-atom, multitoken-motif, and sentence-molecule, ensuring that the similarities between correctly matched text-molecule pairs are maximized while those of unmatched pairs are minimized. To our knowledge, this is the first attempt to explore alignments at both the motif and multi-token levels. Experimental results on the ChEBI-20 and PCdes datasets demonstrate that ORMA significantly outperforms existing state-of-the-art (SOTA) models., Comment: BIBM 2024 Regular Paper
- Published
- 2024
17. X-ray and Radio Campaign of the Z-source GX 340+0 II: the X-ray polarization in the normal branch
- Author
-
Bhargava, Yash, Russell, Thomas D., Ng, Mason, Balasubramanian, Arvind, Zhang, Liang, Ravi, Swati, Jadoliya, Vishal, Bhattacharyya, Sudip, Pahari, Mayukh, Homan, Jeroen, Marshall, Herman L., Chakrabarty, Deepto, Carotenuto, Francesco, and Kaushik, Aman
- Subjects
Astrophysics - High Energy Astrophysical Phenomena - Abstract
We present the first X-ray polarization measurement of the neutron star low-mass X-ray binary and Z-source, GX 340$+$0, in the normal branch (NB) using a 200 ks observation with the Imaging X-ray Polarimetric Explorer (IXPE). This observation was performed in 2024 August. Along with IXPE, we also conducted simultaneous observations with NICER, AstroSat, Insight-HXMT, ATCA, and GMRT to investigate the broadband spectral and timing properties in the X-ray and radio wavelengths. During the campaign, the source traced a complete Z-track during the IXPE observation but spent most of the time in the NB. We measure X-ray polarization degree (PD) of $1.22\pm0.25\%$ in the 2-8 keV energy band with a polarization angle (PA) of $38\pm6^\circ$. The PD in the NB is observed to be weaker than in the horizontal branch (HB) but aligned in the same direction. The PD of the source exhibits a marginal increase with energy while the PA shows no energy dependence. The joint spectro-polarimetric modeling is consistent with the observed X-ray polarization originating from a single spectral component from the blackbody, the Comptonized emission, or reflection feature, while the disk emission does not contribute towards the X-ray polarization. GMRT observations at 1.26 GHz during HB had a tentative detection at 4.5$\pm$0.7 mJy while ATCA observations a day later during the NB detected the source at 0.70$\pm$0.05 mJy and 0.59$\pm$0.05 mJy in the 5.5 & 9 GHz bands, respectively, suggesting an evolving jet structure depending on the Z-track position., Comment: 17 pages, 5 figures, 4 tables; Submitted to ApJ
- Published
- 2024
18. Towards Cross-Modal Text-Molecule Retrieval with Better Modality Alignment
- Author
-
Song, Jia, Zhuang, Wanru, Lin, Yujie, Zhang, Liang, Li, Chunyan, Su, Jinsong, He, Song, and Bo, Xiaochen
- Subjects
Computer Science - Information Retrieval - Abstract
Cross-modal text-molecule retrieval model aims to learn a shared feature space of the text and molecule modalities for accurate similarity calculation, which facilitates the rapid screening of molecules with specific properties and activities in drug design. However, previous works have two main defects. First, they are inadequate in capturing modality-shared features considering the significant gap between text sequences and molecule graphs. Second, they mainly rely on contrastive learning and adversarial training for cross-modality alignment, both of which mainly focus on the first-order similarity, ignoring the second-order similarity that can capture more structural information in the embedding space. To address these issues, we propose a novel cross-modal text-molecule retrieval model with two-fold improvements. Specifically, on the top of two modality-specific encoders, we stack a memory bank based feature projector that contain learnable memory vectors to extract modality-shared features better. More importantly, during the model training, we calculate four kinds of similarity distributions (text-to-text, text-to-molecule, molecule-to-molecule, and molecule-to-text similarity distributions) for each instance, and then minimize the distance between these similarity distributions (namely second-order similarity losses) to enhance cross-modal alignment. Experimental results and analysis strongly demonstrate the effectiveness of our model. Particularly, our model achieves SOTA performance, outperforming the previously-reported best result by 6.4%., Comment: BIBM 2024 regular paper
- Published
- 2024
19. On the Crucial Role of Initialization for Matrix Factorization
- Author
-
Li, Bingcong, Zhang, Liang, Mokhtari, Aryan, and He, Niao
- Subjects
Computer Science - Machine Learning ,Electrical Engineering and Systems Science - Signal Processing ,Mathematics - Optimization and Control - Abstract
This work revisits the classical low-rank matrix factorization problem and unveils the critical role of initialization in shaping convergence rates for such nonconvex and nonsmooth optimization. We introduce Nystrom initialization, which significantly improves the global convergence of Scaled Gradient Descent (ScaledGD) in both symmetric and asymmetric matrix factorization tasks. Specifically, we prove that ScaledGD with Nystrom initialization achieves quadratic convergence in cases where only linear rates were previously known. Furthermore, we extend this initialization to low-rank adapters (LoRA) commonly used for finetuning foundation models. Our approach, NoRA, i.e., LoRA with Nystrom initialization, demonstrates superior performance across various downstream tasks and model scales, from 1B to 7B parameters, in large language and diffusion models.
- Published
- 2024
20. Implicit Regularization of Sharpness-Aware Minimization for Scale-Invariant Problems
- Author
-
Li, Bingcong, Zhang, Liang, and He, Niao
- Subjects
Computer Science - Machine Learning ,Statistics - Machine Learning - Abstract
Sharpness-aware minimization (SAM) improves generalization of various deep learning tasks. Motivated by popular architectures such as LoRA, we explore the implicit regularization of SAM for scale-invariant problems involving two groups of variables. Instead of focusing on commonly used sharpness, this work introduces a concept termed balancedness, defined as the difference between the squared norm of two variables. This allows us to depict richer global behaviors of SAM. In particular, our theoretical and empirical findings reveal that i) SAM promotes balancedness; and ii) the regularization on balancedness is data-responsive -- outliers have stronger impact. The latter coincides with empirical observations that SAM outperforms SGD in the presence of outliers. Leveraging the implicit regularization, we develop a resource-efficient SAM variant, balancedness-aware regularization (BAR), tailored for scale-invariant problems such as finetuning language models with LoRA. BAR saves 95% computational overhead of SAM, with enhanced test performance across various tasks on RoBERTa, GPT2, and OPT-1.3B., Comment: NeurIPS 2024
- Published
- 2024
21. A Systematic Assessment of OpenAI o1-Preview for Higher Order Thinking in Education
- Author
-
Latif, Ehsan, Zhou, Yifan, Guo, Shuchen, Gao, Yizhu, Shi, Lehong, Nayaaba, Matthew, Lee, Gyeonggeon, Zhang, Liang, Bewersdorff, Arne, Fang, Luyang, Yang, Xiantong, Zhao, Huaqin, Jiang, Hanqi, Lu, Haoran, Li, Jiaxi, Yu, Jichao, You, Weihang, Liu, Zhengliang, Liu, Vincent Shung, Wang, Hui, Wu, Zihao, Lu, Jin, Dou, Fei, Ma, Ping, Liu, Ninghao, Liu, Tianming, and Zhai, Xiaoming
- Subjects
Computer Science - Computers and Society ,Computer Science - Artificial Intelligence - Abstract
As artificial intelligence (AI) continues to advance, it demonstrates capabilities comparable to human intelligence, with significant potential to transform education and workforce development. This study evaluates OpenAI o1-preview's ability to perform higher-order cognitive tasks across 14 dimensions, including critical thinking, systems thinking, computational thinking, design thinking, metacognition, data literacy, creative thinking, abstract reasoning, quantitative reasoning, logical reasoning, analogical reasoning, and scientific reasoning. We used validated instruments like the Ennis-Weir Critical Thinking Essay Test and the Biological Systems Thinking Test to compare the o1-preview's performance with human performance systematically. Our findings reveal that o1-preview outperforms humans in most categories, achieving 150% better results in systems thinking, computational thinking, data literacy, creative thinking, scientific reasoning, and abstract reasoning. However, compared to humans, it underperforms by around 25% in logical reasoning, critical thinking, and quantitative reasoning. In analogical reasoning, both o1-preview and humans achieved perfect scores. Despite these strengths, the o1-preview shows limitations in abstract reasoning, where human psychology students outperform it, highlighting the continued importance of human oversight in tasks requiring high-level abstraction. These results have significant educational implications, suggesting a shift toward developing human skills that complement AI, such as creativity, abstract reasoning, and critical thinking. This study emphasizes the transformative potential of AI in education and calls for a recalibration of educational goals, teaching methods, and curricula to align with an AI-driven world., Comment: An assessment of OpenAI o1-Preview for Higher Order Thinking in Education
- Published
- 2024
22. A practical applicable quantum-classical hybrid ant colony algorithm for the NISQ era
- Author
-
Qiu, Qian, Zhang, Liang, Wu, Mohan, Sun, Qichun, Li, Xiaogang, Li, Da-Chuang, and Xu, Hua
- Subjects
Quantum Physics ,Computer Science - Neural and Evolutionary Computing - Abstract
Quantum ant colony optimization (QACO) has drew much attention since it combines the advantages of quantum computing and ant colony optimization (ACO) algorithm overcoming some limitations of the traditional ACO algorithm. However,due to the hardware resource limitations of currently available quantum computers, the practical application of the QACO is still not realized. In this paper, we developed a quantum-classical hybrid algorithm by combining the clustering algorithm with QACO algorithm.This extended QACO can handle large-scale optimization problems with currently available quantum computing resource. We have tested the effectiveness and performance of the extended QACO algorithm with the Travelling Salesman Problem (TSP) as benchmarks, and found the algorithm achieves better performance under multiple diverse datasets. In addition, we investigated the noise impact on the extended QACO and evaluated its operation possibility on current available noisy intermediate scale quantum(NISQ) devices. Our work shows that the combination of the clustering algorithm with QACO effectively improved its problem solving scale, which makes its practical application possible in current NISQ era of quantum computing., Comment: arXiv admin note: substantial text overlap with arXiv:2403.00367
- Published
- 2024
23. One2set + Large Language Model: Best Partners for Keyphrase Generation
- Author
-
Shao, Liangying, Zhang, Liang, Peng, Minlong, Ma, Guoqi, Yue, Hao, Sun, Mingming, and Su, Jinsong
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence - Abstract
Keyphrase generation (KPG) aims to automatically generate a collection of phrases representing the core concepts of a given document. The dominant paradigms in KPG include one2seq and one2set. Recently, there has been increasing interest in applying large language models (LLMs) to KPG. Our preliminary experiments reveal that it is challenging for a single model to excel in both recall and precision. Further analysis shows that: 1) the one2set paradigm owns the advantage of high recall, but suffers from improper assignments of supervision signals during training; 2) LLMs are powerful in keyphrase selection, but existing selection methods often make redundant selections. Given these observations, we introduce a generate-then-select framework decomposing KPG into two steps, where we adopt a one2set-based model as generator to produce candidates and then use an LLM as selector to select keyphrases from these candidates. Particularly, we make two important improvements on our generator and selector: 1) we design an Optimal Transport-based assignment strategy to address the above improper assignments; 2) we model the keyphrase selection as a sequence labeling task to alleviate redundant selections. Experimental results on multiple benchmark datasets show that our framework significantly surpasses state-of-the-art models, especially in absent keyphrase prediction., Comment: Accepted by EMNLP 2024 Main Conference
- Published
- 2024
24. Exploring Semantic Clustering in Deep Reinforcement Learning for Video Games
- Author
-
Zhang, Liang, Lieffers, Justin, and Pyarelal, Adarsh
- Subjects
Computer Science - Artificial Intelligence - Abstract
In this paper, we investigate the semantic clustering properties of deep reinforcement learning (DRL) for video games, enriching our understanding of the internal dynamics of DRL and advancing its interpretability. In this context, semantic clustering refers to the inherent capacity of neural networks to internally group video inputs based on semantic similarity. To achieve this, we propose a novel DRL architecture that integrates a semantic clustering module featuring both feature dimensionality reduction and online clustering. This module seamlessly integrates into the DRL training pipeline, addressing instability issues observed in previous t-SNE-based analysis methods and eliminating the necessity for extensive manual annotation of semantic analysis. Through experiments, we validate the effectiveness of the proposed module and the semantic clustering properties in DRL for video games. Additionally, based on these properties, we introduce new analytical methods to help understand the hierarchical structure of policies and the semantic distribution within the feature space.
- Published
- 2024
25. Geometry of the comptonization region of MAXI J1348$-$630 through type-C quasi-periodic oscillations with NICER
- Author
-
Alabarta, Kevin, Méndez, Mariano, García, Federico, Altamirano, Diego, Zhang, Yuexin, Zhang, Liang, Russell, David M., and König, Ole
- Subjects
Astrophysics - High Energy Astrophysical Phenomena - Abstract
We use the rms and lag spectra of the type-C quasi-periodic oscillation (QPO) to study the properties of the Comptonisation region (aka corona) during the low/hard and hard-intermediate states of the main outburst and reflare of MAXI J1348$-$630. We simultaneously fit the time-averaged energy spectrum of the source and the fractional rms and phase-lag spectra of the QPO with the time-dependent Comptonization model vKompth. The data can be explained by two physically connected coronae interacting with the accretion disc via a feedback loop of X-ray photons. The best-fitting model consists of a corona of $\sim$10$^3$ km located at the inner edge of the disc and a second corona of $\sim$10$^4$ km horizontally extended and covering the inner parts of the accretion disc. The properties of both coronae during the reflare are similar to those during the low/hard state of the main outburst, reinforcing the idea that both the outburst and the reflare are driven by the same physical mechanisms. We combine our results for the type-C QPO with those from previous work focused on the study of type-A and type-B QPOs with the same model to study the evolution of the geometry of the corona through the whole outburst, including the reflare of MAXI J1348$-$630. Finally, we show that the sudden increase in the phase-lag frequency spectrum and the sharp drop in the coherence function previously observed in MAXI J1348$-$630 are due to the type-C QPO during the decay of the outburst and can be explained in terms of the geometry of the coronae., Comment: 18 pages, 8 figures, 1 table. Submitted to ApJ
- Published
- 2024
26. Data Augmentation for Sparse Multidimensional Learning Performance Data Using Generative AI
- Author
-
Zhang, Liang, Lin, Jionghao, Sabatini, John, Borchers, Conrad, Weitekamp, Daniel, Cao, Meng, Hollander, John, Hu, Xiangen, and Graesser, Arthur C.
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence - Abstract
Learning performance data describe correct and incorrect answers or problem-solving attempts in adaptive learning, such as in intelligent tutoring systems (ITSs). Learning performance data tend to be highly sparse (80\%\(\sim\)90\% missing observations) in most real-world applications due to adaptive item selection. This data sparsity presents challenges to using learner models to effectively predict future performance explore new hypotheses about learning. This article proposes a systematic framework for augmenting learner data to address data sparsity in learning performance data. First, learning performance is represented as a three-dimensional tensor of learners' questions, answers, and attempts, capturing longitudinal knowledge states during learning. Second, a tensor factorization method is used to impute missing values in sparse tensors of collected learner data, thereby grounding the imputation on knowledge tracing tasks that predict missing performance values based on real observations. Third, a module for generating patterns of learning is used. This study contrasts two forms of generative Artificial Intelligence (AI), including Generative Adversarial Networks (GANs) and Generate Pre-Trained Transformers (GPT) to generate data associated with different clusters of learner data. We tested this approach on an adult literacy dataset from AutoTutor lessons developed for Adult Reading Comprehension (ARC). We found that: (1) tensor factorization improved the performance in tracing and predicting knowledge mastery compared with other knowledge tracing techniques without data augmentation, showing higher relative fidelity for this imputation method, and (2) the GAN-based simulation showed greater overall stability and less statistical bias based on a divergence evaluation with varying simulation sample sizes compared to GPT.
- Published
- 2024
27. Cycle Pixel Difference Network for Crisp Edge Detection
- Author
-
Liu, Changsong, Zhang, Wei, Liu, Yanyan, Li, Mingyang, Li, Wenlin, Fan, Yimeng, Bai, Xiangnan, and Zhang, Liang
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Artificial Intelligence - Abstract
Edge detection, as a fundamental task in computer vision, has garnered increasing attention. The advent of deep learning has significantly advanced this field. However, recent deep learning-based methods generally face two significant issues: 1) reliance on large-scale pre-trained weights, and 2) generation of thick edges. We construct a U-shape encoder-decoder model named CPD-Net that successfully addresses these two issues simultaneously. In response to issue 1), we propose a novel cycle pixel difference convolution (CPDC), which effectively integrates edge prior knowledge with modern convolution operations, consequently successfully eliminating the dependence on large-scale pre-trained weights. As for issue 2), we construct a multi-scale information enhancement module (MSEM) and a dual residual connection-based (DRC) decoder to enhance the edge location ability of the model, thereby generating crisp and clean contour maps. Comprehensive experiments conducted on four standard benchmarks demonstrate that our method achieves competitive performance on the BSDS500 dataset (ODS=0.813 and AC=0.352), NYUD-V2 (ODS=0.760 and AC=0.223), BIPED dataset (ODS=0.898 and AC=0.426), and CID (ODS=0.59). Our approach provides a novel perspective for addressing these challenges in edge detection.
- Published
- 2024
28. mPLUG-DocOwl2: High-resolution Compressing for OCR-free Multi-page Document Understanding
- Author
-
Hu, Anwen, Xu, Haiyang, Zhang, Liang, Ye, Jiabo, Yan, Ming, Zhang, Ji, Jin, Qin, Huang, Fei, and Zhou, Jingren
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Multimodel Large Language Models(MLLMs) have achieved promising OCR-free Document Understanding performance by increasing the supported resolution of document images. However, this comes at the cost of generating thousands of visual tokens for a single document image, leading to excessive GPU memory and slower inference times, particularly in multi-page document comprehension. In this work, to address these challenges, we propose a High-resolution DocCompressor module to compress each high-resolution document image into 324 tokens, guided by low-resolution global visual features. With this compression module, to strengthen multi-page document comprehension ability and balance both token efficiency and question-answering performance, we develop the DocOwl2 under a three-stage training framework: Single-image Pretraining, Multi-image Continue-pretraining, and Multi-task Finetuning. DocOwl2 sets a new state-of-the-art across multi-page document understanding benchmarks and reduces first token latency by more than 50%, demonstrating advanced capabilities in multi-page questioning answering, explanation with evidence pages, and cross-page structure understanding. Additionally, compared to single-image MLLMs trained on similar data, our DocOwl2 achieves comparable single-page understanding performance with less than 20% of the visual tokens. Our codes, models, and data are publicly available at https://github.com/X-PLUG/mPLUG-DocOwl/tree/main/DocOwl2., Comment: 15 pages, 7 figures
- Published
- 2024
29. An Adaptive Hot Ranking Algorithm for Popular Item Recommendation in the Express Industry
- Author
-
Li, Bohan, Zeng, Qingwei, Ren, Pan, Chen, Huan, Geng, Yankun, Zhang, Liang-Jie, Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Xu, Ruifeng, editor, Chen, Huan, editor, Wu, Yirui, editor, and Zhang, Liang-Jie, editor
- Published
- 2025
- Full Text
- View/download PDF
30. A Survey on Anomaly Detection with Few-Shot Learning
- Author
-
Chen, Junyang, Wang, Changbo, Hong, Yifan, Mi, Rui, Zhang, Liang-Jie, Wu, Yirui, Wang, Huan, Zhou, Yue, Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Xu, Ruifeng, editor, Chen, Huan, editor, Wu, Yirui, editor, and Zhang, Liang-Jie, editor
- Published
- 2025
- Full Text
- View/download PDF
31. A Review of Link Prediction on Heterogeneous Networks
- Author
-
Mi, Rui, Wang, Changbo, Zhang, Liang-Jie, Wu, Yirui, Chen, Junyang, Wang, Huan, Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Xu, Ruifeng, editor, Chen, Huan, editor, Wu, Yirui, editor, and Zhang, Liang-Jie, editor
- Published
- 2025
- Full Text
- View/download PDF
32. A Paradigm Shift to Causal Model-Driven Decision-Making With Generative AI
- Author
-
He, Sheng, Ning, Yishuang, Zhang, Liang-Jie, Lei, Kai, Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Pan, Xiuqin, editor, Huang, Mengxing, editor, Zhang, Jiajia, editor, Chen, Junyang, editor, and Zhang, Liang-Jie, editor
- Published
- 2025
- Full Text
- View/download PDF
33. Accelerating Blockchain Application Development: Integrating Blockchain as a Service Within Low-Code Platforms
- Author
-
He, Sheng, Huang, Qinglin, Jiao, Shaoshuai, Lin, Zepeng, Lin, Jinxuan, Ren, Jun, Xiong, Dengbin, Zhang, Liang-Jie, Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Feng, Jun, editor, He, Songlin, editor, and Zhang, Liang-Jie, editor
- Published
- 2025
- Full Text
- View/download PDF
34. COSIS: An AI-Enabled Digital Transformation Framework Integrating Large Language Models and Key Performance Indicators
- Author
-
Zhang, Liang-Jie, Chen, Huan, He, Sheng, Li, Changhu, Chen, Junyang, Zhang, Haodi, Du, Wenfeng, Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, He, Sheng, editor, and Zhang, Liang-Jie, editor
- Published
- 2025
- Full Text
- View/download PDF
35. Engineered melatonin-pretreated plasma exosomes repair traumatic spinal cord injury by regulating miR-138-5p/SOX4 axis mediated microglia polarization.
- Author
-
Chen, Hao, Sun, Huihui, Yang, Yaqing, Wang, Pingchuan, Chen, Xizhao, Yin, Junxiang, Li, Aoying, Zhang, Liang, Cai, Jun, Huang, Jijun, Zhang, Shengfei, Zhang, Zhiqiang, Feng, Xinmin, Yin, Jian, Wang, Yongxiang, Xiong, Wu, and Wan, Bowen
- Subjects
Engineered melatonin-pretreated plasma exosomes ,MiR-138-5p/SOX4 axis ,Spinal cord injury - Abstract
BACKGROUND: Neuroinflammation plays a crucial role in the repair of spinal cord injury (SCI), with microglia, pivotal in neuroinflammation, driving either degeneration or recovery in this pathological process. Recently, plasma-derived exosomes (denoted Exos) have presented a high capacity for promoting functional recovery of SCI through the anti-inflammatory effects, and pretreated exosomes are associated with better outcomes. Thus, we aimed to explore whether melatonin-pretreated plasma-derived exosomes (denoted MExo) could exert superior effects on SCI, and attempted to elucidate the potential mechanisms. METHODS: Electron microscopy, nanoparticle tracking analysis, and western blot were applied to delineate the distinctions between Exos and MExos. To assess their therapeutic potentials, we established a contusion SCI rat model, complemented by a battery of in vitro experiments comparing both groups. Subsequently, a miRNA microarray analysis was conducted, followed by a series of rescue experiments to elucidate the specific role of miRNAs in MExos. To further delve into the molecular mechanisms involved, we employed western blot analysis and the luciferase reporter gene assay. RESULTS: Melatonin promoted the release of exosome from plasma, concurrently amplifying their anti-inflammatory properties. Furthermore, it was discerned that MExos facilitated a transition in microglia polarization from M1 to M2 phenotype, a phenomenon more pronounced than that observed with Exos. In an endeavor to elucidate this variance, we scrutinized miRNAs exhibiting elevated expression levels in MExos, pinpointing miR-138-5p as a pivotal element in this dynamic. Following this, an in-depth investigation into the role of miR-138-5p was undertaken, which uncovered its efficacy in driving phenotypic alterations within microglia. The analysis of downstream genes targeted by miR-138-5p revealed that it exerted a negative regulatory influence on SOX4, which was found to obstruct the generation of M2-type microglia and the secretion of anti-inflammatory cytokines, thereby partially elucidating the mechanism behind miR-138-5ps regulation of microglia polarization. CONCLUSIONS: We innovatively observed that melatonin enhanced the anti-inflammatory function of Exos, which further decreased the expression of SOX4 by delivering miR-138-5p. This inhibition promoted the conversion of M1 microglia to M2 microglia, thus offering a viable option for the treatment of SCI. THE TRANSLATIONAL POTENTIAL OF THIS ARTICLE: This study highlights that melatonin enhances the anti-inflammatory function of Exos through delivery of miR-138-5p. Activation of miR-138-5p/SOX4 axis by engineered melatonin-pretreated plasma exosomes may be a potential target for SCI treatment.
- Published
- 2024
36. Modeling Reference-dependent Choices with Graph Neural Networks
- Author
-
Zhang, Liang, Liu, Guannan, Wu, Junjie, and Tan, Yong
- Subjects
Computer Science - Machine Learning ,Computer Science - Computers and Society - Abstract
While the classic Prospect Theory has highlighted the reference-dependent and comparative nature of consumers' product evaluation processes, few models have successfully integrated this theoretical hypothesis into data-driven preference quantification, particularly in the realm of recommender systems development. To bridge this gap, we propose a new research problem of modeling reference-dependent preferences from a data-driven perspective, and design a novel deep learning-based framework named Attributed Reference-dependent Choice Model for Recommendation (ArcRec) to tackle the inherent challenges associated with this problem. ArcRec features in building a reference network from aggregated historical purchase records for instantiating theoretical reference points, which is then decomposed into product attribute specific sub-networks and represented through Graph Neural Networks. In this way, the reference points of a consumer can be encoded at the attribute-level individually from her past experiences but also reflect the crowd influences. ArcRec also makes novel contributions to quantifying consumers' reference-dependent preferences using a deep neural network-based utility function that integrates both interest-inspired and price-inspired preferences, with their complex interaction effects captured by an attribute-aware price sensitivity mechanism. Most importantly, ArcRec introduces a novel Attribute-level Willingness-To-Pay measure to the reference-dependent utility function, which captures a consumer's heterogeneous salience of product attributes via observing her attribute-level price tolerance to a product. Empirical evaluations on both synthetic and real-world online shopping datasets demonstrate ArcRec's superior performances over fourteen state-of-the-art baselines.
- Published
- 2024
37. Microsatellite-based real-time quantum key distribution
- Author
-
Li, Yang, Cai, Wen-Qi, Ren, Ji-Gang, Wang, Chao-Ze, Yang, Meng, Zhang, Liang, Wu, Hui-Ying, Chang, Liang, Wu, Jin-Cai, Jin, Biao, Xue, Hua-Jian, Li, Xue-Jiao, Liu, Hui, Yu, Guang-Wen, Tao, Xue-Ying, Chen, Ting, Liu, Chong-Fei, Luo, Wen-Bin, Zhou, Jie, Yong, Hai-Lin, Li, Yu-Huai, Li, Feng-Zhi, Jiang, Cong, Chen, Hao-Ze, Wu, Chao, Tong, Xin-Hai, Xie, Si-Jiang, Zhou, Fei, Liu, Wei-Yue, Liu, Nai-Le, Li, Li, Xu, Feihu, Cao, Yuan, Yin, Juan, Shu, Rong, Wang, Xiang-Bin, Zhang, Qiang, Wang, Jian-Yu, Liao, Sheng-Kai, Peng, Cheng-Zhi, and Pan, Jian-Wei
- Subjects
Quantum Physics - Abstract
A quantum network provides an infrastructure connecting quantum devices with revolutionary computing, sensing, and communication capabilities. As the best-known application of a quantum network, quantum key distribution (QKD) shares secure keys guaranteed by the laws of quantum mechanics. A quantum satellite constellation offers a solution to facilitate the quantum network on a global scale. The Micius satellite has verified the feasibility of satellite quantum communications, however, scaling up quantum satellite constellations is challenging, requiring small lightweight satellites, portable ground stations and real-time secure key exchange. Here we tackle these challenges and report the development of a quantum microsatellite capable of performing space-to-ground QKD using portable ground stations. The quantum microsatellite features a payload weighing approximately 23 kg, while the portable ground station weighs about 100 kg. These weights represent reductions by more than an order and two orders of magnitude, respectively, compared to the Micius satellite. Additionally, we multiplex bidirectional satellite-ground optical communication with quantum communication, enabling key distillation and secure communication in real-time. Using the microsatellite and the portable ground stations, we demonstrate satellite-based QKD with multiple ground stations and achieve the sharing of up to 0.59 million bits of secure keys during a single satellite pass. The compact quantum payload can be readily assembled on existing space stations or small satellites, paving the way for a satellite-constellation-based quantum and classical network for widespread real-life applications., Comment: 40 pages, 8 figures
- Published
- 2024
38. Unraveling the hybrid origins of the X-ray non-thermal emission from IGR J17091-3624
- Author
-
Lin, Zikun, Wang, Yanan, del Palacio, Santiago, Méndez, Mariano, Zhang, Shuang-Nan, Russell, Thomas D., Ji, Long, Zhang, Jin, Zhang, Liang, Altamirano, Diego, and Liu, Jifeng
- Subjects
Astrophysics - High Energy Astrophysical Phenomena - Abstract
We present a comprehensive study based on multi-wavelength observations from the NuSTAR, NICER, Swift, Fermi, NEOWISE, and ATCA telescopes during the 2022 outburst of the black hole X-ray binary IGR J17091-3624. Our investigation concentrates on the heartbeat-like variability in the X-ray emission, with the aim of using it as a tool to unravel the origin of the non-thermal emission during the heartbeat state. Through X-ray timing and spectral analysis, we observe that the heartbeat-like variability correlates with changes in the disk temperature, supporting the disk radiation pressure instability scenario. Moreover, in addition to a Comptonization component, our time-averaged and phase-resolved spectroscopy reveal the presence of a power-law component that varies independently from the disk component. Combined with the radio to X-ray spectral energy distribution fitting, our results suggest that the power-law component could originate from synchrotron self-Compton radiation in the jet, which requires a strong magnetic field of about $B = (0.3$-$3.5)\times10^6$ G. Additionally, assuming that IGR J17091-3624 and GRS 1915+105 share the same radio-X-ray correlation coefficient during both the hard and the heartbeat states, we obtain a distance of $13.7\pm2.3$ kpc for IGR J17091-3624., Comment: 19 pages, 11 figures, 3 tables; accepted for publication in ApJ
- Published
- 2024
39. Quantum Long Short-Term Memory for Drug Discovery
- Author
-
Zhang, Liang, Xu, Yin, Wu, Mohan, Wang, Liang, and Xu, Hua
- Subjects
Quantum Physics ,Computer Science - Machine Learning ,Quantitative Biology - Biomolecules - Abstract
Quantum computing combined with machine learning (ML) is an extremely promising research area, with numerous studies demonstrating that quantum machine learning (QML) is expected to solve scientific problems more effectively than classical ML. In this work, we successfully apply QML to drug discovery, showing that QML can significantly improve model performance and achieve faster convergence compared to classical ML. Moreover, we demonstrate that the model accuracy of the QML improves as the number of qubits increases. We also introduce noise to the QML model and find that it has little effect on our experimental conclusions, illustrating the high robustness of the QML model. This work highlights the potential application of quantum computing to yield significant benefits for scientific advancement as the qubit quantity increase and quality improvement in the future.
- Published
- 2024
40. Generative Adversarial Networks for Imputing Sparse Learning Performance
- Author
-
Zhang, Liang, Yeasin, Mohammed, Lin, Jionghao, Havugimana, Felix, and Hu, Xiangen
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence - Abstract
Learning performance data, such as correct or incorrect responses to questions in Intelligent Tutoring Systems (ITSs) is crucial for tracking and assessing the learners' progress and mastery of knowledge. However, the issue of data sparsity, characterized by unexplored questions and missing attempts, hampers accurate assessment and the provision of tailored, personalized instruction within ITSs. This paper proposes using the Generative Adversarial Imputation Networks (GAIN) framework to impute sparse learning performance data, reconstructed into a three-dimensional (3D) tensor representation across the dimensions of learners, questions and attempts. Our customized GAIN-based method computational process imputes sparse data in a 3D tensor space, significantly enhanced by convolutional neural networks for its input and output layers. This adaptation also includes the use of a least squares loss function for optimization and aligns the shapes of the input and output with the dimensions of the questions-attempts matrices along the learners' dimension. Through extensive experiments on six datasets from various ITSs, including AutoTutor, ASSISTments and MATHia, we demonstrate that the GAIN approach generally outperforms existing methods such as tensor factorization and other generative adversarial network (GAN) based approaches in terms of imputation accuracy. This finding enhances comprehensive learning data modeling and analytics in AI-based education.
- Published
- 2024
41. Phase-resolved Spectroscopy of Low-frequency Quasi-periodic Oscillations from the Newly Discovered Black Hole X-ray Binary Swift J1727.8-1613
- Author
-
Shui, Qing-Cang, Zhang, Shu, Peng, Jiang-Qiang, Zhang, Shuang-Nan, Chen, Yu-Peng, Ji, Long, Kong, Ling-Da, Feng, Hua, Yu, Zhuo-Li, Wang, Peng-Ju, Chang, Zhi, Yin, Hong-Xing, Qu, Jin-Lu, Tao, Lian, Ge, Ming-Yu, Zhang, Liang, and Li, Jian
- Subjects
Astrophysics - High Energy Astrophysical Phenomena - Abstract
Low-frequency quasi-periodic oscillations (LFQPOs) are commonly observed in X-ray light curves of black hole X-ray binaries (BHXRBs); however, their origin remains a topic of debate. In order to thoroughly investigate variations in spectral properties on the QPO timescale, we utilized the Hilbert-Huang transform technique to conduct phase-resolved spectroscopy across a broad energy band for LFQPOs in the newly discovered BHXRB Swift J1727.8-1613. This is achieved through quasi-simultaneous observations from Neutron star Interior Composition ExploreR (NICER), Nuclear Spectroscopic Telescope ARray (NuSTAR), and Hard X-ray Modulation Telescope (Insight-HXMT). Our analysis reveals that both the non-thermal and disk-blackbody components exhibit variations on the QPO timescale, with the former dominating the QPO variability. For the spectral parameters, we observe modulation of the disk temperature, spectral indices, and reflection fraction with the QPO phase with high statistical significance (>5\sigma). Notably, the variation in the disk temperature is found to precede the variations in the non-thermal and disk fluxes by ~0.4-0.5 QPO cycles. We suggest that these findings offer further evidence that the type-C QPO variability is a result of geometric effects of the accretion flow., Comment: Accepted for pulication in The Astrophysical Journal
- Published
- 2024
42. HHGT: Hierarchical Heterogeneous Graph Transformer for Heterogeneous Graph Representation Learning
- Author
-
Zhu, Qiuyu, Zhang, Liang, Xu, Qianxiong, Liu, Kaijun, Long, Cheng, and Wang, Xiaoyang
- Subjects
Computer Science - Machine Learning ,Computer Science - Databases - Abstract
Despite the success of Heterogeneous Graph Neural Networks (HGNNs) in modeling real-world Heterogeneous Information Networks (HINs), challenges such as expressiveness limitations and over-smoothing have prompted researchers to explore Graph Transformers (GTs) for enhanced HIN representation learning. However, research on GT in HINs remains limited, with two key shortcomings in existing work: (1) A node's neighbors at different distances in HINs convey diverse semantics. Unfortunately, existing methods ignore such differences and uniformly treat neighbors within a given distance in a coarse manner, which results in semantic confusion. (2) Nodes in HINs have various types, each with unique semantics. Nevertheless, existing methods mix nodes of different types during neighbor aggregation, hindering the capture of proper correlations between nodes of diverse types. To bridge these gaps, we design an innovative structure named (k,t)-ring neighborhood, where nodes are initially organized by their distance, forming different non-overlapping k-ring neighborhoods for each distance. Within each k-ring structure, nodes are further categorized into different groups according to their types, thus emphasizing the heterogeneity of both distances and types in HINs naturally. Based on this structure, we propose a novel Hierarchical Heterogeneous Graph Transformer (HHGT) model, which seamlessly integrates a Type-level Transformer for aggregating nodes of different types within each k-ring neighborhood, followed by a Ring-level Transformer for aggregating different k-ring neighborhoods in a hierarchical manner. Extensive experiments are conducted on downstream tasks to verify HHGT's superiority over 14 baselines, with a notable improvement of up to 24.75% in NMI and 29.25% in ARI for node clustering task on the ACM dataset compared to the best baseline.
- Published
- 2024
43. A blazar in the epoch of reionization
- Author
-
Banados, Eduardo, Momjian, Emmanuel, Connor, Thomas, Belladitta, Silvia, Decarli, Roberto, Mazzucchelli, Chiara, Venemans, Bram P., Walter, Fabian, Wang, Feige, Xie, Zhang-Liang, Barth, Aaron J., Eilers, Anna-Christina, Fan, Xiaohui, Khusanova, Yana, Schindler, Jan-Torge, Stern, Daniel, Yang, Jinyi, Andika, Irham Taufik, Carilli, Chris, Farina, Emanuele P., Fabian, Andrew, Hennawi, Joseph F., Pensabene, Antonio, and Rojas-Ruiz, Sofia
- Subjects
Astrophysics - Astrophysics of Galaxies ,Astrophysics - High Energy Astrophysical Phenomena - Abstract
Relativistic jets are thought to play a crucial role in the formation and evolution of massive galaxies and supermassive black holes. Blazars, which are quasars with jets aligned along our line of sight, provide insights into the jetted population and have been observed up to redshifts of z=6.1. Here, we report the discovery and multi-wavelength characterization of the blazar VLASS J041009.05-013919.88 at z=7 (Universe's age ~750 Myr), powered by a ~7x10^8 Msun black hole. The presence of this high-redshift blazar implies a large population of similar but unaligned jetted sources in the early Universe. Our findings suggest two possible scenarios: in one, the jet in J0410-0139 is intrinsically low-power but appears highly luminous due to relativistic beaming, suggesting that most UV-bright quasars at this redshift host jets. Alternatively, if J0410-0139 represents an intrinsically powerful radio source, there should be hundreds to thousands of radio-quiet quasars at z~7 with properties similar to J0410-0139, a prediction in tension with observed quasar densities based on their UV luminosity function. These results support the hypothesis that rapid black hole growth in the early Universe may be driven by jet-enhanced or obscured super-Eddington accretion, potentially playing a key role in forming massive black holes during the epoch of reionization., Comment: Updated to match accepted/final version in Nature Astronomy
- Published
- 2024
- Full Text
- View/download PDF
44. A timing view of the additional high-energy spectral component discovered in the black hole candidate Swift J1727.8-1613
- Author
-
Yang, Zi-Xu, Zhang, Liang, Zhang, Shuang-Nan, Tao, L., Zhang, Shu, Ma, Ruican, Bu, Qingcui, Huang, Yue, Liu, He-Xin, Yu, Wei, Xiao, Guang C., Wang, Peng-Ju, Feng, Hua, Song, Li-Ming, Ma, Xiang, Ge, Mingyu, Zhao, QingChang, and Qu, J. L.
- Subjects
Astrophysics - High Energy Astrophysical Phenomena - Abstract
We present an energy-dependent analysis for the type-C quasi-periodic oscillations (QPOs) observed in the black hole X-ray binary Swift J1727.8-1613 using Insight-HXMT observations. We find that the QPO fractional rms at energies above 40 keV is significantly higher than that below 20 keV. This is the first report of a high energy (HE)-rms excess in the rms spectrum of a black hole X-ray binary. In the high energy band, an extra hard component is observed in additional to the standard thermal Comptonization component at similar energy band. The value of the QPO HE-rms excess is not only correlated with the disk parameters and the photon index of the standard Comptonization component, but also exhibits a moderate positive correlation with the flux of the additional hard spectral component. No features in the QPO phase-lag spectra are seen corresponding to the additional hard component. We propose that the additional hard component in the spectrum may originate from jet emission and the associated QPO HE-rms excess can be explained by the precession of the jet base.
- Published
- 2024
45. DailyDVS-200: A Comprehensive Benchmark Dataset for Event-Based Action Recognition
- Author
-
Wang, Qi, Xu, Zhou, Lin, Yuming, Ye, Jingtao, Li, Hongsheng, Zhu, Guangming, Shah, Syed Afaq Ali, Bennamoun, Mohammed, and Zhang, Liang
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Neuromorphic sensors, specifically event cameras, revolutionize visual data acquisition by capturing pixel intensity changes with exceptional dynamic range, minimal latency, and energy efficiency, setting them apart from conventional frame-based cameras. The distinctive capabilities of event cameras have ignited significant interest in the domain of event-based action recognition, recognizing their vast potential for advancement. However, the development in this field is currently slowed by the lack of comprehensive, large-scale datasets, which are critical for developing robust recognition frameworks. To bridge this gap, we introduces DailyDVS-200, a meticulously curated benchmark dataset tailored for the event-based action recognition community. DailyDVS-200 is extensive, covering 200 action categories across real-world scenarios, recorded by 47 participants, and comprises more than 22,000 event sequences. This dataset is designed to reflect a broad spectrum of action types, scene complexities, and data acquisition diversity. Each sequence in the dataset is annotated with 14 attributes, ensuring a detailed characterization of the recorded actions. Moreover, DailyDVS-200 is structured to facilitate a wide range of research paths, offering a solid foundation for both validating existing approaches and inspiring novel methodologies. By setting a new benchmark in the field, we challenge the current limitations of neuromorphic data processing and invite a surge of new approaches in event-based action recognition techniques, which paves the way for future explorations in neuromorphic computing and beyond. The dataset and source code are available at https://github.com/QiWang233/DailyDVS-200., Comment: Accepted to ECCV 2024
- Published
- 2024
46. Scaling Data-Driven Building Energy Modelling using Large Language Models
- Author
-
Khadka, Sunil and Zhang, Liang
- Subjects
Computer Science - Software Engineering ,Computer Science - Artificial Intelligence - Abstract
Building Management System (BMS) through a data-driven method always faces data and model scalability issues. We propose a methodology to tackle the scalability challenges associated with the development of data-driven models for BMS by using Large Language Models (LLMs). LLMs' code generation adaptability can enable broader adoption of BMS by "automating the automation," particularly the data handling and data-driven modeling processes. In this paper, we use LLMs to generate code that processes structured data from BMS and build data-driven models for BMS's specific requirements. This eliminates the need for manual data and model development, reducing the time, effort, and cost associated with this process. Our hypothesis is that LLMs can incorporate domain knowledge about data science and BMS into data processing and modeling, ensuring that the data-driven modeling is automated for specific requirements of different building types and control objectives, which also improves accuracy and scalability. We generate a prompt template following the framework of Machine Learning Operations so that the prompts are designed to systematically generate Python code for data-driven modeling. Our case study indicates that bi-sequential prompting under the prompt template can achieve a high success rate of code generation and code accuracy, and significantly reduce human labor costs.
- Published
- 2024
47. Towards Better Graph-based Cross-document Relation Extraction via Non-bridge Entity Enhancement and Prediction Debiasing
- Author
-
Yue, Hao, Lai, Shaopeng, Yang, Chengyi, Zhang, Liang, Yao, Junfeng, and Su, Jinsong
- Subjects
Computer Science - Computation and Language - Abstract
Cross-document Relation Extraction aims to predict the relation between target entities located in different documents. In this regard, the dominant models commonly retain useful information for relation prediction via bridge entities, which allows the model to elaborately capture the intrinsic interdependence between target entities. However, these studies ignore the non-bridge entities, each of which co-occurs with only one target entity and offers the semantic association between target entities for relation prediction. Besides, the commonly-used dataset--CodRED contains substantial NA instances, leading to the prediction bias during inference. To address these issues, in this paper, we propose a novel graph-based cross-document RE model with non-bridge entity enhancement and prediction debiasing. Specifically, we use a unified entity graph to integrate numerous non-bridge entities with target entities and bridge entities, modeling various associations between them, and then use a graph recurrent network to encode this graph. Finally, we introduce a novel debiasing strategy to calibrate the original prediction distribution. Experimental results on the closed and open settings show that our model significantly outperforms all baselines, including the GPT-3.5-turbo and InstructUIE, achieving state-of-the-art performance. Particularly, our model obtains 66.23% and 55.87% AUC points in the official leaderboard\footnote{\url{https://codalab.lisn.upsaclay.fr/competitions/3770#results}} under the two settings, respectively, ranking the first place in all submissions since December 2023. Our code is available at https://github.com/DeepLearnXMU/CoRE-NEPD., Comment: Accepted to ACL 2024 Findings
- Published
- 2024
48. Integrating Attentional Factors and Spacing in Logistic Knowledge Tracing Models to Explore the Impact of Training Sequences on Category Learning
- Author
-
Cao, Meng, Pavlik Jr., Philip I., Chu, Wei, and Zhang, Liang
- Subjects
Computer Science - Computers and Society ,Computer Science - Machine Learning ,Statistics - Machine Learning - Abstract
In category learning, a growing body of literature has increasingly focused on exploring the impacts of interleaving in contrast to blocking. The sequential attention hypothesis posits that interleaving draws attention to the differences between categories while blocking directs attention toward similarities within categories. Although a recent study underscores the joint influence of memory and attentional factors on sequencing effects, there remains a scarcity of effective computational models integrating both attentional and memory considerations to comprehensively understand the effect of training sequences on students' performance. This study introduces a novel integration of attentional factors and spacing into the logistic knowledge tracing (LKT) models to monitor students' performance across different training sequences (interleaving and blocking). Attentional factors were incorporated by recording the counts of comparisons between adjacent trials, considering whether they belong to the same or different category. Several features were employed to account for temporal spacing. We used cross-validations to test the model fit and predictions on the learning session and posttest. Our findings reveal that incorporating both attentional factors and spacing features in the Additive Factors Model (AFM) significantly enhances its capacity to capture the effects of interleaving and blocking and demonstrates superior predictive accuracy for students' learning outcomes. By bridging the gap between attentional factors and memory processes, our computational approach offers a more comprehensive framework for understanding and predicting category learning outcomes in educational settings., Comment: 7 pages, 3 figures, Educational Data Mining 2024
- Published
- 2024
49. SPL: A Socratic Playground for Learning Powered by Large Language Model
- Author
-
Zhang, Liang, Lin, Jionghao, Kuang, Ziyi, Xu, Sheng, and Hu, Xiangen
- Subjects
Computer Science - Artificial Intelligence - Abstract
Dialogue-based Intelligent Tutoring Systems (ITSs) have significantly advanced adaptive and personalized learning by automating sophisticated human tutoring strategies within interactive dialogues. However, replicating the nuanced patterns of expert human communication remains a challenge in Natural Language Processing (NLP). Recent advancements in NLP, particularly Large Language Models (LLMs) such as OpenAI's GPT-4, offer promising solutions by providing human-like and context-aware responses based on extensive pre-trained knowledge. Motivated by the effectiveness of LLMs in various educational tasks (e.g., content creation and summarization, problem-solving, and automated feedback provision), our study introduces the Socratic Playground for Learning (SPL), a dialogue-based ITS powered by the GPT-4 model, which employs the Socratic teaching method to foster critical thinking among learners. Through extensive prompt engineering, SPL can generate specific learning scenarios and facilitates efficient multi-turn tutoring dialogues. The SPL system aims to enhance personalized and adaptive learning experiences tailored to individual needs, specifically focusing on improving critical thinking skills. Our pilot experimental results from essay writing tasks demonstrate SPL has the potential to improve tutoring interactions and further enhance dialogue-based ITS functionalities. Our study, exemplified by SPL, demonstrates how LLMs enhance dialogue-based ITSs and expand the accessibility and efficacy of educational technologies.
- Published
- 2024
50. Learning to utilize image second-order derivative information for crisp edge detection
- Author
-
Liu, Changsong, Fan, Yimeng, Li, Mingyang, Zhang, Wei, Liu, Yanyan, Li, Yuming, Li, Wenlin, and Zhang, Liang
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Edge detection is a fundamental task in computer vision. It has made great progress under the development of deep convolutional neural networks (DCNNs), some of which have achieved a beyond human-level performance. However, recent top-performing edge detection methods tend to generate thick and noisy edge lines. In this work, we solve this problem from two aspects: (1) the lack of prior knowledge regarding image edges, and (2) the issue of imbalanced pixel distribution. We propose a second-order derivative-based multi-scale contextual enhancement module (SDMCM) to help the model locate true edge pixels accurately by introducing the edge prior knowledge. We also construct a hybrid focal loss function (HFL) to alleviate the imbalanced distribution issue. In addition, we employ the conditionally parameterized convolution (CondConv) to develop a novel boundary refinement module (BRM), which can further refine the final output edge maps. In the end, we propose a U-shape network named LUS-Net which is based on the SDMCM and BRM for crisp edge detection. We perform extensive experiments on three standard benchmarks, and the experiment results illustrate that our method can predict crisp and clean edge maps and achieves state-of-the-art performance on the BSDS500 dataset (ODS=0.829), NYUD-V2 dataset (ODS=0.768), and BIPED dataset (ODS=0.903).
- Published
- 2024
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.