98,586 results on '"Chu, P."'
Search Results
2. Investigating Sequencing as a Means to Computational Thinking in Young Learners
- Author
-
Kristina M. Tank, Anne Ottenbreit-Leftwich, Tamara J. Moore, Sohheon Yang, Zarina Wafula, Jiyoung Kim, Bárbara Fagundes, and Lin Chu
- Abstract
Within the field of K-2 computer science (CS) education, unplugged computational thinking (CT) activities have been suggested as beneficial for younger students and shown to impact young students' skills and motivation to learn about CS. This study sought to examine how children demonstrate CT competencies in unplugged sequencing tasks and how children use manipulatives to solve unplugged sequencing tasks. This case study approach examined two unplugged sequencing tasks for six children ranging from ages five to eight (pre-kindergarten to 2nd grade). Children showed evidence of several CT competencies during the sequencing tasks: (1) pattern recognition, (2) algorithms and procedures, (3) problem decomposition, and (4) debugging. The strategies and use of manipulatives to showcase CT competencies seemed to evolve in complexity based on age and developmental levels. Taking into account children's abilities to demonstrate CT competencies, this study suggests that sequencing is a developmentally appropriate entry point for young children to begin engaging in other CT competencies. In addition, these unplugged sequencing tasks can also be easily integrated into other activities commonly experienced in early childhood classrooms.
- Published
- 2024
3. 'So Hard, but so Rewarding:' How School System Leaders Are Scaling up Strategic School Staffing Models
- Author
-
Arizona State University (ASU), Center on Reinventing Public Education (CRPE), Lisa Chu, Lydia Rainey, and Steven Weiner
- Abstract
Innovative staffing models are promising, but challenging to scale up. What does the work of leading strategic staffing involve, and what could make scaling up easier? This report digs deep into the many challenges system leaders face when scaling up innovative staffing solutions. These leaders are trying to address longstanding teacher shortages and retention challenges by rethinking everything, including who they hire and how they design the job, provide support, build trust, and uproot old assumptions about the teaching role. The early results are promising: these leaders report fewer vacancies, higher staff satisfaction, and improved student learning experiences. This work is "so hard, but so rewarding"--and it could be much more manageable if policymakers, technical assistance providers, and researchers stepped up to share the load.
- Published
- 2024
4. Prioritize Denoising Steps on Diffusion Model Preference Alignment via Explicit Denoised Distribution Estimation
- Author
-
Shi, Dingyuan, Wang, Yong, Li, Hangyu, and Chu, Xiangxiang
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Artificial Intelligence ,Computer Science - Computation and Language ,Computer Science - Machine Learning - Abstract
Diffusion models have shown remarkable success in text-to-image generation, making alignment methods for these models increasingly important. A key challenge is the sparsity of preference labels, which are typically available only at the terminal of denoising trajectories. This raises the issue of how to assign credit across denoising steps based on these sparse labels. In this paper, we propose Denoised Distribution Estimation (DDE), a novel method for credit assignment. Unlike previous approaches that rely on auxiliary models or hand-crafted schemes, DDE derives its strategy more explicitly. The proposed DDE directly estimates the terminal denoised distribution from the perspective of each step. It is equipped with two estimation strategies and capable of representing the entire denoising trajectory with a single model inference. Theoretically and empirically, we show that DDE prioritizes optimizing the middle part of the denoising trajectory, resulting in a novel and effective credit assignment scheme. Extensive experiments demonstrate that our approach achieves superior performance, both quantitatively and qualitatively.
- Published
- 2024
5. MMGenBench: Evaluating the Limits of LMMs from the Text-to-Image Generation Perspective
- Author
-
Huang, Hailang, Wang, Yong, Huang, Zixuan, Li, Huaqiu, Huang, Tongwen, Chu, Xiangxiang, and Zhang, Richong
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Artificial Intelligence ,Computer Science - Computation and Language ,Computer Science - Machine Learning - Abstract
Large Multimodal Models (LMMs) have demonstrated remarkable capabilities. While existing benchmarks for evaluating LMMs mainly focus on image comprehension, few works evaluate them from the image generation perspective. To address this issue, we propose a straightforward automated evaluation pipeline. Specifically, this pipeline requires LMMs to generate an image-prompt from a given input image. Subsequently, it employs text-to-image generative models to create a new image based on these generated prompts. Finally, we evaluate the performance of LMMs by comparing the original image with the generated one. Furthermore, we introduce MMGenBench-Test, a comprehensive benchmark developed to evaluate LMMs across 13 distinct image patterns, and MMGenBench-Domain, targeting the performance evaluation of LMMs within the generative image domain. A thorough evaluation involving over 50 popular LMMs demonstrates the effectiveness and reliability in both the pipeline and benchmark. Our observations indicate that numerous LMMs excelling in existing benchmarks fail to adequately complete the basic tasks, related to image understanding and description. This finding highlights the substantial potential for performance improvement in current LMMs and suggests avenues for future model optimization. Concurrently, our pipeline facilitates the efficient assessment of LMMs performance across diverse domains by using solely image inputs., Comment: This project is available at: https://github.com/lerogo/MMGenBench
- Published
- 2024
6. Leveraging Prior Experience: An Expandable Auxiliary Knowledge Base for Text-to-SQL
- Author
-
Chu, Zhibo, Wang, Zichong, and Qin, Qitao
- Subjects
Computer Science - Computation and Language - Abstract
Large Language Models (LLMs) exhibit impressive problem-solving skills across many tasks, but they still underperform compared to humans in various downstream applications, such as text-to-SQL. On the BIRD benchmark leaderboard, human performance achieves an accuracy of 92.96\%, whereas the top-performing method reaches only 72.39\%. Notably, these state-of-the-art (SoTA) methods predominantly rely on in-context learning to simulate human-like reasoning. However, they overlook a critical human skill: continual learning. Inspired by the educational practice of maintaining mistake notebooks during our formative years, we propose LPE-SQL (Leveraging Prior Experience: An Expandable Auxiliary Knowledge Base for Text-to-SQL), a novel framework designed to augment LLMs by enabling continual learning without requiring parameter fine-tuning. LPE-SQL consists of four modules that \textbf{i)} retrieve relevant entries, \textbf{ii)} efficient sql generation, \textbf{iii)} generate the final result through a cross-consistency mechanism and \textbf{iv)} log successful and failed tasks along with their reasoning processes or reflection-generated tips. Importantly, the core module of LPE-SQL is the fourth one, while the other modules employ foundational methods, allowing LPE-SQL to be easily integrated with SoTA technologies to further enhance performance. Our experimental results demonstrate that this continual learning approach yields substantial performance gains, with the smaller Llama-3.1-70B model with surpassing the performance of the larger Llama-3.1-405B model using SoTA methods.
- Published
- 2024
7. Large-scale cross-modality pretrained model enhances cardiovascular state estimation and cardiomyopathy detection from electrocardiograms: An AI system development and multi-center validation study
- Author
-
Ding, Zhengyao, Hu, Yujian, Xu, Youyao, Zhao, Chengchen, Li, Ziyu, Mao, Yiheng, Li, Haitao, Li, Qian, Wang, Jing, Chen, Yue, Chen, Mengjia, Wang, Longbo, Chu, Xuesen, Pan, Weichao, Liu, Ziyi, Wu, Fei, Zhang, Hongkun, Chen, Ting, and Huang, Zhengxing
- Subjects
Electrical Engineering and Systems Science - Image and Video Processing ,Computer Science - Artificial Intelligence ,Computer Science - Computer Vision and Pattern Recognition - Abstract
Cardiovascular diseases (CVDs) present significant challenges for early and accurate diagnosis. While cardiac magnetic resonance imaging (CMR) is the gold standard for assessing cardiac function and diagnosing CVDs, its high cost and technical complexity limit accessibility. In contrast, electrocardiography (ECG) offers promise for large-scale early screening. This study introduces CardiacNets, an innovative model that enhances ECG analysis by leveraging the diagnostic strengths of CMR through cross-modal contrastive learning and generative pretraining. CardiacNets serves two primary functions: (1) it evaluates detailed cardiac function indicators and screens for potential CVDs, including coronary artery disease, cardiomyopathy, pericarditis, heart failure and pulmonary hypertension, using ECG input; and (2) it enhances interpretability by generating high-quality CMR images from ECG data. We train and validate the proposed CardiacNets on two large-scale public datasets (the UK Biobank with 41,519 individuals and the MIMIC-IV-ECG comprising 501,172 samples) as well as three private datasets (FAHZU with 410 individuals, SAHZU with 464 individuals, and QPH with 338 individuals), and the findings demonstrate that CardiacNets consistently outperforms traditional ECG-only models, substantially improving screening accuracy. Furthermore, the generated CMR images provide valuable diagnostic support for physicians of all experience levels. This proof-of-concept study highlights how ECG can facilitate cross-modal insights into cardiac function assessment, paving the way for enhanced CVD screening and diagnosis at a population level., Comment: 23 pages, 8 figures
- Published
- 2024
8. Heuristic-Free Multi-Teacher Learning
- Author
-
Nguyen, Huy Thong, Chu, En-Hung, Melvix, Lenord, Jiao, Jazon, Wen, Chunglin, and Louie, Benjamin
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence ,Computer Science - Computer Vision and Pattern Recognition - Abstract
We introduce Teacher2Task, a novel framework for multi-teacher learning that eliminates the need for manual aggregation heuristics. Existing multi-teacher methods typically rely on such heuristics to combine predictions from multiple teachers, often resulting in sub-optimal aggregated labels and the propagation of aggregation errors. Teacher2Task addresses these limitations by introducing teacher-specific input tokens and reformulating the training process. Instead of relying on aggregated labels, the framework transforms the training data, consisting of ground truth labels and annotations from N teachers, into N+1 distinct tasks: N auxiliary tasks that predict the labeling styles of the N individual teachers, and one primary task that focuses on the ground truth labels. This approach, drawing upon principles from multiple learning paradigms, demonstrates strong empirical results across a range of architectures, modalities, and tasks.
- Published
- 2024
9. OrigamiPlot: An R Package and Shiny Web App Enhanced Visualizations for Multivariate Data
- Author
-
Lu, Yiwen, Tong, Jiayi, Lei, Yuqing, Sutton, Alex J., Chu, Haitao, Levine, Lisa D., Lumley, Thomas, Asch, David A., Duan, Rui, Schmid, Christopher H., and Chen, Yong
- Subjects
Computer Science - Human-Computer Interaction ,Statistics - Methodology - Abstract
We introduce OrigamiPlot, an open-source R package and Shiny web application designed to enhance the visualization of multivariate data. This package implements the origami plot, a novel visualization technique proposed by Duan et al. in 2023, which improves upon traditional radar charts by ensuring that the area of the connected region is invariant to the ordering of attributes, addressing a key limitation of radar charts. The software facilitates multivariate decision-making by supporting comparisons across multiple objects and attributes, offering customizable features such as auxiliary axes and weighted attributes for enhanced clarity. Through the R package and user-friendly Shiny interface, researchers can efficiently create and customize plots without requiring extensive programming knowledge. Demonstrated using network meta-analysis as a real-world example, OrigamiPlot proves to be a versatile tool for visualizing multivariate data across various fields. This package opens new opportunities for simplifying decision-making processes with complex data.
- Published
- 2024
10. Light Cone Distribution Amplitude for the $\Lambda$ Baryon from Lattice QCD
- Author
-
Chu, Min-Huan, Bai, Haoyang, Hua, Jun, Liang, Jian, Ji, Xiangdong, Schafer, Andreas, Su, Yushan, Wang, Wei, Yang, Yi-Bo, Zeng, Jun, Zhang, Jian-Hui, and Zhang, Qi-An
- Subjects
High Energy Physics - Lattice - Abstract
We calculate the leading-twist light-cone distribution amplitudes of the light $\Lambda$ baryon using lattice methods within the framework of large momentum effective theory. Our numerical computations are conducted employing $N_f=2+1$ stout smeared clover fermions and a Symanzik gauge action on a lattice with spacing $a=0.077\;\rm{fm}$, and a pion mass of 303 MeV. To approach the large momentum regime, we simulate the equal-time correlations with the hadron momentum $P^z = \{2.52, 3.02, 3.52\}$ GeV. By investigating the potential analytic characteristics of the baryon quasi-distribution amplitude in coordinate space, we validate these findings through our lattice calculations. After renormalization and extrapolation, we present results for the three-dimensional distribution of momentum fractions for the two light quarks. Based on these findings the paper briefly discusses the phenomenological impact on weak decays of $\Lambda_b$, and outlines potential systematic uncertainties that can be improved in the future. This work lays the theoretical foundation for accessing baryon LCDAs from lattice QCD.
- Published
- 2024
11. VMGNet: A Low Computational Complexity Robotic Grasping Network Based on VMamba with Multi-Scale Feature Fusion
- Author
-
Jin, Yuhao, Gao, Qizhong, Zhu, Xiaohui, Yue, Yong, Lim, Eng Gee, Chen, Yuqing, Wong, Prudence, and Chu, Yijie
- Subjects
Computer Science - Robotics ,Computer Science - Computer Vision and Pattern Recognition - Abstract
While deep learning-based robotic grasping technology has demonstrated strong adaptability, its computational complexity has also significantly increased, making it unsuitable for scenarios with high real-time requirements. Therefore, we propose a low computational complexity and high accuracy model named VMGNet for robotic grasping. For the first time, we introduce the Visual State Space into the robotic grasping field to achieve linear computational complexity, thereby greatly reducing the model's computational cost. Meanwhile, to improve the accuracy of the model, we propose an efficient and lightweight multi-scale feature fusion module, named Fusion Bridge Module, to extract and fuse information at different scales. We also present a new loss function calculation method to enhance the importance differences between subtasks, improving the model's fitting ability. Experiments show that VMGNet has only 8.7G Floating Point Operations and an inference time of 8.1 ms on our devices. VMGNet also achieved state-of-the-art performance on the Cornell and Jacquard public datasets. To validate VMGNet's effectiveness in practical applications, we conducted real grasping experiments in multi-object scenarios, and VMGNet achieved an excellent performance with a 94.4% success rate in real-world grasping tasks. The video for the real-world robotic grasping experiments is available at https://youtu.be/S-QHBtbmLc4.
- Published
- 2024
12. On the Incorporation of Stability Constraints into Sequential Operational Scheduling
- Author
-
Xu, Wangkun, Chu, Zhongda, Capitanescu, Florin, and Teng, Fei
- Subjects
Electrical Engineering and Systems Science - Systems and Control - Abstract
With the increasing penetration of Inverter-Based Resources (IBRs), power system stability constraints must be incorporated into the operational framework, transforming it into stability-constrained optimization. Currently, there exist parallel research efforts on developing the stability constraints within DC power flow-based unit commitment (UC) and AC Optimal Power Flow (OPF). However, few studies discuss how including such constraints can interact with each other and eventually impact grid stability. In this context, this work simulates a realistic power system decision making framework and provides a thorough analysis on the necessity of incorporating frequency nadir and small signal stability constraints into these sequentially connected two operation stages. The simulation results demonstrate that including both stability constraints in the UC is essential to maintain power system stability, while the inclusion in AC OPF can further improve the stability index.
- Published
- 2024
13. Revealing Structure and Symmetry of Nonlinearity in Natural and Engineering Flows
- Author
-
Yeung, Brandon, Chu, Tianyi, and Schmidt, Oliver T.
- Subjects
Physics - Fluid Dynamics ,Nonlinear Sciences - Chaotic Dynamics - Abstract
Energy transfer across scales is fundamental in fluid dynamics, linking large-scale flow motions to small-scale turbulent structures in engineering and natural environments. Triadic interactions among three wave components form complex networks across scales, challenging understanding and model reduction. We introduce Triadic Orthogonal Decomposition (TOD), a method that identifies coherent flow structures optimally capturing spectral momentum transfer, quantifies their coupling and energy exchange in an energy budget bispectrum, and reveals the regions where they interact. TOD distinguishes three components--a momentum recipient, donor, and catalyst--and recovers laws governing pairwise, six-triad, and global triad conservation. Applied to unsteady cylinder wake and wind turbine wake data, TOD reveals networks of triadic interactions with forward and backward energy transfer across frequencies and scales., Comment: BY and TC are equal contributors to this work and designated as co-first authors
- Published
- 2024
14. Space-time structure and particle-fluid duality of solutions for Boltzmann equation with hard potentials
- Author
-
Lin, Yu-Chu, Wang, Haitao, and Wu, Kung-Chien
- Subjects
Mathematics - Analysis of PDEs - Abstract
We study the quantitative pointwise behavior of solutions to the Boltzmann equation for hard potentials and Maxwellian molecules, which generalize the hard sphere case introduced by Liu-Yu in 2004 (Comm. Pure Appl. Math. 57:1543-1608, 2004). The large time behavior of the solution is dominated by fluid structures, similar to the hard sphere case. However, unlike hard sphere, the spatial decay here depends on the potential power $\gamma$ and the initial velocity weight. A key challenge in this problem is the loss of velocity weight in linear estimates, which makes standard nonlinear iteration infeasible. To address this, we develop an Enhanced Mixture Lemma, demonstrating that mixing the transport and gain parts of the linearized collision operator can generate arbitrary-order regularity and decay in both space and velocity variables. This allows us to decompose the linearized solution into fluid (arbitrary regularity and velocity decay) and particle (rapid space-time decay, but with loss of velocity decay) parts, making it possible to solve the nonlinear problem through this particle-fluid duality., Comment: 37 pages
- Published
- 2024
15. JailbreakLens: Interpreting Jailbreak Mechanism in the Lens of Representation and Circuit
- Author
-
He, Zeqing, Wang, Zhibo, Chu, Zhixuan, Xu, Huiyu, Zheng, Rui, Ren, Kui, and Chen, Chun
- Subjects
Computer Science - Cryptography and Security - Abstract
Despite the outstanding performance of Large language models (LLMs) in diverse tasks, they are vulnerable to jailbreak attacks, wherein adversarial prompts are crafted to bypass their security mechanisms and elicit unexpected responses.Although jailbreak attacks are prevalent, the understanding of their underlying mechanisms remains limited. Recent studies have explain typical jailbreaking behavior (e.g., the degree to which the model refuses to respond) of LLMs by analyzing the representation shifts in their latent space caused by jailbreak prompts or identifying key neurons that contribute to the success of these attacks. However, these studies neither explore diverse jailbreak patterns nor provide a fine-grained explanation from the failure of circuit to the changes of representational, leaving significant gaps in uncovering the jailbreak mechanism. In this paper, we propose JailbreakLens, an interpretation framework that analyzes jailbreak mechanisms from both representation (which reveals how jailbreaks alter the model's harmfulness perception) and circuit perspectives (which uncovers the causes of these deceptions by identifying key circuits contributing to the vulnerability), tracking their evolution throughout the entire response generation process. We then conduct an in-depth evaluation of jailbreak behavior on four mainstream LLMs under seven jailbreak strategies. Our evaluation finds that jailbreak prompts amplify components that reinforce affirmative responses while suppressing those that produce refusal. Although this manipulation shifts model representations toward safe clusters to deceive the LLM, leading it to provide detailed responses instead of refusals, it still produce abnormal activation which can be caught in the circuit analysis., Comment: 18 pages, 10 figures
- Published
- 2024
16. BioNeMo Framework: a modular, high-performance library for AI model development in drug discovery
- Author
-
John, Peter St., Lin, Dejun, Binder, Polina, Greaves, Malcolm, Shah, Vega, John, John St., Lange, Adrian, Hsu, Patrick, Illango, Rajesh, Ramanathan, Arvind, Anandkumar, Anima, Brookes, David H, Busia, Akosua, Mahajan, Abhishaike, Malina, Stephen, Prasad, Neha, Sinai, Sam, Edwards, Lindsay, Gaudelet, Thomas, Regep, Cristian, Steinegger, Martin, Rost, Burkhard, Brace, Alexander, Hippe, Kyle, Naef, Luca, Kamata, Keisuke, Armstrong, George, Boyd, Kevin, Cao, Zhonglin, Chou, Han-Yi, Chu, Simon, Costa, Allan dos Santos, Darabi, Sajad, Dawson, Eric, Didi, Kieran, Fu, Cong, Geiger, Mario, Gill, Michelle, Hsu, Darren, Kaushik, Gagan, Korshunova, Maria, Kothen-Hill, Steven, Lee, Youhan, Liu, Meng, Livne, Micha, McClure, Zachary, Mitchell, Jonathan, Moradzadeh, Alireza, Mosafi, Ohad, Nashed, Youssef, Paliwal, Saee, Peng, Yuxing, Rabhi, Sara, Ramezanghorbani, Farhad, Reidenbach, Danny, Ricketts, Camir, Roland, Brian, Shah, Kushal, Shimko, Tyler, Sirelkhatim, Hassan, Srinivasan, Savitha, Stern, Abraham C, Toczydlowska, Dorota, Veccham, Srimukh Prasad, Venanzi, Niccolò Alberto Elia, Vorontsov, Anton, Wilber, Jared, Wilkinson, Isabel, Wong, Wei Jing, Xue, Eva, Ye, Cory, Yu, Xin, Zhang, Yang, Zhou, Guoqing, Zandstein, Becca, Dallago, Christian, Trentini, Bruno, Kucukbenli, Emine, Rvachov, Timur, Calleja, Eddie, Israeli, Johnny, Clifford, Harry, Haukioja, Risto, Haemel, Nicholas, Tretina, Kyle, Tadimeti, Neha, and Costa, Anthony B
- Subjects
Computer Science - Machine Learning ,Quantitative Biology - Biomolecules - Abstract
Artificial Intelligence models encoding biology and chemistry are opening new routes to high-throughput and high-quality in-silico drug development. However, their training increasingly relies on computational scale, with recent protein language models (pLM) training on hundreds of graphical processing units (GPUs). We introduce the BioNeMo Framework to facilitate the training of computational biology and chemistry AI models across hundreds of GPUs. Its modular design allows the integration of individual components, such as data loaders, into existing workflows and is open to community contributions. We detail technical features of the BioNeMo Framework through use cases such as pLM pre-training and fine-tuning. On 256 NVIDIA A100s, BioNeMo Framework trains a three billion parameter BERT-based pLM on over one trillion tokens in 4.2 days. The BioNeMo Framework is open-source and free for everyone to use.
- Published
- 2024
17. Domain Adaptation-based Edge Computing for Cross-Conditions Fault Diagnosis
- Author
-
Wang, Yanzhi, Wang, Chu, Wu, Jinhong, Yu, Ziyang, and Zhou, Qi
- Subjects
Computer Science - Distributed, Parallel, and Cluster Computing ,Computer Science - Artificial Intelligence ,Computer Science - Software Engineering - Abstract
Fault diagnosis technology supports the healthy operation of mechanical equipment. However, the variations conditions during the operation of mechanical equipment lead to significant disparities in data distribution, posing challenges to fault diagnosis. Furthermore, when deploying applications, traditional methods often encounter issues such as latency and data security. Therefore, conducting fault diagnosis and deploying application methods under cross-operating conditions holds significant value. This paper proposes a domain adaptation-based lightweight fault diagnosis framework for edge computing scenarios. Incorporating the local maximum mean discrepancy into knowledge transfer aligns the feature distributions of different domains in a high-dimensional feature space, to discover a common feature space across domains. The acquired fault diagnosis expertise from the cloud-model is transferred to the lightweight edge-model using adaptation knowledge transfer methods. While ensuring real-time diagnostic capabilities, accurate fault diagnosis is achieved across working conditions. We conducted validation experiments on the NVIDIA Jetson Xavier NX kit. In terms of diagnostic performance, the proposed method significantly improved diagnostic accuracy, with average increases of 34.44% and 17.33% compared to the comparison method, respectively. Regarding lightweight effectiveness, proposed method achieved an average inference speed increase of 80.47%. Additionally, compared to the cloud-model, the parameter count of the edge-model decreased by 96.37%, while the Flops decreased by 83.08%., Comment: 28 pages, 11 figures
- Published
- 2024
18. Explanation for Trajectory Planning using Multi-modal Large Language Model for Autonomous Driving
- Author
-
Yamazaki, Shota, Zhang, Chenyu, Nanri, Takuya, Shigekane, Akio, Wang, Siyuan, Nishiyama, Jo, Chu, Tao, and Yokosawa, Kohei
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Robotics - Abstract
End-to-end style autonomous driving models have been developed recently. These models lack interpretability of decision-making process from perception to control of the ego vehicle, resulting in anxiety for passengers. To alleviate it, it is effective to build a model which outputs captions describing future behaviors of the ego vehicle and their reason. However, the existing approaches generate reasoning text that inadequately reflects the future plans of the ego vehicle, because they train models to output captions using momentary control signals as inputs. In this study, we propose a reasoning model that takes future planning trajectories of the ego vehicle as inputs to solve this limitation with the dataset newly collected., Comment: Accepted and presented at ECCV 2024 2nd Workshop on Vision-Centric Autonomous Driving (VCAD) on September 30, 2024. 13 pages, 5 figures
- Published
- 2024
19. WavChat: A Survey of Spoken Dialogue Models
- Author
-
Ji, Shengpeng, Chen, Yifu, Fang, Minghui, Zuo, Jialong, Lu, Jingyu, Wang, Hanting, Jiang, Ziyue, Zhou, Long, Liu, Shujie, Cheng, Xize, Yang, Xiaoda, Wang, Zehan, Yang, Qian, Li, Jian, Jiang, Yidi, He, Jingzhen, Chu, Yunfei, Xu, Jin, and Zhao, Zhou
- Subjects
Electrical Engineering and Systems Science - Audio and Speech Processing ,Computer Science - Computation and Language ,Computer Science - Machine Learning ,Computer Science - Multimedia ,Computer Science - Sound - Abstract
Recent advancements in spoken dialogue models, exemplified by systems like GPT-4o, have captured significant attention in the speech domain. Compared to traditional three-tier cascaded spoken dialogue models that comprise speech recognition (ASR), large language models (LLMs), and text-to-speech (TTS), modern spoken dialogue models exhibit greater intelligence. These advanced spoken dialogue models not only comprehend audio, music, and other speech-related features, but also capture stylistic and timbral characteristics in speech. Moreover, they generate high-quality, multi-turn speech responses with low latency, enabling real-time interaction through simultaneous listening and speaking capability. Despite the progress in spoken dialogue systems, there is a lack of comprehensive surveys that systematically organize and analyze these systems and the underlying technologies. To address this, we have first compiled existing spoken dialogue systems in the chronological order and categorized them into the cascaded and end-to-end paradigms. We then provide an in-depth overview of the core technologies in spoken dialogue models, covering aspects such as speech representation, training paradigm, streaming, duplex, and interaction capabilities. Each section discusses the limitations of these technologies and outlines considerations for future research. Additionally, we present a thorough review of relevant datasets, evaluation metrics, and benchmarks from the perspectives of training and evaluating spoken dialogue systems. We hope this survey will contribute to advancing both academic research and industrial applications in the field of spoken dialogue systems. The related material is available at https://github.com/jishengpeng/WavChat., Comment: 60 papes, working in progress
- Published
- 2024
20. One or two poles for the $\Xi(1820)$?
- Author
-
Molina, R., Liang, Wei-Hong, Xiao, Chu-Wen, Sun, Zhi-Feng, and Oset, E.
- Subjects
High Energy Physics - Phenomenology - Abstract
In this talk, we present a new interpretation for the recently observed $\Xi(1820)$ resonance. We recall that the chiral unitary approach for the interaction of pseudoscalar mesons with the baryons of the decuplet predicts two states for the $\Xi(1820)$ resonance, one with a narrow width and the other one with a large width. We contrast this fact with the recent BESIII measurement of the $K^- \Lambda$ mass distribution in the $\psi(3686)$ decay to $K^- \Lambda \bar\Xi^+ $, which demands a width much larger than the average of the PDG, and show how the consideration of the two $\Xi(1820)$ states provides a natural explanation to this apparent contradiction., Comment: Proceeding of the QNP 2024 Conference. 6 pages, 3 figures, 1 table. arXiv admin note: substantial text overlap with arXiv:2309.03618
- Published
- 2024
21. OpenLS-DGF: An Adaptive Open-Source Dataset Generation Framework for Machine Learning Tasks in Logic Synthesis
- Author
-
Ni, Liwei, Wang, Rui, Liu, Miao, Meng, Xingyu, Lin, Xiaoze, Liu, Junfeng, Luo, Guojie, Chu, Zhufei, Qian, Weikang, Yang, Xiaoyan, Xie, Biwei, Li, Xingquan, and Li, Huawei
- Subjects
Computer Science - Artificial Intelligence - Abstract
This paper introduces OpenLS-DGF, an adaptive logic synthesis dataset generation framework, to enhance machine learning~(ML) applications within the logic synthesis process. Previous dataset generation flows were tailored for specific tasks or lacked integrated machine learning capabilities. While OpenLS-DGF supports various machine learning tasks by encapsulating the three fundamental steps of logic synthesis: Boolean representation, logic optimization, and technology mapping. It preserves the original information in both Verilog and machine-learning-friendly GraphML formats. The verilog files offer semi-customizable capabilities, enabling researchers to insert additional steps and incrementally refine the generated dataset. Furthermore, OpenLS-DGF includes an adaptive circuit engine that facilitates the final dataset management and downstream tasks. The generated OpenLS-D-v1 dataset comprises 46 combinational designs from established benchmarks, totaling over 966,000 Boolean circuits. OpenLS-D-v1 supports integrating new data features, making it more versatile for new challenges. This paper demonstrates the versatility of OpenLS-D-v1 through four distinct downstream tasks: circuit classification, circuit ranking, quality of results (QoR) prediction, and probability prediction. Each task is chosen to represent essential steps of logic synthesis, and the experimental results show the generated dataset from OpenLS-DGF achieves prominent diversity and applicability. The source code and datasets are available at https://github.com/Logic-Factory/ACE/blob/master/OpenLS-DGF/readme.md., Comment: 14 pages
- Published
- 2024
22. Estimating unknown parameters in differential equations with a reinforcement learning based PSO method
- Author
-
Sun, Wenkui, Fan, Xiaoya, Jia, Lijuan, Chu, Tinyi, Yau, Shing-Tung, Wu, Rongling, and Wang, Zhong
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence - Abstract
Differential equations offer a foundational yet powerful framework for modeling interactions within complex dynamic systems and are widely applied across numerous scientific fields. One common challenge in this area is estimating the unknown parameters of these dynamic relationships. However, traditional numerical optimization methods rely on the selection of initial parameter values, making them prone to local optima. Meanwhile, deep learning and Bayesian methods require training models on specific differential equations, resulting in poor versatility. This paper reformulates the parameter estimation problem of differential equations as an optimization problem by introducing the concept of particles from the particle swarm optimization algorithm. Building on reinforcement learning-based particle swarm optimization (RLLPSO), this paper proposes a novel method, DERLPSO, for estimating unknown parameters of differential equations. We compared its performance on three typical ordinary differential equations with the state-of-the-art methods, including the RLLPSO algorithm, traditional numerical methods, deep learning approaches, and Bayesian methods. The experimental results demonstrate that our DERLPSO consistently outperforms other methods in terms of performance, achieving an average Mean Square Error of 1.13e-05, which reduces the error by approximately 4 orders of magnitude compared to other methods. Apart from ordinary differential equations, our DERLPSO also show great promise for estimating unknown parameters of partial differential equations. The DERLPSO method proposed in this paper has high accuracy, is independent of initial parameter values, and possesses strong versatility and stability. This work provides new insights into unknown parameter estimation for differential equations.
- Published
- 2024
23. Scale Contrastive Learning with Selective Attentions for Blind Image Quality Assessment
- Author
-
Huang, Zihao, Li, Xudong, Fu, Bohan, Chu, Xiaohui, Li, Ke, Shen, Yunhang, and Zhang, Yan
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Blind image quality assessment (BIQA) serves as a fundamental task in computer vision, yet it often fails to consistently align with human subjective perception. Recent advances show that multi-scale evaluation strategies are promising due to their ability to replicate the hierarchical structure of human vision. However, the effectiveness of these strategies is limited by a lack of understanding of how different image scales influence perceived quality. This paper addresses two primary challenges: the significant redundancy of information across different scales, and the confusion caused by combining features from these scales, which may vary widely in quality. To this end, a new multi-scale BIQA framework is proposed, namely Contrast-Constrained Scale-Focused IQA Framework (CSFIQA). CSFIQA features a selective focus attention mechanism to minimize information redundancy and highlight critical quality-related information. Additionally, CSFIQA includes a scale-level contrastive learning module equipped with a noise sample matching mechanism to identify quality discrepancies across the same image content at different scales. By exploring the intrinsic relationship between image scales and the perceived quality, the proposed CSFIQA achieves leading performance on eight benchmark datasets, e.g., achieving SRCC values of 0.967 (versus 0.947 in CSIQ) and 0.905 (versus 0.876 in LIVEC).
- Published
- 2024
24. Emergence of steady quantum transport in a superconducting processor
- Author
-
Zhang, Pengfei, Gao, Yu, Xu, Xiansong, Wang, Ning, Dong, Hang, Guo, Chu, Deng, Jinfeng, Zhang, Xu, Chen, Jiachen, Xu, Shibo, Wang, Ke, Wu, Yaozu, Zhang, Chuanyu, Jin, Feitong, Zhu, Xuhao, Zhang, Aosai, Zou, Yiren, Tan, Ziqi, Cui, Zhengyi, Zhu, Zitian, Shen, Fanhao, Li, Tingting, Zhong, Jiarun, Bao, Zehang, Zhao, Liangtian, Hao, Jie, Li, Hekang, Wang, Zhen, Song, Chao, Guo, Qiujiang, Wang, H., and Poletti, Dario
- Subjects
Quantum Physics ,Condensed Matter - Mesoscale and Nanoscale Physics ,Condensed Matter - Statistical Mechanics - Abstract
Non-equilibrium quantum transport is crucial to technological advances ranging from nanoelectronics to thermal management. In essence, it deals with the coherent transfer of energy and (quasi-)particles through quantum channels between thermodynamic baths. A complete understanding of quantum transport thus requires the ability to simulate and probe macroscopic and microscopic physics on equal footing. Using a superconducting quantum processor, we demonstrate the emergence of non-equilibrium steady quantum transport by emulating the baths with qubit ladders and realising steady particle currents between the baths. We experimentally show that the currents are independent of the microscopic details of bath initialisation, and their temporal fluctuations decrease rapidly with the size of the baths, emulating those predicted by thermodynamic baths. The above characteristics are experimental evidence of pure-state statistical mechanics and prethermalisation in non-equilibrium many-body quantum systems. Furthermore, by utilising precise controls and measurements with single-site resolution, we demonstrate the capability to tune steady currents by manipulating the macroscopic properties of the baths, including filling and spectral properties. Our investigation paves the way for a new generation of experimental exploration of non-equilibrium quantum transport in strongly correlated quantum matter., Comment: 7 pages, 4 figures
- Published
- 2024
25. Rigorous enclosure of Lyapunov exponents of stochastic flows
- Author
-
Breden, Maxime, Chu, Hugo, Lamb, Jeroen S. W., and Rasmussen, Martin
- Subjects
Mathematics - Dynamical Systems ,Mathematics - Numerical Analysis ,Mathematics - Probability ,37M25, 37H15, 65P30, 60J22, 65G20 - Abstract
We develop a powerful and general method to provide arbitrarily accurate rigorous upper and lower bounds for Lyapunov exponents of stochastic flows. Our approach is based on computer-assisted tools, the adjoint method and established results on the ergodicity of diffusion processes. We do not require any structural assumptions on the stochastic system and work under mild hypoellipticity conditions outside of perturbative regimes. Therefore, our method allows for the treatment of systems that were so far inaccessible from existing mathematical tools. We demonstrate our method to exhibit the chaotic nature of three non-Hamiltonian systems. Finally, we show that our approach is robust to continuation methods to produce bounds on Lyapunov exponents for large parameter regions.
- Published
- 2024
26. Return-forecasting and Volatility-forecasting Power of On-chain Activities in the Cryptocurrency Market
- Author
-
Chi, Yeguang, Qionghua, Chu, and Hao, Wenyan
- Subjects
Economics - Econometrics - Abstract
We investigate the return-forecasting and volatility-forecasting power of intraday on-chain flow data for BTC, ETH, and USDT, and the associated option strategies. First, we find that USDT net inflow into cryptocurrency exchanges positively forecasts future returns of both BTC and ETH, with the strongest effect at the 1-hour frequency. Second, we find that ETH net inflow into cryptocurrency exchanges negatively forecasts future returns of ETH. Third, we find that BTC net inflow into cryptocurrency exchanges does not significantly forecast future returns of BTC. Finally, we confirm that selling 0DTE ETH call options is a profitable trading strategy when the net inflow into cryptocurrency exchanges is high. Our study lends new insights into the emerging literature that studies the on-chain activities and their asset-pricing impact in the cryptocurrency market.
- Published
- 2024
27. A Survey of AI-Related Cyber Security Risks and Countermeasures in Mobility-as-a-Service
- Author
-
Chu, Kai-Fung, Yuan, Haiyue, Yuan, Jinsheng, Guo, Weisi, Balta-Ozkan, Nazmiye, and Li, Shujun
- Subjects
Computer Science - Cryptography and Security - Abstract
Mobility-as-a-Service (MaaS) integrates different transport modalities and can support more personalisation of travellers' journey planning based on their individual preferences, behaviours and wishes. To fully achieve the potential of MaaS, a range of AI (including machine learning and data mining) algorithms are needed to learn personal requirements and needs, to optimise journey planning of each traveller and all travellers as a whole, to help transport service operators and relevant governmental bodies to operate and plan their services, and to detect and prevent cyber attacks from various threat actors including dishonest and malicious travellers and transport operators. The increasing use of different AI and data processing algorithms in both centralised and distributed settings opens the MaaS ecosystem up to diverse cyber and privacy attacks at both the AI algorithm level and the connectivity surfaces. In this paper, we present the first comprehensive review on the coupling between AI-driven MaaS design and the diverse cyber security challenges related to cyber attacks and countermeasures. In particular, we focus on how current and emerging AI-facilitated privacy risks (profiling, inference, and third-party threats) and adversarial AI attacks (evasion, extraction, and gamification) may impact the MaaS ecosystem. These risks often combine novel attacks (e.g., inverse learning) with traditional attack vectors (e.g., man-in-the-middle attacks), exacerbating the risks for the wider participation actors and the emergence of new business models.
- Published
- 2024
- Full Text
- View/download PDF
28. Inversion-based Latent Bayesian Optimization
- Author
-
Chu, Jaewon, Park, Jinyoung, Lee, Seunghun, and Kim, Hyunwoo J.
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence - Abstract
Latent Bayesian optimization (LBO) approaches have successfully adopted Bayesian optimization over a continuous latent space by employing an encoder-decoder architecture to address the challenge of optimization in a high dimensional or discrete input space. LBO learns a surrogate model to approximate the black-box objective function in the latent space. However, we observed that most LBO methods suffer from the `misalignment problem`, which is induced by the reconstruction error of the encoder-decoder architecture. It hinders learning an accurate surrogate model and generating high-quality solutions. In addition, several trust region-based LBO methods select the anchor, the center of the trust region, based solely on the objective function value without considering the trust region`s potential to enhance the optimization process. To address these issues, we propose Inversion-based Latent Bayesian Optimization (InvBO), a plug-and-play module for LBO. InvBO consists of two components: an inversion method and a potential-aware trust region anchor selection. The inversion method searches the latent code that completely reconstructs the given target data. The potential-aware trust region anchor selection considers the potential capability of the trust region for better local optimization. Experimental results demonstrate the effectiveness of InvBO on nine real-world benchmarks, such as molecule design and arithmetic expression fitting tasks. Code is available at https://github.com/mlvlab/InvBO., Comment: Accepted to NeurIPS 2024
- Published
- 2024
29. Internal Solitary Wave Generation Using A Jet-Array Wavemaker
- Author
-
Chu, Jen-Ping, Luhar, Mitul, and Lynett, Partrick
- Subjects
Physics - Fluid Dynamics - Abstract
This paper evaluates the experimental generation of internal solitary waves (ISWs) in a miscible two-layer system with a free surface using a jet-array wavemaker (JAW). Unlike traditional gate-release experiments, the JAW system generates ISWs by forcing a prescribed vertical distribution of mass flux. Experiments examine three different layer-depth ratios, with ISW amplitudes up to the maximum allowed by the extended Korteweg-de Vries (eKdV) solution. Phase speeds and wave profiles are captured via planar laser-induced fluorescence and the velocity field is measured synchronously using particle imaging velocimetry. Measured properties are directly compared with the eKdV predictions. As expected, small- and intermediate-amplitude waves match well with the corresponding eKdV solutions, with errors in amplitude and phase speed below 10%. For large waves with amplitudes approaching the maximum allowed by the eKdV solution, the phase speed and the velocity profiles resemble the eKdV solution while the wave profiles are distorted following the trough. This can potentially be attributed to Kelvin-Helmholtz instabilities forming at the pycnocline. Larger errors are generally observed when the local Richardson number at the JAW inlet exceeds the threshold for instability., Comment: 25 pages, 11 figures
- Published
- 2024
30. OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models
- Author
-
Huang, Siming, Cheng, Tianhao, Liu, J. K., Hao, Jiaran, Song, Liuyihan, Xu, Yang, Yang, J., Liu, J. H., Zhang, Chenchen, Chai, Linzheng, Yuan, Ruifeng, Zhang, Zhaoxiang, Fu, Jie, Liu, Qian, Zhang, Ge, Wang, Zili, Qi, Yuan, Xu, Yinghui, and Chu, Wei
- Subjects
Computer Science - Computation and Language ,Computer Science - Programming Languages - Abstract
Large language models (LLMs) for code have become indispensable in various domains, including code generation, reasoning tasks and agent systems. While open-access code LLMs are increasingly approaching the performance levels of proprietary models, high-quality code LLMs suitable for rigorous scientific investigation, particularly those with reproducible data processing pipelines and transparent training protocols, remain limited. The scarcity is due to various challenges, including resource constraints, ethical considerations, and the competitive advantages of keeping models advanced. To address the gap, we introduce OpenCoder, a top-tier code LLM that not only achieves performance comparable to leading models but also serves as an "open cookbook" for the research community. Unlike most prior efforts, we release not only model weights and inference code, but also the reproducible training data, complete data processing pipeline, rigorous experimental ablation results, and detailed training protocols for open scientific research. Through this comprehensive release, we identify the key ingredients for building a top-tier code LLM: (1) code optimized heuristic rules for data cleaning and methods for data deduplication, (2) recall of text corpus related to code and (3) high-quality synthetic data in both annealing and supervised fine-tuning stages. By offering this level of openness, we aim to broaden access to all aspects of a top-tier code LLM, with OpenCoder serving as both a powerful model and an open foundation to accelerate research, and enable reproducible advancements in code AI.
- Published
- 2024
31. Lee Bounds with a Continuous Treatment in Sample Selection
- Author
-
Lee, Ying-Ying and Liu, Chu-An
- Subjects
Economics - Econometrics - Abstract
Sample selection problems arise when treatment affects both the outcome and the researcher's ability to observe it. This paper generalizes Lee (2009) bounds for the average treatment effect of a binary treatment to a continuous/multivalued treatment. We evaluate the Job Crops program to study the causal effect of training hours on wages. To identify the average treatment effect of always-takers who are selected regardless of the treatment values, we assume that if a subject is selected at some sufficient treatment values, then it remains selected at all treatment values. For example, if program participants are employed with one month of training, then they remain employed with any training hours. This sufficient treatment values assumption includes the monotone assumption on the treatment effect on selection as a special case. We further allow the conditional independence assumption and subjects with different pretreatment covariates to have different sufficient treatment values. The estimation and inference theory utilize the orthogonal moment function and cross-fitting for double debiased machine learning.
- Published
- 2024
32. Error Controlled Cubic Spline Interpolation in CNC Technology
- Author
-
Ze-Wei, Cai, Wei, Zheng, Yi-Ru, Bao, Hai-Long, Chu, and Meng, Wu
- Subjects
Computer Science - Computational Geometry - Abstract
Traditional CNC technology mostly uses the method of increasing the degree of interpolation polynomial when constructing $C^2$ continuous NURBS curves, but this often leads to the appearance of Runge phenomenon in interpolation curves. Alternatively,the method of adding boundary conditions at the endpoints can often make it difficult to control the error range of the interpolation curve. This article presents a $C^2$ continuous cubic B-spline curve interpolation method,which achieves $C^2$ continuity of the interpolation curve when the interpolation polynomial is cubic. At the same time, this article also studies the corresponding error control methods., Comment: in Chinese language
- Published
- 2024
33. Pedestrian Volume Prediction Using a Diffusion Convolutional Gated Recurrent Unit Model
- Author
-
Dong, Yiwei, Chu, Tingjin, Zhang, Lele, Ghaderi, Hadi, and Yang, Hanfang
- Subjects
Computer Science - Machine Learning ,Statistics - Applications - Abstract
Effective models for analysing and predicting pedestrian flow are important to ensure the safety of both pedestrians and other road users. These tools also play a key role in optimising infrastructure design and geometry and supporting the economic utility of interconnected communities. The implementation of city-wide automatic pedestrian counting systems provides researchers with invaluable data, enabling the development and training of deep learning applications that offer better insights into traffic and crowd flows. Benefiting from real-world data provided by the City of Melbourne pedestrian counting system, this study presents a pedestrian flow prediction model, as an extension of Diffusion Convolutional Grated Recurrent Unit (DCGRU) with dynamic time warping, named DCGRU-DTW. This model captures the spatial dependencies of pedestrian flow through the diffusion process and the temporal dependency captured by Gated Recurrent Unit (GRU). Through extensive numerical experiments, we demonstrate that the proposed model outperforms the classic vector autoregressive model and the original DCGRU across multiple model accuracy metrics.
- Published
- 2024
34. Gradient Methods with Online Scaling
- Author
-
Gao, Wenzhi, Chu, Ya-Chi, Ye, Yinyu, and Udell, Madeleine
- Subjects
Mathematics - Optimization and Control ,Computer Science - Machine Learning - Abstract
We introduce a framework to accelerate the convergence of gradient-based methods with online learning. The framework learns to scale the gradient at each iteration through an online learning algorithm and provably accelerates gradient-based methods asymptotically. In contrast with previous literature, where convergence is established based on worst-case analysis, our framework provides a strong convergence guarantee with respect to the optimal scaling matrix for the iteration trajectory. For smooth strongly convex optimization, our results provide an $O(\kappa^\star \log(1/\varepsilon)$) complexity result, where $\kappa^\star$ is the condition number achievable by the optimal preconditioner, improving on the previous $O(\sqrt{n}\kappa^\star \log(1/\varepsilon))$ result. In particular, a variant of our method achieves superlinear convergence on convex quadratics. For smooth convex optimization, we show for the first time that the widely-used hypergradient descent heuristic improves on the convergence of gradient descent.
- Published
- 2024
35. Technical Report for SoccerNet Challenge 2022 -- Replay Grounding Task
- Author
-
Chen, Shimin, Li, Wei, Chu, Jiaming, Chen, Chen, Zhang, Chen, and Guo, Yandong
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
In order to make full use of video information, we transform the replay grounding problem into a video action location problem. We apply a unified network Faster-TAD proposed by us for temporal action detection to get the results of replay grounding. Finally, by observing the data distribution of the training data, we refine the output of the model to get the final submission.
- Published
- 2024
36. From Flip FET to Flip 3D Integration (F3D): Maximizing the Scaling Potential of Wafer Both Sides Beyond Conventional 3D Integration
- Author
-
Wu, Heng, Lu, Haoran, Peng, Wanyue, Xu, Ziqiao, Chu, Yanbang, Sun, Jiacheng, Zhou, Falong, Wu, Jack, Zhang, Lijie, Bu, Weihai, Kang, Jin, Li, Ming, Lin, Yibo, Wang, Runsheng, Zhang, Xin, and Huang, Ru
- Subjects
Condensed Matter - Mesoscale and Nanoscale Physics - Abstract
In this work, we proposed a new 3D integration technology: the Flip 3D integration (F3D), consisting of the 3D transistor stacking, the 3D dual-sided interconnects, the 3D die-to-die stacking and the dual-sided Monolithic 3D (M3D). Based on a 32-bit FFET RISCV core, besides the scaling benefits of the Flip FET (FFET), the dual-sided signal routing shows even more routing flexibility with 6.8% area reduction and 5.9% EDP improvement. Novel concepts of Multi-Flipping processes (Double Flips and Triple Flips) were proposed to relax the thermal budget constraints in the F3D and thus support the dual-sided M3D in the F3D. The core's EDP and frequency are improved by up to 3.2% and 2.3% respectively, after BEOL optimizations based on the Triple Flips compared with unoptimized ones., Comment: Accepted by EDTM 2025
- Published
- 2024
37. Divergent Energy-Momentum Fluxes In Nonlocal Gravity Models
- Author
-
Chu, Yi-Zen and Zuroida, Afidah
- Subjects
General Relativity and Quantum Cosmology ,Astrophysics - Cosmology and Nongalactic Astrophysics ,High Energy Physics - Theory - Abstract
We analyze the second order perturbations of the Deser-Woodard II (DWII), Vardanyan-Akrami-Amendola-Silvestri (VAAS) and Amendola-Burzilla-Nersisyan (ABN) nonlocal gravity models in an attempt to extract their associated gravitational wave energy-momentum fluxes. In Minkowski spacetime, the gravitational spatial momentum density is supposed to scale at most as $1/r^{2}$, in the $r \rightarrow \infty$ limit, where $r$ is the observer-source spatial distance. The DWII model has a divergent flux because its momentum density goes as $1/r$; though this can be avoided when we set to zero the first derivative of its distortion function at the origin. Meanwhile, the ABN model also suffers from a divergent flux because its momentum density goes as $r^{2}$. The momentum density from the VAAS model was computed on a cosmological background expressed in a Fermi-Normal-Coordinate system, and was found to scale as $r$. For generic parameters, therefore, none of these three Dark Energy models appear to yield well-defined gravitational wave energies, as a result of their nonlocal gravitational self-interactions., Comment: 23 pages, 1 figure
- Published
- 2024
38. RAGraph: A General Retrieval-Augmented Graph Learning Framework
- Author
-
Jiang, Xinke, Qiu, Rihong, Xu, Yongxin, Zhang, Wentao, Zhu, Yichen, Zhang, Ruizhe, Fang, Yuchen, Chu, Xu, Zhao, Junfeng, and Wang, Yasha
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence ,Computer Science - Social and Information Networks - Abstract
Graph Neural Networks (GNNs) have become essential in interpreting relational data across various domains, yet, they often struggle to generalize to unseen graph data that differs markedly from training instances. In this paper, we introduce a novel framework called General Retrieval-Augmented Graph Learning (RAGraph), which brings external graph data into the general graph foundation model to improve model generalization on unseen scenarios. On the top of our framework is a toy graph vector library that we established, which captures key attributes, such as features and task-specific label information. During inference, the RAGraph adeptly retrieves similar toy graphs based on key similarities in downstream tasks, integrating the retrieved data to enrich the learning context via the message-passing prompting mechanism. Our extensive experimental evaluations demonstrate that RAGraph significantly outperforms state-of-the-art graph learning methods in multiple tasks such as node classification, link prediction, and graph classification across both dynamic and static datasets. Furthermore, extensive testing confirms that RAGraph consistently maintains high performance without the need for task-specific fine-tuning, highlighting its adaptability, robustness, and broad applicability., Comment: NeurIPS 2024
- Published
- 2024
39. Towards Convexity in Anomaly Detection: A New Formulation of SSLM with Unique Optimal Solutions
- Author
-
Liu, Hongying, Wang, Hao, Chu, Haoran, and Wu, Yibo
- Subjects
Computer Science - Machine Learning - Abstract
An unsolved issue in widely used methods such as Support Vector Data Description (SVDD) and Small Sphere and Large Margin SVM (SSLM) for anomaly detection is their nonconvexity, which hampers the analysis of optimal solutions in a manner similar to SVMs and limits their applicability in large-scale scenarios. In this paper, we introduce a novel convex SSLM formulation which has been demonstrated to revert to a convex quadratic programming problem for hyperparameter values of interest. Leveraging the convexity of our method, we derive numerous results that are unattainable with traditional nonconvex approaches. We conduct a thorough analysis of how hyperparameters influence the optimal solution, pointing out scenarios where optimal solutions can be trivially found and identifying instances of ill-posedness. Most notably, we establish connections between our method and traditional approaches, providing a clear determination of when the optimal solution is unique -- a task unachievable with traditional nonconvex methods. We also derive the {\nu}-property to elucidate the interactions between hyperparameters and the fractions of support vectors and margin errors in both positive and negative classes.
- Published
- 2024
40. From Black Strings to Fundamental Strings: Non-uniformity and Phase Transitions
- Author
-
Chu, Jinwei
- Subjects
High Energy Physics - Theory - Abstract
We discuss the transition between black strings and fundamental strings in the presence of a compact dimension, $\mathbb{S}^1_z$. In particular, we study the Horowitz-Polchinski effective field theory in $\mathbb{R}^d\times\mathbb{S}^1_z$, with a reduction on the Euclidean time circle $\mathbb{S}_\tau^1$. The classical solution of this theory describes a bound state of self-gravitating strings, known as a ``string star'', in Lorentzian spacetime. By analyzing non-uniform perturbations to the uniform solution, we identify the critical mass at which the string star becomes unstable towards non-uniformity along the spatial circle (i.e., Gregory-Laflamme instability) and determine the order of the associated phase transition. For $3\le d<4$, we argue that at the critical mass, the uniform string star can transition into a localized black hole. More generally, we describe the sequence of transitions from a large uniform black string as its mass decreases, depending on the number of dimensions $d$. Additionally, using the $SL(2)_k/U(1)$ model in string theory, we show that for sufficiently large $d$, the uniform black string is stable against non-uniformity before transitioning into fundamental strings. We also present a novel solution that exhibits double winding symmetry breaking in the asymptotically $\mathbb{R}^d\times\mathbb{S}^1_\tau\times\mathbb{S}^1_z$ Euclidean spacetime., Comment: 37+8 pages, 18 figures
- Published
- 2024
41. Optimized Flow Control based on Automatic Differentiation in Compressible Turbulent Channel Flows
- Author
-
Wang, Wenkang and Chu, Xu
- Subjects
Physics - Fluid Dynamics - Abstract
This study presents an automatic differentiation (AD)-based optimization framework for flow control in compressible turbulent channel flows. We developed a fully differentiable boundary condition framework that allows for the precise calculation of gradients with respect to boundary control variables. This facilitates the efficient optimization of flow control methods. The framework's adaptability and effectiveness are demonstrated using two boundary conditions: opposition control and tunable permeable walls. Various optimization targets are evaluated, including wall friction and turbulent kinetic energy (TKE), across different time horizons. In each optimization, there were around $4\times10^4$ control variables and $3\times10^{9}$ state variables in a single episode. Results indicate that TKE-targeted opposition control achieves a more stable and significant reduction in drag, with effective suppression of turbulence throughout the channel. In contrast, strategies that focus directly on minimizing wall friction were found to be less effective, exhibiting instability and increased turbulence in the outer region. The tunable permeable walls also show potential to achieve stable drag reduction through a `flux-inducing' mechanism. This study demonstrates the advantages of AD-based optimization in complex flow control scenarios and provides physical insight into the choice of quantity of interest for improved optimization performance.
- Published
- 2024
42. Testing the molecular nature of the $\Omega(2012)$ with the $\psi(3770) \to \bar{\Omega} \bar{K} \Xi$ and $\psi(3770) \to \bar{\Omega} \bar{K} \Xi^*(1530) (\bar \Omega \bar K \pi \Xi)$ reactions
- Author
-
Song, Jing, Liang, Wei-Hong, Xiao, Chu-Wen, Dias, Jorgivan Morais, and Oset, Eulogio
- Subjects
High Energy Physics - Phenomenology - Abstract
We report on the reactions $\psi(3770)\to \bar{\Omega}^+ \bar{K} \Xi $ and $\psi(3770)\to \bar{\Omega}^+ \bar{K}\Xi^*(1530)~(\Xi^*(1530)\to \pi\Xi $), and calculate the mass distributions $\frac{{\rm d}\Gamma}{{\rm d}M_\text{inv}(\bar{K}\Xi)}$ and $\frac{{\rm d}\Gamma}{{\rm d}M_\text{inv}(\bar{K}\Xi^*)}$, respectively. We obtain clear peaks corresponding to the $\Omega(2012)$. From the decay of $\psi(3770)\to \bar{\Omega}^+ \bar{K}\Xi^*$, we also get a second, broader, peak around $2035\,\rm MeV$, which comes from the $\Omega(2012)$ decay to $\bar{K}\Xi^*$. This second peak is closely tied to the molecular picture of the $\Omega(2012)$ with the $\bar{K}\Xi^*(1530)$ and $\eta\Omega$ components. Its observation would provide a boost to the molecular picture of the $\Omega(2012)$., Comment: 6 pages, 5 figures;V2: new reference and footnote added
- Published
- 2024
43. V2X-Assisted Distributed Computing and Control Framework for Connected and Automated Vehicles under Ramp Merging Scenario
- Author
-
Wu, Qiong, Chu, Jiahou, Fan, Pingyi, Wang, Kezhi, Cheng, Nan, Chen, Wen, and Letaief, Khaled B.
- Subjects
Electrical Engineering and Systems Science - Systems and Control ,Computer Science - Machine Learning ,Computer Science - Networking and Internet Architecture - Abstract
This paper investigates distributed computing and cooperative control of connected and automated vehicles (CAVs) in ramp merging scenario under transportation cyber-physical system. Firstly, a centralized cooperative trajectory planning problem is formulated subject to the safely constraints and traffic performance in ramp merging scenario, where the trajectories of all vehicles are jointly optimized. To get rid of the reliance on a central controller and reduce computation time, a distributed solution to this problem implemented among CAVs through Vehicles-to-Everything (V2X) communication is proposed. Unlike existing method, our method can distribute the computational task among CAVs and carry out parallel solving through V2X communication. Then, a multi-vehicles model predictive control (MPC) problem aimed at maximizing system stability and minimizing control input is formulated based on the solution of the first problem subject to strict safety constants and input limits. Due to these complex constraints, this problem becomes high-dimensional, centralized, and non-convex. To solve it in a short time, a decomposition and convex reformulation method, namely distributed cooperative iterative model predictive control (DCIMPC), is proposed. This method leverages the communication capability of CAVs to decompose the problem, making full use of the computational resources on vehicles to achieve fast solutions and distributed control. The two above problems with their corresponding solving methods form the systemic framework of the V2X assisted distributed computing and control. Simulations have been conducted to evaluate the framework's convergence, safety, and solving speed. Additionally, extra experiments are conducted to validate the performance of DCIMPC. The results show that our method can greatly improve computation speed without sacrificing system performance., Comment: This paper has been submitted to IEEE Journal. The source code has been released at: https://github.com/qiongwu86/V2X-Assisted-Distributed-Computing-and-Control-Framework-for-Connected-and-Automated-Vehicles.git
- Published
- 2024
44. Explainable Behavior Cloning: Teaching Large Language Model Agents through Learning by Demonstration
- Author
-
Guan, Yanchu, Wang, Dong, Wang, Yan, Wang, Haiqing, Sun, Renen, Zhuang, Chenyi, Gu, Jinjie, and Chu, Zhixuan
- Subjects
Computer Science - Computation and Language - Abstract
Autonomous mobile app interaction has become increasingly important with growing complexity of mobile applications. Developing intelligent agents that can effectively navigate and interact with mobile apps remains a significant challenge. In this paper, we propose an Explainable Behavior Cloning LLM Agent (EBC-LLMAgent), a novel approach that combines large language models (LLMs) with behavior cloning by learning demonstrations to create intelligent and explainable agents for autonomous mobile app interaction. EBC-LLMAgent consists of three core modules: Demonstration Encoding, Code Generation, and UI Mapping, which work synergistically to capture user demonstrations, generate executable codes, and establish accurate correspondence between code and UI elements. We introduce the Behavior Cloning Chain Fusion technique to enhance the generalization capabilities of the agent. Extensive experiments on five popular mobile applications from diverse domains demonstrate the superior performance of EBC-LLMAgent, achieving high success rates in task completion, efficient generalization to unseen scenarios, and the generation of meaningful explanations., Comment: 20 pages
- Published
- 2024
45. Dataset Awareness is not Enough: Implementing Sample-level Tail Encouragement in Long-tailed Self-supervised Learning
- Author
-
Xiao, Haowen, Liu, Guanghui, Gao, Xinyi, Li, Yang, Lv, Fengmao, and Chu, Jielei
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Artificial Intelligence - Abstract
Self-supervised learning (SSL) has shown remarkable data representation capabilities across a wide range of datasets. However, when applied to real-world datasets with long-tailed distributions, performance on multiple downstream tasks degrades significantly. Recently, the community has begun to focus more on self-supervised long-tailed learning. Some works attempt to transfer temperature mechanisms to self-supervised learning or use category-space uniformity constraints to balance the representation of different categories in the embedding space to fight against long-tail distributions. However, most of these approaches focus on the joint optimization of all samples in the dataset or on constraining the category distribution, with little attention given to whether each individual sample is optimally guided during training. To address this issue, we propose Temperature Auxiliary Sample-level Encouragement (TASE). We introduce pseudo-labels into self-supervised long-tailed learning, utilizing pseudo-label information to drive a dynamic temperature and re-weighting strategy. Specifically, We assign an optimal temperature parameter to each sample. Additionally, we analyze the lack of quantity awareness in the temperature parameter and use re-weighting to compensate for this deficiency, thereby achieving optimal training patterns at the sample level. Comprehensive experimental results on six benchmarks across three datasets demonstrate that our method achieves outstanding performance in improving long-tail recognition, while also exhibiting high robustness.
- Published
- 2024
46. DOFS: A Real-world 3D Deformable Object Dataset with Full Spatial Information for Dynamics Model Learning
- Author
-
Zhang, Zhen, Chu, Xiangyu, Tang, Yunxi, and Au, K. W. Samuel
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Robotics - Abstract
This work proposes DOFS, a pilot dataset of 3D deformable objects (DOs) (e.g., elasto-plastic objects) with full spatial information (i.e., top, side, and bottom information) using a novel and low-cost data collection platform with a transparent operating plane. The dataset consists of active manipulation action, multi-view RGB-D images, well-registered point clouds, 3D deformed mesh, and 3D occupancy with semantics, using a pinching strategy with a two-parallel-finger gripper. In addition, we trained a neural network with the down-sampled 3D occupancy and action as input to model the dynamics of an elasto-plastic object. Our dataset and all CADs of the data collection system will be released soon on our website., Comment: 5 pages, 6 figures, 2024 CoRL Workshop on Learning Robot Fine and Dexterous Manipulation: Perception and Control
- Published
- 2024
47. Homological $n$-systole in $(n+1)$-manifolds and bi-Ricci curvature
- Author
-
Chu, Jianchun, Lee, Man-Chun, and Zhu, Jintian
- Subjects
Mathematics - Differential Geometry - Abstract
In this paper, we prove an optimal systolic inequality and the corresponding rigidity in the equality case on closed manifolds with positive bi-Ricci curvature, which generalizes the work of Bray-Brendle-Neves. The proof is given in all dimensions based on the method of minimal surfaces under the Generic Regularity Hypothesis, which is known to be true up to dimension ten., Comment: 23 pages, no figure
- Published
- 2024
48. Optimization and Characterization of Thermoelectric Properties in Selenium-Doped Bismuth Telluride Ultra Thin Films
- Author
-
Nguyen, Kien Trung, Dong, Lan Anh, Dinh, Hien Thi, Bui, Thi Huyen Trang, Chu, Son Truong, Nguyen-Tran, Thuat, Hoang, Chi Hieu, and Nguyen, Hung Quoc
- Subjects
Condensed Matter - Materials Science - Abstract
Thermoelectricity in telluride materials is often improved by replacing telluride with selenium in its crystal. Most work, however, focuses on bulk crystal and leaves the 2D thin films intact. In this paper, we optimize the fabrication of selenium-doped bismuth telluride (Bi$_2$Te$_{3-\rm{x}}$Se$_{\rm{x}}$) thin films using a 3-source thermal co-evaporation. Thermoelectric properties, including the Seebeck coefficient and electrical resistivity, are systematically characterized to evaluate the material's performance for thermoelectric applications near room temperature. The thin films were deposited under carefully controlled conditions, with the evaporation rates of bismuth, tellurium, and selenium precisely monitored to achieve the desired stoichiometry and crystalline phase. Finally, thermoelectricity in Bi$_2$Te$_{3-\rm{x}}$Se$_{\rm{x}}$ at the ultra-thin regime is investigated. We consistently obtain films with thickness near 30 nm with a Seebeck coefficient of 400 $\mu$V/K and a power factor of 1 mW/mK$^2$.
- Published
- 2024
49. Practical Bayesian Algorithm Execution via Posterior Sampling
- Author
-
Cheng, Chu Xin, Astudillo, Raul, Desautels, Thomas, and Yue, Yisong
- Subjects
Computer Science - Machine Learning ,Mathematics - Optimization and Control ,Statistics - Machine Learning - Abstract
We consider Bayesian algorithm execution (BAX), a framework for efficiently selecting evaluation points of an expensive function to infer a property of interest encoded as the output of a base algorithm. Since the base algorithm typically requires more evaluations than are feasible, it cannot be directly applied. Instead, BAX methods sequentially select evaluation points using a probabilistic numerical approach. Current BAX methods use expected information gain to guide this selection. However, this approach is computationally intensive. Observing that, in many tasks, the property of interest corresponds to a target set of points defined by the function, we introduce PS-BAX, a simple, effective, and scalable BAX method based on posterior sampling. PS-BAX is applicable to a wide range of problems, including many optimization variants and level set estimation. Experiments across diverse tasks demonstrate that PS-BAX performs competitively with existing baselines while being significantly faster, simpler to implement, and easily parallelizable, setting a strong baseline for future research. Additionally, we establish conditions under which PS-BAX is asymptotically convergent, offering new insights into posterior sampling as an algorithm design paradigm., Comment: Published as a conference paper at the 38th Conference on Neural Information Processing Systems (NeurIPS 2024)
- Published
- 2024
50. FuseFL: One-Shot Federated Learning through the Lens of Causality with Progressive Model Fusion
- Author
-
Tang, Zhenheng, Zhang, Yonggang, Dong, Peijie, Cheung, Yiu-ming, Zhou, Amelie Chi, Han, Bo, and Chu, Xiaowen
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence ,Computer Science - Distributed, Parallel, and Cluster Computing ,Computer Science - Networking and Internet Architecture - Abstract
One-shot Federated Learning (OFL) significantly reduces communication costs in FL by aggregating trained models only once. However, the performance of advanced OFL methods is far behind the normal FL. In this work, we provide a causal view to find that this performance drop of OFL methods comes from the isolation problem, which means that local isolatedly trained models in OFL may easily fit to spurious correlations due to the data heterogeneity. From the causal perspective, we observe that the spurious fitting can be alleviated by augmenting intermediate features from other clients. Built upon our observation, we propose a novel learning approach to endow OFL with superb performance and low communication and storage costs, termed as FuseFL. Specifically, FuseFL decomposes neural networks into several blocks, and progressively trains and fuses each block following a bottom-up manner for feature augmentation, introducing no additional communication costs. Comprehensive experiments demonstrate that FuseFL outperforms existing OFL and ensemble FL by a significant margin. We conduct comprehensive experiments to show that FuseFL supports high scalability of clients, heterogeneous model training, and low memory costs. Our work is the first attempt using causality to analyze and alleviate data heterogeneity of OFL.
- Published
- 2024
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.