Author: "Zhu, Ming" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Zhu, Ming"' showing total 11,156 results

Start Over Author "Zhu, Ming"

11,156 results on '"Zhu, Ming"'

1. SpecTool: A Benchmark for Characterizing Errors in Tool-Use LLMs

Author: Kokane, Shirley, Zhu, Ming, Awalgaonkar, Tulika, Zhang, Jianguo, Hoang, Thai, Prabhakar, Akshara, Liu, Zuxin, Lan, Tian, Yang, Liangwei, Tan, Juntao, Murthy, Rithesh, Yao, Weiran, Liu, Zhiwei, Niebles, Juan Carlos, Wang, Huan, Heinecke, Shelby, Xiong, Caiming, and Savarese, Silivo
Subjects: Computer Science - Software Engineering, Computer Science - Artificial Intelligence
Abstract: Evaluating the output of Large Language Models (LLMs) is one of the most critical aspects of building a performant compound AI system. Since the output from LLMs propagate to downstream steps, identifying LLM errors is crucial to system performance. A common task for LLMs in AI systems is tool use. While there are several benchmark environments for evaluating LLMs on this task, they typically only give a success rate without any explanation of the failure cases. To solve this problem, we introduce SpecTool, a new benchmark to identify error patterns in LLM output on tool-use tasks. Our benchmark data set comprises of queries from diverse environments that can be used to test for the presence of seven newly characterized error patterns. Using SPECTOOL , we show that even the most prominent LLMs exhibit these error patterns in their outputs. Researchers can use the analysis and insights from SPECTOOL to guide their error mitigation strategies.
Published: 2024

2. The HI Mass Function of the Local Universe: Combining Measurements from HIPASS, ALFALFA and FASHI

Author: Ma, Wenlin, Guo, Hong, Xu, Haojie, Jones, Michael G., Zhang, Chuan-Peng, Zhu, Ming, Wang, Jing, Wang, Jie, and Jiang, Peng
Subjects: Astrophysics - Astrophysics of Galaxies
Abstract: We present the first HI mass function (HIMF) measurement for the recent FAST All Sky HI (FASHI) survey and the most complete measurements of HIMF in the local universe so far by combining the HI catalogues from HI Parkes All Sky Survey (HIPASS), Arecibo Legacy Fast ALFA (ALFALFA) and FASHI surveys at redshift 0 < z < 0.05, covering 76% of the entire sky. We adopt the same methods to estimate distances, calculate sample completeness, and determine the HIMF for all three surveys. The best-fitting Schechter function for the total HIMF has a low-mass slope parameter alpha = -1.30 and a knee mass log(Ms) = 9.86 and a normalization phi_s = 0.00658. This gives the cosmic HI abundance omega_HI= 0.000454. We find that a double Schechter function with the same slope alpha better describes our HIMF, and the two different knee masses are log(Ms1) = 9.96 and log(Ms2) = 9.65. We verify that the measured HIMF is marginally affected by the choice of distance estimates. The effect of cosmic variance is significantly suppressed by combining the three surveys and it provides a unique opportunity to obtain an unbiased estimate of the HIMF in the local universe., Comment: 10 pages, 7 figures, submitted to A&A
Published: 2024

3. PRACT: Optimizing Principled Reasoning and Acting of LLM Agent

Author: Liu, Zhiwei, Yao, Weiran, Zhang, Jianguo, Murthy, Rithesh, Yang, Liangwei, Liu, Zuxin, Lan, Tian, Zhu, Ming, Tan, Juntao, Kokane, Shirley, Hoang, Thai, Niebles, Juan Carlos, Heinecke, Shelby, Wang, Huan, Savarese, Silvio, and Xiong, Caiming
Subjects: Computer Science - Artificial Intelligence
Abstract: We introduce the Principled Reasoning and Acting (PRAct) framework, a novel method for learning and enforcing action principles from trajectory data. Central to our approach is the use of text gradients from a reflection and optimization engine to derive these action principles. To adapt action principles to specific task requirements, we propose a new optimization framework, Reflective Principle Optimization (RPO). After execution, RPO employs a reflector to critique current action principles and an optimizer to update them accordingly. We develop the RPO framework under two scenarios: Reward-RPO, which uses environmental rewards for reflection, and Self-RPO, which conducts self-reflection without external rewards. Additionally, two RPO methods, RPO-Traj and RPO-Batch, is introduced to adapt to different settings. Experimental results across four environments demonstrate that the PRAct agent, leveraging the RPO framework, effectively learns and applies action principles to enhance performance., Comment: Accepted to SIG CoNLL 2024
Published: 2024

4. Deep HI Mapping of M 106 Group with FAST

Author: Liu, Yao, Zhu, Ming, Yu, Hai-Yang, Zhou, Rui-Lei, Xu, Jin-Long, Ai, Mei, Jiang, Peng, Yuan, Li-Xia, and Zhang, Hai-Yan
Subjects: Astrophysics - Astrophysics of Galaxies
Abstract: We used FAST to conduct deep HI imaging of the entire M 106 group region, and have discovered a few new HI filaments and clouds. Three HI clouds/filaments are found in a region connecting DDO 120 and NGC 4288, indicating an interaction between these two galaxies. The HI features in this region suggest that DDO 120 is probably the origin of the HI stream extending from the northern end of NGC 4288 to M 106. This structure is similar to the SMC-LMC stream, but much longer, about 190 kpc. Furthermore, based on the distance measurements, we have determined the satellite galaxy members of M 106. With an absolute magnitude cutoff of M_B=-10, we obtained a sample of 11 member satellite galaxies for M 106. Using the observed HI mass with FAST, we studied the properties of satellite galaxies in M 106 and found that satellite galaxies with lower stellar masses exhibit more significant deviations from the star-forming main sequence (SFMS) in their specific star formation rates. Furthermore, the relationship between the HI mass of satellite galaxies and optical diameter generally follows the field galaxies relation. We discuss the possible mechanisms leading to the quenching in the M 106 group based on the new data from FAST, Comment: 18 pages,11 figures and 3 tables.Accepted by mnras
Published: 2024

5. Development of a Platform to Enable Real Time, Non-disruptive Testing and Early Fault Detection of Critical High Voltage Transformers and Switchgears in High Speed-rail

Author: Fan, Jiawei, Zhu, Ming, Jiang, Yingtao, and Teng, Hualiang
Subjects: Electrical Engineering and Systems Science - Systems and Control, Electrical Engineering and Systems Science - Signal Processing
Abstract: Partial discharge (PD) incidents can occur in critical components of high-speed rail electric systems, such as transformers and switchgears, due to localized insulation defects that cannot withstand electric stress, leading to potential flashovers. These incidents can escalate over time, resulting in breakdowns, downtime, and safety risks. Fortunately, PD activities emit radio frequency (RF) signals, allowing for the development of a hardware platform for real-time, non-invasive PD detection and monitoring. The system uses an RF antenna and high-speed data acquisition to scan signals across a configurable frequency range (100MHz to 3GHz), utilizing intermediate frequency modulation and sliding frequency windows for detailed analysis. When signals exceed a threshold, the system records the events, capturing both raw signal data and spectrum snapshots. Real-time data is streamed to a cloud server, offering remote access through a dedicated smartphone application, enabling maintenance teams to monitor and respond promptly. Laboratory testing has confirmed the system's ability to accurately capture RF signals and provide real-time PD monitoring, enhancing the reliability and safety of high-speed rail infrastructure.
Published: 2024

6. New HI observations Toward the NGC 5055 Galaxy Group with FAST

Author: Liu, Xiao-Lan, Zhu, Ming, Xu, Jin-Long, Jiang, Peng, Zhang, Chuan-Peng, Yu, Nai-Ping, Wang, Jun-Jie, and Yang, Yan-Bin
Subjects: Astrophysics - Astrophysics of Galaxies
Abstract: We report a new high-sensitivity HI mapping observation of the NGC 5055 galaxy group over an area of $1.^\circ5\times0.^\circ75$ with the Five-hundred-meter Aperture Spherical radio Telescope (FAST). Our observation reveals that the warped H\,{\sc i} disk of NGC~5055 is more extended than what previously observed by WSRT, out to $ 23.'9$ (61.7 kpc). The total HI mass of NGC 5055 is determined to be $\rm\sim 1.1\times10^{10}\,M_\odot$. We identified three HI clouds with HI masses of the order of $\rm \sim 10^7\,M_\odot$ at the southeastern edge of the HI disk, as well as a candidate high-velocity cloud with an HI mass of $\rm (1.2\pm0.5) \times10^6\,M_\odot$ to the north of NGC 5055. The HI content of UGCA 337 is robustly detected for the first time by the FAST observations. It has a narrow HI linewidth of $W_{50}=17.4\pm3.8$ km s$^{-1}$ with a total \HI\ mass of ($\rm 3.5\pm0.3)\times10^6\,M_\odot$. Comparing the gas content and g-r color of UGCA 337 with typical low-mass dwarf galaxies, UGCA~337 appears relatively gas-poor despite its blue color. This suggests that UGCA 337 may have undergone gas stripping in the past. We also analyzed the possible origin of the diffuse HI clouds located at the outskirts of NGC 5055, and speculate that they might be the remnant features of a merger event in the past., Comment: 10 pages, 6 figures
Published: 2024

7. Non-Interrupting Rail Track Geometry Measurement System Using UAV and LiDAR

Author: Qiu, Lihao, Zhu, Ming, Park, JeeWoong, Jiang, Yingtao, Hualiang, and Teng
Subjects: Computer Science - Robotics, Electrical Engineering and Systems Science - Image and Video Processing
Abstract: The safety of train operations is largely dependent on the health of rail tracks, necessitating regular and meticulous inspection and maintenance. A significant part of such inspections involves geometric measurements of the tracks to detect any potential problems. Traditional methods for track geometry measurements, while proven to be accurate, require track closures during inspections, and consume a considerable amount of time as the inspection area grows, causing significant disruptions to regular operations. To address this challenge, this paper proposes a track geometry measurement system (TGMS) that utilizes an unmanned aerial vehicle (UAV) platform equipped with a light detection and ranging (LiDAR) sensor. Integrated with a state-of-the-art machine-learning-based computer vision algorithm, and a simultaneous localization and mapping (SLAM) algorithm, this platform can conduct rail geometry inspections seamlessly over a larger area without interrupting rail operations. In particular, this semi- or fully automated measurement is found capable of measuring critical rail geometry irregularities in gauge, curvature, and profile with sub-inch accuracy. Cross-level and warp are not measured due to the absence of gravity data. By eliminating operational interruptions, our system offers a more streamlined, cost-effective, and safer solution for inspecting and maintaining rail infrastructure.
Published: 2024

8. MOSS: Enabling Code-Driven Evolution and Context Management for AI Agents

Author: Zhu, Ming and Zhou, Yi
Subjects: Computer Science - Software Engineering, Computer Science - Artificial Intelligence, Computer Science - Computation and Language
Abstract: Developing AI agents powered by large language models (LLMs) faces significant challenges in achieving true Turing completeness and adaptive, code-driven evolution. Current approaches often generate code independently of its runtime context, relying heavily on the LLM's memory, which results in inefficiencies and limits adaptability. Manual protocol development in sandbox environments further constrains the agent's autonomous adaptability. Crucially, achieving consistency in code and context across multi-turn interactions and ensuring isolation of local variables within each interaction remains an unsolved problem. We introduce MOSS (llM-oriented Operating System Simulation), a novel framework that addresses these challenges by integrating code generation with a dynamic context management system. MOSS ensures consistency and adaptability by using a mechanism that maintains the Python context across interactions, including isolation of local variables and preservation of runtime integrity. At its core, the framework employs an Inversion of Control (IoC) container in conjunction with decorators to enforce the least knowledge principle, allowing agents to focus on abstract interfaces rather than concrete implementations. This facilitates seamless integration of new tools and libraries, enables runtime instance replacement, and reduces prompt complexity, providing a "what you see is what you get" environment for the agent. Through a series of case studies, we show how this framework can enhance the efficiency and capabilities of agent development and highlight its advantages in moving towards Turing-complete agents capable of evolving through code.
Published: 2024

9. xLAM: A Family of Large Action Models to Empower AI Agent Systems

Author: Zhang, Jianguo, Lan, Tian, Zhu, Ming, Liu, Zuxin, Hoang, Thai, Kokane, Shirley, Yao, Weiran, Tan, Juntao, Prabhakar, Akshara, Chen, Haolin, Liu, Zhiwei, Feng, Yihao, Awalgaonkar, Tulika, Murthy, Rithesh, Hu, Eric, Chen, Zeyuan, Xu, Ran, Niebles, Juan Carlos, Heinecke, Shelby, Wang, Huan, Savarese, Silvio, and Xiong, Caiming
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Autonomous agents powered by large language models (LLMs) have attracted significant research interest. However, the open-source community faces many challenges in developing specialized models for agent tasks, driven by the scarcity of high-quality agent datasets and the absence of standard protocols in this area. We introduce and publicly release xLAM, a series of large action models designed for AI agent tasks. The xLAM series includes five models with both dense and mixture-of-expert architectures, ranging from 1B to 8x22B parameters, trained using a scalable, flexible pipeline that unifies, augments, and synthesizes diverse datasets to enhance AI agents' generalizability and performance across varied environments. Our experimental results demonstrate that xLAM consistently delivers exceptional performance across multiple agent ability benchmarks, notably securing the 1st position on the Berkeley Function-Calling Leaderboard, outperforming GPT-4, Claude-3, and many other models in terms of tool use. By releasing the xLAM series, we aim to advance the performance of open-source LLMs for autonomous AI agents, potentially accelerating progress and democratizing access to high-performance models for agent tasks. Models are available at https://huggingface.co/collections/Salesforce/xlam-models-65f00e2a0a63bbcd1c2dade4, Comment: Technical report for the Salesforce xLAM model series
Published: 2024

10. Deep extragalactic HI survey of the COSMOS field with FAST

Author: Pan, Hengxing, Jarvis, Matt J., Zhu, Ming, Ma, Yin-Zhe, Santos, Mario G., Ponomareva, Anastasia A., Heywood, Ian, Jing, Yingjie, Xu, Chen, Liu, Ziming, Chandola, Yogesh, and Jing, Yipeng
Subjects: Astrophysics - Astrophysics of Galaxies, Astrophysics - Cosmology and Nongalactic Astrophysics
Abstract: We present a deep HI survey at L-band conducted with the Five-hundred-meter Aperture Spherical radio Telescope (FAST) over the COSMOS field. This survey is strategically designed to overlap with the MIGHTEE COSMOS field, aiming to combine the sensitivity of the FAST and high-resolution of the MeerKAT. We observed the field with FAST for approximately 11 hours covering $\sim$2 square degrees, and reduced the raw data to HI spectral cubes over the frequency range 1310-1420 MHz. The FAST-HI data reach a median 3$\sigma$ column density of $N_{\rm HI}\sim2\times10^{17}$ cm$^{-2}$ over a 5 km s$^{-1}$ channel width, allowing for studies of the distribution of HI gas in various environments, such as in galaxies, the Circum-Galactic Medium (CGM) and Intergalactic Medium (IGM). We visually searched the spectral cubes for HI sources, and found a total of 80 HI detections, of which 56 have been cross-matched with the MIGHTEE-HI catalogue. With the cross-matched sources, we compare their HI masses and find that the total HI mass fraction in the IGM and CGM surrounding the galaxy pairs is statistically higher than the HI fraction surrounding the isolated galaxies by a difference of 13$\pm$4%, indicating that the CGM and IGM associated with interacting systems are richer in neutral hydrogen compared to those around isolated galaxies in the local Universe. We also describe several FAST-MeerKAT synergy projects, highlighting the full potential of exploiting both single-dish and interferometric observations to study the distribution and evolution of the diffuse HI gas., Comment: 13 pages, 14 figures; Accepted for publication in MNRAS; Minor corrections made at proof stage
Published: 2024

11. Exploring the origin of cold gas and star formation in a rare population of strongly bulge-dominated early-type Galaxies

Author: Li, Fujia, Wang, Enci, Zhu, Ming, Peng, Yingjie, Wang, Jing, Zhang, Chuanpeng, Lin, Zesen, Rong, Yu, Zhang, Hongxin, and Kong, Xu
Subjects: Astrophysics - Astrophysics of Galaxies
Abstract: We analyze the properties of a rare population, the strongly bulge-dominated early-type galaxies (referred to as sBDEs) with significant HI gas, using the databases from the FAST All Sky HI survey (FASHI) and the Arecibo Legacy Fast ALFA (ALFALFA) survey. We select the sBDEs from the Sloan Digital Sky Survey (SDSS) and cross-match with the FASHI-ALFALFA combined HI sample, resulting in 104 HI-rich sBDEs. These sBDEs tend to have extremely high HI reservoirs, which is rare in previous studies such as ATLAS$^{3D}$. 70% of the selected sBDEs are classified as quiescent galaxies, even though they have a large HI reservoir. We study the properties of these sBDEs from five main aspects: stellar population, gas-phase metallicity, stacked HI spectra, environment, and spatially resolved MaNGA data. The majority of HI-rich sBDEs appear to show lower gas-phase metallicity and are located in significantly lower-density environments, suggesting an external origin for their HI gas. We find that star-forming sBDEs exhibit statistically higher star formation efficiency and slightly older stellar populations compared to normal star-forming galaxies, suggesting a recent star formation on Gyr-timescale. They also show narrower and more concentrated HI profiles compared to control star-forming galaxies, which may explain their higher star formation efficiency., Comment: 18 pages, 14 figures, 1 table. Accepted for publication in ApJ
Published: 2024

12. FAST observations of neutral hydrogen in the interacting galaxies NGC 3395/3396

Author: Yu, Nai-Ping, Zhu, Ming, Xu, Jin-Long, Zhang, Chuan-Peng, Yu, Hai-Yang, Liu, Xiao-Lan, Jiang, Peng, and Ai, Mei
Subjects: Astrophysics - Astrophysics of Galaxies
Abstract: We report on high-sensitivity neutral hydrogen observations toward the gas-rich interacting galaxies NGC 3395/3396 with the Five-hundred-meter Aperture Spherical radio Telescope (FAST). Compared to previous observations carried out by the Very Large Array (VLA) and the Westerbork Synthesis Radio Telescope (WSRT), a more extended HI envelope around this system has been detected. The total HI gas mass of the NGC 3395/3396 system is estimated to be 7.8 x 109 M. This value is 2.7 times more than that reported based on the VLA interferometric maps. Previous observations found a large HI tail extending to the south-west and a minor tail emerging from the north of this peculiar galaxy pair. Based on the high-sensitivity observations of FAST, an extended HI plume to the north-west and a gas plume to the north-east have been detected for the first time. Neutral hydrogen of the two smaller galaxies IC 2604 and IC 2608 on the south of the system have also been detected. We discuss the origins of these extra gas and possible tidal interactions between these galaxies. NGC 3395/3396's most prominent tidal feature, the south-west tail combined with the new detected north-west plume behaves like a large ring. We suggest the ring might be formed by the previous fly-by interaction between NGC 3395 and NGC 3396 which happened 500 Myr ago. Our study shows that high-sensitivity HI observations are important in revealing low column density gas, which is crucial to a deeper understanding of this interacting system.
Published: 2024

13. FASHI: An untargeted survey of the 21 cm HI absorption galaxies with FAST

Author: Zhang, Chuan-Peng, Zhu, Ming, Jiang, Peng, Cheng, Cheng, Xu, Jin-Long, Yu, Nai-Ping, Liu, Xiao-Lan, and Zhang, Bo
Subjects: Astrophysics - Astrophysics of Galaxies
Abstract: The FAST All Sky H I survey (FASHI) will cover the entire observable sky ($\sim$22000 square degrees) with the Five-hundred-meter Aperture Spherical radio Telescope (FAST). With the currently released data, we perform an untargeted survey of 21 cm HI absorption galaxies at redshift $z\lesssim0.09$ over an area of about 10000 square degrees. We have detected 51 HI absorbers, including 21 previously known and 30 new ones. The probability of occurrence for the HI absorbers in all HI galaxies is 1/1078. The radio flux densities of the FASHI absorbers are mainly concentrated in the range of $S_{\rm 1.4GHz}=10\sim100$ mJy, but also as low as $2.6\pm0.4$ mJy. We find that the host galaxies of the associated HI absorbers have relatively high star formation rates, and there is a negative correlation between the HI column density and the stellar mass in the host galaxy. Consequently, FAST has significantly improved the capabilities and performance for HI absorption observations and has provided a true untargeted survey of 21 cm HI absorption galaxies for such studies., Comment: 36 pages, many figures, 3 tables, accepted for publication in ApJS
Published: 2024

14. The FAST HI 21-cm absorption blind survey. II -- statistic exploration for associated and intervening systems

Author: Hu, Wenkai, Wang, Yougang, Li, Yichao, Pen, Ue-Li, Wang, Jie, Jing, Yingjie, Zhu, Ming, Zhang, Xin, Yang, Wenxiu, Xu, Yidong, Chen, Xu, Chen, Jingze, Zheng, Zheng, Li, Di, and Chen, Xuelei
Subjects: Astrophysics - Astrophysics of Galaxies
Abstract: We present an extragalactic HI 21-cm absorption lines catalog from a blind search at z $\leq$ 0.35, using drift-scan data collected in 1616.9 hours by the ongoing Commensal Radio Astronomy FasT Survey (CRAFTS) and FAST All Sky HI Survey (FASHI), which spans a sky area of 7456.8 deg$^{2}$ and covers 84,533 radio sources with a flux density greater than 12 mJy. 14 previously identified HI absorbers and 20 newly discovered HI absorbers were detected, comprising 14 associated systems, 11 intervening systems, and 9 systems with undetermined classifications. We fit HI profiles with multi-component Gaussian functions and calculate the redshift, width, flux density, optical depth, and HI column densities for each source. Through spectral stacking, the mean peak optical path, mean velocity-integrated optical path $\langle \tau\rangle$, mean FWHM and mean HI column density $\langle$ N$_{HI}\rangle$ are measured to be 0.46 and 0.34; 25.85 km/s and 4.62 km/s; 39.80 km/s and 8.95 km/s; 0.470 and 0.085 T$_{s} \times$ 10$^{20}$cm$^{-2}$K$^{-1}$, for the associated and intervening samples, respectively. Statistical analysis also reveals that associated systems tend to be hosted by red (g$-$r$>$0.7) galaxies at lower redshifts, whereas galaxies hosting intervening HI absorption are typically found at higher redshifts and are of a bluer (g$-$r$\leq$0.7) type. Additionally, it has been demonstrated that associated HI 21-cm absorptions connected to compact radio sources display higher N$_{HI}$ values compared to those linked with extended radio sources., Comment: 28 pages, 39 figures, 5 tables
Published: 2024

15. APIGen: Automated Pipeline for Generating Verifiable and Diverse Function-Calling Datasets

Author: Liu, Zuxin, Hoang, Thai, Zhang, Jianguo, Zhu, Ming, Lan, Tian, Kokane, Shirley, Tan, Juntao, Yao, Weiran, Liu, Zhiwei, Feng, Yihao, Murthy, Rithesh, Yang, Liangwei, Savarese, Silvio, Niebles, Juan Carlos, Wang, Huan, Heinecke, Shelby, and Xiong, Caiming
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning, Computer Science - Software Engineering
Abstract: The advancement of function-calling agent models requires diverse, reliable, and high-quality datasets. This paper presents APIGen, an automated data generation pipeline designed to synthesize verifiable high-quality datasets for function-calling applications. We leverage APIGen and collect 3,673 executable APIs across 21 different categories to generate diverse function-calling datasets in a scalable and structured manner. Each data in our dataset is verified through three hierarchical stages: format checking, actual function executions, and semantic verification, ensuring its reliability and correctness. We demonstrate that models trained with our curated datasets, even with only 7B parameters, can achieve state-of-the-art performance on the Berkeley Function-Calling Benchmark, outperforming multiple GPT-4 models. Moreover, our 1B model achieves exceptional performance, surpassing GPT-3.5-Turbo and Claude-3 Haiku. We release a dataset containing 60,000 high-quality entries, aiming to advance the field of function-calling agent domains. The dataset is available on Huggingface: https://huggingface.co/datasets/Salesforce/xlam-function-calling-60k and the project homepage: https://apigen-pipeline.github.io/
Published: 2024

16. MobileAIBench: Benchmarking LLMs and LMMs for On-Device Use Cases

Author: Murthy, Rithesh, Yang, Liangwei, Tan, Juntao, Awalgaonkar, Tulika Manoj, Zhou, Yilun, Heinecke, Shelby, Desai, Sachin, Wu, Jason, Xu, Ran, Tan, Sarah, Zhang, Jianguo, Liu, Zhiwei, Kokane, Shirley, Liu, Zuxin, Zhu, Ming, Wang, Huan, Xiong, Caiming, and Savarese, Silvio
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: The deployment of Large Language Models (LLMs) and Large Multimodal Models (LMMs) on mobile devices has gained significant attention due to the benefits of enhanced privacy, stability, and personalization. However, the hardware constraints of mobile devices necessitate the use of models with fewer parameters and model compression techniques like quantization. Currently, there is limited understanding of quantization's impact on various task performances, including LLM tasks, LMM tasks, and, critically, trust and safety. There is a lack of adequate tools for systematically testing these models on mobile devices. To address these gaps, we introduce MobileAIBench, a comprehensive benchmarking framework for evaluating mobile-optimized LLMs and LMMs. MobileAIBench assesses models across different sizes, quantization levels, and tasks, measuring latency and resource consumption on real devices. Our two-part open-source framework includes a library for running evaluations on desktops and an iOS app for on-device latency and hardware utilization measurements. Our thorough analysis aims to accelerate mobile AI research and deployment by providing insights into the performance and feasibility of deploying LLMs and LMMs on mobile platforms.
Published: 2024

17. Observation of floating surface state in obstructed atomic insulator candidate NiP$_2$

Author: Liu, Xiang-Rui, Zhu, Ming-Yuan, Feng, Yuanwen, Zeng, Meng, Ma, Xiao-Ming, Hao, Yu-Jie, Dai, Yue, Luo, Rong-Hao, Yamagami, Kohei, Liu, Yi, Cui, Shengtao, Sun, Zhe, Liu, Jia-Yu, Liu, Zhengtai, Ye, Mao, Shen, Dawei, Li, Bing, and Liu, Chang
Subjects: Condensed Matter - Materials Science
Abstract: Obstructed atomic insulator is recently proposed as an unconventional material, in which electric charge centers localized at sites away from the atoms. A half-filling surface state would emerge at specific interfaces cutting through these charge centers and avoid intersecting any atoms. In this article, we utilized angle-resolved photoemission spectroscopy and density functional theory calculations to study one of the obstructed atomic insulator candidates, NiP$_2$. A floating surface state with large effective mass that is isolated from all bulk states is resolved on the (100) cleavage plane, distinct from previously reported surface states in obstructed atomic insulators that are merged into bulk bands. Density functional theory calculation results elucidate that this floating surface state is originated from the obstructed Wannier charge centers, albeit underwent surface reconstruction that splits the half-filled obstructed surface state. Our findings not only shed lights on the spectroscopy study of obstructed atomic insulators and obstructed surface states, but also provide possible route for development of new catalysts., Comment: 21 pages, 5 figures
Published: 2024
Full Text: View/download PDF

18. Why Not Transform Chat Large Language Models to Non-English?

Author: Geng, Xiang, Zhu, Ming, Li, Jiahuan, Lai, Zhejian, Zou, Wei, She, Shuaijie, Guo, Jiaxin, Zhao, Xiaofeng, Li, Yinglu, Li, Yuang, Su, Chang, Zhao, Yanqing, Lyu, Xinglin, Zhang, Min, Chen, Jiajun, Yang, Hao, and Huang, Shujian
Subjects: Computer Science - Computation and Language
Abstract: The scarcity of non-English data limits the development of non-English large language models (LLMs). Transforming English-centric LLMs to non-English has been identified as an effective and resource-efficient method. Previous works start from base LLMs and perform knowledge distillation (KD) with data generated by stronger LLMs, e.g. GPT-4. Compared to base LLMs, chat LLMs are further optimized for advanced abilities, e.g. multi-turn conversation and human preference alignment, and thus more powerful in both helpfulness and safety. However, transforming a chat LLM involves two critical issues: (1) How can we effectively transfer advanced abilities without their supervised data? (2) How can we prevent the original knowledge from catastrophic forgetting during transformation? We target these issues by introducing a simple framework called TransLLM. For the first issue, TransLLM divides the transfer problem into some common sub-tasks with the translation chain-of-thought, which uses the translation as the bridge between English and non-English step-by-step. We further enhance the performance of sub-tasks with publicly available data. For the second issue, we propose a method comprising two synergistic components: low-rank adaptation for training to maintain the original LLM parameters, and recovery KD, which utilizes data generated by the chat LLM itself to recover the original knowledge from the frozen parameters. In the experiments, we transform the LLaMA-2-chat-7B to the Thai language. Our method, using only single-turn data, outperforms strong baselines and ChatGPT on multi-turn benchmark MT-bench. Furthermore, our method, without safety data, rejects more harmful queries of safety benchmark AdvBench than both ChatGPT and GPT-4.
Published: 2024

19. Observation of Spin Splitting in Room-Temperature Metallic Antiferromagnet CrSb

Author: Zeng, Meng, Zhu, Ming-Yuan, Zhu, Yu-Peng, Liu, Xiang-Rui, Ma, Xiao-Ming, Hao, Yu-Jie, Liu, Pengfei, Qu, Gexing, Yang, Yichen, Jiang, Zhicheng, Yamagami, Kohei, Arita, Masashi, Zhang, Xiaoqian, Shao, Tian-Hao, Dai, Yue, Shimada, Kenya, Liu, Zhengtai, Ye, Mao, Huang, Yaobo, Liu, Qihang, and Liu, Chang
Subjects: Condensed Matter - Materials Science
Abstract: Recently, unconventional antiferromagnets that enable the splitting of electronic spins have been theoretically proposed and experimentally realized, where the magnetic sublattices containing moments pointing at different directions are connected by a novel set of symmetries. Such spin splitting (SS) is substantial, $k$-dependent, and independent of the spin-orbit coupling strength, making these magnets promising materials for antiferromagnetic spintronics. Here, combined with angle-resolved photoemission spectroscopy (ARPES) and density functional theory (DFT) calculations, we perform a systematic study on CrSb, a metallic spin-split antiferromagnet candidate with $T_N$ = 703 K. Our data reveals the electronic structure of CrSb along both out-of-plane and in-plane momentum directions, which renders anisotropic $k$-dependent SS and agrees well with the calculational results. The magnitude of such SS reaches up to at least 0.8 eV at non-high-symmetry momentum points, which is significantly higher than the largest known SOC-induced SS. This compound expands the choice of materials in the field of antiferromagnetic spintronics and is likely to stimulate subsequent investigations of high-efficiency spintronic devices that are functional at room temperature., Comment: 14 pages, 4 figures
Published: 2024
Full Text: View/download PDF

20. Almost Optically Dark Galaxies in DECaLS (I): Detection, Optical Properties and Possible Origins

Author: Du, Lin, Du, Wei, Cheng, Cheng, Zhu, Ming, Yu, Haiyang, and Wu, Hong
Subjects: Astrophysics - Astrophysics of Galaxies
Abstract: We report the discovery of eight optical counterparts of ALFALFA extragalactic objects from DECaLS, five of which are discovered for the first time. These objects were flagged as HI emission sources with no optical counterparts in SDSS before. Multi-band data reveal their unusual physical properties. They are faint and blue ($g-r=-0.35\sim0.55$), with quite low surface brightness ($\mu_{\rm g,peak}=24.88\sim26.41\,{\rm mag}/{\rm arcsec}^2$), irregular morphologies, low stellar masses ($log_{10}(M_{*}/M_\odot)=5.27\sim7.15$), low star formation rates ($SFR=0.21\sim9.24\times10^{-3}\,{M_\odot}\,{\rm yr}^{-1}$), and remarkably high HI-to-stellar mass ratios ($log_{10}(M_{\rm HI}/M_{*}) = 1.72\sim3.22$, except AGC\,215415). They deviate from the scaling relations between HI and optical properties defined by the ALFALFA sample and the baryonic Tully-Fisher relation. They agree well with the main sequence of star-forming galaxies but exhibit low star-forming efficiency. Based on their physical properties and environments, we speculate that six of these objects may have originated from tidal processes, while the remaining two appear to have isolated origins. They may have had a relatively calm evolutionary history and only begun to form stars recently., Comment: 32 pages, 11 figures, accepted by the Astrophysical Journal
Published: 2024

21. Compact bilinear pooling and multi-loss network for social media multimodal classification

Author: Li, Yushi, Zheng, Xin, Zhu, Ming, Mei, Jie, Chen, Ziwen, and Tao, Yunfei
Published: 2024
Full Text: View/download PDF

22. Cooperative enhancement of mechanical and tribological properties through tailoring TiN transition interface in boron nitride nanosheets reinforced copper composites

Author: Li, Zhong-Hua, Liu, Liang, You, Xin, Yi, Jian-Hong, Bao, Rui, Zhu, Ming-Yi, Lu, Song, and Pai, Jun-Jun
Published: 2024
Full Text: View/download PDF

23. A Multicenter, Randomized, Double-Blind, Parallel-Grouped, Positive-Controlled, Non-Inferiority Clinical Study to Evaluate the Efficacy and Safety of Injectable Calcium Hydroxylapatite Microsphere Hydrogel Fillers in the Correction of Nasolabial Fold in Chinese Subjects

Author: Pan, Yuyan, Luo, Zucheng, Chen, Shuwei, Lu, Nanhang, Zhang, Yong, Yang, Yanwen, Chen, Cheng, Liu, Jiaqi, Zhang, Rufan, Ge, Yining, Qi, Fazhi, and Zhu, Ming
Published: 2024
Full Text: View/download PDF

24. Event-Triggered Adaptive Fixed-Time Trajectory Tracking Control for Stratospheric Airship

Author: Sun, Peihao, Zhu, Ming, Zhang, Yifei, Chen, Tian, and Zheng, Zeiwei
Published: 2024
Full Text: View/download PDF

25. Growth process, defects, and dopants of bulk β-Ga2O3 semiconductor single crystals

Author: Wang, Yan-shen, Zhu, Ming-zhi, and Liu, Yuan
Published: 2024
Full Text: View/download PDF

26. Automated Cluster Detection of Health Care–Associated Infection Based on the Multisource Surveillance of Process Data in the Area Network: Retrospective Study of Algorithm Development and Validation

Author: Fan, Yunzhou, Wu, Yanyan, Cao, Xiongjing, Zou, Junning, Zhu, Ming, Dai, Di, Lu, Lin, Yin, Xiaoxv, and Xiong, Lijuan
Subjects: Computer applications to medicine. Medical informatics, R858-859.7
Abstract: BackgroundThe cluster detection of health care–associated infections (HAIs) is crucial for identifying HAI outbreaks in the early stages. ObjectiveWe aimed to verify whether multisource surveillance based on the process data in an area network can be effective in detecting HAI clusters. MethodsWe retrospectively analyzed the incidence of HAIs and 3 indicators of process data relative to infection, namely, antibiotic utilization rate in combination, inspection rate of bacterial specimens, and positive rate of bacterial specimens, from 4 independent high-risk units in a tertiary hospital in China. We utilized the Shewhart warning model to detect the peaks of the time-series data. Subsequently, we designed 5 surveillance strategies based on the process data for the HAI cluster detection: (1) antibiotic utilization rate in combination only, (2) inspection rate of bacterial specimens only, (3) positive rate of bacterial specimens only, (4) antibiotic utilization rate in combination + inspection rate of bacterial specimens + positive rate of bacterial specimens in parallel, and (5) antibiotic utilization rate in combination + inspection rate of bacterial specimens + positive rate of bacterial specimens in series. We used the receiver operating characteristic (ROC) curve and Youden index to evaluate the warning performance of these surveillance strategies for the detection of HAI clusters. ResultsThe ROC curves of the 5 surveillance strategies were located above the standard line, and the area under the curve of the ROC was larger in the parallel strategy than in the series strategy and the single-indicator strategies. The optimal Youden indexes were 0.48 (95% CI 0.29-0.67) at a threshold of 1.5 in the antibiotic utilization rate in combination–only strategy, 0.49 (95% CI 0.45-0.53) at a threshold of 0.5 in the inspection rate of bacterial specimens–only strategy, 0.50 (95% CI 0.28-0.71) at a threshold of 1.1 in the positive rate of bacterial specimens–only strategy, 0.63 (95% CI 0.49-0.77) at a threshold of 2.6 in the parallel strategy, and 0.32 (95% CI 0.00-0.65) at a threshold of 0.0 in the series strategy. The warning performance of the parallel strategy was greater than that of the single-indicator strategies when the threshold exceeded 1.5. ConclusionsThe multisource surveillance of process data in the area network is an effective method for the early detection of HAI clusters. The combination of multisource data and the threshold of the warning model are 2 important factors that influence the performance of the model.
Published: 2020
Full Text: View/download PDF

27. AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning

Author: Zhang, Jianguo, Lan, Tian, Murthy, Rithesh, Liu, Zhiwei, Yao, Weiran, Zhu, Ming, Tan, Juntao, Hoang, Thai, Liu, Zuxin, Yang, Liangwei, Feng, Yihao, Kokane, Shirley, Awalgaonkar, Tulika, Niebles, Juan Carlos, Savarese, Silvio, Heinecke, Shelby, Wang, Huan, and Xiong, Caiming
Subjects: Computer Science - Artificial Intelligence, Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: Autonomous agents powered by large language models (LLMs) have garnered significant research attention. However, fully harnessing the potential of LLMs for agent-based tasks presents inherent challenges due to the heterogeneous nature of diverse data sources featuring multi-turn trajectories. In this paper, we introduce \textbf{AgentOhana} as a comprehensive solution to address these challenges. \textit{AgentOhana} aggregates agent trajectories from distinct environments, spanning a wide array of scenarios. It meticulously standardizes and unifies these trajectories into a consistent format, streamlining the creation of a generic data loader optimized for agent training. Leveraging the data unification, our training pipeline maintains equilibrium across different data sources and preserves independent randomness across devices during dataset partitioning and model training. Additionally, we present \textbf{xLAM-v0.1}, a large action model tailored for AI agents, which demonstrates exceptional performance across various benchmarks. Begin the exploration at \url{https://github.com/SalesforceAIResearch/xLAM}., Comment: Add GitHub repo link at \url{https://github.com/SalesforceAIResearch/xLAM} and HuggingFace model link at \url{https://huggingface.co/Salesforce/xLAM-v0.1-r}
Published: 2024

28. HiFAST: an HI data calibration and imaging pipeline for FAST

Author: Jing, Yingjie, Wang, Jie, Xu, Chen, Liu, Ziming, Chen, Qingze, Liang, Tiantian, Xu, Jinlong, Cao, Yixian, Wang, Jing, Hu, Huijie, Zhang, Chuan-Peng, Guo, Qi, Gao, Liang, Ai, Mei, Gan, Hengqian, Gao, Xuyang, Han, Jinlin, Hou, Ligang, Hou, Zhipeng, Jiang, Peng, Kong, Xu, Li, Fujia, Liu, Zerui, Shao, Li, Pan, Hengxing, Pan, Jun, Qian, Lei, Sun, Jinghai, Tang, Ningyu, Yang, Qingliang, Zhang, Bo, Zhang, Zhiyu, and Zhu, Ming
Subjects: Astrophysics - Astrophysics of Galaxies, Astrophysics - Cosmology and Nongalactic Astrophysics, Astrophysics - Instrumentation and Methods for Astrophysics
Abstract: The Five-hundred-meter Aperture Spherical radio Telescope (FAST) has the largest aperture and a 19-beam L-band receiver, making it powerful for investigating the neutral hydrogen atomic gas (HI) in the universe. We present HiFAST (https://hifast.readthedocs.io), a dedicated, modular, and self-contained calibration and imaging pipeline for processing the HI data of FAST. The pipeline consists of frequency-dependent noise diode calibration, baseline fitting, standing wave removal using an FFT-based method, flux density calibration, stray radiation correction, and gridding to produce data cubes. These modules can be combined as needed to process the data from most FAST observation modes: tracking, drift scanning, On-The-Fly mapping, and most of their variants. With HiFAST, the RMS noises of the calibrated spectra from all 19 beams were only slightly (~ 5%) higher than the theoretical expectation. The results for the extended source M33 and the point sources are consistent with the results from Arecibo. The moment maps (0,1 and 2) of M33 agree well with the results from the Arecibo Galaxy Environment Survey (AGES) with a fractional difference of less than 10%. For a common sample of 221 sources with signal-to-noise ratio S/N >10 from the Arecibo Legacy Fast ALFA (ALFALFA) survey, the mean value of fractional difference in the integrated flux density, $S_{\mathrm{int}}$, between the two datasets is approximately 0.005 %, with a dispersion of 15.4%. Further checks on the integrated flux density of 23 sources with seven observations indicate that the variance in the flux density of the source with luminous objects ($S_\mathrm{int}$ $ > 2.5$ Jy km s$^{-1}$) is less than 5%. Our tests suggest that the FAST telescope, with the efficient, precise, and user-friendly pipeline HiFAST, will yield numerous significant scientific findings in the investigation of the HI in the universe., Comment: Accepted by SCPMA. 21 pages, 14 figures. The pipeline is accessible at https://hifast.readthedocs.io
Published: 2024
Full Text: View/download PDF

29. FASHI: A search for extragalactic OH megamasers with FAST

Author: Zhang, Chuan-Peng, Cheng, Cheng, Zhu, Ming, Xu, Jin-Long, and Jiang, Peng
Subjects: Astrophysics - Astrophysics of Galaxies
Abstract: The FAST All Sky HI survey (FASHI) is broader in frequency band and sky volume, and deeper in detection sensitivity than the Arecibo Legacy Fast ALFA survey (ALFALFA). To efficiently expand the sample of OH megamasers (OHMs), whose strongest line has a rest frequency of 1667.35903 MHz, we directly matched the IRAS Point Source Catalog Redshift (PSCz) catalog with the corresponding FASHI data cube. From 145 PSCz sources already covered by FASHI, we obtained 27 OHMs with a detection rate of 18.6%, including 9 previously known and 18 new ones, within a redshift range of $0.14314\lesssim z_{\rm OH} \lesssim0.27656$. We also measured the hyperfine ratio of nine OHMs between the 1667 and 1665 MHz lines. The ratio ranges from 1.32 to 15.22, with an average of $R_{1667:1665}=4.74$. In a fit to the $L_{\rm OH}$ vs. $L_{\rm FIR}$ relation, we have ${\rm log}L_{\rm OH}= (1.57\pm0.10){\rm log}L_{\rm FIR}-(15.80\pm1.19)$, which is almost the same as derived from previous observations. As expected, since the OHM sample was selected by cross-correlation with the IRAS-selected PSCz, our detected OHMs are [ultra]luminous infrared galaxies ([U]LIRGs). However, not all [U]LIRGs have detectable OH emission, suggesting that the OH emission may be triggered within a specific stage of the merger or can only be seen in specific orientations. In general, FAST, with its 19-beam array and UWB receiver, will be a powerful tool for observing more OHMs and unraveling their mystery in the future., Comment: 21 pages, 6 figures. Comments are welcome
Published: 2024
Full Text: View/download PDF

30. InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks

Author: Hu, Xueyu, Zhao, Ziyu, Wei, Shuang, Chai, Ziwei, Ma, Qianli, Wang, Guoyin, Wang, Xuwu, Su, Jing, Xu, Jingjing, Zhu, Ming, Cheng, Yao, Yuan, Jianbo, Li, Jiwei, Kuang, Kun, Yang, Yang, Yang, Hongxia, and Wu, Fei
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: In this paper, we introduce InfiAgent-DABench, the first benchmark specifically designed to evaluate LLM-based agents on data analysis tasks. These tasks require agents to end-to-end solving complex tasks by interacting with an execution environment. This benchmark contains DAEval, a dataset consisting of 257 data analysis questions derived from 52 CSV files, and an agent framework which incorporates LLMs to serve as data analysis agents for both serving and evaluation. Since data analysis questions are often open-ended and hard to evaluate without human supervision, we adopt a format-prompting technique to convert each question into a closed-form format so that they can be automatically evaluated. Our extensive benchmarking of 34 LLMs uncovers the current challenges encountered in data analysis tasks. In addition, building on top of our agent framework, we develop a specialized agent, DAAgent, which surpasses GPT-3.5 by 3.9% on DABench. Evaluation datasets and toolkits for InfiAgent-DABench are released at https://github.com/InfiAgent/InfiAgent ., Comment: 27 pages, 7 figures, work in progress
Published: 2024

31. Thermal conductivity and mechanical properties of fluorite-type porous (Ce0.2Zr0.2Ti0.2Sn0.2Ca0.2)O2-δ high-entropy ceramics

Author: Tang, Yingying, Xia, Yongfeng, Yao, Dongxu, Zhu, Ming, Zhao, Jun, and Zeng, Yu-Ping
Published: 2024
Full Text: View/download PDF

32. Service selection based on blockchain smart contracts in cloud-edge environment

Author: Ning, Yingying, Li, Jing, Zhu, Ming, and Liu, Chuanxi
Published: 2024
Full Text: View/download PDF

33. The roles of PD-L1 in the various stages of tumor metastasis

Author: He, Yinjun, Zhu, Ming, Lai, Xuan, Zhang, Honghe, and Jiang, Weiqin
Published: 2024
Full Text: View/download PDF

34. CoachLM: Automatic Instruction Revisions Improve the Data Quality in LLM Instruction Tuning

Author: Liu, Yilun, Tao, Shimin, Zhao, Xiaofeng, Zhu, Ming, Ma, Wenbing, Zhu, Junhao, Su, Chang, Hou, Yutai, Zhang, Miao, Zhang, Min, Ma, Hongxia, Zhang, Li, Yang, Hao, and Jiang, Yanfei
Subjects: Computer Science - Computation and Language
Abstract: Instruction tuning is crucial for enabling Language Learning Models (LLMs) in responding to human instructions. The quality of instruction pairs used for tuning greatly affects the performance of LLMs. However, the manual creation of high-quality instruction datasets is costly, leading to the adoption of automatic generation of instruction pairs by LLMs as a popular alternative. To ensure the high quality of LLM-generated instruction datasets, several approaches have been proposed. Nevertheless, existing methods either compromise dataset integrity by filtering a large proportion of samples, or are unsuitable for industrial applications. In this paper, instead of discarding low-quality samples, we propose CoachLM, a novel approach to enhance the quality of instruction datasets through automatic revisions on samples in the dataset. CoachLM is trained from the samples revised by human experts and significantly increases the proportion of high-quality samples in the dataset from 17.7% to 78.9%. The effectiveness of CoachLM is further assessed on various real-world instruction test sets. The results show that CoachLM improves the instruction-following capabilities of the instruction-tuned LLM by an average of 29.9%, which even surpasses larger LLMs with nearly twice the number of parameters. Furthermore, CoachLM is successfully deployed in a data management system for LLMs at Huawei, resulting in an efficiency improvement of up to 20% in the cleaning of 40k real-world instruction pairs. We release various assets of CoachLM, including the training data, code and test set (https://github.com/lunyiliu/CoachLM)., Comment: Accepted by ICDE 2024
Published: 2023

35. Formation of a massive lenticular galaxy under the tidal interaction with a group of dwarf galaxies

Author: Xu, Jin-Long, Zhu, Ming, Hess, Kelley M., Yu, Naiping, Zhang, Chuan-Peng, Liu, Xiao-Lan, Ai, Mei, Jiang, Peng, and Wang, Jie
Subjects: Astrophysics - Astrophysics of Galaxies
Abstract: Based on the atomic-hydrogen (HI) observations using the Five-hundred-meter Aperture Spherical radio Telescope (FAST), we present a detailed study of the gas-rich massive S0 galaxy NGC 1023 in a nearby galaxy group. The presence of an HI extended warped disk in NGC 1023 indicates that this S0 galaxy originated from a spiral galaxy. The data also suggest that NGC 1023 is interacting with four dwarf galaxies. In particular, one of the largest dwarf galaxies has fallen into the gas disk of NGC 1023, forming a rare bright-dark galaxy pair with a large gas clump. This clump shows the signature of a galaxy but has no optical counterpart, implying that it is a newly formed starless galaxy. Our results firstly suggest that a massive S0 galaxy in a galaxy group can form via the morphological transformation from a spiral under the joint action of multiple tidal interactions., Comment: 13 pages, 8 figures, Accepted for publication in the ApJ Letters
Published: 2023

36. Mental Health Diagnosis in the Digital Age: Harnessing Sentiment Analysis on Social Media Platforms upon Ultra-Sparse Feature Content

Author: Shao, Haijian, Zhu, Ming, and Zhai, Shengjie
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Computation and Language
Abstract: Amid growing global mental health concerns, particularly among vulnerable groups, natural language processing offers a tremendous potential for early detection and intervention of people's mental disorders via analyzing their postings and discussions on social media platforms. However, ultra-sparse training data, often due to vast vocabularies and low-frequency words, hinders the analysis accuracy. Multi-labeling and Co-occurrences of symptoms may also blur the boundaries in distinguishing similar/co-related disorders. To address these issues, we propose a novel semantic feature preprocessing technique with a three-folded structure: 1) mitigating the feature sparsity with a weak classifier, 2) adaptive feature dimension with modulus loops, and 3) deep-mining and extending features among the contexts. With enhanced semantic features, we train a machine learning model to predict and classify mental disorders. We utilize the Reddit Mental Health Dataset 2022 to examine conditions such as Anxiety, Borderline Personality Disorder (BPD), and Bipolar-Disorder (BD) and present solutions to the data sparsity challenge, highlighted by 99.81% non-zero elements. After applying our preprocessing technique, the feature sparsity decreases to 85.4%. Overall, our methods, when compared to seven benchmark models, demonstrate significant performance improvements: 8.0% in accuracy, 0.069 in precision, 0.093 in recall, 0.102 in F1 score, and 0.059 in AUC. This research provides foundational insights for mental health prediction and monitoring, providing innovative solutions to navigate challenges associated with ultra-sparse data feature and intricate multi-label classification in the domain of mental health analysis.
Published: 2023

37. FAST discovery of a fast neutral hydrogen outflow

Author: Su, Renzhi, Gu, Minfeng, Curran, S. J., Mahony, Elizabeth K., Tang, Ningyu, Allison, James R., Li, Di, Zhu, Ming, Aditya, J. N. H. S., Yoon, Hyein, Zheng, Zheng, and Wu, Zhongzu
Subjects: Astrophysics - Astrophysics of Galaxies
Abstract: In this letter, we report the discovery of a fast neutral hydrogen outflow in SDSS J145239.38+062738.0, a merging radio galaxy containing an optical type I active galactic nuclei (AGN). This discovery was made through observations conducted by the Five-hundred-meter Aperture Spherical radio Telescope (FAST) using redshifted 21-cm absorption. The outflow exhibits a blueshifted velocity likely up to $\sim-1000\,\rm km\,s^{-1}$ with respect to the systemic velocity of the host galaxy with an absorption strength of $\sim -0.6\,\rm mJy\,beam^{-1}$ corresponding to an optical depth of 0.002 at $v=-500\,\rm km\,s^{-1}$. The mass outflow rate ranges between $2.8\times10^{-2}$ and $3.6\, \rm M_\odot \, yr^{-1}$, implying an energy outflow rate ranging between $4.2\times10^{39}$ and $9.7\times10^{40}\rm\,erg\,s^{-1}$, assuming 100 K $
Published: 2023

38. Novel thick-target inverse kinematics method for the astrophysical 12C+12C fusion reaction

Author: Nan, Wei-Ke, Wang, You-Bao, Sheng, Yao-De, Su, Jun, Zhang, Yu-Qiang, Song, Lu-Yang, Shen, Yang-Ping, Cao, Fu-Qiang, Chen, Chen, Dong, Chao, Li, Yun-Ju, Li, Zhi-Hong, Lian, Gang, Nan, Wei, Wang, Qiang, Song, Na, Yan, Sheng-Quan, Zeng, Seng, Fan, Qi-Wen, Zhang, Hao, Zhu, Ming-Hao, Guo, Bing, and Liu, Wei-Ping
Published: 2024
Full Text: View/download PDF

39. Toward a direct measurement of the cosmic acceleration: The pilot observation of H I 21cm absorption line at FAST

Author: Kang, Jiangang, Lu, Chang-Zhi, Zhang, TongJie, and Zhu, Ming
Subjects: Astrophysics - Cosmology and Nongalactic Astrophysics
Abstract: This study presents results on detecting neutral atomic hydrogen (HI) 21cm absorption in the spectrum of PKS1413+135 at redshift $z=0.24670041$. The observation was conducted by FAST, with a spectral resolution of 10 Hz, using 10 minutes of observing time. The global spectral profile is examined by modeling the absorption line using a single Gaussian function with a resolution of 10 kHz within a 2 MHz bandwidth. The goal is to determine the rate of the latest cosmic acceleration by directly measuring redshift evolution of H I 21 cm absorption line with Hubble flow towards a same background Quasar over a decade or longer time span. This will serve as a detectable signal generated by the accelerated expansion of the Universe at redshift $z < 1$, referred to as redshift drift $\dot{z}$ or the SL effect. The measured HI gas column density in this DLA system is approximately equivalent to the initial observation value, considering uncertainties of the spin temperature of a spiral host galaxy. The high signal-to-noise ratio of 57, obtained at a 10 kHz resolution, strongly supports the feasibility of using the H I 21 cm absorption line in DLA systems to accurately measure the redshift drift rate at a precision level of around $10^{-10}$ per decade., Comment: 16 pages,6 figures, 2 tables, Accepted for publication by RAA
Published: 2023

40. FAST reveals new evidence for M94 as a merger

Author: Zhou, Ruilei, Zhu, Ming, Yang, Yanbin, Yu, Haiyang, Yuan, Lixia, Jiang, Peng, and Xi, Wenzhe
Subjects: Astrophysics - Astrophysics of Galaxies
Abstract: We report the first high-sensitivity HI observation toward the spiral galaxy M94 with the Five-hundred-meter Aperture Spherical radio Telescope (FAST). From these observations, we discovered that M94 has a very extended HI disk, twice larger than that observed by THINGS, which is accompanied by an HI filament and seven HVCs (high velocity clouds) at different distances. The projected distances of these clouds and filament are less than 50 kpc from the galactic center. We measured a total integrated flux (including all clouds/filament) of 127.3 ($\pm$1) Jy km s$^{-1}$, corresponding to a H I mass of (6.51$\pm$0.06)$\times$10$^{8}$M$_{\odot}$, which is 63.0% more than that observed by THINGS. By comparing numerical simulations with the HI maps and the optical morphology of M94, we suggest that M94 is likely a remnant of a major merger of two galaxies, and the HVCs and HI filament could be the tidal features originated from the first collision of the merger happened about 5 Gyr ago. Furthermore, we found a seemingly isolated HI cloud at a projection distance of 109 kpc without any optical counterpart detected. We discussed the possibilities of the origin of this cloud, such as dark dwarf galaxy and RELHIC (REionization-Limited HI Cloud). Our results demonstrate that high-sensitivity and wide-field HI imaging is important in revealing the diffuse cold gas structures and tidal debris which is crucial to understanding the dynamical evolution of galaxies., Comment: 14 pages, 8 figures
Published: 2023
Full Text: View/download PDF

41. FAST polarization mapping of the SNR VRO 42.05.01

Author: Xiao, Li, Zhu, Ming, Sun, Xiao-Hui, Jiang, Peng, and Sun, Chun
Subjects: Astrophysics - Astrophysics of Galaxies
Abstract: We have obtained the polarization data cube of the VRO 42.05.01 supernova remnant at 1240 MHz using the Five-hundred-meter Aperture Spherical radio Telescope (FAST). Three-dimensional Faraday Synthesis is applied to the FAST data to derive the Faraday depth spectrum. The peak Faraday depth map shows a large area of enhanced foreground RM of ~60 rad m-2 extending along the remnant's "wing" section, which coincides with a large-scale HI shell at -20 km/s. The two depolarization patches within the "wing" region with RM of 97 rad m-2 and 55 rad m-2 coincide with two HI structures in the HI shell. Faraday screen model fitting on the Canadian Galactic Plane Survey (CGPS) 1420 MHz full-scale polarization data reveals a distance of 0.7-0.8d_{SNR} in front of the SNR with enhanced regular magnetic field there. The highly piled-up magnetic field indicates that the HI shell at -20 km/s could originate from an old evolved SNR., Comment: 9 pages, 8 figures, accepted by ApJ
Published: 2023

42. Prediction of microvascular invasion in hepatocellular carcinoma with conventional ultrasound, Sonazoid-enhanced ultrasound, and biochemical indicator: a multicenter study

Author: Lu, Dan, Wang, Li-Fan, Han, Hong, Li, Lin-Lin, Kong, Wen-Tao, Zhou, Qian, Zhou, Bo-Yang, Sun, Yi-Kang, Yin, Hao-Hao, Zhu, Ming-Rui, Hu, Xin-Yuan, Lu, Qing, Xia, Han-Sheng, Wang, Xi, Zhao, Chong-Ke, Zhou, Jian-Hua, and Xu, Hui-Xiong
Published: 2024
Full Text: View/download PDF

43. Evolution of carbides and Charpy toughness in a low alloy bainitic steel during step-up aging process

Author: Jin, Long, Zhang, Kun, Zhu, Ming-Liang, and Xuan, Fu-Zhen
Published: 2024
Full Text: View/download PDF

44. Ground settlement prediction for highway subgrades with sparse data using regression Kriging

Author: Huang, Lei, Qin, Wei, Dai, Guo-liang, Zhu, Ming-xing, Liu, Lei-Lei, Huang, Ling-Jun, Yang, Shan-Pian, and Ge, Miao-Miao
Published: 2024
Full Text: View/download PDF

45. Subtype prediction of intrahepatic cholangiocarcinoma using dynamic contrast-enhanced ultrasound

Author: Zhu, Ming-Rui, Zhao, Chong-Ke, Sun, Yi-Kang, Li, Xiao-Long, Yin, Hao-Hao, Lu, Dan, Ye, Xin, Hu, Xin-Yuan, Wang, Xi, Xia, Han-Sheng, Han, Hong, Zhou, Bo-Yang, Xu, Hui-Xiong, and Wang, Li-Fan
Published: 2024
Full Text: View/download PDF

46. Gene variants and clinical characteristics of children with sitosterolemia

Author: Gu, Rui, Wang, Hui, Wang, Chun-Lin, Lu, Mei, Miao, Miao, Huang, Meng-Na, Chen, Yi, Dai, Yang-Li, Zhu, Ming-Qiang, Zhou, Qiong, and Zou, Chao-Chun
Published: 2024
Full Text: View/download PDF

47. The chromatin factors SET-26 and HCF-1 oppose the histone deacetylase HDA-1 in longevity and gene regulation in C. elegans

Author: Emerson, Felicity J., Chiu, Caitlin, Lin, Laura Y., Riedel, Christian G., Zhu, Ming, and Lee, Siu Sylvia
Published: 2024
Full Text: View/download PDF

48. Tunnelling of electrons via the neighboring atom

Author: Zhu, Ming, Tong, Jihong, Liu, Xiwang, Yang, Weifeng, Gong, Xiaochun, Jiang, Wenyu, Lu, Peifen, Li, Hui, Song, Xiaohong, and Wu, Jian
Published: 2024
Full Text: View/download PDF

49. Comparisons of mpMRI, 68Ga-PSMA PET/CT and mpMRI combined with 68Ga-PSMA PET/CT in diagnosing prostate cancer based on tumor detection, localization and staging

Author: Mai, Zhipeng, Zhu, Ming, Feng, Tianrui, Zhou, Zhien, Zhou, Yi, Wang, Dong, Yuan, Runqiang, Xiao, Yu, Wang, Jiarou, Sun, Hao, and Yan, Weigang
Published: 2024
Full Text: View/download PDF

50. Contribution of systemic factors on macular vessel density: a sex-specific population-based study

Author: Chan, Wilson Chung Fai, Zhu, Ming Ming, Choy, Bonnie Nga Kwan, Chan, Jonathan Cheuk Hung, Ng, Alex Lap Ki, Shih, Kendrick Co, Cheung, Janice Jing Chee, Wong, Jasper Ka Wai, Shum, Jennifer Wei Huen, Ni, Michael Yuxuan, Lai, Jimmy Shiu Ming, Leung, Gabriel Matthew, and Wong, Ian Yat Hin
Published: 2024
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Category

Publication Type

Journal

Region

Database

Publisher

11,156 results on '"Zhu, Ming"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources