Author: "LI, Ning" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"LI, Ning"' showing total 42,546 results

Start Over Author "LI, Ning"

42,546 results on '"LI, Ning"'

1. Multi-IF: Benchmarking LLMs on Multi-Turn and Multilingual Instructions Following

Author: He, Yun, Jin, Di, Wang, Chaoqi, Bi, Chloe, Mandyam, Karishma, Zhang, Hejia, Zhu, Chen, Li, Ning, Xu, Tengyu, Lv, Hongjiang, Bhosale, Shruti, Zhu, Chenguang, Sankararaman, Karthik Abinav, Helenowski, Eryk, Kambadur, Melanie, Tayade, Aditya, Ma, Hao, Fang, Han, and Wang, Sinong
Subjects: Computer Science - Computation and Language
Abstract: Large Language Models (LLMs) have demonstrated impressive capabilities in various tasks, including instruction following, which is crucial for aligning model outputs with user expectations. However, evaluating LLMs' ability to follow instructions remains challenging due to the complexity and subjectivity of human language. Current benchmarks primarily focus on single-turn, monolingual instructions, which do not adequately reflect the complexities of real-world applications that require handling multi-turn and multilingual interactions. To address this gap, we introduce Multi-IF, a new benchmark designed to assess LLMs' proficiency in following multi-turn and multilingual instructions. Multi-IF, which utilizes a hybrid framework combining LLM and human annotators, expands upon the IFEval by incorporating multi-turn sequences and translating the English prompts into another 7 languages, resulting in a dataset of 4,501 multilingual conversations, where each has three turns. Our evaluation of 14 state-of-the-art LLMs on Multi-IF reveals that it presents a significantly more challenging task than existing benchmarks. All the models tested showed a higher rate of failure in executing instructions correctly with each additional turn. For example, o1-preview drops from 0.877 at the first turn to 0.707 at the third turn in terms of average accuracy over all languages. Moreover, languages with non-Latin scripts (Hindi, Russian, and Chinese) generally exhibit higher error rates, suggesting potential limitations in the models' multilingual capabilities. We release Multi-IF prompts and the evaluation code base to encourage further research in this critical area.
Published: 2024

2. ELF-Gym: Evaluating Large Language Models Generated Features for Tabular Prediction

Author: Zhang, Yanlin, Li, Ning, Gan, Quan, Zhang, Weinan, Wipf, David, and Wang, Minjie
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Crafting effective features is a crucial yet labor-intensive and domain-specific task within machine learning pipelines. Fortunately, recent advancements in Large Language Models (LLMs) have shown promise in automating various data science tasks, including feature engineering. But despite this potential, evaluations thus far are primarily based on the end performance of a complete ML pipeline, providing limited insight into precisely how LLMs behave relative to human experts in feature engineering. To address this gap, we propose ELF-Gym, a framework for Evaluating LLM-generated Features. We curated a new dataset from historical Kaggle competitions, including 251 "golden" features used by top-performing teams. ELF-Gym then quantitatively evaluates LLM-generated features by measuring their impact on downstream model performance as well as their alignment with expert-crafted features through semantic and functional similarity assessments. This approach provides a more comprehensive evaluation of disparities between LLMs and human experts, while offering valuable insights into specific areas where LLMs may have room for improvement. For example, using ELF-Gym we empirically demonstrate that, in the best-case scenario, LLMs can semantically capture approximately 56% of the golden features, but at the more demanding implementation level this overlap drops to 13%. Moreover, in other cases LLMs may fail completely, particularly on datasets that require complex features, indicating broad potential pathways for improvement.
Published: 2024

3. Agentic Information Retrieval

Author: Zhang, Weinan, Liao, Junwei, Li, Ning, and Du, Kounianhua
Subjects: Computer Science - Information Retrieval, Computer Science - Artificial Intelligence
Abstract: What will information entry look like in the next generation of digital products? Since the 1970s, user access to relevant information has relied on domain-specific architectures of information retrieval (IR). Over the past two decades, the advent of modern IR systems, including web search engines and personalized recommender systems, has greatly improved the efficiency of retrieving relevant information from vast data corpora. However, the core paradigm of these IR systems remains largely unchanged, relying on filtering a predefined set of candidate items. Since 2022, breakthroughs in large language models (LLMs) have begun transforming how information is accessed, establishing a new technical paradigm. In this position paper, we introduce Agentic Information Retrieval (Agentic IR), a novel IR paradigm shaped by the capabilities of LLM agents. Agentic IR expands the scope of accessible tasks and leverages a suite of new techniques to redefine information retrieval. We discuss three types of cutting-edge applications of agentic IR and the challenges faced. We propose that agentic IR holds promise for generating innovative applications, potentially becoming a central information entry point in future digital ecosystems., Comment: 11 pages, position paper
Published: 2024

4. A QoE-Aware Split Inference Accelerating Algorithm for NOMA-based Edge Intelligence

Author: Yuan, Xin, Li, Ning, Chen, Quan, Xu, Wenchao, Zhang, Zhaoxin, and Guo, Song
Subjects: Computer Science - Machine Learning
Abstract: Even the AI has been widely used and significantly changed our life, deploying the large AI models on resource limited edge devices directly is not appropriate. Thus, the model split inference is proposed to improve the performance of edge intelligence, in which the AI model is divided into different sub models and the resource-intensive sub model is offloaded to edge server wirelessly for reducing resource requirements and inference latency. However, the previous works mainly concentrate on improving and optimizing the system QoS, ignore the effect of QoE which is another critical item for the users except for QoS. Even the QoE has been widely learned in EC, considering the differences between task offloading in EC and split inference in EI, and the specific issues in QoE which are still not addressed in EC and EI, these algorithms cannot work effectively in edge split inference scenarios. Thus, an effective resource allocation algorithm is proposed in this paper, for accelerating split inference in EI and achieving the tradeoff between inference delay, QoE, and resource consumption, abbreviated as ERA. Specifically, the ERA takes the resource consumption, QoE, and inference latency into account to find the optimal model split strategy and resource allocation strategy. Since the minimum inference delay and resource consumption, and maximum QoE cannot be satisfied simultaneously, the gradient descent based algorithm is adopted to find the optimal tradeoff between them. Moreover, the loop iteration GD approach is developed to reduce the complexity of the GD algorithm caused by parameter discretization. Additionally, the properties of the proposed algorithms are investigated, including convergence, complexity, and approximation error. The experimental results demonstrate that the performance of ERA is much better than that of the previous studies., Comment: 16pages, 19figures. arXiv admin note: substantial text overlap with arXiv:2312.15850
Published: 2024

5. Orthogonal Mode Decomposition for Finite Discrete Signals

Author: Li, Ning and Li, Lezhi
Subjects: Electrical Engineering and Systems Science - Systems and Control
Abstract: In this paper, an orthogonal mode decomposition method is proposed to decompose ffnite length real signals on both the real and imaginary axes of the complex plane. The interpolation function space of ffnite length discrete signal is constructed, and the relationship between the dimensionality of the interpolation function space and its subspaces and the band width of the interpolation function is analyzed. It is proved that the intrinsic mode is actually the narrow band signal whose intrinsic instantaneous frequency is always positive (or always negative). Thus, the eigenmode decomposition problem is transformed into the orthogonal projection problem of interpolation function space to its low frequency subspace or narrow band subspace. Different from the existing mode decomposition methods, the orthogonal modal decomposition is a local time-frequency domain algorithm. Each operation extracts a speciffc mode. The global decomposition results obtained under the precise deffnition of eigenmodes have uniqueness and orthogonality. The computational complexity of the orthogonal mode decomposition method is also much smaller than that of the existing mode decomposition methods.
Published: 2024

6. Can AI Replace Human Subjects? A Large-Scale Replication of Psychological Experiments with LLMs

Author: Cui, Ziyan, Li, Ning, and Zhou, Huaikang
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Economics - General Economics
Abstract: Artificial Intelligence (AI) is increasingly being integrated into scientific research, particularly in the social sciences, where understanding human behavior is critical. Large Language Models (LLMs) like GPT-4 have shown promise in replicating human-like responses in various psychological experiments. However, the extent to which LLMs can effectively replace human subjects across diverse experimental contexts remains unclear. Here, we conduct a large-scale study replicating 154 psychological experiments from top social science journals with 618 main effects and 138 interaction effects using GPT-4 as a simulated participant. We find that GPT-4 successfully replicates 76.0 percent of main effects and 47.0 percent of interaction effects observed in the original studies, closely mirroring human responses in both direction and significance. However, only 19.44 percent of GPT-4's replicated confidence intervals contain the original effect sizes, with the majority of replicated effect sizes exceeding the 95 percent confidence interval of the original studies. Additionally, there is a 71.6 percent rate of unexpected significant results where the original studies reported null findings, suggesting potential overestimation or false positives. Our results demonstrate the potential of LLMs as powerful tools in psychological research but also emphasize the need for caution in interpreting AI-driven findings. While LLMs can complement human studies, they cannot yet fully replace the nuanced insights provided by human subjects., Comment: 5 figures, 2 tables
Published: 2024

7. Multi-Modal Dialogue State Tracking for Playing GuessWhich Game

Author: Pang, Wei, Duan, Ruixue, Yang, Jinfu, and Li, Ning
Subjects: Computer Science - Artificial Intelligence
Abstract: GuessWhich is an engaging visual dialogue game that involves interaction between a Questioner Bot (QBot) and an Answer Bot (ABot) in the context of image-guessing. In this game, QBot's objective is to locate a concealed image solely through a series of visually related questions posed to ABot. However, effectively modeling visually related reasoning in QBot's decision-making process poses a significant challenge. Current approaches either lack visual information or rely on a single real image sampled at each round as decoding context, both of which are inadequate for visual reasoning. To address this limitation, we propose a novel approach that focuses on visually related reasoning through the use of a mental model of the undisclosed image. Within this framework, QBot learns to represent mental imagery, enabling robust visual reasoning by tracking the dialogue state. The dialogue state comprises a collection of representations of mental imagery, as well as representations of the entities involved in the conversation. At each round, QBot engages in visually related reasoning using the dialogue state to construct an internal representation, generate relevant questions, and update both the dialogue state and internal representation upon receiving an answer. Our experimental results on the VisDial datasets (v0.5, 0.9, and 1.0) demonstrate the effectiveness of our proposed model, as it achieves new state-of-the-art performance across all metrics and datasets, surpassing previous state-of-the-art models. Codes and datasets from our experiments are freely available at \href{https://github.com/xubuvd/GuessWhich}., Comment: Published at CICAI 2023 (CAAI-A), codes at https://github.com/xubuvd/GuessWhich
Published: 2024
Full Text: View/download PDF

8. Enhancing Visual Dialog State Tracking through Iterative Object-Entity Alignment in Multi-Round Conversations

Author: Pang, Wei, Duan, Ruixue, Yang, Jinfu, and Li, Ning
Subjects: Computer Science - Artificial Intelligence, Computer Science - Computation and Language, Computer Science - Computer Vision and Pattern Recognition
Abstract: Visual Dialog (VD) is a task where an agent answers a series of image-related questions based on a multi-round dialog history. However, previous VD methods often treat the entire dialog history as a simple text input, disregarding the inherent conversational information flows at the round level. In this paper, we introduce Multi-round Dialogue State Tracking model (MDST), a framework that addresses this limitation by leveraging the dialogue state learned from dialog history to answer questions. MDST captures each round of dialog history, constructing internal dialogue state representations defined as 2-tuples of vision-language representations. These representations effectively ground the current question, enabling the generation of accurate answers. Experimental results on the VisDial v1.0 dataset demonstrate that MDST achieves a new state-of-the-art performance in generative setting. Furthermore, through a series of human studies, we validate the effectiveness of MDST in generating long, consistent, and human-like answers while consistently answering a series of questions correctly., Comment: This article has been accepted in CAAI Transactions on Intelligence Technology! Article ID: CIT2_12370, Article DOI: 10.1049/cit2.12370
Published: 2024
Full Text: View/download PDF

9. From Text to Insight: Leveraging Large Language Models for Performance Evaluation in Management

Author: Li, Ning, Zhou, Huaikang, and Xu, Mingze
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Emerging Technologies, Computer Science - Human-Computer Interaction, Economics - General Economics
Abstract: This study explores the potential of Large Language Models (LLMs), specifically GPT-4, to enhance objectivity in organizational task performance evaluations. Through comparative analyses across two studies, including various task performance outputs, we demonstrate that LLMs can serve as a reliable and even superior alternative to human raters in evaluating knowledge-based performance outputs, which are a key contribution of knowledge workers. Our results suggest that GPT ratings are comparable to human ratings but exhibit higher consistency and reliability. Additionally, combined multiple GPT ratings on the same performance output show strong correlations with aggregated human performance ratings, akin to the consensus principle observed in performance evaluation literature. However, we also find that LLMs are prone to contextual biases, such as the halo effect, mirroring human evaluative biases. Our research suggests that while LLMs are capable of extracting meaningful constructs from text-based data, their scope is currently limited to specific forms of performance evaluation. By highlighting both the potential and limitations of LLMs, our study contributes to the discourse on AI role in management studies and sets a foundation for future research to refine AI theoretical and practical applications in management., Comment: 39 pages, 8 figures, 5 tables
Published: 2024

10. PromptSAM+: Malware Detection based on Prompt Segment Anything Model

Author: Wei, Xingyuan, Liu, Yichen, Li, Ce, Li, Ning, Sun, Degang, and Wang, Yan
Subjects: Computer Science - Cryptography and Security, F.2.2, I.2.7, F.2.2, I.2.7
Abstract: Machine learning and deep learning (ML/DL) have been extensively applied in malware detection, and some existing methods demonstrate robust performance. However, several issues persist in the field of malware detection: (1) Existing work often overemphasizes accuracy at the expense of practicality, rarely considering false positive and false negative rates as important metrics. (2) Considering the evolution of malware, the performance of classifiers significantly declines over time, greatly reducing the practicality of malware detectors. (3) Prior ML/DL-based efforts heavily rely on ample labeled data for model training, largely dependent on feature engineering or domain knowledge to build feature databases, making them vulnerable if correct labels are scarce. With the development of computer vision, vision-based malware detection technology has also rapidly evolved. In this paper, we propose a visual malware general enhancement classification framework, `PromptSAM+', based on a large visual network segmentation model, the Prompt Segment Anything Model(named PromptSAM+). Our experimental results indicate that 'PromptSAM+' is effective and efficient in malware detection and classification, achieving high accuracy and low rates of false positives and negatives. The proposed method outperforms the most advanced image-based malware detection technologies on several datasets. 'PromptSAM+' can mitigate aging in existing image-based malware classifiers, reducing the considerable manpower needed for labeling new malware samples through active learning. We conducted experiments on datasets for both Windows and Android platforms, achieving favorable outcomes. Additionally, our ablation experiments on several datasets demonstrate that our model identifies effective modules within the large visual network., Comment: 13pages, 10figures
Published: 2024

11. Mitigating the Impact of Malware Evolution on API Sequence-based Windows Malware Detector

Author: Wei, Xingyuan, Li, Ce, Lv, Qiujian, Li, Ning, Sun, Degang, and Wang, Yan
Subjects: Computer Science - Cryptography and Security, F.2.2, I.2.7
Abstract: In dynamic Windows malware detection, deep learning models are extensively deployed to analyze API sequences. Methods based on API sequences play a crucial role in malware prevention. However, due to the continuous updates of APIs and the changes in API sequence calls leading to the constant evolution of malware variants, the detection capability of API sequence-based malware detection models significantly diminishes over time. We observe that the API sequences of malware samples before and after evolution usually have similar malicious semantics. Specifically, compared to the original samples, evolved malware samples often use the API sequences of the pre-evolution samples to achieve similar malicious behaviors. For instance, they access similar sensitive system resources and extend new malicious functions based on the original functionalities. In this paper, we propose a frame(MME), a framework that can enhance existing API sequence-based malware detectors and mitigate the adverse effects of malware evolution. To help detection models capture the similar semantics of these post-evolution API sequences, our framework represents API sequences using API knowledge graphs and system resource encodings and applies contrastive learning to enhance the model's encoder. Results indicate that, compared to Regular Text-CNN, our framework can significantly reduce the false positive rate by 13.10% and improve the F1-Score by 8.47% on five years of data, achieving the best experimental results. Additionally, evaluations show that our framework can save on the human costs required for model maintenance. We only need 1% of the budget per month to reduce the false positive rate by 11.16% and improve the F1-Score by 6.44%., Comment: 13pages, 11 figures
Published: 2024

12. Reform of China’s Science and Technology System in the Xi Jinping Era

Author: Cao, Cong, Li, Ning, Li, Xia, and Liu, Li
Published: 2018
Full Text: View/download PDF

13. Generative AI Enhances Team Performance and Reduces Need for Traditional Teams

Author: Li, Ning, Zhou, Huaikang, and Mikel-Hong, Kris
Subjects: Computer Science - Human-Computer Interaction, Computer Science - Artificial Intelligence, Economics - General Economics
Abstract: Recent advancements in generative artificial intelligence (AI) have transformed collaborative work processes, yet the impact on team performance remains underexplored. Here we examine the role of generative AI in enhancing or replacing traditional team dynamics using a randomized controlled experiment with 435 participants across 122 teams. We show that teams augmented with generative AI significantly outperformed those relying solely on human collaboration across various performance measures. Interestingly, teams with multiple AIs did not exhibit further gains, indicating diminishing returns with increased AI integration. Our analysis suggests that centralized AI usage by a few team members is more effective than distributed engagement. Additionally, individual-AI pairs matched the performance of conventional teams, suggesting a reduced need for traditional team structures in some contexts. However, despite this capability, individual-AI pairs still fell short of the performance levels achieved by AI-assisted teams. These findings underscore that while generative AI can replace some traditional team functions, more comprehensively integrating AI within team structures provides superior benefits, enhancing overall effectiveness beyond individual efforts., Comment: 55 pages, 8 figures
Published: 2024

14. Look into the Future: Deep Contextualized Sequential Recommendation

Author: Zheng, Lei, Li, Ning, Huang, Yanhuan, Xu, Ruiwen, Zhang, Weinan, and Yu, Yong
Subjects: Computer Science - Information Retrieval
Abstract: Sequential recommendation aims to estimate how a user's interests evolve over time via uncovering valuable patterns from user behavior history. Many previous sequential models have solely relied on users' historical information to model the evolution of their interests, neglecting the crucial role that future information plays in accurately capturing these dynamics. However, effectively incorporating future information in sequential modeling is non-trivial since it is impossible to make the current-step prediction for any target user by leveraging his future data. In this paper, we propose a novel framework of sequential recommendation called Look into the Future (LIFT), which builds and leverages the contexts of sequential recommendation. In LIFT, the context of a target user's interaction is represented based on i) his own past behaviors and ii) the past and future behaviors of the retrieved similar interactions from other users. As such, the learned context will be more informative and effective in predicting the target user's behaviors in sequential recommendation without temporal data leakage. Furthermore, in order to exploit the intrinsic information embedded within the context itself, we introduce an innovative pretraining methodology incorporating behavior masking. In our extensive experiments on five real-world datasets, LIFT achieves significant performance improvement on click-through rate prediction and rating prediction tasks in sequential recommendation over strong baselines, demonstrating that retrieving and leveraging relevant contexts from the global user pool greatly benefits sequential recommendation. The experiment code is provided at https://anonymous.4open.science/r/LIFT-277C/Readme.md., Comment: arXiv admin note: text overlap with arXiv:2404.18304 by other authors
Published: 2024

15. Improving classifier-based effort-aware software defect prediction by reducing ranking errors

Author: Guo, Yuchen, Shepperd, Martin, and Li, Ning
Subjects: Computer Science - Software Engineering, D.2
Abstract: Context: Software defect prediction utilizes historical data to direct software quality assurance resources to potentially problematic components. Effort-aware (EA) defect prediction prioritizes more bug-like components by taking cost-effectiveness into account. In other words, it is a ranking problem, however, existing ranking strategies based on classification, give limited consideration to ranking errors. Objective: Improve the performance of classifier-based EA ranking methods by focusing on ranking errors. Method: We propose a ranking score calculation strategy called EA-Z which sets a lower bound to avoid near-zero ranking errors. We investigate four primary EA ranking strategies with 16 classification learners, and conduct the experiments for EA-Z and the other four existing strategies. Results: Experimental results from 72 data sets show EA-Z is the best ranking score calculation strategy in terms of Recall@20% and Popt when considering all 16 learners. For particular learners, imbalanced ensemble learner UBag-svm and UBst-rf achieve top performance with EA-Z. Conclusion: Our study indicates the effectiveness of reducing ranking errors for classifier-based effort-aware defect prediction. We recommend using EA-Z with imbalanced ensemble learning., Comment: 10 pages with 12 figures. Accepted by International Conference on Evaluation and Assessment in Software Engineering (EASE) 2024
Published: 2024

16. Isospin violation effect and three-body decays of the $T_{cc}^{+}$ state

Author: Sun, Zhi-Feng, Li, Ning, and Liu, Xiang
Subjects: High Energy Physics - Phenomenology
Abstract: In this work, we make a study of $T_{cc}^+$ state observed by the LHCb collaboration in 2021. In obtaining the effective potentials using the One-Boson-Exchange Potential Model we use an exponential form factor, and find that in the short and medium range, the contributions of the $\pi$, $\rho$ and $\omega$ exchanges are comparable while in the long range the pion-exchange contribution is dominant. Based on the assumption that $T_{cc}^+$ is a loosely bound state of $D^*D$, we focus on its three-body decay using the meson-exchange method. Considering that the difference between the thresholds of $D^{*+}D^0$ and $D^{*0}D^+$ is even larger than the binding energy of $T_{cc}^+$, the isospin-breaking effect is amplified by the small binding energy of $T_{cc}^+$. Explicitly including such an isospin-breaking effect we obtain, by solving the Schr\"{o}dinger equation, that the probability of the isoscalar component is about $91\%$ while that of the isovector component is around $9\%$ for $T_{cc}^+$. Using the experimental value of the mass of $T_{cc}^+$ as an input, we obtain the wave function of $T_{cc}^+$ and further obtain its width via the three-body hadronic as well as the radiative decays. The total width we obtain is in agreement with the experimental value of the LHCb measurement with a unitarised Breit-Wigner profile. Conversely, the current results support the conclusion that $T_{cc}^+$ is a hadronic molecule of $D^*D$., Comment: 10 pages, 2 figures
Published: 2024

17. 4DBInfer: A 4D Benchmarking Toolbox for Graph-Centric Predictive Modeling on Relational DBs

Author: Wang, Minjie, Gan, Quan, Wipf, David, Cai, Zhenkun, Li, Ning, Tang, Jianheng, Zhang, Yanlin, Zhang, Zizhao, Mao, Zunyao, Song, Yakun, Wang, Yanbo, Li, Jiahang, Zhang, Han, Yang, Guang, Qin, Xiao, Lei, Chuan, Zhang, Muhan, Zhang, Weinan, Faloutsos, Christos, and Zhang, Zheng
Subjects: Computer Science - Machine Learning, Computer Science - Databases
Abstract: Although RDBs store vast amounts of rich, informative data spread across interconnected tables, the progress of predictive machine learning models as applied to such tasks arguably falls well behind advances in other domains such as computer vision or natural language processing. This deficit stems, at least in part, from the lack of established/public RDB benchmarks as needed for training and evaluation purposes. As a result, related model development thus far often defaults to tabular approaches trained on ubiquitous single-table benchmarks, or on the relational side, graph-based alternatives such as GNNs applied to a completely different set of graph datasets devoid of tabular characteristics. To more precisely target RDBs lying at the nexus of these two complementary regimes, we explore a broad class of baseline models predicated on: (i) converting multi-table datasets into graphs using various strategies equipped with efficient subsampling, while preserving tabular characteristics; and (ii) trainable models with well-matched inductive biases that output predictions based on these input subgraphs. Then, to address the dearth of suitable public benchmarks and reduce siloed comparisons, we assemble a diverse collection of (i) large-scale RDB datasets and (ii) coincident predictive tasks. From a delivery standpoint, we operationalize the above four dimensions (4D) of exploration within a unified, scalable open-source toolbox called 4DBInfer. We conclude by presenting evaluations using 4DBInfer, the results of which highlight the importance of considering each such dimension in the design of RDB predictive models, as well as the limitations of more naive approaches such as simply joining adjacent tables. Our source code is released at https://github.com/awslabs/multi-table-benchmark ., Comment: Under review
Published: 2024

18. Retrieval and Distill: A Temporal Data Shift-Free Paradigm for Online Recommendation System

Author: Zheng, Lei, Li, Ning, Zhang, Weinan, and Yu, Yong
Subjects: Computer Science - Information Retrieval, Computer Science - Artificial Intelligence
Abstract: Current recommendation systems are significantly affected by a serious issue of temporal data shift, which is the inconsistency between the distribution of historical data and that of online data. Most existing models focus on utilizing updated data, overlooking the transferable, temporal data shift-free information that can be learned from shifting data. We propose the Temporal Invariance of Association theorem, which suggests that given a fixed search space, the relationship between the data and the data in the search space keeps invariant over time. Leveraging this principle, we designed a retrieval-based recommendation system framework that can train a data shift-free relevance network using shifting data, significantly enhancing the predictive performance of the original model in the recommendation system. However, retrieval-based recommendation models face substantial inference time costs when deployed online. To address this, we further designed a distill framework that can distill information from the relevance network into a parameterized module using shifting data. The distilled model can be deployed online alongside the original model, with only a minimal increase in inference time. Extensive experiments on multiple real datasets demonstrate that our framework significantly improves the performance of the original model by utilizing shifting data.
Published: 2024

19. Quantum-activated neural reservoirs on-chip open up large hardware security models for resilient authentication

Author: He, Zhao, Elizarov, Maxim S., Li, Ning, Xiang, Fei, and Fratalocchi, Andrea
Subjects: Condensed Matter - Disordered Systems and Neural Networks, Computer Science - Artificial Intelligence, Computer Science - Cryptography and Security
Abstract: Quantum artificial intelligence is a frontier of artificial intelligence research, pioneering quantum AI-powered circuits to address problems beyond the reach of deep learning with classical architectures. This work implements a large-scale quantum-activated recurrent neural network possessing more than 3 trillion hardware nodes/cm$^2$, originating from repeatable atomic-scale nucleation dynamics in an amorphous material integrated on-chip, controlled with 0.07 nW electric power per readout channel. Compared to the best-performing reservoirs currently reported, this implementation increases the scale of the network by two orders of magnitude and reduces the power consumption by six, reaching power efficiencies in the range of the human brain, dissipating 0.2 nW/neuron. When interrogated by a classical input, the chip implements a large-scale hardware security model, enabling dictionary-free authentication secure against statistical inference attacks, including AI's present and future development, even for an adversary with a copy of all the classical components available. Experimental tests report 99.6% reliability, 100% user authentication accuracy, and an ideal 50% key uniqueness. Due to its quantum nature, the chip supports a bit density per feature size area three times higher than the best technology available, with the capacity to store more than $2^{1104}$ keys in a footprint of 1 cm$^2$. Such a quantum-powered platform could help counteract the emerging form of warfare led by the cybercrime industry in breaching authentication to target small to large-scale facilities, from private users to intelligent energy grids.
Published: 2024

20. AACP: Aesthetics assessment of children's paintings based on self-supervised learning

Author: Jiang, Shiqi, Li, Ning, Shi, Chen, Guo, Liping, Wang, Changbo, and Li, Chenhui
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: The Aesthetics Assessment of Children's Paintings (AACP) is an important branch of the image aesthetics assessment (IAA), playing a significant role in children's education. This task presents unique challenges, such as limited available data and the requirement for evaluation metrics from multiple perspectives. However, previous approaches have relied on training large datasets and subsequently providing an aesthetics score to the image, which is not applicable to AACP. To solve this problem, we construct an aesthetics assessment dataset of children's paintings and a model based on self-supervised learning. 1) We build a novel dataset composed of two parts: the first part contains more than 20k unlabeled images of children's paintings; the second part contains 1.2k images of children's paintings, and each image contains eight attributes labeled by multiple design experts. 2) We design a pipeline that includes a feature extraction module, perception modules and a disentangled evaluation module. 3) We conduct both qualitative and quantitative experiments to compare our model's performance with five other methods using the AACP dataset. Our experiments reveal that our method can accurately capture aesthetic features and achieve state-of-the-art performance., Comment: AAAI 2024
Published: 2024

21. Fruit recognition, task plan, and control for apple harvesting robots/Reconhecimento de frutas, plano de tarefas e controle para robos de colheita de macas

Author: Yang, Huawei, Wu, Jie, Liang, Aifeng, Wang, Shaowei, Yan, Yinfa, Zhang, Hongjian, Li, Ning, Liu, Yinzeng, Wang, Jinxing, and Qiu, Jianfeng
Published: 2024
Full Text: View/download PDF

22. A fire risk pre-warning framework for high-rise buildings based on unascertained method

Author: Zhang, Li-Ning, Wang, Xiang-Ming, An, Jing, Li, Hong Xian, Guo, Jiao-Qian, Han, Guo-bo, and Gou, Peng-Fei
Published: 2024
Full Text: View/download PDF

23. Identification of JAZF1, KNOP1, and PLEKHA1 as causally associated genes and drug targets for Alzheimer’s disease: a summary data-based Mendelian randomization study

Author: Zhai, Yuhan, Li, Ning, Zhang, Yujie, Li, Haibin, Wu, Lijuan, Wei, Cuibai, Ji, Jianguang, and Zheng, Deqiang
Published: 2024
Full Text: View/download PDF

24. Valorization of quinoa by-products and their potential applications

Author: Zhang, Guangjie, Ren, Chenxi, Li, Ning, Chu, Ganghui, Yusuf, Abdulla, Zhao, Hongmei, and Guan, Tianzhu
Published: 2024
Full Text: View/download PDF

25. Post-transplant cyclophosphamide with post-engraftment anti-thymocyte globulin reduce moderate to severe chronic graft-versus-host disease in peripheral stem cell transplantation from HLA-matched unrelated and haploidentical donors

Author: Wang, Ying, Gao, Wen-Hui, Wang, Li-ning, Wang, Ling, Jiang, Jie-ling, Wan, Ming, Liang, Ai-Bin, Blaise, Didier, and Hu, Jiong
Published: 2024
Full Text: View/download PDF

26. Systemic therapy in the management of metastatic or locally recurrent adenoid cystic carcinoma of the salivary glands: a systematic review of the last decade

Author: Zhou, Jiawei, Zhao, Guo, Wang, Shuhang, and Li, Ning
Published: 2024
Full Text: View/download PDF

27. Bracket-type Vertical Load Transfer System for Heavy-load Column Strengthening: Design and Application

Author: Guo, Xiuhua, Song, Zhiwei, Ke, Guoju, and Li, Ning
Published: 2024
Full Text: View/download PDF

28. An in-depth analysis of postoperative insomnia in elderly patients and its implications on rehabilitation

Author: Wang, Yuanqing, Wang, Tianlong, Feng, Shuai, Li, Ning, Zhang, Yimeng, Cheng, Yueyang, Wu, Hao, and Zhan, Shuqin
Published: 2024
Full Text: View/download PDF

29. Multimodal remote sensing image registration based on adaptive multi-scale PIIFD

Author: Li, Ning, Li, Yuxuan, and Jiao, Jichao
Published: 2024
Full Text: View/download PDF

30. Contributions of cognitive flexibility to reading comprehension in chinese beginning readers

Author: Xu, Zhengye, Wang, Li-Chih, Chung, Kevin Kien Hoa, Zhang, Xinyong, Li, Ning, and Liu, Duo
Published: 2024
Full Text: View/download PDF

31. PDSE-YOLOv8: a lightweight detection method for internal defects in asphalt roads

Author: Li, Ning, Zhang, Wenliang, Liu, Zhaoxu, Liu, Kaifeng, Wang, Junjie, and Zhang, Fan
Published: 2024
Full Text: View/download PDF

32. Does emotion regulation mediate the relationship between interoception/alexithymia and co-morbid depression/anxiety? Evidence from two independent samples

Author: Li, Ning, Chuning, Anne E., Durham, Michelle R., Killgore, William D. S., and Smith, Ryan
Published: 2024
Full Text: View/download PDF

33. Dissecting the vascular-cognitive nexus: energetic vs. conventional hemodynamic parameters

Author: Cheng, Hao-Min, Wang, Jiun-Jr, Chuang, Shao-Yuan, Lin, Chen-Hua, Mitchell, Gary F., Huang, Chi-Jung, Wang, Pei-Ning, Chung, Chih-Ping, Chen, Liang-Kung, Pan, Wen-Harn, Peng, Li-Ning, and Chen, Chen-Huan
Published: 2024
Full Text: View/download PDF

34. Generic Enrichment Method for Liquid Chromatography-Multiple Reaction Monitoring-Mass Spectrometry Assay for Quantitative Measurement of Biological Therapeutics in Serum

Author: Zhang, Sisi, Xiao, Hui, and Li, Ning
Published: 2024
Full Text: View/download PDF

35. A soft gripper driven by conical dielectric elastomer actuator to achieve displacement amplification and compliant grips

Author: Li, Ning, Xue, Yanwen, Li, Yajiao, Liu, Changhao, Du, Qingyuan, Huang, Yao, Jiang, Yingjie, and Sun, Jingyao
Published: 2024
Full Text: View/download PDF

36. A Numerical Model for the Bearing Capacity of Anchoring Interface in Composite Rock Mass by A Modified Nonlinear Bond-Slip Model

Author: Li, Ning, Gong, Peng, Du, Weisheng, Huang, Qiang, Wei, Yanqing, Gao, Fuqiang, Xie, Bing, Du, Zhigang, and Huang, Zhiquan
Published: 2024
Full Text: View/download PDF

37. TROP2 promotes the proliferation of triple-negative breast cancer cells via calcium ion-dependent ER stress signaling pathway

Author: Li, Ning, Xu, Jianzhong, Yan, Xi, Liu, Qing, and Zhang, Mingqi
Published: 2024
Full Text: View/download PDF

38. Daphnane diterpenoid orthoesters with an odd-numbered aliphatic side chain from Daphne pedunculata

Author: Tan, Lingjian, Otsuki, Kouharu, Kikuchi, Takashi, Zhou, Di, Li, Ning, Huang, Li, Chen, Chin-Ho, and Li, Wei
Published: 2024
Full Text: View/download PDF

39. Enhanced Methods for Evaluating Water-inrush Risk from Underlying Aquifers: Incorporating Dynamic Weight Theory and Uncertainty Analysis Model

Author: Li, Ning and Du, Wenfeng
Published: 2024
Full Text: View/download PDF

40. Active Surveillance of Adverse Events Following Human Papillomavirus Vaccination: Feasibility Pilot Study Based on the Regional Health Care Information Platform in the City of Ningbo, China

Author: Liu, Zhike, Zhang, Liang, Yang, Yu, Meng, Ruogu, Fang, Ting, Dong, Ying, Li, Ning, Xu, Guozhang, and Zhan, Siyan
Subjects: Computer applications to medicine. Medical informatics, R858-859.7, Public aspects of medicine, RA1-1270
Abstract: BackgroundComprehensive safety data for vaccines from post-licensure surveillance, especially active surveillance, could guide administrations and individuals to make reasonable decisions on vaccination. Therefore, we designed a pilot study to assess the capability of a regional health care information platform to actively monitor the safety of a newly licensed vaccine. ObjectiveThis study aimed to conduct active surveillance of human papillomavirus (HPV) vaccine safety based on this information platform. MethodsIn 2017, one of China’s most mature information platforms with superior data linkage was selected. A structured questionnaire and open-ended interview guidelines were developed to investigate the feasibility of active surveillance following HPV vaccination using the regional health care information platform in Ningbo. The questionnaire was sent to participants via email, and a face-to-face interview was conducted to confirm details or resolve discrepancies. ResultsFive databases that could be considered essential to active surveillance of vaccine safety were integrated into the platform starting in 2015. Except for residents' health records, which had a coverage rate of 87%, the data sources covered more than 95% of the records that were documented in Ningbo. All the data could be inherently linked using the national identity card. There were 19,328 women who received the HPV vaccine, and 37,988 doses were administered in 2017 and 2018. Women aged 30-40 years accounted for the largest proportion. Quadrivalent vaccination accounted for 73.1% of total vaccination, a much higher proportion than that of bivalent vaccination. Of the first doses, 60 (60/19,328, 0.31%) occurred outside Ningbo. There were no missing data for vaccination-relevant variables, such as identity card, vaccine name, vaccination doses, vaccination date, and manufacturer. ICD-10 coding could be used to identify 9,180 cases using a predefined list of the outcomes of interest, and 1.88% of these cases were missing the identity card. During the 90 days following HPV vaccination, 4 incident cases were found through the linked vaccination history and electronic medical records. The combined incident rate of rheumatoid arthritis, optic neuritis, and Henoch-Schonlein purpura was 8.84/100,000 doses of bivalent HPV, and the incidence rate of rheumatoid arthritis was 3.75/100,000 doses of quadrivalent HPV. ConclusionsThis study presents an available approach to initiate an active surveillance system for adverse events following HPV vaccination, based on a regional health care information platform in China. An extended observation period or the inclusion of additional functional sites is warranted to conduct future hypothesis-generating and hypothesis-confirming studies for vaccine safety concerns.
Published: 2020
Full Text: View/download PDF

41. Development and Assessment of a Gastroscopy Electronic Learning System for Primary Learners: Randomized Controlled Trial

Author: Li, Shuang, Li, Guoqing, Liu, Ying, Xu, Wanying, Yang, Ningning, Chen, Haoyuan, Li, Ning, Luo, Kunpeng, and Jin, Shizhu
Subjects: Computer applications to medicine. Medical informatics, R858-859.7, Public aspects of medicine, RA1-1270
Abstract: BackgroundEndoscopic examination is a popular and routine procedure for the diagnosis and treatment of gastrointestinal (GI) diseases. Skilled endoscopists are in great demand in clinical practice, but the training process for beginners to become endoscopy specialists is fairly long. Convenience and a self-paced, learner-centered approach make electronic learning (e-learning) an excellent instructional prospect. ObjectiveThis study aimed to develop and apply an e-learning system in gastroscopy teaching and learning and to evaluate its effectiveness and user satisfaction. MethodsThe e-learning software Gastroscope Roaming System was developed for primary learners. The system simulates the real structure of the upper gastrointestinal (UGI) tract to teach the main characteristics of gastroscopy under both normal conditions and conditions of common UGI tract diseases. A randomized controlled trial was conducted. Participants were randomly allocated to an e-learning group (EG)or a non–e-learning control group after a pretest. On completing the training, participants undertook a posttest and gastroscopy examination. In addition, the EG completed a satisfaction questionnaire. ResultsOf the 44 volunteers, 41 (93%) completed the gastroscopy learning and testing components. No significant pretest differences were found between the intervention and control groups (mean 50.86, SD 6.12 vs mean 50.76, SD 6.88; P=.96). After 1 month of learning, the EG’s posttest scores were higher (mean 83.70, SD 5.99 vs mean 78.76, SD 7.58; P=.03) and improved more (P=.01) than those of the control group, with better performance in the gastroscopy examination (mean 91.05, SD 4.58 vs mean 84.38, SD 5.19; P
Published: 2020
Full Text: View/download PDF

42. Security-Sensitive Task Offloading in Integrated Satellite-Terrestrial Networks

Author: Lan, Wenjun, Chen, Kongyang, Cao, Jiannong, Li, Yikai, Li, Ning, Chen, Qi, and Sahni, Yuvraj
Subjects: Electrical Engineering and Systems Science - Signal Processing, Computer Science - Cryptography and Security, Computer Science - Networking and Internet Architecture
Abstract: With the rapid development of sixth-generation (6G) communication technology, global communication networks are moving towards the goal of comprehensive and seamless coverage. In particular, low earth orbit (LEO) satellites have become a critical component of satellite communication networks. The emergence of LEO satellites has brought about new computational resources known as the \textit{LEO satellite edge}, enabling ground users (GU) to offload computing tasks to the resource-rich LEO satellite edge. However, existing LEO satellite computational offloading solutions primarily focus on optimizing system performance, neglecting the potential issue of malicious satellite attacks during task offloading. In this paper, we propose the deployment of LEO satellite edge in an integrated satellite-terrestrial networks (ISTN) structure to support \textit{security-sensitive computing task offloading}. We model the task allocation and offloading order problem as a joint optimization problem to minimize task offloading delay, energy consumption, and the number of attacks while satisfying reliability constraints. To achieve this objective, we model the task offloading process as a Markov decision process (MDP) and propose a security-sensitive task offloading strategy optimization algorithm based on proximal policy optimization (PPO). Experimental results demonstrate that our algorithm significantly outperforms other benchmark methods in terms of performance.
Published: 2024

43. Fault-tolerant Quantum Chemical Calculations with Improved Machine-Learning Models

Author: Yuan, Kai, Zhou, Shuai, Li, Ning, Li, Tianyan, Ding, Bowen, Guo, Danhuai, and Ma, Yingjin
Subjects: Physics - Chemical Physics
Abstract: Easy and effective usage of computational resources is crucial for scientific calculations. Following our recent work of machine-learning (ML) assisted scheduling optimization [Ref: J. Comput. Chem. 2023, 44, 1174], we further propose 1) the improve ML models for the better predictions of computational loads, and as such, more elaborate load-balancing calculations can be expected; 2) the idea of coded computation, i.e. the integration of gradient coding, in order to introduce fault tolerance during the distributed calculations; and 3) their applications together with re-normalized exciton model with time-dependent density functional theory (REM-TDDFT) for calculating the excited states. Illustrated benchmark calculations include P38 protein, and solvent model with one or several excitable centers. The results show that the improved ML-assisted coded calculations can further improve the load-balancing and cluster utilization, and owing primarily profit in fault tolerance that aiming at the automated quantum chemical calculations for both ground and excited states., Comment: 39 pages
Published: 2024
Full Text: View/download PDF

44. OkayPlan: Obstacle Kinematics Augmented Dynamic Real-time Path Planning via Particle Swarm Optimization

Author: Xin, Jinghao, Kim, Jinwoo, Chu, Shengjia, and Li, Ning
Subjects: Computer Science - Robotics
Abstract: Existing Global Path Planning (GPP) algorithms predominantly presume planning in static environments. This assumption immensely limits their applications to Unmanned Surface Vehicles (USVs) that typically navigate in dynamic environments. To address this limitation, we present OkayPlan, a GPP algorithm capable of generating safe and short paths in dynamic scenarios at a real-time executing speed (125 Hz on a desktop-class computer). Specifically, we approach the challenge of dynamic obstacle avoidance by formulating the path planning problem as an Obstacle Kinematics Augmented Optimization Problem (OKAOP), which can be efficiently resolved through a PSO-based optimizer at a real-time speed. Meanwhile, a Dynamic Prioritized Initialization (DPI) mechanism that adaptively initializes potential solutions for the optimization problem is established to further ameliorate the solution quality. Additionally, a relaxation strategy that facilitates the autonomous tuning of OkayPlan's hyperparameters in dynamic environments is devised. Comprehensive experiments, including comparative evaluations, ablation studies, and \textcolor{black}{applications to 3D physical simulation platforms}, have been conducted to substantiate the efficacy of our approach. Results indicate that OkayPlan outstrips existing methods in terms of path safety, length optimality, and computational efficiency, establishing it as a potent GPP technique for dynamic environments. The video and code associated with this paper are accessible at https://github.com/XinJingHao/OkayPlan., Comment: 19 pages, 17 figures, 9 tables
Published: 2024
Full Text: View/download PDF

45. Content-Conditioned Generation of Stylized Free hand Sketches

Author: Liu, Jiajun, Wang, Siyuan, Zhu, Guangming, Zhang, Liang, Li, Ning, and Gao, Eryang
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence
Abstract: In recent years, the recognition of free-hand sketches has remained a popular task. However, in some special fields such as the military field, free-hand sketches are difficult to sample on a large scale. Common data augmentation and image generation techniques are difficult to produce images with various free-hand sketching styles. Therefore, the recognition and segmentation tasks in related fields are limited. In this paper, we propose a novel adversarial generative network that can accurately generate realistic free-hand sketches with various styles. We explore the performance of the model, including using styles randomly sampled from a prior normal distribution to generate images with various free-hand sketching styles, disentangling the painters' styles from known free-hand sketches to generate images with specific styles, and generating images of unknown classes that are not in the training set. We further demonstrate with qualitative and quantitative evaluations our advantages in visual quality, content accuracy, and style imitation on SketchIME., Comment: 6 pages, 7 figures, ICSMD
Published: 2024

46. A multimodal gesture recognition dataset for desktop human-computer interaction

Author: Wang, Qi, Zhu, Fengchao, Zhu, Guangming, Zhang, Liang, Li, Ning, and Gao, Eryang
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Gesture recognition is an indispensable component of natural and efficient human-computer interaction technology, particularly in desktop-level applications, where it can significantly enhance people's productivity. However, the current gesture recognition community lacks a suitable desktop-level (top-view perspective) dataset for lightweight gesture capture devices. In this study, we have established a dataset named GR4DHCI. What distinguishes this dataset is its inherent naturalness, intuitive characteristics, and diversity. Its primary purpose is to serve as a valuable resource for the development of desktop-level portable applications. GR4DHCI comprises over 7,000 gesture samples and a total of 382,447 frames for both Stereo IR and skeletal modalities. We also address the variances in hand positioning during desktop interactions by incorporating 27 different hand positions into the dataset. Building upon the GR4DHCI dataset, we conducted a series of experimental studies, the results of which demonstrate that the fine-grained classification blocks proposed in this paper can enhance the model's recognition accuracy. Our dataset and experimental findings presented in this paper are anticipated to propel advancements in desktop-level gesture recognition research.
Published: 2024

47. Explicit Visual Prompts for Visual Object Tracking

Author: Shi, Liangtao, Zhong, Bineng, Liang, Qihua, Li, Ning, Zhang, Shengping, and Li, Xianxian
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: How to effectively exploit spatio-temporal information is crucial to capture target appearance changes in visual tracking. However, most deep learning-based trackers mainly focus on designing a complicated appearance model or template updating strategy, while lacking the exploitation of context between consecutive frames and thus entailing the \textit{when-and-how-to-update} dilemma. To address these issues, we propose a novel explicit visual prompts framework for visual tracking, dubbed \textbf{EVPTrack}. Specifically, we utilize spatio-temporal tokens to propagate information between consecutive frames without focusing on updating templates. As a result, we cannot only alleviate the challenge of \textit{when-to-update}, but also avoid the hyper-parameters associated with updating strategies. Then, we utilize the spatio-temporal tokens to generate explicit visual prompts that facilitate inference in the current frame. The prompts are fed into a transformer encoder together with the image tokens without additional processing. Consequently, the efficiency of our model is improved by avoiding \textit{how-to-update}. In addition, we consider multi-scale information as explicit visual prompts, providing multiscale template features to enhance the EVPTrack's ability to handle target scale changes. Extensive experimental results on six benchmarks (i.e., LaSOT, LaSOT\rm $_{ext}$, GOT-10k, UAV123, TrackingNet, and TNL2K.) validate that our EVPTrack can achieve competitive performance at a real-time speed by effectively exploiting both spatio-temporal and multi-scale information. Code and models are available at https://github.com/GXNU-ZhongLab/EVPTrack.
Published: 2024

48. Mobility and Cost Aware Inference Accelerating Algorithm for Edge Intelligence

Author: Yuan, Xin, Li, Ning, Wei, kang, Xu, Wenchao, Chen, Quan, Chen, Hao, and Guo, Song
Subjects: Computer Science - Distributed, Parallel, and Cluster Computing, Computer Science - Artificial Intelligence
Abstract: The edge intelligence (EI) has been widely applied recently. Spliting the model between device, edge server, and cloud can improve the performance of EI greatly. The model segmentation without user mobility has been investigated deeply by previous works. However, in most use cases of EI, the end devices are mobile. Only a few works have been carried out on this aspect. These works still have many issues, such as ignoring the energy consumption of mobile device, inappropriate network assumption, and low effectiveness on adaptiving user mobility, etc. Therefore, for addressing the disadvantages of model segmentation and resource allocation in previous works, we propose mobility and cost aware model segmentation and resource allocation algorithm for accelerating the inference at edge (MCSA). Specfically, in the scenario without user mobility, the loop interation gradient descent (Li-GD) algorithm is provided. When the mobile user has a large model inference task needs to be calculated, it will take the energy consumption of mobile user, the communication and computing resource renting cost, and the inference delay into account to find the optimal model segmentation and resource allocation strategy. In the scenario with user mobility, the mobiity aware Li-GD (MLi-GD) algorithm is proposed to calculate the optimal strategy. Then, the properties of the proposed algorithms are investigated, including convergence, complexity, and approximation ratio. The experimental results demonstrate the effectiveness of the proposed algorithms., Comment: 17 pages, 16 figures. arXiv admin note: substantial text overlap with arXiv:2312.15850
Published: 2023

49. High Efficiency Inference Accelerating Algorithm for NOMA-based Mobile Edge Computing

Author: Yuan, Xin, Li, Ning, Zhang, Tuo, Li, Muqing, Chen, Yuwen, Ortega, Jose Fernan Martinez, and Guo, Song
Subjects: Computer Science - Distributed, Parallel, and Cluster Computing, Computer Science - Artificial Intelligence
Abstract: Splitting the inference model between device, edge server, and cloud can improve the performance of EI greatly. Additionally, the non-orthogonal multiple access (NOMA), which is the key supporting technologies of B5G/6G, can achieve massive connections and high spectrum efficiency. Motivated by the benefits of NOMA, integrating NOMA with model split in MEC to reduce the inference latency further becomes attractive. However, the NOMA based communication during split inference has not been properly considered in previous works. Therefore, in this paper, we integrate the NOMA into split inference in MEC, and propose the effective communication and computing resource allocation algorithm to accelerate the model inference at edge. Specifically, when the mobile user has a large model inference task needed to be calculated in the NOMA-based MEC, it will take the energy consumption of both device and edge server and the inference latency into account to find the optimal model split strategy, subchannel allocation strategy (uplink and downlink), and transmission power allocation strategy (uplink and downlink). Since the minimum inference delay and energy consumption cannot be satisfied simultaneously, and the variables of subchannel allocation and model split are discrete, the gradient descent (GD) algorithm is adopted to find the optimal tradeoff between them. Moreover, the loop iteration GD approach (Li-GD) is proposed to reduce the complexity of GD algorithm that caused by the parameter discrete. Additionally, the properties of the proposed algorithm are also investigated, which demonstrate the effectiveness of the proposed algorithms., Comment: 13 pages, 11 figures
Published: 2023

50. FODT: Fast, Online, Distributed and Temporary Failure Recovery Approach for MEC

Author: Yuan, Xin, Li, Ning, Zhang, Zhaoxin, Chen, Quan, and Martinez, Jose Fernan
Subjects: Computer Science - Distributed, Parallel, and Cluster Computing, Computer Science - Networking and Internet Architecture
Abstract: Mobile edge computing (MEC) can reduce the latency of cloud computing successfully. However, the edge server may fail due to the hardware of software issues. When the edge server failure happens, the users who offload tasks to this server will be affected. How to recover the services for these affected users quickly and effectively is challenging. Moreover, considering that the server failure is continuous and temporary, and the failed server can be repaired, the previous works cannot handle this problem effectively. Therefore, in this paper, we propose the fast, online, distributed, and temporary failure recovery algorithm (FODT) for MEC. In FODT, when edge sever failure happens, only the affected APs recalculate their user-server allocation strategies and the other APs do not change their strategies. For the affected access points (Aps), the strategies before server failure are reused to reduce complexity and latency. When the failed server is repaired, the influenced APs reuse the strategies before server failure to offload task to this server. Based on this approach, the FODT can achieve better performance than previous works. To the best of knowledge, the FODT is the first failure recovery algorithm, and when compared with previous research, it has higher failure recovery efficiency and lower complexity with acceptable approximate ratio., Comment: 12 pages, 7 figures
Published: 2023

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

42,546 results on '"LI, Ning"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources