Author: "Chen, Xianzhang" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Chen, Xianzhang"' showing total 216 results

Start Over Author "Chen, Xianzhang"

216 results on '"Chen, Xianzhang"'

1. Finding Visual Saliency in Continuous Spike Stream

Author: Zhu, Lin, Chen, Xianzhang, Wang, Xiao, and Huang, Hua
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: As a bio-inspired vision sensor, the spike camera emulates the operational principles of the fovea, a compact retinal region, by employing spike discharges to encode the accumulation of per-pixel luminance intensity. Leveraging its high temporal resolution and bio-inspired neuromorphic design, the spike camera holds significant promise for advancing computer vision applications. Saliency detection mimics the behavior of human beings and captures the most salient region from the scenes. In this paper, we investigate the visual saliency in the continuous spike stream for the first time. To effectively process the binary spike stream, we propose a Recurrent Spiking Transformer (RST) framework, which is based on a full spiking neural network. Our framework enables the extraction of spatio-temporal features from the continuous spatio-temporal spike stream while maintaining low power consumption. To facilitate the training and validation of our proposed model, we build a comprehensive real-world spike-based visual saliency dataset, enriched with numerous light conditions. Extensive experiments demonstrate the superior performance of our Recurrent Spiking Transformer framework in comparison to other spike neural network-based methods. Our framework exhibits a substantial margin of improvement in capturing and highlighting visual saliency in the spike stream, which not only provides a new perspective for spike-based saliency segmentation but also shows a new paradigm for full SNN-based transformer models. The code and dataset are available at \url{https://github.com/BIT-Vision/SVS}., Comment: Accepted by AAAI 2024
Published: 2024

2. An Efficient Scheduling Algorithm for Multi-mode Tasks on Near-Data Processing SSDs

Author: Li, Guo, Chen, Xianzhang, Liu, Duo, Li, Jiali, Tan, Yujuan, Ren, Ao, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Tari, Zahir, editor, Li, Keqiu, editor, and Wu, Hongyi, editor
Published: 2024
Full Text: View/download PDF

3. Scanning gate microscopy in graphene nanostructures

Author: Chen, Xianzhang, Weick, Guillaume, Weinmann, Dietmar, and Jalabert, Rodolfo A.
Subjects: Condensed Matter - Mesoscale and Nanoscale Physics
Abstract: The conductance of graphene nanoribbons and nanoconstrictions under the effect of a scanning gate microscopy tip is systematically studied. Using a scattering approach for noninvasive probes, the first- and second-order conductance corrections caused by the tip potential disturbance are expressed explicitly in terms of the scattering states of the unperturbed structure. Numerical calculations confirm the perturbative results, showing that the second-order term prevails in the conductance plateaus, exhibiting a universal scaling law for armchair graphene strips. For stronger tips, at specific probe potential widths and strengths beyond the perturbative regime, the conductance corrections reveal the appearance of resonances originated from states trapped below the tip. The zero-transverse-energy mode of an armchair metallic strip is shown to be insensitive to the long-range electrostatic potential of the probe. For nanoconstrictions defined on a strip, scanning gate microscopy allows to get insight into the breakdown of conductance quantization. The first-order correction generically dominates at low tip strength, while for Fermi energies associated with faint conductance plateaus, the second-order correction becomes dominant for relatively small potential tip strengths. In accordance with the spatial dependence of the partial local density of states, the largest tip effect occurs in the central part of the constriction, close to the edges. Nanoribbons and nanoconstrictions with zigzag edges exhibit a similar response as in the case of armchair nanostructures, except when the intervalley coupling induced by the tip potential destroys the chiral edge states., Comment: 21 pages, 16 figures
Published: 2022
Full Text: View/download PDF

4. An Efficient Scheduling Algorithm for Multi-mode Tasks on Near-Data Processing SSDs

Author: Li, Guo, primary, Chen, Xianzhang, additional, Liu, Duo, additional, Li, Jiali, additional, Tan, Yujuan, additional, and Ren, Ao, additional
Published: 2024
Full Text: View/download PDF

5. BGS: Accelerate GNN training on multiple GPUs

Author: Tan, Yujuan, Bai, Zhuoxin, Liu, Duo, Zeng, Zhaoyang, Gan, Yan, Ren, Ao, Chen, Xianzhang, and Zhong, Kan
Published: 2024
Full Text: View/download PDF

6. CEIU: Consistent and Efficient Incremental Update mechanism for mobile systems on flash storage

Author: Lei, Ruiqing, Chen, Xianzhang, Liu, Duo, Song, Chunlin, Tan, Yujuan, and Ren, Ao
Published: 2024
Full Text: View/download PDF

7. Flexible Clustered Federated Learning for Client-Level Data Distribution Shift

Author: Duan, Moming, Liu, Duo, Ji, Xinyuan, Wu, Yu, Liang, Liang, Chen, Xianzhang, and Tan, Yujuan
Subjects: Computer Science - Machine Learning, Computer Science - Distributed, Parallel, and Cluster Computing
Abstract: Federated Learning (FL) enables the multiple participating devices to collaboratively contribute to a global neural network model while keeping the training data locally. Unlike the centralized training setting, the non-IID, imbalanced (statistical heterogeneity) and distribution shifted training data of FL is distributed in the federated network, which will increase the divergences between the local models and the global model, further degrading performance. In this paper, we propose a flexible clustered federated learning (CFL) framework named FlexCFL, in which we 1) group the training of clients based on the similarities between the clients' optimization directions for lower training divergence; 2) implement an efficient newcomer device cold start mechanism for framework scalability and practicality; 3) flexibly migrate clients to meet the challenge of client-level data distribution shift. FlexCFL can achieve improvements by dividing joint optimization into groups of sub-optimization and can strike a balance between accuracy and communication efficiency in the distribution shift environment. The convergence and complexity are analyzed to demonstrate the efficiency of FlexCFL. We also evaluate FlexCFL on several open datasets and made comparisons with related CFL frameworks. The results show that FlexCFL can significantly improve absolute test accuracy by +10.6% on FEMNIST compared to FedAvg, +3.5% on FashionMNIST compared to FedProx, +8.4% on MNIST compared to FeSEM. The experiment results show that FlexCFL is also communication efficient in the distribution shift environment., Comment: Manuscript under review. arXiv admin note: substantial text overlap with arXiv:2010.06870
Published: 2021
Full Text: View/download PDF

8. CSAFL: A Clustered Semi-Asynchronous Federated Learning Framework

Author: Zhang, Yu, Duan, Moming, Liu, Duo, Li, Li, Ren, Ao, Chen, Xianzhang, Tan, Yujuan, and Wang, Chengliang
Subjects: Computer Science - Machine Learning, Computer Science - Distributed, Parallel, and Cluster Computing
Abstract: Federated learning (FL) is an emerging distributed machine learning paradigm that protects privacy and tackles the problem of isolated data islands. At present, there are two main communication strategies of FL: synchronous FL and asynchronous FL. The advantages of synchronous FL are that the model has high precision and fast convergence speed. However, this synchronous communication strategy has the risk that the central server waits too long for the devices, namely, the straggler effect which has a negative impact on some time-critical applications. Asynchronous FL has a natural advantage in mitigating the straggler effect, but there are threats of model quality degradation and server crash. Therefore, we combine the advantages of these two strategies to propose a clustered semi-asynchronous federated learning (CSAFL) framework. We evaluate CSAFL based on four imbalanced federated datasets in a non-IID setting and compare CSAFL to the baseline methods. The experimental results show that CSAFL significantly improves test accuracy by more than +5% on the four datasets compared to TA-FedAvg. In particular, CSAFL improves absolute test accuracy by +34.4% on non-IID FEMNIST compared to TA-FedAvg., Comment: This paper will be presented at IJCNN 2021
Published: 2021
Full Text: View/download PDF

9. FedSAE: A Novel Self-Adaptive Federated Learning Framework in Heterogeneous Systems

Author: Li, Li, Duan, Moming, Liu, Duo, Zhang, Yu, Ren, Ao, Chen, Xianzhang, Tan, Yujuan, and Wang, Chengliang
Subjects: Computer Science - Machine Learning, Computer Science - Distributed, Parallel, and Cluster Computing
Abstract: Federated Learning (FL) is a novel distributed machine learning which allows thousands of edge devices to train model locally without uploading data concentrically to the server. But since real federated settings are resource-constrained, FL is encountered with systems heterogeneity which causes a lot of stragglers directly and then leads to significantly accuracy reduction indirectly. To solve the problems caused by systems heterogeneity, we introduce a novel self-adaptive federated framework FedSAE which adjusts the training task of devices automatically and selects participants actively to alleviate the performance degradation. In this work, we 1) propose FedSAE which leverages the complete information of devices' historical training tasks to predict the affordable training workloads for each device. In this way, FedSAE can estimate the reliability of each device and self-adaptively adjust the amount of training load per client in each round. 2) combine our framework with Active Learning to self-adaptively select participants. Then the framework accelerates the convergence of the global model. In our framework, the server evaluates devices' value of training based on their training loss. Then the server selects those clients with bigger value for the global model to reduce communication overhead. The experimental result indicates that in a highly heterogeneous system, FedSAE converges faster than FedAvg, the vanilla FL framework. Furthermore, FedSAE outperforms than FedAvg on several federated datasets - FedSAE improves test accuracy by 26.7% and reduces stragglers by 90.3% on average., Comment: This paper will be presented at IJCNN 2021
Published: 2021
Full Text: View/download PDF

10. FedGroup: Efficient Clustered Federated Learning via Decomposed Data-Driven Measure

Author: Duan, Moming, Liu, Duo, Ji, Xinyuan, Liu, Renping, Liang, Liang, Chen, Xianzhang, and Tan, Yujuan
Subjects: Computer Science - Machine Learning, Computer Science - Distributed, Parallel, and Cluster Computing
Abstract: Federated Learning (FL) enables the multiple participating devices to collaboratively contribute to a global neural network model while keeping the training data locally. Unlike the centralized training setting, the non-IID and imbalanced (statistical heterogeneity) training data of FL is distributed in the federated network, which will increase the divergences between the local models and global model, further degrading performance. In this paper, we propose a novel clustered federated learning (CFL) framework FedGroup, in which we 1) group the training of clients based on the similarities between the clients' optimization directions for high training performance; 2) construct a new data-driven distance measure to improve the efficiency of the client clustering procedure. 3) implement a newcomer device cold start mechanism based on the auxiliary global model for framework scalability and practicality. FedGroup can achieve improvements by dividing joint optimization into groups of sub-optimization and can be combined with FL optimizer FedProx. The convergence and complexity are analyzed to demonstrate the efficiency of our proposed framework. We also evaluate FedGroup and FedGrouProx (combined with FedProx) on several open datasets and made comparisons with related CFL frameworks. The results show that FedGroup can significantly improve absolute test accuracy by +14.1% on FEMNIST compared to FedAvg. +3.4% on Sentiment140 compared to FedProx, +6.9% on MNIST compared to FeSEM., Comment: This work will be presented at IEEE International Symposium on Parallel and Distributed Processing with Applications (ISPA) 2021. NOTE: This revision contains a crucial correction of the client cold start mechanism, please discard all previous manuscripts
Published: 2020
Full Text: View/download PDF

11. FSR: A host-storage collaborative mechanism for data path optimization of NDP operations

Author: Sun, Qiao, Chen, Xianzhang, Li, Lin, Wang, Jiapin, and Liu, Shukan
Published: 2023
Full Text: View/download PDF

12. Astraea: Self-balancing Federated Learning for Improving Classification Accuracy of Mobile Deep Learning Applications

Author: Duan, Moming, Liu, Duo, Chen, Xianzhang, Tan, Yujuan, Ren, Jinting, Qiao, Lei, and Liang, Liang
Subjects: Computer Science - Machine Learning, Computer Science - Distributed, Parallel, and Cluster Computing, Statistics - Machine Learning
Abstract: Federated learning (FL) is a distributed deep learning method which enables multiple participants, such as mobile phones and IoT devices, to contribute a neural network model while their private training data remains in local devices. This distributed approach is promising in the edge computing system where have a large corpus of decentralized data and require high privacy. However, unlike the common training dataset, the data distribution of the edge computing system is imbalanced which will introduce biases in the model training and cause a decrease in accuracy of federated learning applications. In this paper, we demonstrate that the imbalanced distributed training data will cause accuracy degradation in FL. To counter this problem, we build a self-balancing federated learning framework call Astraea, which alleviates the imbalances by 1) Global data distribution based data augmentation, and 2) Mediator based multi-client rescheduling. The proposed framework relieves global imbalance by runtime data augmentation, and for averaging the local imbalance, it creates the mediator to reschedule the training of clients based on Kullback-Leibler divergence (KLD) of their data distribution. Compared with FedAvg, the state-of-the-art FL algorithm, Astraea shows +5.59% and +5.89% improvement of top-1 accuracy on the imbalanced EMNIST and imbalanced CINIC-10 datasets, respectively. Meanwhile, the communication traffic of Astraea can be 82% lower than that of FedAvg., Comment: Published as a conference paper at IEEE 37th International Conference on Computer Design (ICCD) 2019
Published: 2019
Full Text: View/download PDF

13. Transport signatures of relativistic quantum scars in a graphene cavity

Author: Zhang, G. Q., Chen, Xianzhang, Lin, Li, Peng, Hailin, Liu, Zhongfan, Huang, Liang, Kang, N., and Xu, H. Q.
Subjects: Condensed Matter - Mesoscale and Nanoscale Physics
Abstract: We study a relativistic quantum cavity system realized by etching out from a graphene sheet by quantum transport measurements and theoretical calculations. The conductance of the graphene cavity has been measured as a function of the back gate voltage (or the Fermi energy) and the magnetic field applied perpendicular to the graphene sheet, and characteristic conductance contour patterns are observed in the measurements. In particular, two types of high conductance contour lines, i.e., straight and parabolic-like high conductance contour lines, are found in the measurements. The theoretical calculations are performed within the framework of tight-binding approach and Green's function formalism. Similar characteristic high conductance contour features as in the experiments are found in the calculations. The wave functions calculated at points selected along a straight conductance contour line are found to be dominated by a chain of scars of high probability distributions arranged as a necklace following the shape of cavity and the current density distributions calculated at these point are dominated by an overall vortex in the cavity. These characteristics are found to be insensitive to increasing magnetic field. However, the wave function probability distributions and the current density distributions calculated at points selected along a parabolic-like contour line show a clear dependence on increasing magnetic field, and the current density distributions at these points are characterized by the complex formation of several localized vortices in the cavity. Our work brings a new insight into quantum chaos in relativistic particle systems and would greatly stimulate experimental and theoretical efforts towards this still emerging field., Comment: 20 pages, 6 figures
Published: 2018
Full Text: View/download PDF

14. Federated learning with workload-aware client scheduling in heterogeneous systems

Author: Li, Li, Liu, Duo, Duan, Moming, Zhang, Yu, Ren, Ao, Chen, Xianzhang, Tan, Yujuan, and Wang, Chengliang
Published: 2022
Full Text: View/download PDF

15. Efficient persistent memory file systems using virtual superpages with multi-level allocator

Author: Yang, Chaoshu, Yu, Zhiwang, Zhang, Runyu, Nie, Shun, Li, Hui, Chen, Xianzhang, Long, Linbo, and Liu, Duo
Published: 2022
Full Text: View/download PDF

16. CoDiscard: A revenue model based cross-layer cooperative discarding mechanism for flash memory devices

Author: Feng, Xiaoliu, Chen, Xianzhang, Li, Ruolan, Li, Jiali, Song, Chunlin, Liu, Duo, Tan, Yujuan, and Qiao, Lei
Published: 2022
Full Text: View/download PDF

17. Towards highly-concurrent leaderless state machine replication for distributed systems

Author: Wang, Weilue, Tan, Yujuan, Wu, Changze, Liu, Duo, Wu, Yu, Luo, Longpan, and Chen, Xianzhang
Published: 2022
Full Text: View/download PDF

18. AIR Cache: A Variable-Size Block Cache Based on Fine-Grained Management Method

Author: Li, Yuxiong, Tan, Yujuan, Xu, Congcong, Liu, Duo, Chen, Xianzhang, Wang, Chengliang, Zhou, Mingliang, U, Leong Hou, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Woeginger, Gerhard, Editorial Board Member, Yung, Moti, Editorial Board Member, U, Leong Hou, editor, Spaniol, Marc, editor, Sakurai, Yasushi, editor, and Chen, Junying, editor
Published: 2021
Full Text: View/download PDF

19. A machine learning assisted data placement mechanism for hybrid storage systems

Author: Ren, Jinting, Chen, Xianzhang, Liu, Duo, Tan, Yujuan, Duan, Moming, Li, Ruolan, and Liang, Liang
Published: 2021
Full Text: View/download PDF

20. MobileRE: A replicas prioritized hybrid fault tolerance strategy for mobile distributed system

Author: Wu, Yu, Liu, Duo, Chen, Xianzhang, Ren, Jinting, Liu, Renping, Tan, Yujuan, and Zhang, Ziling
Published: 2021
Full Text: View/download PDF

21. FreePrune: An Automatic Pruning Framework Across Various Granularities Based on Training-Free Evaluation

Author: Tang, Miao, Liu, Ning, Yang, Tao, Fang, Haining, Lin, Qiu, Tan, Yujuan, Chen, Xianzhang, Liu, Duo, Zhong, Kan, and Ren, Ao
Abstract: Network pruning is an effective technique that reduces the computational costs of networks while maintaining accuracy. However, pruning requires expert knowledge and hyperparameter tuning, such as determining the pruning rate for each layer. Automatic pruning methods address this challenge by proposing an effective training-free metric to quickly evaluate the pruned network without fine-tuning. However, most existing automatic pruning methods only investigate a certain pruning granularity, and it remains unclear whether metrics benefit automatic pruning at different granularities. Neural architecture search also studies training-free metrics to accelerate network generation. Nevertheless, whether they apply to pruning needs further investigation. In this study, we first systematically analyze various advanced training-free metrics for various granularities in pruning, and then we investigate the correlation between the training-free metric score and the after-fine-tuned model accuracy. Based on the analysis, we proposed FreePrune score, a more general metric compatible with all pruning granularities. Aiming at generating high-quality pruned networks and unleashing the power of FreePrune score, we further propose FreePrune, an automatic framework that can rapidly generate and evaluate the candidate networks, leading to a final pruned network with both high accuracy and pruning rate. Experiments show that our method achieves high correlation on various pruning granularities and comprehensively improves the accuracy.
Published: 2024
Full Text: View/download PDF

22. LightFS: A Lightweight Host-CSD Coordinated File System Optimizing for Heavy Small File Accesses

Author: Li, Jiali, Shen, Zhaoyan, Liu, Duo, Chen, Xianzhang, Zhong, Kan, Zeng, Zhaoyang, and Tan, Yujuan
Abstract: Computational storage drive (CSD) improves the data processing efficiency by processing the data within the storage. However, existing CSDs rely on the host-centric file systems to manage the data, where the layouts of files are retrieved by the host and sent to the CSD, resulting in additional I/O overhead and reduced processing efficiency, especially in heavy small file accesses. Moreover, the lack of consistency mechanisms poses potential consistency issues. To address these challenges, we propose LightFS, a lightweight host-CSD coordinated file system for the CSD file management. To reduce task offloading overhead, LightFS builds an index file $.ndpmeta$ which summarizes the files’ metadata and shares between the host and CSD to enable CSD to retrieve the file layout in storage directly. To ensure consistency, LightFS employs a metadata locker and an update synchronizer. The metadata locker leverages the out-of-place update feature of the flash to capture a snapshot of the file to be written without any data copy, while the update synchronizer triggers metadata updates by monitoring the addresses of written blocks to ensure that the modified file is successfully written to the CSD. We implement and evaluate LightFS on a real testbed, and the results demonstrate that LightFS achieves $3.66\times $ performance improvement on the average in real-world operations.
Published: 2024
Full Text: View/download PDF

23. HydraFS: an efficient NUMA-aware in-memory file system

Author: Wu, Ting, Chen, Xianzhang, Liu, Kai, Xiao, Chunhua, Liu, Zhixiang, Zhuge, Qingfeng, and Sha, Edwin H.-M.
Published: 2020
Full Text: View/download PDF

24. AIR Cache: A Variable-Size Block Cache Based on Fine-Grained Management Method

Author: Li, Yuxiong, primary, Tan, Yujuan, additional, Xu, Congcong, additional, Liu, Duo, additional, Chen, Xianzhang, additional, Wang, Chengliang, additional, Zhou, Mingliang, additional, and U, Leong Hou, additional
Published: 2021
Full Text: View/download PDF

25. Data-Quality-Driven Federated Learning for Optimizing Communication Costs

Author: Fan, Xuehong, primary, Wu, Nanzhong, additional, Liu, Shukan, additional, Chen, Xianzhang, additional, Liu, Duo, additional, Tan, Yujuan, additional, and Ren, Ao, additional
Published: 2023
Full Text: View/download PDF

26. Re-compact: Structured Pruning and SpMM Kernel Co-design for Accelerating DNNs on GPUs

Author: Zhang, Yuling, primary, Ren, Ao, additional, Chen, Xianzhang, additional, Lin, Qiu, additional, Tan, Yujuan, additional, and Liu, Duo, additional
Published: 2023
Full Text: View/download PDF

27. UMFS: An efficient user-space file system for non-volatile memory

Author: Chen, Xianzhang, Sha, Edwin H.-M., Zhuge, Qingfeng, Wu, Ting, Jiang, Weiwen, Zeng, Xiaoping, and Wu, Lin
Published: 2018
Full Text: View/download PDF

28. RadarSSD: A Computational Storage for Radar Signal Processing

Author: Li, Jiali, primary, Chen, Xianzhang, additional, Liu, Duo, additional, Ren, Ao, additional, Zeng, Zhaoyang, additional, and Tan, Yujuan, additional
Published: 2023
Full Text: View/download PDF

29. HBP: Hierarchically Balanced Pruning and Accelerator Co-Design for Efficient DNN Inference

Author: Ren, Ao, primary, Wang, Yuhao, additional, Zhang, Tao, additional, Shi, Jiaxing, additional, Liu, Duo, additional, Chen, Xianzhang, additional, Tan, Yujuan, additional, and Xie, Yuan, additional
Published: 2023
Full Text: View/download PDF

30. Optimizing the Performance of NDP Operations by Retrieving File Semantics in Storage

Author: Li, Lin, primary, Chen, Xianzhang, additional, Li, Jiali, additional, Wang, Jiapin, additional, Liu, Duo, additional, Tan, Yujuan, additional, and Ren, Ao, additional
Published: 2023
Full Text: View/download PDF

31. Prevent Deadlock and Remove Blocking for Self-Timed Systems

Author: Sha, Edwin H. -M., Jiang, Weiwen, Zhuge, Qingfeng, Chen, Xianzhang, Yang, Lei, Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Weikum, Gerhard, Series editor, Wang, Guojun, editor, Zomaya, Albert, editor, Martinez, Gregorio, editor, and Li, Kenli, editor
Published: 2015
Full Text: View/download PDF

32. V-WAFA: An Endurance Variation Aware Fine-Grained Allocator for Persistent Memory

Author: Feng, Xiaoliu, primary, Chen, Xianzhang, additional, Zhuge, Qingfeng, additional, Liu, Duo, additional, Sha, Edwin H.-M., additional, and Xue, Chun Jason, additional
Published: 2023
Full Text: View/download PDF

33. A unified framework for designing high performance in-memory and hybrid memory file systems

Author: Chen, Xianzhang, Sha, Edwin H.-M., Zhuge, Qingfeng, Jiang, Weiwen, Chen, Junxi, Chen, Jun, and Xu, Jun
Published: 2016
Full Text: View/download PDF

34. Optimizing the Performance of Consistency-Aware Deduplication Using Persistent Memory

Author: Song, Chunlin, Chen, Xianzhang, Liu, Duo, Li, Jiali, Tan, Yujuan, and Ren, Ao
Abstract: Block-level data deduplication is a widely used technology for saving storage space by filtering the data blocks with the same hash value. However, existing block-level data deduplication approaches either ignore the data consistency of deduplication or suffer severe performance degradation for providing consistency guarantees. In this article, we propose Consistency-Aware Deduplication (CADedup+) to achieve high-performance block-level data deduplication with data consistency. The main idea of CADedup+ is to achieve an efficient journaling mechanism for deduplication by taking advantage of persistent memory (PM), such as byte-addressability and near-DRAM access latency. To balance the tradeoffs between performance and consistency requirements in data deduplication, we carefully design three modes of journaling mechanism, i.e., writeback mode, ordered mode, and journal mode, for CADedup+. We properly place the deduplication metadata of CADedup+ onto the DRAM- PM hybrid memory architecture to minimize PM costs according to the features of metadata updates. The deduplication metadata on PM is managed by a set of metadata transactions and updated with the help of the efficient hardware atomic operations provided by CPU. We implement CADedup+ in the generic block layer in Linux kernel 4.9.0. We conduct extensive experiments on Intel Optane PMEM to evaluate CADedup+ with typical benchmarks. Experimental results show that CADedup+ can reduce 63%–70% write volume and 50%–60% I/O latency over Dmdedup, a widely used open-source block-level data deduplication system, while ensuring deduplication consistency.
Published: 2024
Full Text: View/download PDF

35. Scanning gate microscopy in graphene nanostructures

Author: Chen, Xianzhang, primary, Weick, Guillaume, additional, Weinmann, Dietmar, additional, and Jalabert, Rodolfo A., additional
Published: 2023
Full Text: View/download PDF

36. Many-body spectral statistics of relativistic quantum billiard systems

Author: Chen, Xianzhang, primary, Chen, Zhen-Qi, additional, Huang, Liang, additional, Grebogi, Celso, additional, and Lai, Ying-Cheng, additional
Published: 2023
Full Text: View/download PDF

37. LFPR: A Lazy Fast Predictive Repair Strategy for Mobile Distributed Erasure Coded Cluster

Author: Wu, Yu, primary, Liu, Duo, additional, Tan, Yujuan, additional, Duan, Moming, additional, Luo, Longpan, additional, Wang, Weilve, additional, and Chen, Xianzhang, additional
Published: 2023
Full Text: View/download PDF

38. Horae: A Hybrid I/O Request Scheduling Technique for Near-Data Processing-Based SSD

Author: Li, Jiali, primary, Chen, Xianzhang, additional, Liu, Duo, additional, Li, Lin, additional, Wang, Jiapin, additional, Zeng, Zhaoyang, additional, Tan, Yujuan, additional, and Qiao, Lei, additional
Published: 2022
Full Text: View/download PDF

39. FRL: Fast and Reconfigurable Accelerator for Distributed Sound Source Localization

Author: Ding, Xiaofeng, primary, Wang, Chengliang, additional, Liu, Heping, additional, Zhang, Zhihai, additional, Chen, Xianzhang, additional, Tan, Yujuan, additional, Liu, Duo, additional, and Ren, Ao, additional
Published: 2022
Full Text: View/download PDF

40. eRDAC: Efficient and Reliable Remote Direct Access and Control for Embedded Systems

Author: Feng, Junjie, primary, Chen, Xianzhang, additional, Liu, Duo, additional, Zhang, Weigong, additional, Wang, Jiapin, additional, Zheng, Rongwei, additional, and Tan, Yujuan, additional
Published: 2022
Full Text: View/download PDF

41. SENTunnel: Fast Path for Sensor Data Access on Automotive Embedded Systems

Author: Zheng, Rongwei, primary, Chen, Xianzhang, additional, Liu, Duo, additional, Feng, Junjie, additional, Wang, Jiapin, additional, Ren, Ao, additional, Wang, Chengliang, additional, and Tan, Yujuan, additional
Published: 2022
Full Text: View/download PDF

42. VEA: An FPGA-Based Voxel Encoding Accelerator for 3D Object Detection with LiDAR

Author: Li, Xin, primary, Ren, Ao, additional, Tan, Yujuan, additional, Li, Xusheng, additional, Huang, Zhetong, additional, Wang, Chengliang, additional, Chen, Xianzhang, additional, and Liu, Duo, additional
Published: 2022
Full Text: View/download PDF

43. CADedup: High-performance Consistency-aware Deduplication Based on Persistent Memory

Author: Song, Chunlin, primary, Chen, Xianzhang, additional, Liu, Duo, additional, Feng, Xiaoliu, additional, Yu, Xi, additional, Li, Jiali, additional, Tan, Yujuan, additional, and Ren, Ao, additional
Published: 2022
Full Text: View/download PDF

44. Properties of Self-Timed Ring Architectures for Deadlock-Free and Consistent Configuration Reaching Maximum Throughput

Author: Jiang, Weiwen, Zhuge, Qingfeng, Chen, Xianzhang, Yang, Lei, Yi, Juan, and Sha, Edwin H.-M.
Published: 2016
Full Text: View/download PDF

45. SAPredictor

Author: Tan, Yujuan, primary, Chen, Wei, additional, Ma, Zhulin, additional, Xiao, Dan, additional, Yan, Zhichao, additional, Liu, Duo, additional, and Chen, Xianzhang, additional
Published: 2022
Full Text: View/download PDF

46. Lazy repair with temporary redundancy(LRTR)

Author: Luo, Longpan, primary, Tan, Yujuan, additional, Liu, Duo, additional, Duan, Moming, additional, Wang, Weilue, additional, Wu, Yu, additional, and Chen, Xianzhang, additional
Published: 2022
Full Text: View/download PDF

47. GATLB: A Granularity-Aware TLB to Support Multi-Granularity Pages in Hybrid Memory System

Author: Tan, Yujuan, primary, Xie, Yujie, additional, Ma, Zhulin, additional, Yan, Zhichao, additional, Zhang, Zhichao, additional, Liu, Duo, additional, and Chen, Xianzhang, additional
Published: 2022
Full Text: View/download PDF

48. Optimizing CoW-based File Systems on Open-Channel SSDs with Persistent Memory

Author: Zhang, Runyu, primary, Liu, Duo, additional, Yang, Chaoshu, additional, Chen, Xianzhang, additional, Qiao, Lei, additional, and Tan, Yujuan, additional
Published: 2022
Full Text: View/download PDF

49. Self-Adapting Channel Allocation for Multiple Tenants Sharing SSD Devices

Author: Liu, Renping, primary, Liu, Duo, additional, Chen, Xianzhang, additional, Tan, Yujuan, additional, Zhang, Runyu, additional, and Liang, Liang, additional
Published: 2022
Full Text: View/download PDF

50. ChordMap: Automated Mapping of Streaming Applications Onto CGRA

Author: Li, Zhaoying, primary, Wijerathne, Dhananjaya, additional, Chen, Xianzhang, additional, Pathania, Anuj, additional, and Mitra, Tulika, additional
Published: 2022
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

216 results on '"Chen, Xianzhang"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources