235 results on '"NVM"'
Search Results
2. Integration
- Author
-
Chi, Min-Hwa, Wang, Yangyuan, editor, Chi, Min-Hwa, editor, Lou, Jesse Jen-Chung, editor, and Chen, Chun-Zhang, editor
- Published
- 2024
- Full Text
- View/download PDF
3. Comparative evaluation of memristor-based compact 4T2M SRAM with different memristor models.
- Author
-
Ashrafi, Md. Shakib Ibne, Maruf, Md Hasan, Shihavuddin, ASM, and Ali, Syed Iftekhar
- Subjects
- *
STATIC random access memory , *OPTICAL disks - Abstract
Static Random Access Memory (SRAM) is volatile and uses latching flip-flops to store each bit. To make SRAM work as non-volatile memory (NVM), memristor-based SRAM is a feasible choice mainly due to its high-speed operation and low power consumption. In this paper, the operational characteristics of 4T2M SRAM have been studied based on three different memristor models developed by Biolek et al. (2009), Joglekar and Wolf 2009, and Prodromakis et al. (2011), and comparative performance analysis has been made to assess its adaption to NVM. These three different models are compared in terms of delay, power consumption, and static noise margin. From the simulation, it has been observed that Biolek 4T2M SRAM produces better performance in write delay calculation scoring 0.873 ns when '0' is written and 0.166 ns when '1' is written. This model also provided a low power consumption value compared to other models. However, ternary plot analysis finds that Prodromakis is performing in average better in all positive traits. All the simulations are done in LTSpice and the transistor uses TSMC 180 nm CMOS technology. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
4. High-density via RRAM cell with multi-level setting by current compliance circuits
- Author
-
Yu-Cheng Hsieh, Yu-Cheng Lin, Yao-Hung Huang, Yu-Der Chih, Jonathan Chang, Chrong-Jung Lin, and Ya-Chin King
- Subjects
RRAM ,MLC ,NVM ,Materials of engineering and construction. Mechanics of materials ,TA401-492 - Abstract
Abstract In this work, multi-level storage in the via RRAM has been first time reported and demonstrated with the standard FinFET CMOS logic process. Multi-level states in via RRAM are achieved by controlling the current compliance during set operations. The new current compliance setting circuits are proposed to ensure stable resistance control when one considers cells under the process variation effect. The improved stability and tightened distributions on its multi-level states on via RRAM have been successfully demonstrated.
- Published
- 2024
- Full Text
- View/download PDF
5. Consistent Performance ZnO TFT Based Single Transistor Nonvolatile Memory with Minimal Charge Loss
- Author
-
Kumar, Binay Binod and Singh, Kunal
- Published
- 2024
- Full Text
- View/download PDF
6. High-density via RRAM cell with multi-level setting by current compliance circuits
- Author
-
Hsieh, Yu-Cheng, Lin, Yu-Cheng, Huang, Yao-Hung, Chih, Yu-Der, Chang, Jonathan, Lin, Chrong-Jung, and King, Ya-Chin
- Published
- 2024
- Full Text
- View/download PDF
7. Memory Challenges
- Author
-
Cagli, Carlo, Perniola, Luca, Merkle, Dieter, Managing Editor, Rudan, Massimo, editor, Brunetti, Rossella, editor, and Reggiani, Susanna, editor
- Published
- 2023
- Full Text
- View/download PDF
8. Programming Techniques of Resistive Random-Access Memory Devices for Neuromorphic Computing.
- Author
-
Machado, Pau, Manich, Salvador, Gómez-Pau, Álvaro, Rodríguez-Montañés, Rosa, González, Mireia Bargalló, Campabadal, Francesca, and Arumí, Daniel
- Subjects
COMPUTER storage devices ,SYMMETRY ,RANDOM access memory - Abstract
Neuromorphic computing offers a promising solution to overcome the von Neumann bottleneck, where the separation between the memory and the processor poses increasing limitations of latency and power consumption. For this purpose, a device with analog switching for weight update is necessary to implement neuromorphic applications. In the diversity of emerging devices postulated as synaptic elements in neural networks, RRAM emerges as a standout candidate for its ability to tune its resistance. The learning accuracy of a neural network is directly related to the linearity and symmetry of the weight update behavior of the synaptic element. However, it is challenging to obtain such a linear and symmetrical behavior with RRAM devices. Thus, extensive research is currently devoted at different levels, from material to device engineering, to improve the linearity and symmetry of the conductance update of RRAM devices. In this work, the experimental results based on different programming pulse conditions of RRAM devices are presented, considering both voltage and current pulses. Their suitability for application as analog RRAM-based synaptic devices for neuromorphic computing is analyzed by computing an asymmetric nonlinearity factor. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
9. ANV-PUF: Machine-Learning-Resilient NVM-Based Arbiter PUF.
- Author
-
NASSAR, HASSAN, BAUER, LARS, and HENKEL, JÖRG
- Subjects
MACHINE learning ,PHYSICAL mobility ,WORK design - Abstract
Physical Unclonable Functions (PUFs) have been widely considered an attractive security primitive. They use the deviations in the fabrication process to have unique responses from each device. Due to their nature, they serve as a DNA-like identity of the device. But PUFs have also been targeted for attacks. It has been proven that machine learning (ML) can be used to effectively model a PUF design and predict its behavior, leading to leakage of the internal secrets. To combat such attacks, several designs have been proposed to make it harder to model PUFs. One design direction is to use Non-Volatile Memory (NVM) as the building block of the PUF. NVM typically are multilevel cells, i.e, they have several internal states, which makes it harder to model them. However, the current state of the art of NVM-based PUFs is limited to 'weak PUFs', i.e., the number of outputs grows only linearly with the number of inputs, which limits the number of possible secret values that can be stored using the PUF. To overcome this limitation, in this work we design the Arbiter Non-Volatile PUF (ANV-PUF) that is exponential in the number of inputs and that is resilient against ML-based modeling. The concept is based on the famous delay-based Arbiter PUF (which is not resilient against modeling attacks) while using NVM as a building block instead of switches. Hence, we replace the switch delays (which are easy to model via ML) with the multi-level property of NVM (which is hard to model via ML). Consequently, our design has the exponential output characteristics of the Arbiter PUF and the resilience against attacks from the NVM-based PUFs. Our results show that the resilience to ML modeling, uniqueness, and uniformity are all in the ideal range of 50%. Thus, in contrast to the state-of-the-art, ANV-PUF is able to be resilient to attacks, while having an exponential number of outputs. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
10. Challenges in Design, Data Placement, Migration and Power-Performance Trade-offs in DRAM-NVM-based Hybrid Memory Systems.
- Author
-
Rai, Sadhana and Talawar, Basavaraj
- Subjects
- *
DYNAMIC random access memory , *AMALGAMATION - Abstract
DRAM-NVM-based hybrid memory opens up a varied range of power-performance-area operational configurations through page migration between the high-performance DRAM and the reliable NVM. The amalgamation of two technologies requires various modifications for the existing monolithic DRAM-based systems. This paper summarizes the current research work in the areas of data placement and page migration in hybrid memories. The challenges and design solutions from a range of NVMs-PCM, STT-RAM, ReRAM is presented. This paper also identifies several research challenges in these areas. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
11. SEC-Learn: Sensor Edge Cloud for Federated Learning : Invited Paper
- Author
-
Aichroth, Patrick, Antes, Christoph, Gembatzka, Pierre, Graf, Holger, Johnson, David S., Jung, Matthias, Kämpfe, Thomas, Kleinberger, Thomas, Köllmer, Thomas, Kuhn, Thomas, Kutter, Christoph, Krüger, Jens, Loroch, Dominik M., Lukashevich, Hanna, Laleni, Nellie, Zhang, Lei, Leugering, Johannes, Martín Fernández, Rodrigo, Mateu, Loreto, Mojumder, Shaown, Prautsch, Benjamin, Pscheidl, Ferdinand, Roscher, Karsten, Schneickert, Sören, Vanselow, Frank, Wallbott, Paul, Walter, Oliver, Weber, Nico, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Woeginger, Gerhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Orailoglu, Alex, editor, Jung, Matthias, editor, and Reichenbach, Marc, editor
- Published
- 2022
- Full Text
- View/download PDF
12. A Two Tier Hybrid Metadata Management Mechanism for NVM Storage System
- Author
-
Cai, Tao, Gao, Pengfei, Chen, Fuli, Niu, Dejiao, Wang, Fei, Ma, Yueming, Li, Lei, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Woeginger, Gerhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Cérin, Christophe, editor, Qian, Depei, editor, Gaudiot, Jean-Luc, editor, Tan, Guangming, editor, and Zuckerman, Stéphane, editor
- Published
- 2022
- Full Text
- View/download PDF
13. Drug Repurposing Using Link Prediction on Knowledge Graphs with Applications to Non-volatile Memory
- Author
-
Cohen, Sarel, Hershcovitch, Moshik, Taraz, Martin, Kißig, Otto, Wood, Andrew, Waddington, Daniel, Chin, Peter, Friedrich, Tobias, Kacprzyk, Janusz, Series Editor, Benito, Rosa Maria, editor, Cherifi, Chantal, editor, Cherifi, Hocine, editor, Moro, Esteban, editor, Rocha, Luis M., editor, and Sales-Pardo, Marta, editor
- Published
- 2022
- Full Text
- View/download PDF
14. Recent Research for HZO-Based Ferroelectric Memory towards In-Memory Computing Applications.
- Author
-
Yoo, Jaewook, Song, Hyeonjun, Lee, Hongseung, Lim, Seongbin, Kim, Soyeon, Heo, Keun, and Bae, Hagyoul
- Subjects
FLASH memory ,DYNAMIC random access memory ,FERROELECTRIC materials ,ELECTRIC field effects ,FERROELECTRIC capacitors ,NONVOLATILE memory ,SCHOTTKY barrier - Abstract
The AI and IoT era requires software and hardware capable of efficiently processing massive amounts data quickly and at a low cost. However, there are bottlenecks in existing Von Neumann structures, including the difference in the operating speed of current-generation DRAM and Flash memory systems, the large voltage required to erase the charge of nonvolatile memory cells, and the limitations of scaled-down systems. Ferroelectric materials are one exciting means of breaking away from this structure, as Hf-based ferroelectric materials have a low operating voltage, excellent data retention qualities, and show fast switching speed, and can be used as non-volatile memory (NVM) if polarization characteristics are utilized. Moreover, adjusting their conductance enables diverse computing architectures, such as neuromorphic computing with analog characteristics or 'logic-in-memory' computing with digital characteristics, through high integration. Several types of ferroelectric memories, including two-terminal-based FTJs, three-terminal-based FeFETs using electric field effect, and FeRAMs using ferroelectric materials as capacitors, are currently being studied. In this review paper, we include these devices, as well as a Fe-diode with high on/off ratio properties, which has a similar structure to the FTJs but operate with the Schottky barrier modulation. After reviewing the operating principles and features of each structure, we conclude with a summary of recent applications that have incorporated them. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
15. NVM Device-Based Deep Inference Architecture Using Self-gated Activation Functions (Swish)
- Author
-
Fatima, Afroz, Pethe, Abhijit, Angrisani, Leopoldo, Series Editor, Arteaga, Marco, Series Editor, Panigrahi, Bijaya Ketan, Series Editor, Chakraborty, Samarjit, Series Editor, Chen, Jiming, Series Editor, Chen, Shanben, Series Editor, Chen, Tan Kay, Series Editor, Dillmann, Rüdiger, Series Editor, Duan, Haibin, Series Editor, Ferrari, Gianluigi, Series Editor, Ferre, Manuel, Series Editor, Hirche, Sandra, Series Editor, Jabbari, Faryar, Series Editor, Jia, Limin, Series Editor, Kacprzyk, Janusz, Series Editor, Khamis, Alaa, Series Editor, Kroeger, Torsten, Series Editor, Li, Yong, Series Editor, Liang, Qilian, Series Editor, Martín, Ferran, Series Editor, Ming, Tan Cher, Series Editor, Minker, Wolfgang, Series Editor, Misra, Pradeep, Series Editor, Möller, Sebastian, Series Editor, Mukhopadhyay, Subhas, Series Editor, Ning, Cun-Zheng, Series Editor, Nishida, Toyoaki, Series Editor, Pascucci, Federica, Series Editor, Qin, Yong, Series Editor, Seng, Gan Woon, Series Editor, Speidel, Joachim, Series Editor, Veiga, Germano, Series Editor, Wu, Haitao, Series Editor, Zamboni, Walter, Series Editor, Zhang, Junjie James, Series Editor, Bajpai, Manish Kumar, editor, Kumar Singh, Koushlendra, editor, and Giakos, George, editor
- Published
- 2021
- Full Text
- View/download PDF
16. fogcached: DRAM-NVM Hybrid Memory-Based KVS Server for Edge Computing
- Author
-
Ozawa, Kouki, Hirofuchi, Takahiro, Takano, Ryousei, Sugaya, Midori, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Woeginger, Gerhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Katangur, Ajay, editor, Lin, Shih-Chun, editor, Wei, Jinpeng, editor, Yang, Shuhui, editor, and Zhang, Liang-Jie, editor
- Published
- 2020
- Full Text
- View/download PDF
17. Differentiated Services Oriented Auction Mechanism Design for NVM Based Edge Caching
- Author
-
Zhang, Zhenyuan, Liu, Fang, Cai, Zhenhua, Su, Yihong, Li, Weijun, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Woeginger, Gerhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Sun, Xingming, editor, and Wang, Jinwei, editor
- Published
- 2020
- Full Text
- View/download PDF
18. Quantization and sparsity-aware processing for energy-efficient NVM-based convolutional neural networks.
- Author
-
Han Bao, Yifan Qin, Jia Chen, Ling Yang, Jiancong Li, Houji Zhou, Yi Li, and Xiangshui Miao
- Subjects
CONVOLUTIONAL neural networks ,NONVOLATILE memory ,ANALOG-to-digital converters ,DATABASES ,NETWORK performance ,ENERGY consumption - Abstract
Nonvolatile memory (NVM)-based convolutional neural networks (NvCNNs) have received widespread attention as a promising solution for hardware edge intelligence. However, there still exist many challenges in the resourceconstrained conditions, such as the limitations of the hardware precision and cost and, especially, the large overhead of the analog-to-digital converters (ADCs). In this study, we systematically analyze the performance of NvCNNs and the hardware restrictions with quantization in both weight and activation and propose the corresponding requirements of NVM devices and peripheral circuits for multiply-accumulate (MAC) units. In addition, we put forward an in situ sparsity-aware processing method that exploits the sparsity of the network and the device array characteristics to further improve the energy efficiency of quantized NvCNNs. Our results suggest that the 4-bit-weight and 3-bit-activation (W4A3) design demonstrates the optimal compromise between the network performance and hardware overhead, achieving 98.82% accuracy for the Modified National Institute of Standards and Technology database (MNIST) classification task. Moreover, higher-precision designs will claim more restrictive requirements for hardware nonidealities including the variations of NVM devices and the nonlinearities of the converters. Moreover, the sparsity-aware processing method can obtain 79%/53% ADC energy reduction and 2.98×/1.15× energy efficiency improvement based on the W8A8/W4A3 quantization design with an array size of 128 × 128. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
19. Library support for historical and persistent data structures in non-volatile memories
- Author
-
Chatzistergiou, Andreas, Viglas, Stratis, Nagarajan, Vijayanand, and Cintra, Marcelo
- Subjects
621.39 ,non-volatile memory ,NVM ,recoverability ,ARIES ,storage-class memories - Abstract
In the context of emerging non-volatile memory (NVM) where data structures can persist in-memory and are accessed through CPU loads and stores, we study how to efficiently manage data evolution. This is an extensively applied problem in both the scientific and business domains and is rapidly becoming an important component for a wider range of applications. We argue that the best way to achieve a smoother transition to the new programming model is to design a solution that is non-intrusive and generic i.e. not bound to a specific data model. We propose a novel library-level approach where the user can manage historical data directly from programming language code. This is achieved with a combination of two software layers: REWIND and VARIANT. At the bottom, lies REWIND (REcovery Write-Ahead System for In- Memory Non-Volatile Data Structures) which handles the low level specifics of NVM by dealing with write-ordering problems that arise in such context and allows recoverability of arbitrary data structures. Then, VARIANT (Versioning ARbItrary dAta structures in Non-volatile memory for Time-travel) focuses on versioning and time travel (moving between versions). We adopt a logging approach and we tightly integrate both systems for best performance by utilizing a common physical log of memory operations. With REWIND, we propose a novel recoverable log structure that permits atomic and durable appends and removals of log records. This is the keystone for building recoverable systems on top of NVM. Because latencies in recent NVM technologies such as Phase-change memory (PCM) are asymmetric, we propose novel techniques for reducing the write pressure of the recoverable log as well as mitigating the effect of synchronization control primitives such as memory fences (enhanced for NVM), i.e. barriers that enforce ordering and persistence to preceding instructions. We also propose different implementations for trading logging performance for rollback performance when this is appropriate. Finally, we revisit state-of-the-art recovery algorithms for the new context given the different latencies and synchronization control. Our results clearly indicate that current approaches for recoverability are ill-fitted for persisting data structures in the new context and it is possible to achieve low-overhead logging with customized mechanisms. Next, we focus on data evolution. We expose a simple API that allows versioning and time travel with minimal intrusiveness. We propose mechanisms for efficient and transparent cloning of Versionable data structures. This allows high concurrency since past images are returned as copies of the original data structure which remains intact. Then, we propose novel indexing techniques that significantly improve time travel performance as well as cloning with lazy schemes. We achieve a low overhead architecture by employing a mix of volatile and non-volatile data structures as well as hybrid structures that reside in both volatile and non-volatile memories. We perform an extensive evaluation of the proposed techniques and conclude that, in our context, by carefully mitigating the drawbacks of physical logging it is possible to create efficient systems for managing data evolution that are both data structure agnostic and non-intrusive.
- Published
- 2016
20. Exploring the Behavior of Coherent Accelerator Processor Interface (CAPI) on IBM Power8+ Architecture and FlashSystem 900
- Author
-
Velusamy, Kaushik, Prathapan, Smriti, Halem, Milton, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Woeginger, Gerhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Weiland, Michèle, editor, Juckeland, Guido, editor, Alam, Sadaf, editor, and Jagode, Heike, editor
- Published
- 2019
- Full Text
- View/download PDF
21. N-Docker: A NVM-HDD Hybrid Docker Storage Framework to Improve Docker Performance
- Author
-
Gu, Lin, Tang, Qizhi, Wu, Song, Jin, Hai, Zhang, Yingxi, Shi, Guoqiang, Lin, Tingyu, Rao, Jia, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Woeginger, Gerhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Tang, Xiaoxin, editor, Chen, Quan, editor, Bose, Pradip, editor, Zheng, Weiming, editor, and Gaudiot, Jean-Luc, editor
- Published
- 2019
- Full Text
- View/download PDF
22. RRAMSpec: A Design Space Exploration Framework for High Density Resistive RAM
- Author
-
Mathew, Deepak M., Chinazzo, André Lucas, Weis, Christian, Jung, Matthias, Giraud, Bastien, Vivet, Pascal, Levisse, Alexandre, Wehn, Norbert, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Woeginger, Gerhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Pnevmatikatos, Dionisios N., editor, Pelcat, Maxime, editor, and Jung, Matthias, editor
- Published
- 2019
- Full Text
- View/download PDF
23. Disaggregating Non-Volatile Memory for Throughput-Oriented Genomics Workloads
- Author
-
Call, Aaron, Polo, Jordà, Carrera, David, Guim, Francesc, Sen, Sujoy, Hutchison, David, Series Editor, Kanade, Takeo, Series Editor, Kittler, Josef, Series Editor, Kleinberg, Jon M., Series Editor, Mattern, Friedemann, Series Editor, Mitchell, John C., Series Editor, Naor, Moni, Series Editor, Pandu Rangan, C., Series Editor, Steffen, Bernhard, Series Editor, Terzopoulos, Demetri, Series Editor, Tygar, Doug, Series Editor, Mencagli, Gabriele, editor, B. Heras, Dora, editor, Cardellini, Valeria, editor, Casalicchio, Emiliano, editor, Jeannot, Emmanuel, editor, Wolf, Felix, editor, Salis, Antonio, editor, Schifanella, Claudio, editor, Manumachu, Ravi Reddy, editor, Ricci, Laura, editor, Beccuti, Marco, editor, Antonelli, Laura, editor, Garcia Sanchez, José Daniel, editor, and Scott, Stephen L., editor
- Published
- 2019
- Full Text
- View/download PDF
24. Challenges and Trends of Nonvolatile In-Memory-Computation Circuits for AI Edge Devices
- Author
-
Je-Min Hung, Chuan-Jia Jhang, Ping-Chun Wu, Yen-Cheng Chiu, and Meng-Fan Chang
- Subjects
Artificial intelligence ,nonvolatile-memory ,NVM ,computation-in-memory ,CIM ,nvCIM ,Electric apparatus and materials. Electric circuits. Electric networks ,TK452-454.4 - Abstract
Nonvolatile memory (NVM)-based computing-in-memory (nvCIM) is a promising candidate for artificial intelligence (AI) edge devices to overcome the latency and energy consumption imposed by the movement of data between memory and processors under the von Neumann architecture. This paper explores the background and basic approaches to nvCIM implementation, including input methodologies, weight formation and placement, and readout and quantization methods. This paper outlines the major challenges in the further development of nvCIM macros and reviews trends in recent silicon-verified devices.
- Published
- 2021
- Full Text
- View/download PDF
25. Characterization of Android Memory References and Implication to Hybrid Memory Management
- Author
-
Soyoon Lee and Hyokyung Bahn
- Subjects
Android ,smartphone ,application ,memory reference ,NVM ,write operation ,Electrical engineering. Electronics. Nuclear engineering ,TK1-9971 - Abstract
In this article, we analyze Android applications’ memory reference behaviors, and observe that smartphone memory accesses are different from traditional computer systems with respect to the following five aspects: 1) A limited number of hot pages account for a majority of memory writes, and these hot pages have similar logical addresses regardless of application types; 2) The identities of these hot pages are shared library, linker, and stack regions; 3) The memory access behaviors of hot pages do not change significantly as time progresses even after applications finish their launching; 4) The skewness of memory write accesses in Android is extremely stronger than that of desktop systems; 5) In predicting re-reference likelihood of hot pages, temporal locality is better than reference frequency. Based on these observations, we present a new smartphone memory management scheme for DRAM-NVM hybrid memory. Adopting NVM is effective in power-saving of smartphones, but NVM has weaknesses in write operations. Thus, we aim to identify write-intensive pages and place them on DRAM. Unlike previous studies, we prevent migration of pages between DRAM and NVM, which eliminates unnecessary NVM write traffic that accounts for 32-42% of total write traffic. By judiciously managing the admission of hot pages in DRAM, our scheme reduces the write traffic to NVM by 42% on average without performance degradations.
- Published
- 2021
- Full Text
- View/download PDF
26. Modeling and Analysis of the Page Sizing Problem for NVM Storage in Virtualized Systems
- Author
-
Yunjoo Park and Hyokyung Bahn
- Subjects
Page size ,NVM ,virtualization ,memory performance ,address translation ,page fault ,Electrical engineering. Electronics. Nuclear engineering ,TK1-9971 - Abstract
Recently, NVM (non-volatile memory) has advanced as a fast storage medium, and traditional memory management systems designed for HDD storage should be reconsidered. In this article, we revisit the page sizing problem in NVM storage, specially focusing on virtualized systems. The page sizing problem has not caught attention in traditional systems because of the two reasons. First, the memory performance is not sensitive to the page size when HDD is adopted as storage. We show that this is not the case in NVM storage by analyzing the TLB miss rate and the page fault rate, which have trade-off relations with respect to the page size. Second, changing the page size in traditional systems is not easy as it accompanies significant overhead. However, due to the widespread adoption of virtualized systems, the page sizing problem becomes feasible for virtual machines, which are generated for executing specific workloads with fixed hardware resources. In this article, we design a page size model that accurately estimates the TLB miss rate and the page fault rate for NVM storage. We then present a method that has the ability of estimating the memory access time as the page size is varied, which can guide a suitable page size for given environments. By considering workload characteristics with given memory and storage resources, we show that the memory performance of virtualized systems can be improved by 38.4% when our model is adopted.
- Published
- 2021
- Full Text
- View/download PDF
27. Metrics for Quantification of By-Process Segregation in Ge-Rich GST
- Author
-
Elisa Petroni, Andrea Serafini, Davide Codegoni, Paolo Targa, Luca Mariani, Mario Scuderi, Giuseppe Nicotra, and Andrea Redaelli
- Subjects
pcm ,Ge-rich GST ,GST ,NVM ,phase segregation ,EELS analysis ,Physics ,QC1-999 - Abstract
Ge-rich GST alloys are the most promising materials for phase-change memory (PCM) to fulfill the soldering compliance and the tough data retention requirements of automotive applications. Significant efforts have been made to engineer those materials and optimize their integration inside the fabrication process of PCM. In this perspective, the physical characterization of the device and the material is instrumental in understanding the underlying physics, improving the process, and optimizing the interactions between the device, the process, and the material itself. Especially, microscopic investigations have gathered increasing interest, giving detailed descriptions of local material modulations that have a crucial role in cell programming and reliability performances. In this work, a deep analysis of Ge-rich GST microscopic alloy evolution during the integration process has been performed, exploiting analysis by EELS with TEM supported by a novel statistical data post-processing method. The new proposed statistical-based methodology also introduces new simple metrics for elemental compositional evaluations that have been exploited for process engineering.
- Published
- 2022
- Full Text
- View/download PDF
28. Analog In-Memory Multiply-and-Accumulate Engine Fabricated in 22nm FDSOI Technology
- Author
-
Moran, Steven
- Subjects
Electrical engineering ,Charge-Trap ,CTT ,inference ,Multiply-and-Accumulate ,nonvolatile ,NVM - Abstract
This dissertation presents the first on-chip demonstration of a Multiply-and-Accumulate (MAC) function in 22nm CMOS on SOI with the Charge-Trap Transistor (CTT).Recent developments in machine learning and AI focus on digital-based von Neumann architectures to accelerate computation using massively parallel processing platforms including Graphics Processing Units (GPUs), Tensor Processing Units (TPUs), and Application- Specific Integrated Circuits (ASICs), to name a few. While these platforms have dramatically improved system performance, they are inherently limited by the von Neumann memory bottleneck. A resurgence of digital and analog in-memory & near-memory computing (iMC) techniques have been proposed to perform computation directly where the memory is stored, eliminating unnecessary memory accesses and minimizing memory access energy.A hybrid approach to designing high-performance AI computation platforms is composed of learning on the cloud and energy-efficient inference at the edge. In this dissertation, we explore the latter through the use of the Charge-Trap Transistor (CTT)—commercial high-k logic nFET device on SOI—as an ideal candidate nonvolatile memory device for analog-based in-memory computing. Past results show that the CTT can be accurately programmed with excellent resolution, device programming variance, and retention characteristics. We propose a NeuroCTT inference architecture and present experimental results based on two test chips taped-out utilizing GlobalFoundries 22FDX technology. A first-time demonstration of an on-chip analog MAC Engine using the CTT in a commercial CMOS technology is provided. Accurate on-chip weight programming with sufficient retention are also demonstrated in hardware. In addition, we introduce a CTT-Hardware-based Inference Realistic Circuit Universal Simulator (CIRCUS) Platform for studying the effects of circuit-induced errors and device non-idealities on system performance and accuracy.We conclude by evaluating the resiliency of general-purpose neural network applications by evaluating the effect of weight programming variance on analog-based in-memory computing and bit errors on digital-based architectures. As a baseline for digital-based & energy-efficient ASICs, an IBM TrueNorth Neurosynaptic System is exposed to 4MeV protons corrupting the on-chip model file for a trained 12-layer Convolutional Neural Network (CNN). The IBM TrueNorth continues to perform classification with negligible degradation to accuracy. For larger-scale networks and memory-intensive applications, reliability studies were also performed on 3D-stacked (3DS) DRAM to study the effect of radiation on more advanced 3D-stacked architectures.
- Published
- 2022
29. NVM Storage in IoT Devices: Opportunities and Challenges.
- Author
-
Yang Liu, Shan Zhao, Wenhan Chen, Xuran Ge, Fang Liu, Shuo Li, and Nong Xiao
- Subjects
NONVOLATILE memory ,INTERNET of things ,ENERGY consumption ,5G networks ,BANDWIDTHS - Abstract
Edge storage stores the data directly at the data collection point, and does not need to transmit the collected data to the storage central server through the network. It is a critical technology that supports applications such as edge computing and 5G network applications, with lower network communication overhead, lower interaction delay and lower bandwidth cost. However, with the explosion of data and higher real-time requirements, the traditional Internet of Things (IoT) storage architecture cannot meet the requirements of low latency and large capacity. Non-volatile memory (NVM) presents new possibilities regarding this aspect. This paper classifies the different storage architectures based on NVM and compares the system goals, architectures, features, and limitations to explore new research opportunities. Moreover, the existing solutions to reduce the write latency and energy consumption and increase the lifetime of NVM IoT storage devices are analyzed. Furthermore, we discuss the security and privacy issues of IoT devices and compare the mainstream solutions. Finally, we present the opportunities and challenges of building IoT storage systems based on NVM. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
30. A unified hybrid memory system for scalable deep learning and big data applications.
- Author
-
Rang, Wei, Liang, Huanghuang, Wang, Ye, Zhou, Xiaobo, and Cheng, Dazhao
- Subjects
- *
DYNAMIC random access memory , *DEEP learning , *BIG data - Abstract
Emerging non-volatile memory (NVM) technologies are of dynamic random access memory (DRAM)-like, high capacity, and low cost, at the expense of slower bandwidth and higher read/write latency compared to DRAM. Typically, NVM finds its primary application in serving as an extension of conventional DRAM to create hybrid memory systems tailored to non-uniform memory access (NUMA) architectures. This strategic integration offers the potential for high performance, enhanced capacity efficiency, and a favorable balance of cost considerations. Traditional NUMA memory management policies distribute data uniformly across both DRAM and NVM, overlooking the inherent performance gap between these heterogeneous memory systems. This challenge becomes particularly pronounced when provisioning resources for deep learning and big data applications in hybrid memory systems. To tackle the performance issues in the hybrid memory systems, we propose and develop a unified memory system, UniRedl, which automatically optimizes data migration between DRAM and NVM based on data access patterns and computation graphs of applications. To improve application performance, we provide a new memory allocation strategy named HiLowAlloc. We further design two data migration strategies in UniRedl, Idle Migration and Dynamic Migration, for management of hybrid memory systems. Specifically, Idle Migration aims to manage data placed in DRAM, while Dynamic Migration manages data saved in NVM. The experimental results demonstrate that on average UniRedl improves application performance by 33.2%, 20.6%, 19.0%, and 17.5% compared to the traditional NUMA, NUMA with anb, BMPM, and OIM, respectively. It also achieves 52.0%, 34.3%, 30.6%, 22.1% on average improvement in data locality against the state-of-the-art solutions. • UniRedl abstracts DRAM and NVM across the whole cluster into a unified memory space. • UniRedl features a new memory allocation strategy named HiLowAlloc. • UniRedl offers two data migration strategies, Idle Migration and Dynamic Migration. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
31. Research on the consensus of big data systems based on RDMA and NVM
- Author
-
Hao WU, Kang CHEN, Yongwei WU, and Weimin ZHENG
- Subjects
big data ,distributed system ,consensus protocol ,RDMA ,NVM ,Electronic computers. Computer science ,QA75.5-76.95 - Abstract
Distributed storage systems and computing systems are the foundation for constructing big data processing systems.High availability of the system is the cornerstone of any distributed system.High-availability technologies generally rely on consensus protocols.The classic non-Byzantine distributed consensus protocol was discussed,as well as the RDMA communication protocol and NVM storage media under the development of new technologies to achieve higher performance high availability systems by combining them.The consensus protocol to make the better use of the features of RDMA and NVM was modified.The implemented system effectively improves the performance of the protocol while ensuring the consistency and availability of the system data.Experiments show that the system implemented in this paper can achieve 40% performance improvement compared to existing systems.
- Published
- 2019
- Full Text
- View/download PDF
32. Active Data Replica Recovery for Quality-Assurance Big Data Analysis in IC-IoT
- Author
-
Songyun Wang, Jiabin Yuan, Xin Li, Zhuzhong Qian, Fabio Arena, and Ilsun You
- Subjects
Big data analysis ,data recovery ,IC-IoT ,NVM ,QoS improvement ,Electrical engineering. Electronics. Nuclear engineering ,TK1-9971 - Abstract
QoS-aware big data analysis is critical in Information-Centric Internet of Things (IC-IoT) system to support various applications like smart city, smart grid, smart health, intelligent transportation systems, and so on. The employment of non-volatile memory (NVM) in cloud or edge system provides good opportunity to improve quality of data analysis tasks. However, we have to face the data recovery problem led by NVM failure due to the limited write endurance. In this paper, we investigate the data recovery problem for QoS guarantee and system robustness, followed by proposing a rarity-aware data recovery algorithm. The core idea is to establish the rarity indicator to evaluate the replica distribution and service requirement comprehensively. With this idea, we give the lost replicas with distinguishing priority and eliminate the unnecessary replicas. Then, the data replicas are recovered stage by stage to guarantee QoS and provide system robustness. From our extensive experiments and simulations, it is shown that the proposed algorithm has significant performance improvement on QoS and robustness than the traditional direct data recovery method. Besides, the algorithm gives an acceptable data recovery time.
- Published
- 2019
- Full Text
- View/download PDF
33. An Energy Efficient and Secure Data Aggregation Method for WSNs Based on Dynamic Set
- Author
-
Zhu, Jinsheng, Jia, Zhiping, Diniz Junqueira Barbosa, Simone, Series editor, Chen, Phoebe, Series editor, Du, Xiaoyong, Series editor, Filipe, Joaquim, Series editor, Kara, Orhun, Series editor, Kotenko, Igor, Series editor, Liu, Ting, Series editor, Sivalingam, Krishna M., Series editor, Washio, Takashi, Series editor, Yuan, Hanning, editor, Geng, Jing, editor, and Bian, Fuling, editor
- Published
- 2017
- Full Text
- View/download PDF
34. Pyramid: Revisiting Memory Extension with Remote Accessible Non-Volatile Main Memory
- Author
-
Yu, Songping, Deng, Mingzhu, Xing, Yuxuan, Xiao, Nong, Liu, Fang, Chen, Wei, Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Weikum, Gerhard, Series editor, Wang, Guojun, editor, Atiquzzaman, Mohammed, editor, Yan, Zheng, editor, and Choo, Kim-Kwang Raymond, editor
- Published
- 2017
- Full Text
- View/download PDF
35. Logic Nonvolatile Memory
- Author
-
Ma, Yanjun, Kan, Edwin, Ma, Yanjun, and Kan, Edwin
- Published
- 2017
- Full Text
- View/download PDF
36. Non-Data-Storage Applications
- Author
-
Ma, Yanjun, Kan, Edwin, Ma, Yanjun, and Kan, Edwin
- Published
- 2017
- Full Text
- View/download PDF
37. Multiple-Times Programmable Logic Nonvolatile Memory
- Author
-
Ma, Yanjun, Kan, Edwin, Ma, Yanjun, and Kan, Edwin
- Published
- 2017
- Full Text
- View/download PDF
38. Twizzler: A Data-centric OS for Non-volatile Memory.
- Author
-
BITTMAN, DANIEL, ALVARO, PETER, MEHRA, PANKAJ, LONG, DARRELL D. E., and MILLER, ETHAN L.
- Subjects
NONVOLATILE memory ,COMPUTER operating systems ,INFORMATION sharing - Abstract
Byte-addressable, non-volatile memory (NVM) presents an opportunity to rethink the entire system stack. We present Twizzler, an operating system redesign for this near-future. Twizzler removes the kernel from the I/O path, provides programs with memory-style access to persistent data using small (64 bit), object-relative cross-object pointers, and enables simple and efficient long-term sharing of data both between applications and between runs of an application. Twizzler provides a clean-slate programming model for persistent data, realizing the vision of Unix in a world of persistent RAM. We show that Twizzler is simpler, more extensible, and more secure than existing I/O models and implementations by building software for Twizzler and evaluating it on NVM DIMMs. Most persistent pointer operations in Twizzler impose less than 0.5 ns added latency. Twizzler operations are up to 13× faster than Unix, and SQLite queries are up to 4.2× faster than on PMDK. YCSB workloads ran 1.1-2.9× faster on Twizzler than on native and NVM-optimized SQLite backends. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
39. Impact of the Nonlinear Dielectric Hysteresis Properties of a Charge Trap Layer in a Novel Hybrid High-Speed and Low-Power Ferroelectric or Antiferroelectric HSO/HZO Boosted Charge Trap Memory.
- Author
-
Ali, Tarek, Mertens, Konstantin, Olivo, Ricardo, Rudolph, Matthias, Oehler, Sebastian, Kuhnel, Kati, Lehninger, David, Muller, Franz, Hoffmann, Raik, Schramm, Philipp, Biedermann, Kati, Metzger, Joachim, Binder, Robert, Czernohorsky, Malte, Kampfe, Thomas, Muller, Johannes, Seidel, Konrad, Van Houdt, Jan, and Eng, Lukas M.
- Subjects
- *
DIELECTRIC properties , *LONG-term synaptic depression , *HAFNIUM oxide , *MEMORY , *LINEAR orderings , *NON-coding RNA , *CURRENT transformers (Instrument transformer) - Abstract
A novel hybrid antiferroelectric (AFE)-based charge trap (CT) memory is reported focusing on an amplified tunnel oxide field (ETO) via the dynamic of an AFE hysteresis dipole switching. The role of dynamic permittivity increase and the saturated polarization at the instant of the write operation are explored for enhanced ETO. The hybrid CT concept is studied by benchmarking the controlled properties of the HfO2 CT layer via the Si elemental doping in order to stabilize the linear dielectric (DE), ferroelectric (FE), or AFE response. The Si-doped HfO2 (HSO) with AFE stabilized phase shows the largest memory window (4.5 V) compared to the DE- or FE-based CT layers. The dynamic AFE dipole switching enables a maximized ETO at the instant of switching such that a high-speed and low-power CT memory is realized. The role of the tailored hysteresis shape on the ETO magnitude is studied for different Si contents and benchmarked to the Zr-doped hafnium oxide (HZO). The AFE CT multilevel coding as 1–3 bit/cell, the role of the pass voltage disturb, and a mini-NAND array operation are demonstrated. The global variability and area scalability of the HSO CT devices are studied to indicate the effect of Si content distribution and the area dependence of the AFE film variability. AFE CT devices are characterized for a switching speed (< 1 μs), ten-year data retention, and 105 endurance. Moreover, the improved CT characteristics by the AFE dipole switching are explored for enhanced long-term potentiation and depression of a synaptic device. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
40. RCAN family member 3 deficiency contributes to noncompaction of the ventricular myocardium.
- Author
-
Hu T, Liu L, Wang H, Yang M, Xu B, Xie H, Lin Z, Jin X, Wang P, Liu Y, Sun H, and Liu S
- Subjects
- Animals, Female, Humans, Infant, Male, Cardiomyopathies genetics, Cardiomyopathies pathology, Exome Sequencing, Heart Ventricles pathology, Isolated Noncompaction of the Ventricular Myocardium genetics, Isolated Noncompaction of the Ventricular Myocardium pathology, Mutation, Missense genetics, Myocardium pathology, Myocardium metabolism, Myocardium ultrastructure, Myocytes, Cardiac pathology, Myocytes, Cardiac metabolism, Pedigree, Adaptor Proteins, Signal Transducing genetics, Adaptor Proteins, Signal Transducing metabolism, Zebrafish genetics
- Abstract
Noncompaction of the ventricular myocardium (NVM), the third most diagnosed cardiomyopathy, is characterized by prominent trabeculae and intratrabecular recesses. However, the genetic etiology of 40%-60% of NVM cases remains unknown. Here, we identify two infants with NVM, in a nonconsanguineous family, with a typical clinical presentation of persistent bradycardia since the prenatal period. A homozygous missense variant (R223L) of RCAN family member 3 (RCAN3) is detected in both infants using whole-exome sequencing. In the zebrafish model, marked cardiac dysfunction is detected in rcan3 deficiency (MO-rcan3
ATG -injected) and rcan-/- embryos. Developmental dysplasia of both endocardial and myocardial layers is also detected in rcan3-deficient embryos. RCAN3 R223L variant mRNAs can not rescue heart defects caused by rcan3 knockdown or knockout; however, hRCAN3 mRNAs rescue these phenotypes. RNA-seq experiments show that several genes involved in cardiomyopathies are significantly regulated through multiple signaling pathways in the rcan3-knockdown zebrafish model. In human cardiomyocytes, RCAN3 deficiency results in reduced proliferation and increased apoptosis, together with an abnormal mitochondrial ultrastructure. Thus, we suggest that RCAN3 is a susceptibility gene for cardiomyopathies, especially NVM and that the R223L mutation is a potential loss-of-function variant., Competing Interests: Conflict of interest The authors declare no conflicts of interest., (Copyright © 2024 Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, and Genetics Society of China. Published by Elsevier Ltd. All rights reserved.)- Published
- 2024
- Full Text
- View/download PDF
41. HasFS: optimizing file system consistency mechanism on NVM-based hybrid storage architecture.
- Author
-
Liu, Yubo, Li, Hongbo, Lu, Yutong, Chen, Zhiguang, Xiao, Nong, and Zhao, Ming
- Subjects
- *
CACHE memory , *METADATA , *SYSTEMS design , *NONVOLATILE memory - Abstract
In order to protect the data during system crash, traditional DRAM–DISK architecture file systems (e.g., EXT4) need to synchronize the dirty metadata and data from the memory to disk. At the same time, the disk synchronization may break the consistency of file system upon a crash, so traditional file systems use some mechanisms to guarantee the file system consistency when the dirty metadata and data is synchronized onto persistent storage devices (e.g., HDD and SSD). Journaling is a consistency mechanism widely used by file systems. We observe that the overhead of periodic disk synchronization and journaling is high. Emerging non-volatile memories (NVMs) can be potentially utilized to reduce these overheads. In this paper, we present hybrid architecture for storage file system (HasFS), a file system designed for the DRAM–NVM–DISK architecture. HasFS extends the main memory with NVM and considers NVM as a persistent page cache to eliminate the periodic disk synchronization overhead of dirty data. Then we design an efficient consistency mechanism based on the hybrid memory architecture to provide strong (both metadata and data) consistency guarantee with low overhead. The evaluation demonstrates that HasFS outperforms mainstream DRAM–DISK file systems for many workloads. For instance, HasFS has between 1.6X to 46.6X performance improvement over other tested file systems in random write workload. In particular, HasFS outperforms EXT4 without journal in some cases even though HasFS provides metadata and data consistency guarantees (similar to EXT4 with journal data mode). [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
42. Initial experience with 3D XPoint main memory.
- Author
-
Liu, Jihang and Chen, Shimin
- Subjects
MEMORY ,EXPERIENCE - Abstract
3D XPoint is the first commercially available main memory NVM solution targeting mainstream computer systems. Previous database studies on NVM memory evaluate their proposed techniques mainly on simulated or emulated NVM hardware. In this paper, we report our initial experience experimenting with the real 3D XPoint main memory hardware. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
43. A Methology for Improving the Performance of Paravirtual I/O Systems Based on Fast NVM Devices.
- Author
-
MYOUNGWON OH, HANUL SUNG, SANGGEOL LEE, HYEONSANG EOM, and HEON Y. YEOM
- Subjects
PERFORMANCES - Abstract
Paravirtual I/O systems have been paid much attention to due to their reasonable performance levels. To increase these levels, Non-Volatile Memory (NVM) such as flash memory is considered and used as their storage media alternative to HDD. Although the storage media is fast, the performance of paravirtual I/O systems using the media is much lower than expected. The performance is lowered because the I/O process in the guest and host OSes is serialized and the OSes are run ignoring the processor affinity while the same software layers performing I/O are duplicated in the OSes. We present our methodology to employ towards optimizing the performance of NVM-based paravirtual I/O systems: the use of polling rather than interrupt, and parallel batching in order to maximize the parallelism in performing the sequence of I/O operations, and avoidance of context switches in order to consider the processor affinity. Our experiments with Kernel Virtual Machine (KVM) I/O systems using different NVM storage devices suggest that the use of the methodology can lead to enhancements in throughput by 50% to more than 80% while reducing CPU usage by up to 25% for a microbenchmark program and by up to 100% for workloads in mixed-read/write patterms. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
44. Quantum Dot Gate (QDG) Quantum Dot Channel (QDC) Multistate Logic Non-Volatile Memory (NVM) with High-K Dielectric HfO2 Barriers.
- Author
-
Butterfield, N. R., Mays, R., Khan, B., Gudlavalleti, R., and Jain, F. C.
- Subjects
- *
QUANTUM dots , *QUANTUM gates , *INDIUM gallium zinc oxide , *MOORE'S law , *DIELECTRICS , *QUANTUM wells , *TUNNEL junctions (Materials science) , *MODULATION-doped field-effect transistors - Abstract
This paper presents the theory, fabrication and experimental testing results for a multiple state Non-Volatile Memory (NVM), comprised of hafnium oxide high-k dielectric tunnel and gate barriers as well as a Silicon Quantum Dot Superlattice (QDSL) implemented for the floating gate and inversion channel (QDG) and (QDC) respectively. With the conclusion of Moore's Law for conventional transistor fabrication, regarding the minimum gate size, current efforts in memory cell research and development are focused on bridging the gap between the conventions of the past sixty years and the future of computing. One method of continuing the increasing chip density is to create multistate devices capable of storing and processing additional logic states beyond 1 and 0. Replacing the silicon nitride floating gate of a conventional Flash NVM with QDSL gives rise to minibands that result in greater control over charge levels stored in the QDG and additional intermediate states. Utilizing Hot Carrier Injection (HCI) programming, for the realized device, various magnitudes of gate voltage pulses demonstrated the ability to accurately control the charge levels stored in the QDG. This corresponds to multiple threshold voltage shifts allowing detection of multiple states during read operations. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
45. Implications of NVM Based Storage on Memory Subsystem Management.
- Author
-
Bahn, Hyokyung and Cho, Kyungwoon
- Subjects
DYNAMIC random access memory ,HARD disks ,COMPUTER storage devices ,SYSTEMS software - Abstract
Featured Application: The authors anticipate that memory and storage configurations explored in this article will be helpful in the design of system software for future computer systems with ever-growing memory demands and the limited density of DRAM. Recently, non-volatile memory (NVM) has advanced as a fast storage medium, and legacy memory subsystems optimized for DRAM (dynamic random access memory) and HDD (hard disk drive) hierarchies need to be revisited. In this article, we explore the memory subsystems that use NVM as an underlying storage device and discuss the challenges and implications of such systems. As storage performance becomes close to DRAM performance, existing memory configurations and I/O (input/output) mechanisms should be reassessed. This article explores the performance of systems with NVM based storage emulated by the RAMDisk under various configurations. Through our measurement study, we make the following findings. (1) We can decrease the main memory size without performance penalties when NVM storage is adopted instead of HDD. (2) For buffer caching to be effective, judicious management techniques like admission control are necessary. (3) Prefetching is not effective in NVM storage. (4) The effect of synchronous I/O and direct I/O in NVM storage is less significant than that in HDD storage. (5) Performance degradation due to the contention of multi-threads is less severe in NVM based storage than in HDD. Based on these observations, we discuss a new PC configuration consisting of small memory and fast storage in comparison with a traditional PC consisting of large memory and slow storage. We show that this new memory-storage configuration can be an alternative solution for ever-growing memory demands and the limited density of DRAM memory. We anticipate that our results will provide directions in system software development in the presence of ever-faster storage devices. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
46. User-Space I/O for s-level Storage Devices
- Author
-
Papagiannis, Anastasios, Saloustros, Giorgos, Marazakis, Manolis, Bilas, Angelos, Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Weikum, Gerhard, Series editor, Taufer, Michela, editor, Mohr, Bernd, editor, and Kunkel, Julian M., editor
- Published
- 2016
- Full Text
- View/download PDF
47. Programming techniques of resistive random-access memory devices for neuromorphic computing
- Author
-
Universitat Politècnica de Catalunya. Departament d'Enginyeria Electrònica, Universitat Politècnica de Catalunya. EFRICS - Efficient and Robust Integrated Circuits and Systems, Machado Panadés, Pau, Manich Bou, Salvador, Gómez Pau, Álvaro, Rodríguez Montañés, Rosa, Bargalló González, Mireia, Campabadal, Francesca, Arumi Delgado, Daniel, Universitat Politècnica de Catalunya. Departament d'Enginyeria Electrònica, Universitat Politècnica de Catalunya. EFRICS - Efficient and Robust Integrated Circuits and Systems, Machado Panadés, Pau, Manich Bou, Salvador, Gómez Pau, Álvaro, Rodríguez Montañés, Rosa, Bargalló González, Mireia, Campabadal, Francesca, and Arumi Delgado, Daniel
- Abstract
Neuromorphic computing offers a promising solution to overcome the von Neumann bottleneck, where the separation between the memory and the processor poses increasing limitations of latency and power consumption. For this purpose, a device with analog switching for weight update is necessary to implement neuromorphic applications. In the diversity of emerging devices postulated as synaptic elements in neural networks, RRAM emerges as a standout candidate for its ability to tune its resistance. The learning accuracy of a neural network is directly related to the linearity and symmetry of the weight update behavior of the synaptic element. However, it is challenging to obtain such a linear and symmetrical behavior with RRAM devices. Thus, extensive research is currently devoted at different levels, from material to device engineering, to improve the linearity and symmetry of the conductance update of RRAM devices. In this work, the experimental results based on different programming pulse conditions of RRAM devices are presented, considering both voltage and current pulses. Their suitability for application as analog RRAM-based synaptic devices for neuromorphic computing is analyzed by computing an asymmetric nonlinearity factor., This work was supported in part by Spanish MCIN/AEI /10.13039/501100011033/FEDER, under Projects PID2022-141391OB-C22 and PID2022-139586NB-C42., Postprint (published version)
- Published
- 2023
48. True random number generator based on RRAM-bias current starved ring oscillator
- Author
-
Universitat Politècnica de Catalunya. Departament d'Enginyeria Electrònica, Universitat Politècnica de Catalunya. EFRICS - Efficient and Robust Integrated Circuits and Systems, Arumi Delgado, Daniel, Manich Bou, Salvador, Gómez Pau, Álvaro, Rodríguez Montañés, Rosa, Bargalló González, Mireia, Campabadal, Francesca, Universitat Politècnica de Catalunya. Departament d'Enginyeria Electrònica, Universitat Politècnica de Catalunya. EFRICS - Efficient and Robust Integrated Circuits and Systems, Arumi Delgado, Daniel, Manich Bou, Salvador, Gómez Pau, Álvaro, Rodríguez Montañés, Rosa, Bargalló González, Mireia, and Campabadal, Francesca
- Abstract
This work presents a RRAM-bias current starved ring oscillator (CSRO) as TRNG, where the cycle-to-cycle variability of a RRAM device is exploited as source of randomness. A simple voltage divider composed of this RRAM and a resistor is considered to bias the gate terminal of the extra transistor of every current starved (CS) inverter of the RO. In this way, the delay of the inverters is modified, deriving an unpredictable oscillation frequency every time the RRAM switches to the HRS. The oscillation frequency is finally leveraged to extract the sequence of random bits. The design is simple and add low area overhead. Experimental measurements are performed to analyze the cycle-to-cycle variability in the HRS. The very same measurements are subsequently used to validate the TRNG by means of electrical simulations. The obtained results passed all the NIST tests without the need for post-processing., This work was supported by the Spanish MCIN/AEI/10.13039/501100011033 under Project PID2019-103869RB-C33. The work of M. B. González was supported by the Ramón y Cajal under Grant RYC2020-030150-I., Peer Reviewed, Postprint (published version)
- Published
- 2023
49. MNEMOSENE++: Scalable multi-tile design with enhanced buffering and VGSOT-MRAM based compute-in-memory crossbar array
- Author
-
Universitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors, Escuín Blasco, Carlos, García Redondo, Fernando, Zahedi, Mahdi, Ibáñez Marín, Pablo Enrique, Monreal Arnal, Teresa, Viñals Yúfera, Victor, Llaberia Griñó, José M., Myers, James, Ryckaert, Julien, Biswas, Dwaipayan, Catthoor, Francky, Universitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors, Escuín Blasco, Carlos, García Redondo, Fernando, Zahedi, Mahdi, Ibáñez Marín, Pablo Enrique, Monreal Arnal, Teresa, Viñals Yúfera, Victor, Llaberia Griñó, José M., Myers, James, Ryckaert, Julien, Biswas, Dwaipayan, and Catthoor, Francky
- Abstract
This paper optimizes the MNEMOSENE architecture, a compute-in-memory (CiM) tile design integrating computation and storage for increased efficiency. We identify and address bottlenecks in the Row Data (RD) buffer that cause losses in performance. Our proposed approach includes mitigating these buffering bottlenecks and extending MNEMOSENE’s single-tile design to a multi-tile configuration for improved parallel processing. The proposal is validated through comprehensive analyses exploring the mapping of diverse neural networks evaluated on CiM crossbar arrays based on NVM technologies. These proposed enhancements lead up to 55% reduction in execution time compared to the original single-tile architecture for any general matrix multiplication (GEMM) operation. Our evaluation shows that while ReRAM and PCM offer notable energy advantages, their integration with scaled CMOS is limited, which leads to VGSOT-MRAM emerging as a promising alternative due to its good balance between energy efficiency and superior integration capabilities. The VGSOT-MRAM crossbar arrays provide 12×,49×, and 346× more energy efficiency than PCM, ReRAM, and STT-MRAM ones, respectively. It translates, on average for the considered workload, in 1.5×,3×, and 14.5× better energy efficiency of the entire system., This project has received funding from the ECSEL Joint Undertaking (JU) under grant agreement No 876925, from MCIN/AEI/10.13039/501100011033 (grants PID2019-105660RB-C21 and PID2019-107255GB-C22), and from Aragon Government (T58_23R research group)., Peer Reviewed, Postprint (author's final draft)
- Published
- 2023
50. Adjacent LSTM-Based Page Scheduling for Hybrid DRAM/NVM Memory Systems
- Author
-
Manolis Katsaragakis and Konstantinos Stavrakakis and Dimosthenis Masouros and Lazaros Papadopoulos and Dimitrios Soudris, Katsaragakis, Manolis, Stavrakakis, Konstantinos, Masouros, Dimosthenis, Papadopoulos, Lazaros, Soudris, Dimitrios, Manolis Katsaragakis and Konstantinos Stavrakakis and Dimosthenis Masouros and Lazaros Papadopoulos and Dimitrios Soudris, Katsaragakis, Manolis, Stavrakakis, Konstantinos, Masouros, Dimosthenis, Papadopoulos, Lazaros, and Soudris, Dimitrios
- Abstract
Recent advances in memory technologies have led to the rapid growth of hybrid systems that combine traditional DRAM and Non Volatile Memory (NVM) technologies, as the latter provide lower cost per byte, low leakage power and larger capacities than DRAM, while they can guarantee comparable access latency. Such kind of heterogeneous memory systems impose new challenges in terms of page placement and migration among the alternative technologies of the heterogeneous memory system. In this paper, we present a novel approach for efficient page placement on heterogeneous DRAM/NVM systems. We design an adjacent LSTM-based approach for page placement, which strongly relies on page accesses prediction, while sharing knowledge among pages with behavioral similarity. The proposed approach leads up to 65.5% optimized performance compared to existing approaches, while achieving near-optimal results and saving 20.2% energy consumption on average. Moreover, we propose a new page replacement policy, namely clustered-LRU, achieving up to 8.1% optimized performance, compared to the default Least Recently Used (LRU) policy.
- Published
- 2023
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.