Author: "Kanghee Kim" / Topic: computer - Searchworks@Jio Institute Digital Library Search Results

1. Implementation of Hardware Data Prefetcher Adaptable for Various State-of-the-Art Workload

Author: KyungHwan Song, DongSung Yoon, KangHee Kim, SangBang Choi, and TaeShin Park
Subjects: 010302 applied physics, Computer science, 0103 physical sciences, 0202 electrical engineering, electronic engineering, information engineering, Operating system, Workload, 02 engineering and technology, State (computer science), computer.software_genre, 01 natural sciences, computer, 020202 computer hardware & architecture
Published: 2016

2. UltraShare: FPGA-based Dynamic Accelerator Sharing and Allocation

Author: Siavash Rezaei, Kanghee Kim, and Eli Bozorgzadeh
Subjects: FOS: Computer and information sciences, Scheme (programming language), SIMPLE (military communications protocol), business.industry, Computer science, Controller (computing), 020206 networking & telecommunications, 02 engineering and technology, 020202 computer hardware & architecture, Software, Computer architecture, Hardware Architecture (cs.AR), Scalability, 0202 electrical engineering, electronic engineering, information engineering, business, Field-programmable gate array, Computer Science - Hardware Architecture, Throughput (business), computer, computer.programming_language, Data transmission
Abstract: Despite all the available commercial and open-source frameworks to ease deploying FPGAs in accelerating applications, the current schemes fail to support sharing multiple accelerators among various applications. There are three main features that an accelerator sharing scheme requires to support: exploiting dynamic parallelism of multiple accelerators for a single application, sharing accelerators among multiple applications, and providing a non-blocking congestion-free environment for applications to invoke the accelerators. In this paper, we developed a scalable fully functional hardware controller, called UltraShare, with a supporting software stack that provides a dynamic accelerator sharing scheme through an accelerators grouping mechanism. UltraShare allows software applications to fully utilize FPGA accelerators in a non-blocking congestion-free environment. Our experimental results for a simple scenario of a combination of three streaming accelerators invocation show an improvement of up to 8x in throughput of the accelerators by removing accelerators idle times.
Published: 2019

3. A Survey Study of Life Log and a Proposal of Record-Prevention Method

Author: Kanghee Kim, Jaegeol Yim, and Ilkwon Son
Subjects: Database, Computer science, business.industry, Control (management), Cloud computing, 02 engineering and technology, Function (mathematics), computer.software_genre, Identification (information), Anesthesiology and Pain Medicine, Order (business), Component (UML), 0202 electrical engineering, electronic engineering, information engineering, Code (cryptography), 020201 artificial intelligence & image processing, business, Mobile device, computer
Abstract: The Life Log records computer use, details of mobile device use and sensor values collected from user's activities. Data recorded in life log is analyzed by the cloud computing and turned into useful information. Many researches to provide useful services to human beings making use of the useful information have been widely conducted. This paper surveys life log and proposes a record-preventing method and a device that is facilitated with the record-preventing function. The proposed system consists of the input component, the control component, and the output component. The input component reads data from the GPS receiver, the RFID (Radio-Frequency Identification) reader, the QR code (Quick Response Code) reader, a camera, and sensors. The control component compares the input data with the data stored in the database in order to determine whether the user is within a forbidden area. The control component stops life-logging if the user is within a forbidden area.
Published: 2016

4. Flash Operation Group Scheduling for Supporting QoS of SSD I/O Request Streams

Author: Eungyu Lee, Eyeehyun Nam, Joonwoo Lee, Kanghee Kim, and Sun Won
Subjects: Hardware_MEMORYSTRUCTURES, Differentiated services, Computer science, Quality of service, NVM Express, Controller (computing), Interface (computing), Bandwidth (computing), Operating system, Differentiated service, computer.software_genre, computer, PCI Express
Abstract: As SSDs are increasingly being used as high-performance storage or caches, attention is increasingly paid to the provision of SSDs with Quality-of-Service for I/O request streams of various applications in server systems. Since most SSDs are using the AHCI controller interface on a SATA bus, it is not possible to provide a differentiated service by distinguishing each I/O stream from others within the SSD. However, since a new SSD interface, the NVME controller interface on a PCI Express bus, has been proposed, it is now possible to recognize each I/O stream and schedule I/O requests within the SSD for differentiated services. This paper proposes Flash Operation Group Scheduling within NVME-based flash storage devices, and demonstrates through QEMU-based simulation that we can achieve a proportional bandwidth share for each I/O stream.
Published: 2015

5. Real-time motion control on Android platform

Author: Jeongnam Kang, Dohyeon Kim, Hyeongseok Kang, and Kanghee Kim
Subjects: Computer science, business.industry, 020208 electrical & electronic engineering, 02 engineering and technology, computer.software_genre, Motion control, 020202 computer hardware & architecture, Theoretical Computer Science, Hardware and Architecture, Embedded system, 0202 electrical engineering, electronic engineering, information engineering, Operating system, Robot, Android (operating system), business, computer, Software, Information Systems
Abstract: Recently, it is increasingly important to provide good real-time performance with the Android platform, since it has been used in industrial devices. The Android platform, however, does not provide a tight real-time guarantee that is required by such industrial devices as robots. Until a real-time extension of the Android platform becomes publicly available, application developers need a lightweight application-centric approach to achieve good real-time performance on the existing platform. This paper proposes an application-centric approach requiring no real-time extensions of the Android platform, which centers around multi-core partitioning and partition-aware application design. Following the proposed approach, we present an implementation study of motion control applications where the real-time tasks of the target application run on dedicated processor cores as a native Linux process while the non-real-time tasks run on other cores as an Android process. In the paper, we prove that the proposed approach is enough effective to deal with motion control applications on the existing Android platform with no real-time extensions. Our experiments show that on a quad-core Android board we can achieve such a good real-time performance as 99 % task activation jitters less than 5 µs for a motion control application with four real-time tasks and a period of 500 µs.
Published: 2015

6. Design and Implementation of an Android Application for Real-time Motion Control

Author: Hyeongseok Kang, Dohyeon Kim, Kanghee Kim, Eungyu Lee, and Jeongnam Kang
Subjects: Multi-core processor, business.industry, Computer science, Embedded system, Operating system, Android application, Android (operating system), Daemon, Fieldbus, computer.software_genre, Motion control, business, computer
Abstract: This paper addresses the design and implementation of an Android application for real-time precise motion control. To provide stable real-time performance, we implemented the application in two parts: Android service in the form of a daemon process, which periodically transfers a set of position commands for all motors through a real-time fieldbus, and Android UI application, which generates and delivers the set of position commands to the Android service. To support such a real-time motion control application, we use multi-core partitioning, which partitions the processor cores into a real-time partition to be used by the real-time motion control service and a non-real-time partition to be used by the Android application, and set up a shared buffer between them for communication. Our experiments show that we can obtain a motion control period of 2 ms with 99% task activation jitters less than for a configuration where each of the four threads controls two motors in a group.
Published: 2015

7. A Trace-based Precompile Method for Improving the Response Times of Android Applications

Author: Sunggil Hong and Kanghee Kim
Subjects: Computer science, business.industry, Legacy system, computer.file_format, computer.software_genre, Launch Time, Execution time, Virtual machine, Embedded system, Operating system, Executable, Software_PROGRAMMINGLANGUAGES, Android (operating system), business, computer
Abstract: Recently, to improve the user response times of Android applications, several studies have been proposed to combine the idea of Ahead-of-Time compilation into Dalvik virtual machine, which uses Just-in-Time compilation. The studies, however, require modifications of the Dalvik executables of target applications, thus are difficult to be adopted for legacy applications already deployed. This paper proposes a JITwP(JIT with Precompile) technique that precompiles hot traces at application launch time with no modification of the Dalvik executable. It improves the user response times of target applications by providing precompile hints prepared offline. Our experimental results demonstrate a 4% improvement in terms of execution time for the Web browser application.
Published: 2013

8. Design of Parallel Decimal Multiplier using Limited Range of Signed-Digit Number Encoding

Author: Wan-Oh Yoon, Kanghee Kim, In-Guk Hwang, and Sang-Bang Choi
Subjects: Partial product, Digit number, computer.software_genre, Binary Integer Decimal, Operand, Decimal, CMOS, Multiplier (economics), Compiler, Hardware_ARITHMETICANDLOGICSTRUCTURES, Arithmetic, Algorithm, computer, Mathematics
Abstract: In this paper, parallel decimal fixed-point multiplier which uses the limited range of Singed-Digit number encoding and the reduction step is proposed. The partial products are generated without carry propagation delay by encoding a multiplicand and a multiplier to the limited range of SD number. With the limited range of SD number, the proposed multiplier can improve the partial product reduction step by increasing the number of possible operands for multi-operand SD addition. In order to estimate the proposed parallel decimal multiplier, synthesis is implemented using Design Compiler with SMIC 180nm CMOS technology library. Synthesis results show that the delay of proposed parallel decimal multiplier is reduced by 4.3% and the area by 5.3%, compared to the existing SD parallel decimal multiplier. Despite of the slightly increased delay and area of partial product generation step, the total delay and area are reduced since the partial product reduction step takes the most proportion.
Published: 2013

9. RT-PLRU: A New Paging Scheme for Real--Time Execution of Program Codes on NAND Flash Memory for Portable Media Players

Author: Jong-Chan Kim, Chang-Gun Lee, Kanghee Kim, and Duhee Lee
Subjects: Scheme (programming language), Random access memory, Hardware_MEMORYSTRUCTURES, Flash memory emulator, business.industry, Computer science, Nand flash memory, Process (computing), computer.software_genre, Theoretical Computer Science, Memory management, Computational Theory and Mathematics, Hardware and Architecture, Virtual memory, Operating system, Paging, business, computer, Software, Computer hardware, Computer memory, Flash file system, computer.programming_language
Abstract: NAND flash memory has been widely used as a nonvolatile storage for storing data. However, it is challenging to execute program codes on NAND flash memory, since NAND flash memory only supports page-based reads, not byte-level random reads. This paper proposes an automated process to find the optimal paging strategy called RT-PLRU (Real-Time constrained combination of Pinning and LRU) that allows program codes stored in NAND flash memory to be executed satisfying real-time requirements with minimal usage of RAM. Moreover, the proposed process optimally configure the RT-PLRU in a developer-transparent way without giving any burden to the program developer. The developed technique is specifically applied to a media player program targeting a portable media player (PMP). To the best of our knowledge, this is the first effort to use NAND flash memory as a code storage for storing and executing real-time programs with minimal usage of RAM.
Published: 2011

10. Computer aided design of fault-tolerant application specific programmable processors

Author: Kanghee Kim, Ramesh Karri, and Miodrag Potkonjak
Subjects: Computer science, business.industry, Fault tolerance, computer.software_genre, Theoretical Computer Science, Scheduling (computing), Computational Theory and Mathematics, Application-specific integrated circuit, Hardware and Architecture, Software fault tolerance, Embedded system, Computer Aided Design, business, computer, Software, Digital signal processing
Abstract: Application Specific Programmable Processors (ASPP) provide efficient implementation for any of m specified functionalities. Due to their flexibility and convenient performance-cost trade-offs, ASPPs are being developed by DSP, video, multimedia, and embedded lC manufacturers. In this paper, we present two low-cost approaches to graceful degradation-based permanent fault tolerance of ASPPs. ASPP fault tolerance constraints are incorporated during scheduling, allocation, and assignment phases of behavioral synthesis: Graceful degradation is supported by implementing multiple schedules of the ASPP applications, each with a different throughput constraint. In this paper, we do not consider concurrent error detection. The first ASPP fault tolerance technique minimizes the hardware resources while guaranteeing that the ASPP remains operational in the presence of all k-unit faults. On the other hand, the second fault tolerance technique maximizes the ASPP fault tolerance subject to constraints on the hardware resources. These ASPP fault tolerance techniques impose several unique tasks, such as fault-tolerant scheduling, hardware allocation, and application-to-faulty-unit assignment. We address each of them and demonstrate the effectiveness of the overall approach, the synthesis algorithms, and software implementations on a number of industrial-strength designs.
Published: 2000

11. A wireless multimedia LAN architecture using DCF with shortened contention window for QoS provisioning

Author: Kanghee Kim, Kiseon Kim, and Aftab Ahmad
Subjects: Service quality, Multimedia, business.industry, Computer science, Quality of service, ComputerSystemsOrganization_COMPUTER-COMMUNICATIONNETWORKS, Local area network, Wireless Multimedia Extensions, Throughput, Access control, Network allocation vector, Distributed coordination function, computer.software_genre, Computer Science Applications, Point coordination function, Traffic classification, Modeling and Simulation, Wireless lan, Wireless, The Internet, IEEE 802.11e-2005, Electrical and Electronic Engineering, business, computer, Computer network
Abstract: In this paper, we propose a MAC architecture for IEEE 802.11-like WLANs supporting multimedia services. The main characteristics of the proposed MAC procedure are the period restriction and DCF/SC (distributed coordination function with shortened contention-windows) instead of the PCF (point coordination function) of the IEEE 802.11 standards. We investigate the performance of the proposed MAC mechanism in terms of utilization, throughput, latency and jitter under two multimedia network environments considered by the IEEE 802.11-task group E. These environments are home and enterprise. In the simulated performance, this novel approach, which preserves the traffic classification but increases the channel utilization, thus guarantees QoS (quality of service) of multimedia traffic.
Published: 2003

12. An implementation study of a ghost drive

Author: Eungyu Lee, Joonwoo Lee, Sung Y. Shin, Sung-Ryul Kim, Kanghee Kim, and Hyeongseok Kang
Subjects: business.industry, Computer science, Loadable kernel module, computer.software_genre, Virtual file system, Mandatory access control, Upgrade, Software, Installation, Operating system, business, SSH File Transfer Protocol, Mobile device, computer
Abstract: Recently, it becomes increasingly important to secure user private data in mobile devices. To protect user private data, one possible approach is to implement a secure file storage in the mobile devices based on mandatory access control (MAC), but the device manufacturers seldom implement it because of high pressure of time-to-market, frequent version upgrade of the operating system, and no unanimous agreement in the MAC standard software. In this paper, we propose an implementation study of a secure file storage, called a ghost drive, which can facilitate the implementation of MAC in the mobile devices by unburdening the manufacturers from aggressive instrumentation of the whole operating system. Since our implementation is in form of a loadable kernel module, separated from the main kernel, it can be deployed even to commercial mobile devices already in use by installing it over the air. Our experiments show that the performance of our secure storage implementation is not worse than the original unmodified implementation.
Published: 2012

13. An efficient checkpoint scheme for the fast mount of flash file system

Author: Joongjin Kook, Kanghee Kim, Sanghoon Choi, and Jiman Hong
Subjects: File system, Hardware_MEMORYSTRUCTURES, Flash memory emulator, business.industry, Computer science, computer.software_genre, Mount, Flash memory, Power (physics), Embedded system, Data_FILES, State (computer science), business, Mobile device, computer, Computer hardware, Flash file system
Abstract: The mounting time of file system increases in embedded systems, as the capacity of flash memory keeps increasing rapidly. There is an unexpected power failure or an abnormal shutdown due to the power of mobile devices is supplied by batteries. Previous recovery schemes of the flash file system have limitation to reduce the mounting time in large flash memory and this resources constraint environment. Our goal is to reduce the reconstruction delay in flash file system. We present a state transition rate diagram of checkpoint to obtain the checkpoint interval and compute the interval of a checkpoint from the analysis of the diagram to reduce the scan time.
Published: 2011

14. The performance analysis of ARM NEON technology for mobile platforms

Author: Minwoo Jang, Kanghee Kim, and Kukhyun Kim
Subjects: business.industry, Computer science, chemistry.chemical_element, computer.software_genre, Power (physics), ARM architecture, Neon, chemistry, Feature (computer vision), Embedded system, Compiler, SIMD, Graphics, business, computer, Arm neon, Computer hardware
Abstract: Mobile platforms have compute-bound applications, which will be executed on top of ARM processors in many cases. For such applications, the use of the ARM NEON technology should be considered since it can significantly increase the computing power of the underlying processor for vector operations. The ARM NEON technology is a media processing architecture based on SIMD (Single Instruction Multiple Data) that adds instructions targeted primarily for audio, video, and 3-D graphics processing. In this paper, we evaluate the ARM NEON technology in ARM Cortex-A8 processor for several open-source applications. To do this, we use the auto-vectorization feature of the ARM GCC compiler, which is implemented with the NEON technology. The evaluation criteria include program code size, execution time and power consumption, which are crucial in most embedded systems.
Published: 2011

15. Using NAND flash memory for executing large volume real-time programs in automotive embedded systems

Author: Kanghee Kim, Kwangyoon Cho, Chang-Gun Lee, and Kyoung-Soo We
Subjects: Hardware_MEMORYSTRUCTURES, Flash memory emulator, business.industry, Computer science, Nand flash memory, computer.software_genre, Partition (database), Non-volatile memory, Embedded system, Convex optimization, Operating system, business, computer, Computer memory, Flash file system, Random access
Abstract: For advanced features of next generation vehicles, the real-time programs in automotive embedded systems are dramatically increasing. For such large volume program codes, this paper proposes a novel framework to use high-density and low-cost nonvolatile memory, i.e., NAND flash memory, as a low-cost mean of storing and executing hard real-time programs. Regarding this, one challenge is that NAND flash memory allows only 2KB page-based read operations not per-byte random access, which requires RAM as working storage for code executions. In order to minimize the expensive RAM requirements, the proposed framework optimally partitions the RAM for multiple hard real-time tasks and optimally determines the pinning/LRU combination for each RAM partition such that all task deadlines are deterministically guaranteed. The proposed framework is verified with the actual real-time programs for unmanned autonomous driving. To the best of our knowledge, this is the first work that allows us to use NAND flash memory for hard real-time program executions with the minimal usage of RAM.
Published: 2010

16. Real-Time Program Execution on NAND Flash Memory for Portable Media Players

Author: Eun Yong Ha, Kanghee Kim, Duhee Lee, Jong-Chan Kim, and Chang-Gun Lee
Subjects: Hardware_MEMORYSTRUCTURES, business.industry, Computer science, computer.software_genre, Flash memory, Memory management, Embedded system, Universal memory, Virtual memory, Operating system, Paging, Static random-access memory, business, computer, Real-time operating system, Flash file system
Abstract: NAND flash memory has been widely used as a non-volatile storage for storing data. However, it requires a large amount of SRAM for executing program codes stored in it since it only supports page-based reads, not byte-level random reads. This paper proposes a new paging mechanism called RT-PLRU (real-time constrained combination of pinning and LRU) that allows program codes stored in NAND flash memory to be executed satisfying real-time requirements with minimal usage of SRAM. Moreover, the RT-PLRU is optimally configured in a developer-transparent way without giving any burden to the program developer. The developed technique is specifically applied to a media player program targeting a PMP (portable medial player). To the best of our knowledge, this is the first effort to use NAND flash memory as a code storage for storing and executing real-time programs with minimal usage of SRAM.
Published: 2008

17. Transparent and Selective Real-Time Interrupt Services for Performance Improvement

Author: Joonwon Lee, Yung-Joon Jung, Euiseong Seo, Jinkyu Jeong, Donghwan Kim, Dongsung Kim, Jin-Soo Kim, and Kanghee Kim
Subjects: business.industry, Computer science, computer.software_genre, Embedded operating system, Scheduling (computing), Kernel (image processing), Embedded system, Operating system, Overall performance, Performance improvement, Interrupt, business, Real-time operating system, computer
Abstract: The popularity of mobile and multimedia applications made real-time support a mandatory feature for embedded operating systems. However, the current situation is that the overall performance is significantly degraded due to the real-time support. This paper suggests a novel scheme to minimize the performance degradation in embedded operating systems with real-time support. Especially, we propose transparent and selective real-time interrupt services which transparently monitor the system and postpone interrupt handling that are not relevant to real-time tasks. The proposed scheme was implemented on the Linux 2.6 kernel and the experimental results show that our scheme improves the throughput by up to 86% for Hackbench benchmark while providing almost the same scheduling latency compared to the previous work.
Published: 2007

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

17 results on '"Kanghee Kim"'

1. Implementation of Hardware Data Prefetcher Adaptable for Various State-of-the-Art Workload

2. UltraShare: FPGA-based Dynamic Accelerator Sharing and Allocation

3. A Survey Study of Life Log and a Proposal of Record-Prevention Method

4. Flash Operation Group Scheduling for Supporting QoS of SSD I/O Request Streams

5. Real-time motion control on Android platform

6. Design and Implementation of an Android Application for Real-time Motion Control

7. A Trace-based Precompile Method for Improving the Response Times of Android Applications

8. Design of Parallel Decimal Multiplier using Limited Range of Signed-Digit Number Encoding

9. RT-PLRU: A New Paging Scheme for Real--Time Execution of Program Codes on NAND Flash Memory for Portable Media Players

10. Computer aided design of fault-tolerant application specific programmable processors

11. A wireless multimedia LAN architecture using DCF with shortened contention window for QoS provisioning

12. An implementation study of a ghost drive

13. An efficient checkpoint scheme for the fast mount of flash file system

14. The performance analysis of ARM NEON technology for mobile platforms

15. Using NAND flash memory for executing large volume real-time programs in automotive embedded systems

16. Real-Time Program Execution on NAND Flash Memory for Portable Media Players

17. Transparent and Selective Real-Time Interrupt Services for Performance Improvement

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Journal

Database

Publisher

17 results on '"Kanghee Kim"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources