21 results on '"Reconfigurable hardware"'
Search Results
2. ENREM: An efficient NFA-based regular expression matching engine on reconfigurable hardware for NIDS
- Author
-
Hieu, Tran Trung, Thinh, Tran Ngoc, and Tomiyama, Shigenori
- Published
- 2013
- Full Text
- View/download PDF
3. Open-architecture system based on a reconfigurable hardware–software multi-agent platform for CNC machines
- Author
-
Morales-Velazquez, Luis, Romero-Troncoso, Rene de Jesus, Osornio-Rios, Roque Alfredo, Herrera-Ruiz, Gilberto, and Cabal-Yepez, Eduardo
- Published
- 2010
- Full Text
- View/download PDF
4. Open-architecture system based on a reconfigurable hardware-software multi-agent platform for CNC machines
- Subjects
Programmable logic array ,Machinery ,Magneto-electric machines ,Control systems ,Digital integrated circuits - Abstract
To link to full-text access for this article, visit this link: http://dx.doi.org/10.1016/j.sysarc.2010.04.009 Byline: Luis Morales-Velazquez (a), Rene de Jesus Romero-Troncoso (a)(b), Roque Alfredo Osornio-Rios (a), Gilberto Herrera-Ruiz (a), Eduardo Cabal-Yepez (b) Keywords: Open-architecture; Multi-agent platform; Hardware-software co-design; FPGA; Intelligent manufacturing systems Abstract: New generation of manufacturing systems endows their intelligence and reconfigurability to the computerized numerical controller (CNC) machines. This paper presents an open-architecture platform based on multi-agent hardware-software units, by developing a novel Multi-Agent Distributed CONtroller (MADCON) system. This system intends to fulfill the requirements of reconfigurability for the next generation of intelligent machines. The design of intelligent drives for this system follows a hardware-software co-design approach using a simple and intuitive structure. The hardware units of the proposed system integrate control and monitoring functions providing an FPGA-based open architecture for reconfigurable applications. On the other hand, software components were developed utilizing the XML structure for system description files, gathering features like a flowchart descriptive language and a graphic user-interface. MADCON was applied to a retrofitted to CNC lathe for control and monitoring in order to validate the proposed architecture towards the development of new generation intelligent manufacturing systems. Author Affiliation: (a) Facultad de Ingenieria, Universidad Autonoma de Queretaro, Cerro de las Campanas s/n, 76010 Queretaro, Qro., Mexico (b) Division de Ingenieria - Salamanca, Universidad de Guanajuato, Carr. Salamanca - Valle de Santiago km 3.5+1.8 Com. de Palo Blanco, 36700 Salamanca, Gto., Mexico Article History: Received 26 September 2009; Revised 23 March 2010; Accepted 29 April 2010
- Published
- 2010
5. Static scheduling techniques for dependent tasks on dynamically reconfigurable devices
- Author
-
Jari Nurmi, Juha-Pekka Soininen, and Yang Qu
- Subjects
task scheduling ,run-time reconfiguration ,reconfiguration ,Computer science ,dynamic reconfiguration ,Distributed computing ,Control reconfiguration ,algorithms ,Multiplexing ,Reconfigurable computing ,Scheduling (computing) ,Random search ,Hardware and Architecture ,Constraint programming ,dynamically reconfigurable hardware ,genetic algorithm ,Time domain ,dynamic reconfigurable systems ,Software ,Reusability - Abstract
Dynamically reconfigurable hardware not only has high silicon reusability, but it can also deliver high performance for computation-intensive tasks. Advanced features such as run-time reconfiguration allow multiple tasks to be mapped onto the same device either simultaneously or multiplexed in time domain. These tasks need to be scheduled optimally or near optimally in order to efficiently utilize the device. It is a NP-hard problem, because task scheduling, allocation and configuration prefetching all need to be considered. In this paper, we target dependent task models and propose three static schedulers that use different problem solving strategies. The first is a heuristic approach developed from traditional list-based schedulers. It presents high efficiency but the least accuracy. The second is based on a full-domain search using constraint programming. It can guarantee to produce optimal solutions but requires significant searching effort. The last is a guided random search technique based on a genetic algorithm, which shows reasonable efficiency and much better accuracy than the heuristic approach.
- Published
- 2007
- Full Text
- View/download PDF
6. A Survey and Taxonomy of FPGA-based Deep Learning Accelerators.
- Author
-
Blaiech, Ahmed Ghazi, Ben Khalifa, Khaled, Valderrama, Carlos, Fernandes, Marcelo A.C., and Bedoui, Mohamed Hedi
- Subjects
- *
DEEP learning , *ARTIFICIAL neural networks , *GATE array circuits , *MACHINE learning , *ADAPTIVE computing systems , *COMPUTER performance , *PARALLEL processing - Abstract
• This paper is original given that serve as directive for researchers in the area of FPGA-based deep learning accelerators. • This paper analyzes the characteristics of existing architectures to finally propose the better development strategies. • The literature used in this paper is very recent (many references are in 2018). Deep learning, the fastest growing segment of Artificial Neural Network (ANN), has led to the emergence of many machine learning applications and their implementation across multiple platforms such as CPUs, GPUs and reconfigurable hardware (Field-Programmable Gate Arrays or FPGAs). However, inspired by the structure and function of ANNs, large-scale deep learning topologies require a considerable amount of parallel processing, memory resources, high throughput and significant processing power. Consequently, in the context of real time hardware systems, it is crucial to find the right trade-off between performance, energy efficiency, fast development, and cost. Although limited in size and resources, several approaches have showed that FPGAs provide a good starting point for the development of future deep learning implementation architectures. Through this paper, we briefly review recent work related to the implementation of deep learning algorithms in FPGAs. We will analyze and compare the design requirements and features of existing topologies to finally propose development strategies and implementation architectures for better use of FPGA-based deep learning topologies. In this context, we will examine the frameworks used in these studies, which will allow testing a lot of topologies to finally arrive at the best implementation alternatives in terms of performance and energy efficiency. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
7. UML-based hardware/software co-design platform for dynamically partially reconfigurable network security systems
- Author
-
Huang, Chun-Hsian, Hsiung, Pao-Ann, and Shen, Jih-Sheng
- Subjects
- *
UNIFIED modeling language , *COMPUTER network security , *COMPUTER input-output equipment design & construction , *SOFTWARE architecture , *EMBEDDINGS (Mathematics) , *MATHEMATICAL models , *ESTIMATION theory , *SPACE exploration , *OUTER space - Abstract
Abstract: The dynamic partial reconfiguration technology of FPGA has made it possible to adapt system functionalities at run-time to changing environment conditions. However, this new dimension of dynamic hardware reconfigurability has rendered existing CAD tools and platforms incapable of efficiently exploring the design space. As a solution, we proposed a novel UML-based hardware/software co-design platform (UCoP) targeting at dynamically partially reconfigurable network security systems (DPRNSS). Computation-intensive network security functions, implemented as reconfigurable hardware functions, can be configured on-demand into a DPRNSS at run-time. Thus, UCoP not only supports dynamic adaptation to different environment conditions, but also increases hardware resource utilization. UCoP supports design space exploration for reconfigurable systems in three folds. Firstly, it provides reusable models of typical reconfigurable systems that can be customized according to user applications. Secondly, UCoP provides a partially reconfigurable hardware task template, using which users can focus on their hardware designs without going through the full partial reconfiguration flow. Thirdly, UCoP provides direct interactions between UML system models and real reconfigurable hardware modules, thus allowing accurate time measurements. Compared to the existing lower-bound and synthesis-based estimation methods, the accurate time measurements using UCoP at a high abstraction level can more efficiently reduce the system development efforts. [Copyright &y& Elsevier]
- Published
- 2010
- Full Text
- View/download PDF
8. Realization of wireless multimedia communication systems on reconfigurable platforms
- Author
-
Masselos, K., Pelkonen, A., Cupak, M., and Blionas, S.
- Subjects
- *
WIRELESS communications , *HARDWARE , *MULTIMEDIA communications , *PROTOTYPES - Abstract
Wireless multimedia communication systems become increasingly more computational intensive and demand for higher flexibility. The realization of these systems on reconfigurable hardware offers a good balance for these requirements. In this paper the suitability of commercially available reconfigurable hardware platforms for the target application domain is evaluated. Based on this evaluation a heterogeneous partly reconfigurable system-on-chip platform is identified as ideal implementation platform for the targeted systems. Systems from different target domains are analysed and different cases where the inclusion of reconfigurable hardware in their realizations would lead to improved quality in terms of implementation efficiency and flexibility are identified. Design methodology requirements for the realization of systems from the target application domain on the targeted platform are analysed and issues not covered by existing methodologies are identified. The principles of a methodology handling these open issues are described. Results from the prototyping of different systems are also presented and show the potentials of a reconfigurable hardware platform, which in the future will lead to reduced costs and increased flexibility of the wireless multimedia communication systems. [Copyright &y& Elsevier]
- Published
- 2003
- Full Text
- View/download PDF
9. VIDGCN: Embracing input data diversity with a configurable graph convolutional network accelerator.
- Author
-
Ming, Hao, Pan, Tingting, Chen, Dong, Ye, Chencheng, Liu, Haikun, Tang, Liting, Liao, Xiaofei, and Jin, Hai
- Subjects
- *
MATRIX multiplications , *GRAPH algorithms - Abstract
Hardware accelerated inference is a promising solution for exploiting graph convolutional networks (GCN) in latency-sensitive applications. Existing accelerators overlook an important barrier to widespread adoption: the input data (i.e., weighted graphs) of GCN inference diverge from scale and sparsity, causing the accelerators optimized for an array of graphs to lose efficiency on other graphs. This paper presents a reconfigurable GCN inference accelerator, VIDGCN, that switches between all possible GCN inference computation schemes to realize timely inference for all input graphs. VIDGCN incorporates an analytical performance model and a reconfigurable hardware design. The performance model allows users to find the optimal computation scheme for any given input graph. The hardware design reuses all the computation units under all computation schemes, and only distributes the data to the units in different ways. Evaluation on seven real-world graphs shows that VIDGCN outperforms state of the art, SGCNAX, by 1.79 × , and consistently yields the ideal amount of memory accesses. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
10. FPGA acceleration of semantic tree reasoning algorithms.
- Author
-
Barba, Jesús, Santofimia, María José, Dondo, Julio, Rincón, Fernando, Caba, Julián, and López, Juan Carlos
- Subjects
- *
FIELD programmable gate arrays , *SEMANTIC computing , *REASONING , *ALGORITHMS , *ARTIFICIAL intelligence , *DATA structures - Abstract
Semantic trees are a particular type of trees widely used in the representation of the concepts and their relations. Therefore, a computational model of the reality can be built and processed by Artificial Intelligence algorithms to infer knowledge, make decisions, etc. In this work, the design of a hardware component to accelerate reasoning operations on semantic trees by means of an FPGA based platform is presented. The target application is common-sense reasoning where marker-passing algorithms work on semantic tree structures; the core of the Scone Knowledge-Based system. On top of the functionality to be implemented, a strategy to deal with the implementation in reconfigurable hardware of dynamic and recursive data structures has been envisioned. Since lists, graphs or trees are the cornerstone in the modelling of computer friendly solutions for complex problems; this proposal contributes to reduce the breach between the software and silicon domains. As a result, an optimized micro-architecture of an FPGA accelerator for marker-passer algorithms integrated into a heterogeneous computing platform, and a smart data mapping procedure have been delivered. The design has been prototyped on a Xilinx ML507 board and compared to an equivalent software implementation, showing a significant reduction in execution times. [ABSTRACT FROM AUTHOR]
- Published
- 2015
- Full Text
- View/download PDF
11. A framework for post-silicon realization of arbitrary instruction extensions on reconfigurable data-paths.
- Author
-
Das, Saptarsi, Madhu, Kavitha, Krishna, Madhav, Sivanandan, Nalesh, Merchant, Farhad, Natarajan, Santhi, Biswas, Ipsita, Pulli, Adithya, Nandy, S. K., and Narayan, Ranjani
- Subjects
- *
MICROPROCESSORS , *INFORMATION storage & retrieval systems , *INTEGRATED circuits , *SILICON , *COMPUTATIONAL complexity , *ACQUISITION of data - Abstract
In this paper we present a framework for realizing arbitrary instruction set extensions (IE) that are identified post-silicon. The proposed framework has two components viz., an IE synthesis methodology and the architecture of a reconfigurable data-path for realization of the such IEs. The IE synthesis methodology ensures maximal utilization of resources on the reconfigurable data-path. In this context we present the techniques used to realize IEs for applications that demand high throughput or those that must process data streams. The reconfigurable hardware called HyperCell comprises a reconfigurable execution fabric. The fabric is a collection of interconnected compute units. A typical use case of HyperCell is where it acts as a co-processor with a host and accelerates execution of IEs that are defined post-silicon. We demonstrate the effectiveness of our approach by evaluating the performance of some well-known integer kernels that are realized as IEs on HyperCell. Our methodology for realizing IEs through HyperCells permits overlapping of potentially all memory transactions with computations. We show significant improvement in performance for streaming applications over general purpose processor based solutions, by fully pipelining the data-path. [ABSTRACT FROM AUTHOR]
- Published
- 2014
- Full Text
- View/download PDF
12. Hardware security platform for multicast communications.
- Author
-
Granado-Criado, José M., Vega-Rodríguez, Miguel A., Sánchez-Pérez, Juan M., and Gómez-Pulido, Juan A.
- Subjects
- *
COMPUTER input-output equipment , *COMPUTER security , *MULTICASTING (Computer networks) , *TELECOMMUNICATION , *DATA encryption , *ALGORITHMS - Abstract
Abstract: Secure multicast applications of multimedia contents, such as Internet TV, pay per view, satellite TV, etc., need to maintain a high number of keys. In these applications, a user contracts a group of channels or even specific content (films, sports, etc.) which do not have to coincide with the services contracted by other users, so different keys are needed to encrypt the contents. These keys must be recalculated, encrypted and redistributed when a user joins or unjoins a specific group in order to prevent users who do not belong to a group from being able to access the contents. Original algorithms generate only one group key for all users, so this key must be recalculated and resent when a user joins or unjoins in the user group. This is an important problem, because a group key could be changed even when one content is performing. This paper presents a high performance implementation of one of the most employed algorithms of group key maintenance, the LKH algorithm, using reconfigurable hardware and a very high and realistic number of users (8,388,609). The performance obtained by this study improves a lot other results found in the literature in terms of both performance and number of users. [Copyright &y& Elsevier]
- Published
- 2014
- Full Text
- View/download PDF
13. MobileFBP: Designing portable reconfigurable applications for heterogeneous systems.
- Author
-
Hung, Shih-Hao, Tzeng, Tien-Tzong, Wu, Jyun-De, Tsai, Min-Yu, Lu, Yi-Chih, Shieh, Jeng-Peng, Tu, Chia-Heng, and Ho, Wen-Jen
- Subjects
- *
SYSTEMS design , *ADAPTIVE computing systems , *PORTABLE computers , *ELECTRIC power , *EMBEDDED computer systems , *COMPUTER input-output equipment - Abstract
Abstract: Power-efficiency has been a key issue for today’s application and system design, ranging from embedded systems to data centers. While application-specific designs and optimizations may improve the power efficiency, it requires significant efforts to co-design the hardware and software, which are difficult to re-use. On the hardware front, the trend of heterogeneous computing enables custom designs for specific applications by integrating different types of processors and reconfigurable hardware to handle compute-intensive tasks. However, what is still missing is an elegant application framework, i.e., a programming environment and a runtime system, to develop portable applications which can offload tasks or be reconfigured dynamically to run on a variety of systems efficiently. Our ongoing work, MobileFBP, provides an application framework which aims to support heterogeneous and reconfigurable systems. Using the framework, the developers build portable applications with a dataflow programming paradigm, and the MobileFBP runtime system dynamically schedules the task components to run on available computing resources locally or remotely based on the application profiles. We hope that this ability produces high-level portable applications and reduces the efforts and skills needed for the developers to optimize their applications on a range of systems. This paper describes this work and presents our preliminary results. [Copyright &y& Elsevier]
- Published
- 2014
- Full Text
- View/download PDF
14. Model-based platform-specific co-design methodology for dynamically partially reconfigurable systems with hardware virtualization and preemption
- Author
-
Huang, Chun-Hsian, Hsiung, Pao-Ann, and Shen, Jih-Sheng
- Subjects
- *
MATHEMATICAL models , *SYSTEMS design , *COMPUTER input-output equipment , *ADAPTIVE computing systems , *COMPUTER software , *COMPUTER architecture , *AUTOMATION - Abstract
Abstract: To facilitate the development of the dynamically partially reconfigurable system (DPRS), we propose a model-based platform-specific co-design (MPC) methodology for DPRS with hardware virtualization and preemption. For DPRS analysis and validation, a model-based verification and estimation framework is proposed to make model-driven architecture (MDA) more realistic and applicable to the DPRS design. Considering inherent characteristics of DPRS and real-time system requirements, a semi-automatic model translator converts the UML models of DPRS into timed automata models with transition urgency semantics for model checking. Furthermore, a UML-based hardware/software co-design platform (UCoP) can support the direct interaction between the UML models and the real hardware architecture. Compared to the existing estimation methods, UCoP can provide accurate and efficient platform-specific verification and estimation. We also propose a hierarchical design that consists of a hardware virtualization mechanism for dynamically linking the device nodes, kernel modules, and on-demand reconfigurable hardware functions and a hardware preemption mechanism for further increasing the utilization of hardware resources per unit time. Further, we realize a dynamically partially reconfigurable network security system (DPRNSS) to show the applicability and practicability of the MPC methodology. The DPRNSS cannot only dynamically adapt some of its hardware functions at run-time to meet different system requirements, but also determine which mechanism will be used. Our experiments also demonstrate that the hardware virtualization mechanism can save the overall system execution time up to 12.8% and the hardware preemption mechanism can reduce up to 41.3% of the time required by reconfiguration-based methods. [Copyright &y& Elsevier]
- Published
- 2010
- Full Text
- View/download PDF
15. Genetic algorithms for hardware–software partitioning and optimal resource allocation
- Author
-
Purnaprajna, Madhura, Reformat, Marek, and Pedrycz, Witold
- Subjects
- *
GENETIC algorithms , *COMBINATORIAL optimization , *RESOURCE management , *SYSTEMS development - Abstract
Abstract: A scheme for time and power efficient embedded system design, using hardware and software components, is presented. Our objective is to reduce the execution time and the power consumed by the system, leading to the simultaneous multi-objective minimization of time and power. The goal of suitably partitioning the system into hardware and software components is achieved using Genetic Algorithms (GA). Multiple tests were conducted to confirm the consistency of the results obtained and the versatile nature of the objective functions. An enhanced resource constrained scheduling algorithm is used to determine the system performance. To emulate the characteristics of practical systems, the influence of inter-processor communication is examined. The suitability of introducing a reconfigurable hardware resource over pre-configured hardware is explored for the same objectives. The distinct difference in the task to resource mapping with the variation in design objective is studied. Further, the procedure to allocate optimal number of resources based on the design objective is proposed. The implementation is constrained for power and time individually, with GA being used to arrive at the resource count to suit the objective. The results obtained are compared by varying the time and power constraints. The test environment is developed using randomly generated task graphs. Exhaustive sets of tests are performed on the set design objectives to validate the proposed solution. [Copyright &y& Elsevier]
- Published
- 2007
- Full Text
- View/download PDF
16. A configurable system-on-chip architecture for embedded and real-time applications: concepts, design and realization
- Author
-
Wallner, Sebastian
- Subjects
- *
INTEGRATED circuits , *COMPUTER systems , *EMBEDDED computer systems , *SYSTEMS design - Abstract
Abstract: This paper presents a Configurable System-on-Chip (CSoC) architecture that includes programmable and reconfigurable hardware to cope with the flexibility and real-time signal processing demands in future telecommunication and multimedia systems. A programmable micro Task Controller (mTC) with a small instruction set and a novel pipelined configuration technique with descriptors as configuration templates allows a dynamic use of physical processing resources. The CSoC architecture provides a micro-task based programming model, approves a library-based design approach to reduce developing time and costs and allows forward compatibility to other architecture families. It is shown to be easy scalable to future VLSI technologies where over a hundred processing cells on a single chip will be feasible to deal with the inherent dynamics of future applications and system requirements. Several mappings of commonly used signal processing algorithms and implementation results are given for a standard cell ASIC design realization in 0.18μm 6-layer UMC CMOS technology. [Copyright &y& Elsevier]
- Published
- 2005
- Full Text
- View/download PDF
17. Performance of reconfigurable architectures for image-processing applications
- Author
-
Benitez, Domingo
- Subjects
- *
COMPUTER architecture , *COMPUTER interfaces , *COMPUTER systems , *MICROPROCESSORS , *ADAPTIVE computing systems - Abstract
Reconfigurable architectures combine a programmable–visible interface and the high-level aspects of a computer’s design. The goal of this work is to explore the architectural behaviour of remote reconfigurable systems that are part of general-purpose computers. Our approach analyses various issues arising from the connection of processors with FPGA-based microarchitecture to an existing commodity microprocessor via a standard bus. The quantitative evaluation considers image-processing applications and shows that the maximum performance depends on the amount of data processed by the reconfigurable hardware. Taking images with 256 × 256 pixels, a moderate FPGA capacity of 1E+5 logic blocks provides two orders of magnitude of performance improvement over a Pentium III processor for most of our benchmarks. However, the performance benefits exhibited by reconfigurable architectures may be deeply influenced by some design parameters. This paper studies the impact of hardware capacity, reconfiguration time, memory organisation, and bus bandwidth on the performance achieved by FPGA-based systems. Those image-processing benchmarks that can exhibit high-performance improvement would require about 150 memory banks of 256 bytes each and a bus bandwidth as high as 30 GB/s. This quantitative approach can be applied to the design of high-performance reconfigurable coprocessors for multimedia applications. [Copyright &y& Elsevier]
- Published
- 2003
- Full Text
- View/download PDF
18. IoT Device security through dynamic hardware isolation with cloud-Based update.
- Author
-
Hategekimana, Festus, Whitaker, Taylor JL, Hossain Pantho, Md Jubaer, and Bobda, Christophe
- Subjects
- *
DENIAL of service attacks , *FIELD programmable gate arrays , *HARDWARE , *DEFENSE in depth (Computer security) - Abstract
This work proposes a novel approach to provide comprehensive security to IoT devices. Our approach is based on a reconfigurable hardware-based isolation and protection mechanism (IPM) that operates as a dynamic separation unit between devices and network, far from potential software manipulation. The IPM analyses communications for malicious activities and prevents damage to the IoT device. The IPM leverages a central cloud-based authority to broaden the scope of traffic analysis beyond that of a singular IoT device. The central server evaluates logs from all IPM-protected IoT devices to improve their defense mechanisms and periodically upgrade device IPMs through a remote secure provisioning mechanism. The IPM achieves a 98.68% detection rate when evaluated against a Neptune DoS attack. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
19. Virtualization of reconfigurable coprocessors in HPRC systems with multicore architecture
- Author
-
Gonzalez, Ivan, Lopez-Buedo, Sergio, Sutter, Gustavo, Sanchez-Roman, Diego, Gomez-Arribas, Francisco J., and Aracil, Javier
- Subjects
- *
ADAPTIVE computing systems , *MULTICORE processors , *COMPUTER architecture , *COPROCESSORS , *COMPUTER input-output equipment , *COMPUTER software execution , *COMPUTER software , *CENTRAL processing units - Abstract
Abstract: HPRC (High-Performance Reconfigurable Computing) systems include multicore processors and reconfigurable devices acting as custom coprocessors. Due to economic constraints, the number of reconfigurable devices is usually smaller than the number of processor cores, thus preventing that a 1:1 mapping between cores and coprocessors could be achieved. This paper presents a solution to this problem, based on the virtualization of reconfigurable coprocessors. A Virtual Coprocessor Monitor (VCM) has been devised for the XtremeData XD2000i In-Socket Accelerator, and a thread-safe API is available for user applications to communicate with the VCM. Two reference applications, an IDEA cipher and an Euler CFD solver, have been implemented in order to validate the proposed architecture and execution model. Results show that the benefits arising from coprocessor virtualization outperform its overhead, specially when code has a significant software weight. [Copyright &y& Elsevier]
- Published
- 2012
- Full Text
- View/download PDF
20. Performance of reconfigurable architectures for image-processing applications
- Author
-
Domingo Benítez
- Subjects
Coprocessor ,Computer science ,business.industry ,Pentium ,Memory organisation ,Reconfigurable computing ,Microarchitecture ,law.invention ,Microprocessor ,Memory bank ,Computer architecture ,Hardware and Architecture ,law ,Embedded system ,business ,Field-programmable gate array ,Software - Abstract
Reconfigurable architectures combine a programmable-visible interface and the high-level aspects of a computer's design. The goal of this work is to explore the architectural behaviour of remote reconfigurable systems that are part of general-purpose computers. Our approach analyses various issues arising from the connection of processors with FPGA-based microarchitecture to an existing commodity microprocessor via a standard bus. The quantitative evaluation considers image-processing applications and shows that the maximum performance depends on the amount of data processed by the reconfigurable hardware. Taking images with 256 × 256 pixels, a moderate FPGA capacity of 1E+5 logic blocks provides two orders of magnitude of performance improvement over a Pentium III processor for most of our benchmarks. However, the performance benefits exhibited by reconfigurable architectures may be deeply influenced by some design parameters. This paper studies the impact of hardware capacity, reconfiguration time, memory organisation, and bus bandwidth on the performance achieved by FPGA-based systems. Those image-processing benchmarks that can exhibit high-performance improvement would require about 150 memory banks of 256 bytes each and a bus bandwidth as high as 30 GB/s. This quantitative approach can be applied to the design of high-performance reconfigurable coprocessors for multimedia applications.
- Published
- 2003
- Full Text
- View/download PDF
21. A high level FPGA-based abstract machine for image processing
- Author
-
P. Donachy, Danny Crookes, Khaled Benkrid, K. Alotaibi, and Ahmed Bouridane
- Subjects
business.industry ,Computer science ,Image processing ,Reconfigurable computing ,Abstract machine ,Instruction set ,Hardware and Architecture ,High-level programming language ,Embedded system ,Digital image processing ,Programming paradigm ,Field-programmable gate array ,business ,Software - Abstract
Image processing requires high computational power, plus the ability to experiment with algorithms. Recently, reconfigurable hardware devices in the form of field programmable gate arrays (FPGAs) have been proposed as a way of obtaining high performance at an economical price. At present, however, users must program FPGAs at a very low level and have a detailed knowledge of the architecture of the device being used. They do not therefore facilitate easy development of, or experimentation with, image processing algorithms. To try to reconcile the dual requirements of high performance and ease of development, this paper reports on the design and realisation of an FPGA based image processing machine and its associated high level programming model. This abstract programming model allows an application developer to concentrate on the image processing algorithm in hand rather than on its hardware implementation. The abstract machine is based on a PC host system with a PCI-bus add-on card containing Xilinx XC6200 series FPGA(s). The machine's high level instruction set is based on the operators of image algebra. XC6200 series FPGA configurations have been developed to implement each high level instruction.
- Published
- 1999
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.