13 results on '"Yury Shikunov"'
Search Results
2. Design Validation of Recurrent Signal Processor FPGA prototype
- Author
-
Georgy Orlov, Yury Shikunov, Dmitry Khilko, and Yury Stepchenkov
- Subjects
Digital signal processor ,Correctness ,Process (engineering) ,business.industry ,Computer science ,Data flow diagram ,Software ,Embedded system ,VHDL ,business ,computer ,Digital signal processing ,FPGA prototype ,computer.programming_language - Abstract
This paper describes the final stage of the FPGA prototype development of a recurrent signal processor. During the development of this prototype, a set of tools was created, based on which design verification was carried out. We describe the development process and the prototype validation methodology on a class of DSP tasks using a demo task of isolated word recognition. Taking the previously developed tools and methods for verifying software and hardware models, we have developed a specialized design validation tool. This solution made it possible to ensure the uniformity of the validation process for various types of architecture implementation and to establish the correctness of their operation.
- Published
- 2021
- Full Text
- View/download PDF
3. Self-Timed Storage Register Soft Error Tolerance Improvement
- Author
-
Yury Rogdestvenski, Yury Diachenko, Denis Diachenko, Yury Shikunov, and Yury Stepchenkov
- Subjects
C-element ,Pipeline transport ,Soft error ,business.industry ,Computer science ,Encoding (memory) ,Pipeline (computing) ,State (computer science) ,business ,Computer hardware ,Electronic circuit ,Coding (social sciences) - Abstract
The paper examines the self-timed (ST) pipeline register’s tolerance to soft errors in the pipeline stage’s combinational part and in itself. It aims to analyze the known storage register bit’s circuit cases and improve its soft error tolerance. The use of failure tolerant ST coding, which treats the anti-spacer state as a spacer, increases the ST pipeline’s failure tolerance level. Layout techniques of spacing the dual-rail signal component sources at a distance of more than 2 µm from each other reduce the number of failure types in the ST circuits. In particular, switching the dual-rail signal from a correct working state to an inverse one becomes unrealizable. Circuitry techniques, including cross-connections and local feedback, prevent the bit of the ST-pipeline register from sticking in the anti-spacer state and significantly increase the register’s insensitivity to single soft errors. The use of a DICE-like C-element with two in-phase outputs instead of a known circuitry solution ensures the immunity of the register bit for soft errors inside it. All proposed techniques improve the ST circuit’s soft error tolerance level from 76% to 95%.
- Published
- 2021
- Full Text
- View/download PDF
4. Improvement of Self-Timed Pipeline Immunity of Soft Errors
- Author
-
Yury Diachenko, Denis Diachenko, Yury Shikunov, Yury Stepchenkov, and Yury Rogdestvenski
- Subjects
Pipeline transport ,Digital electronics ,Computer science ,business.industry ,Pipeline (computing) ,Hardware_ARITHMETICANDLOGICSTRUCTURES ,business ,Hardware_REGISTER-TRANSFER-LEVELIMPLEMENTATION ,Reliability engineering - Abstract
The paper presents the results of a study of self-timed (ST) digital circuits' soft-error tolerance. Practical ST circuits have a pipeline structure. The combinational parts of the ST pipeline are naturally immune to 72% of short-term soft errors. The proposed circuitry and layout methods increase the ST pipeline combinational part's failure tolerance to 98% and higher. ST pipeline stage register is the most susceptible to soft errors. A typical variant of the ST pipeline register bit unit based on C-elements has a failure tolerance of 83%. The proposed register bit implementation cases increase the failure tolerance of the ST pipeline up to 98%.
- Published
- 2021
- Full Text
- View/download PDF
5. DSP Filter Kernels Preliminary Benchmarking for Recurrent Data-flow Architecture
- Author
-
Yury Stepchenkov, Dmitry Khilko, Georgy Orlov, and Yury Shikunov
- Subjects
Finite impulse response ,business.industry ,Computer science ,Filter (signal processing) ,Convolution ,Adaptive filter ,symbols.namesake ,Computer engineering ,symbols ,business ,Infinite impulse response ,Digital signal processing ,Dataflow architecture ,Von Neumann architecture - Abstract
The article discusses the results of a preliminary assessment of the data-flow recurrent architecture's performance on a subset of digital signal processing key tasks. Various implementations of vector convolution, FIR filters, IIR filters, adaptive filter, and 256-Point-In-Place FFT are evaluated. The implementation of the listed algorithms is based on the TMS320C55x DSP Library. The preliminary results showed that the performance of the recurrent architecture based on data-flow principles is not inferior to the TMS320C55x, based on von Neumann principles, in terms of the number of computation cycles. Architecture improvement suggestions are presented.
- Published
- 2021
- Full Text
- View/download PDF
6. Testing and optimization of Recurrent Signal Processor
- Author
-
Dmitry Khilko, Yury Stepchenkov, Georgy Orlov, and Yury Shikunov
- Subjects
0106 biological sciences ,Digital signal processor ,business.industry ,Computer science ,Pipeline (computing) ,Test-driven development ,010603 evolutionary biology ,01 natural sciences ,Data flow diagram ,010602 entomology ,Software ,Computer engineering ,Component (UML) ,Architecture ,business ,Digital signal processing - Abstract
This paper covers the optimization research for the novel data-flow computational architecture called Hybrid Architecture of Recurrent Signal Processor. The testing methodology, based on the shift towards Test-Driven Development of architecture models, is provided. We cover the toolset developed to unify the methodology for both software and hardware models. The testing results are analyzed, and the issues are formulated. We propose the pipeline extension by splitting our largest component down. We show the new pipeline utilization ratio this solution provides.
- Published
- 2020
- Full Text
- View/download PDF
7. Self-Timed Multiply-add-subtract Unit Alternates
- Author
-
Denis Diachenko, Yury Shikunov, Yury Stepchenkov, Yury Diachenko, and Yury Rogdestvenski
- Subjects
Adder ,Computer science ,Subtractor ,Subtraction ,Multiplier (economics) ,Booth's multiplication algorithm ,Hardware_ARITHMETICANDLOGICSTRUCTURES ,Arithmetic ,Operand ,IEEE floating point ,Coding (social sciences) - Abstract
Paper presents the results of a study of the self-timed fused multiply-add-subtract unit (FMAS) alternates. All FMAS alternates comply with the IEEE 754 standard and use the modified Booth algorithm to multiply two input 64-bit operands, followed by the addition and subtraction of the third operand. They differ from each other by internal signals self-timed coding: dual-rail, redundant ternary, or redundant quaternary code. The paper analyzes and compares their features, offers their optimized pipeline implementations and recommendations for their use. FMAS alternates have approximately the same performance but different hardware costs and layout sizes.
- Published
- 2020
- Full Text
- View/download PDF
8. Advanced Indication of the Self-Timed Circuits
- Author
-
Yury Stepchenkov, Yury Diachenko, Yury Rogdestvenski, Denis Diachenko, and Yury Shikunov
- Subjects
0106 biological sciences ,Basis (linear algebra) ,Computer science ,Transistor ,010603 evolutionary biology ,01 natural sciences ,law.invention ,010602 entomology ,CMOS ,law ,Factor (programming language) ,Hardware_INTEGRATEDCIRCUITS ,Control signal ,Electronic engineering ,Current (fluid) ,computer ,Hardware_LOGICDESIGN ,computer.programming_language ,Electronic circuit - Abstract
Paper discusses a problem of the CMOS self-timed circuits' indication. Large number of indicating signals in the multi-bit computational devices and registers requires an additional hardware and time for their combining and forming a single control signal that provides a request-acknowledge interaction between interconnected self-timed functional blocks. Indication subcircuit performs this. Multi-input hysteretic triggers allows for accelerating indication subcircuit by factor of 1.1 − 1.6 and reducing its complexity in several times in comparison to standard implementation basis on static and semi-static Muller's elements. A penalty for this is some short-circuit current in the worst case.
- Published
- 2019
- Full Text
- View/download PDF
9. Modeling and debugging tools development for recurrent architecture
- Author
-
Dmitry Khilko, Yury Stepchenkov, Yury Shikunov, and George Orlov
- Subjects
business.industry ,Modeling language ,Computer science ,media_common.quotation_subject ,Reconfigurability ,Domain (software engineering) ,Software ,Debugging ,Task analysis ,Architecture ,business ,Software engineering ,Digital signal processing ,media_common - Abstract
An unconventional multi-core recurrent data-flow architecture, that is being developed at Federal Research Center “Computer Science and Control” of the Russian Academy of Sciences (FRS CSC RAS)was successfully tested on digital signal processing domain both at the model level and on a hardware sample. Based on the test results, several mechanisms had been identified that required improvement and a decision was made to investigate the architecture on other subject domains. Software and main architectural blocks debugging are carried out with the specially developed hardware and software modeling tools. The active extension and debugging of the architecture by using these tools revealed a number of shortcomings of the existing software. To eliminate these shortcomings, two problems have to be solved: to provide a high degree of reconfigurability of the architecture's imitational model (to debug its mechanisms)and implement a symbolic modeling mode (to debug its software). The redesigning results of modeling and debugging tools for recurrent data-flow architecture are discussed in the article.
- Published
- 2019
- Full Text
- View/download PDF
10. Energy Efficient Speed-Independent 64-bit Fused Multiply-Add Unit
- Author
-
Yury Rogdestvenski, Yury Shikunov, Yury Stepchenkov, Yury Diachenko, and Dmitry Stepchenkov
- Subjects
0106 biological sciences ,Adder ,Multiply–accumulate operation ,business.industry ,Computer science ,010603 evolutionary biology ,01 natural sciences ,IEEE floating point ,Pipeline transport ,010602 entomology ,Multiplier (economics) ,Booth's multiplication algorithm ,Hardware_ARITHMETICANDLOGICSTRUCTURES ,business ,Redundant code ,Computer hardware ,Efficient energy use - Abstract
The results of a Speed-Independent Fused Multiply-Add (SIFMA) unit pipeline implementation research are presented. SIFMA is compliant with IEEE 754 Standard. A criterion of the SIFMA pipeline's maximum performance is formulated. A method of reducing hardware costs of SIFMA multiplier by 1.5-2 times depending on its features is offered. The multiplier utilizes a modified Booth algorithm using self-timed redundant code. A new energy efficient self-timed organization of an input and output FIFO was developed. It provides less complexity versus a previous SIFMA implementation on base of semi-dense register.
- Published
- 2019
- Full Text
- View/download PDF
11. Hybrid Multi-Core Recurrent Architecture Approbation on FPGA
- Author
-
N.V. Morozov, Georgy Orlov, Yury Stepchenkov, Yury Shikunov, and Dmitry Khilko
- Subjects
0106 biological sciences ,Nios II ,Signal processing ,Multi-core processor ,business.industry ,Computer science ,010603 evolutionary biology ,01 natural sciences ,010602 entomology ,Word recognition ,Architecture ,Processing core ,business ,Field-programmable gate array ,Digital signal processing ,Computer hardware - Abstract
This paper provides approbation results of the multi-core hybrid architecture for recurrent signal processing (HARSP) as a hardware sample. The prototype has been designed primarily to check architecture's integrity and universality on digital signal processing domain and to verify the hardware implementation of its imitational model, while operational frequency has not been as relevant. Hardware sample has been implemented on FPGA basis with Cyclone V GT Development Kit. Every data-flow processor implements fixed-point 16-bit processing core while the control level is implemented via generated NIOS II processor. The isolated word recognition with a high confidence threshold (at least 95% with a noise level of 15 dB) has been selected as the subject area. We compare HARSP efficiency against specialized TI C66x DSP by implementing the subset of BTDImark2000 algorithms, using computational steps amount the comparison metric.
- Published
- 2019
- Full Text
- View/download PDF
12. Recurrent mechanism developments in the data-flow computer architecture
- Author
-
Dmitry Khilko, Yury Shikunov, and Yury Stepchenkov
- Subjects
Data flow diagram ,Multi-core processor ,symbols.namesake ,Computer architecture ,Computer science ,Feature (computer vision) ,Data redundancy ,symbols ,Overhead (computing) ,Operand ,Viterbi algorithm ,Dataflow architecture - Abstract
This paper covers non-conventional recurrent data-flow architecture, its features, and implementation aspects. Recurrence — the main feature of the new architecture efficiently solves data redundancy problem, typical for data-flow architectures while increasing performance. Conventional recurrence implementation has an overhead of configuration operand insertion that provides required functional fields (tags). Functional capabilities expansion of the architecture mechanism implementing this feature resulted in further efficiency by eliminating said overhead in some instances. We cover enhancements implemented in multicore recurrent data-flow architecture, designed to increase the versatility of recurrent computational process utilization. We compare Viterbi algorithm implementations with and without enhancements.
- Published
- 2018
- Full Text
- View/download PDF
13. Data redundancy problems in data-flow computing and solutions implemented on the recurrent architecture
- Author
-
Dmitry Shikunov, Dmitry Khilko, Yury Shikunov, and Yury Stepchenkov
- Subjects
Cooley–Tukey FFT algorithm ,Theoretical computer science ,Computer science ,010401 analytical chemistry ,04 agricultural and veterinary sciences ,01 natural sciences ,0104 chemical sciences ,Data modeling ,Data flow diagram ,Computer engineering ,Data redundancy ,040103 agronomy & agriculture ,Redundancy (engineering) ,0401 agriculture, forestry, and fisheries ,Algorithm design ,Architecture ,Data compression - Abstract
This paper covers one of the main disadvantages of data-flow computational model — high overhead costs associated with storing and processing large volumes of tag data. Overcoming this disadvantage using various ways of data compression comes with a number of problems described in this article. To solve given problems new recurrent data-flow computational model was created as well as architecture based on that model. This paper describes key features and mechanics of the new model and architecture allowing us to reduce data redundancy in memory storage. Efficiency of developed mechanics is shown in implementation of fast Fourier transform algorithm.
- Published
- 2017
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.