Author: "Fedorov, Igor" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Fedorov, Igor"' showing total 336 results

Start Over Author "Fedorov, Igor"

336 results on '"Fedorov, Igor"'

1. D{\epsilon}pS: Delayed {\epsilon}-Shrinking for Faster Once-For-All Training

Author: Annavajjala, Aditya, Khare, Alind, Agrawal, Animesh, Fedorov, Igor, Latapie, Hugo, Lee, Myungjin, and Tumanov, Alexey
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: CNNs are increasingly deployed across different hardware, dynamic environments, and low-power embedded devices. This has led to the design and training of CNN architectures with the goal of maximizing accuracy subject to such variable deployment constraints. As the number of deployment scenarios grows, there is a need to find scalable solutions to design and train specialized CNNs. Once-for-all training has emerged as a scalable approach that jointly co-trains many models (subnets) at once with a constant training cost and finds specialized CNNs later. The scalability is achieved by training the full model and simultaneously reducing it to smaller subnets that share model weights (weight-shared shrinking). However, existing once-for-all training approaches incur huge training costs reaching 1200 GPU hours. We argue this is because they either start the process of shrinking the full model too early or too late. Hence, we propose Delayed $\epsilon$-Shrinking (D$\epsilon$pS) that starts the process of shrinking the full model when it is partially trained (~50%) which leads to training cost improvement and better in-place knowledge distillation to smaller models. The proposed approach also consists of novel heuristics that dynamically adjust subnet learning rates incrementally (E), leading to improved weight-shared knowledge distillation from larger to smaller subnets as well. As a result, DEpS outperforms state-of-the-art once-for-all training techniques across different datasets including CIFAR10/100, ImageNet-100, and ImageNet-1k on accuracy and cost. It achieves 1.83% higher ImageNet-1k top1 accuracy or the same accuracy with 1.3x reduction in FLOPs and 2.5x drop in training cost (GPU*hrs), Comment: Accepted to the 18th European Conference on Computer Vision (ECCV 2024)
Published: 2024

2. SpinQuant: LLM quantization with learned rotations

Author: Liu, Zechun, Zhao, Changsheng, Fedorov, Igor, Soran, Bilge, Choudhary, Dhruv, Krishnamoorthi, Raghuraman, Chandra, Vikas, Tian, Yuandong, and Blankevoort, Tijmen
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Computation and Language, Computer Science - Computer Vision and Pattern Recognition
Abstract: Post-training quantization (PTQ) techniques applied to weights, activations, and the KV cache greatly reduce memory usage, latency, and power consumption of Large Language Models (LLMs), but may lead to large quantization errors when outliers are present. Rotating activation or weight matrices helps remove outliers and benefits quantization. In this work, we identify a collection of applicable rotation parameterizations that lead to identical outputs in full-precision Transformer architectures while enhancing quantization accuracy. In addition, we find that some random rotations lead to much better quantization than others, with an up to 13 points difference in downstream zero-shot reasoning performance. As a result, we propose SpinQuant, a novel approach that incorporates learned rotation matrices for optimal quantized network accuracy. With 4-bit quantization of weight, activation, and KV-cache, SpinQuant narrows the accuracy gap on zero-shot reasoning tasks with full precision to merely 2.9 points on the LLaMA-2 7B model, surpassing LLM-QAT by 19.1 points and SmoothQuant by 25.0 points. Furthermore, SpinQuant also outperforms concurrent work QuaRot, which applies random rotations to remove outliers. In particular, for LLaMA-3 8B models that are hard to quantize, SpinQuant reduces the gap to full precision by up to 45.1% relative to QuaRot.
Published: 2024

3. MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Author: Liu, Zechun, Zhao, Changsheng, Iandola, Forrest, Lai, Chen, Tian, Yuandong, Fedorov, Igor, Xiong, Yunyang, Chang, Ernie, Shi, Yangyang, Krishnamoorthi, Raghuraman, Lai, Liangzhen, and Chandra, Vikas
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Computation and Language
Abstract: This paper addresses the growing need for efficient large language models (LLMs) on mobile devices, driven by increasing cloud costs and latency concerns. We focus on designing top-quality LLMs with fewer than a billion parameters, a practical choice for mobile deployment. Contrary to prevailing belief emphasizing the pivotal role of data and parameter quantity in determining model quality, our investigation underscores the significance of model architecture for sub-billion scale LLMs. Leveraging deep and thin architectures, coupled with embedding sharing and grouped-query attention mechanisms, we establish a strong baseline network denoted as MobileLLM, which attains a remarkable 2.7%/4.3% accuracy boost over preceding 125M/350M state-of-the-art models. Additionally, we propose an immediate block-wise weight-sharing approach with no increase in model size and only marginal latency overhead. The resultant models, denoted as MobileLLM-LS, demonstrate a further accuracy enhancement of 0.7%/0.8% than MobileLLM 125M/350M. Moreover, MobileLLM model family shows significant improvements compared to previous sub-billion models on chat benchmarks, and demonstrates close correctness to LLaMA-v2 7B in API calling tasks, highlighting the capability of small models for common on-device use cases., Comment: ICML 2024. Code is available at https://github.com/facebookresearch/MobileLLM
Published: 2024

4. SiGeo: Sub-One-Shot NAS via Information Theory and Geometry of Loss Landscape

Author: Zheng, Hua, Liu, Kuang-Hung, Fedorov, Igor, Zhang, Xin, Chen, Wen-Yen, and Wen, Wei
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: Neural Architecture Search (NAS) has become a widely used tool for automating neural network design. While one-shot NAS methods have successfully reduced computational requirements, they often require extensive training. On the other hand, zero-shot NAS utilizes training-free proxies to evaluate a candidate architecture's test performance but has two limitations: (1) inability to use the information gained as a network improves with training and (2) unreliable performance, particularly in complex domains like RecSys, due to the multi-modal data inputs and complex architecture configurations. To synthesize the benefits of both methods, we introduce a "sub-one-shot" paradigm that serves as a bridge between zero-shot and one-shot NAS. In sub-one-shot NAS, the supernet is trained using only a small subset of the training data, a phase we refer to as "warm-up." Within this framework, we present SiGeo, a proxy founded on a novel theoretical framework that connects the supernet warm-up with the efficacy of the proxy. Extensive experiments have shown that SiGeo, with the benefit of warm-up, consistently outperforms state-of-the-art NAS proxies on various established NAS benchmarks. When a supernet is warmed up, it can achieve comparable performance to weight-sharing one-shot NAS methods, but with a significant reduction ($\sim 60$\%) in computational costs., Comment: 24 pages, 7 figures
Published: 2023

5. Rankitect: Ranking Architecture Search Battling World-class Engineers at Meta Scale

Author: Wen, Wei, Liu, Kuang-Hung, Fedorov, Igor, Zhang, Xin, Yin, Hang, Chu, Weiwei, Hassani, Kaveh, Sun, Mengying, Liu, Jiang, Wang, Xu, Jiang, Lin, Chen, Yuxin, Zhang, Buyun, Liu, Xi, Cheng, Dehua, Chen, Zhengxing, Zhao, Guang, Han, Fangqiu, Yang, Jiyan, Hao, Yuchen, Xiong, Liang, and Chen, Wen-Yen
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Information Retrieval
Abstract: Neural Architecture Search (NAS) has demonstrated its efficacy in computer vision and potential for ranking systems. However, prior work focused on academic problems, which are evaluated at small scale under well-controlled fixed baselines. In industry system, such as ranking system in Meta, it is unclear whether NAS algorithms from the literature can outperform production baselines because of: (1) scale - Meta ranking systems serve billions of users, (2) strong baselines - the baselines are production models optimized by hundreds to thousands of world-class engineers for years since the rise of deep learning, (3) dynamic baselines - engineers may have established new and stronger baselines during NAS search, and (4) efficiency - the search pipeline must yield results quickly in alignment with the productionization life cycle. In this paper, we present Rankitect, a NAS software framework for ranking systems at Meta. Rankitect seeks to build brand new architectures by composing low level building blocks from scratch. Rankitect implements and improves state-of-the-art (SOTA) NAS methods for comprehensive and fair comparison under the same search space, including sampling-based NAS, one-shot NAS, and Differentiable NAS (DNAS). We evaluate Rankitect by comparing to multiple production ranking models at Meta. We find that Rankitect can discover new models from scratch achieving competitive tradeoff between Normalized Entropy loss and FLOPs. When utilizing search space designed by engineers, Rankitect can generate better models than engineers, achieving positive offline evaluation and online A/B test at Meta scale., Comment: Wei Wen and Kuang-Hung Liu contribute equally
Published: 2023

6. DistDNAS: Search Efficient Feature Interactions within 2 Hours

Author: Zhang, Tunhou, Wen, Wei, Fedorov, Igor, Liu, Xi, Zhang, Buyun, Han, Fangqiu, Chen, Wen-Yen, Han, Yiping, Yan, Feng, Li, Hai, and Chen, Yiran
Subjects: Computer Science - Information Retrieval, Computer Science - Machine Learning
Abstract: Search efficiency and serving efficiency are two major axes in building feature interactions and expediting the model development process in recommender systems. On large-scale benchmarks, searching for the optimal feature interaction design requires extensive cost due to the sequential workflow on the large volume of data. In addition, fusing interactions of various sources, orders, and mathematical operations introduces potential conflicts and additional redundancy toward recommender models, leading to sub-optimal trade-offs in performance and serving cost. In this paper, we present DistDNAS as a neat solution to brew swift and efficient feature interaction design. DistDNAS proposes a supernet to incorporate interaction modules of varying orders and types as a search space. To optimize search efficiency, DistDNAS distributes the search and aggregates the choice of optimal interaction modules on varying data dates, achieving over 25x speed-up and reducing search cost from 2 days to 2 hours. To optimize serving efficiency, DistDNAS introduces a differentiable cost-aware loss to penalize the selection of redundant interaction modules, enhancing the efficiency of discovered feature interactions in serving. We extensively evaluate the best models crafted by DistDNAS on a 1TB Criteo Terabyte dataset. Experimental evaluations demonstrate 0.001 AUC improvement and 60% FLOPs saving over current state-of-the-art CTR models.
Published: 2023

7. PerfSAGE: Generalized Inference Performance Predictor for Arbitrary Deep Learning Models on Edge Devices

Author: Chai, Yuji, Tripathy, Devashree, Zhou, Chuteng, Gope, Dibakar, Fedorov, Igor, Matas, Ramon, Brooks, David, Wei, Gu-Yeon, and Whatmough, Paul
Subjects: Computer Science - Machine Learning, Computer Science - Performance
Abstract: The ability to accurately predict deep neural network (DNN) inference performance metrics, such as latency, power, and memory footprint, for an arbitrary DNN on a target hardware platform is essential to the design of DNN based models. This ability is critical for the (manual or automatic) design, optimization, and deployment of practical DNNs for a specific hardware deployment platform. Unfortunately, these metrics are slow to evaluate using simulators (where available) and typically require measurement on the target hardware. This work describes PerfSAGE, a novel graph neural network (GNN) that predicts inference latency, energy, and memory footprint on an arbitrary DNN TFlite graph (TFL, 2017). In contrast, previously published performance predictors can only predict latency and are restricted to pre-defined construction rules or search spaces. This paper also describes the EdgeDLPerf dataset of 134,912 DNNs randomly sampled from four task search spaces and annotated with inference performance metrics from three edge hardware platforms. Using this dataset, we train PerfSAGE and provide experimental results that demonstrate state-of-the-art prediction accuracy with a Mean Absolute Percentage Error of <5% across all targets and model search spaces. These results: (1) Outperform previous state-of-art GNN-based predictors (Dudziak et al., 2020), (2) Accurately predict performance on accelerators (a shortfall of non-GNN-based predictors (Zhang et al., 2021)), and (3) Demonstrate predictions on arbitrary input graphs without modifications to the feature extractor.
Published: 2023

8. Restructurable Activation Networks

Author: Bhardwaj, Kartikeya, Ward, James, Tung, Caleb, Gope, Dibakar, Meng, Lingchuan, Fedorov, Igor, Chalfin, Alex, Whatmough, Paul, and Loh, Danny
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence, Statistics - Machine Learning
Abstract: Is it possible to restructure the non-linear activation functions in a deep network to create hardware-efficient models? To address this question, we propose a new paradigm called Restructurable Activation Networks (RANs) that manipulate the amount of non-linearity in models to improve their hardware-awareness and efficiency. First, we propose RAN-explicit (RAN-e) -- a new hardware-aware search space and a semi-automatic search algorithm -- to replace inefficient blocks with hardware-aware blocks. Next, we propose a training-free model scaling method called RAN-implicit (RAN-i) where we theoretically prove the link between network topology and its expressivity in terms of number of non-linear units. We demonstrate that our networks achieve state-of-the-art results on ImageNet at different scales and for several types of hardware. For example, compared to EfficientNet-Lite-B0, RAN-e achieves a similar accuracy while improving Frames-Per-Second (FPS) by 1.5x on Arm micro-NPUs. On the other hand, RAN-i demonstrates up to 2x reduction in #MACs over ConvNexts with a similar or better accuracy. We also show that RAN-i achieves nearly 40% higher FPS than ConvNext on Arm-based datacenter CPUs. Finally, RAN-i based object detection networks achieve a similar or higher mAP and up to 33% higher FPS on datacenter CPUs compared to ConvNext based models. The code to train and evaluate RANs and the pretrained networks are available at https://github.com/ARM-software/ML-restructurable-activation-networks., Comment: This work was presented at an Arm AI virtual tech talk. Video is available at https://www.youtube.com/watch?v=EUqFNE28Kq4
Published: 2022

9. Magnitude-aware Probabilistic Speaker Embeddings

Author: Kuzmin, Nikita, Fedorov, Igor, and Sholokhov, Alexey
Subjects: Electrical Engineering and Systems Science - Audio and Speech Processing, Computer Science - Machine Learning, Computer Science - Sound
Abstract: Recently, hyperspherical embeddings have established themselves as a dominant technique for face and voice recognition. Specifically, Euclidean space vector embeddings are learned to encode person-specific information in their direction while ignoring the magnitude. However, recent studies have shown that the magnitudes of the embeddings extracted by deep neural networks may indicate the quality of the corresponding inputs. This paper explores the properties of the magnitudes of the embeddings related to quality assessment and out-of-distribution detection. We propose a new probabilistic speaker embedding extractor using the information encoded in the embedding magnitude and leverage it in the speaker verification pipeline. We also propose several quality-aware diarization methods and incorporate the magnitudes in those. Our results indicate significant improvements over magnitude-agnostic baselines both in speaker verification and diarization tasks., Comment: Accepted to Odyssey 2022: The Speaker and Language Recognition Workshop, camera-ready version
Published: 2022
Full Text: View/download PDF

10. UDC: Unified DNAS for Compressible TinyML Models

Author: Fedorov, Igor, Matas, Ramon, Tann, Hokchhay, Zhou, Chuteng, Mattina, Matthew, and Whatmough, Paul
Subjects: Computer Science - Machine Learning
Abstract: Deploying TinyML models on low-cost IoT hardware is very challenging, due to limited device memory capacity. Neural processing unit (NPU) hardware address the memory challenge by using model compression to exploit weight quantization and sparsity to fit more parameters in the same footprint. However, designing compressible neural networks (NNs) is challenging, as it expands the design space across which we must make balanced trade-offs. This paper demonstrates Unified DNAS for Compressible (UDC) NNs, which explores a large search space to generate state-of-the-art compressible NNs for NPU. ImageNet results show UDC networks are up to $3.35\times$ smaller (iso-accuracy) or 6.25% more accurate (iso-model size) than previous work.
Published: 2022

11. First principles study of structural, electronic and mechanical properties of crystalline glucose under pressure

Author: Fedorov, Igor A.
Published: 2024
Full Text: View/download PDF

12. MicroNets: Neural Network Architectures for Deploying TinyML Applications on Commodity Microcontrollers

Author: Banbury, Colby, Zhou, Chuteng, Fedorov, Igor, Navarro, Ramon Matas, Thakker, Urmish, Gope, Dibakar, Reddi, Vijay Janapa, Mattina, Matthew, and Whatmough, Paul N.
Subjects: Computer Science - Machine Learning
Abstract: Executing machine learning workloads locally on resource constrained microcontrollers (MCUs) promises to drastically expand the application space of IoT. However, so-called TinyML presents severe technical challenges, as deep neural network inference demands a large compute and memory budget. To address this challenge, neural architecture search (NAS) promises to help design accurate ML models that meet the tight MCU memory, latency and energy constraints. A key component of NAS algorithms is their latency/energy model, i.e., the mapping from a given neural network architecture to its inference latency/energy on an MCU. In this paper, we observe an intriguing property of NAS search spaces for MCU model design: on average, model latency varies linearly with model operation (op) count under a uniform prior over models in the search space. Exploiting this insight, we employ differentiable NAS (DNAS) to search for models with low memory usage and low op count, where op count is treated as a viable proxy to latency. Experimental results validate our methodology, yielding our MicroNet models, which we deploy on MCUs using Tensorflow Lite Micro, a standard open-source NN inference runtime widely used in the TinyML community. MicroNets demonstrate state-of-the-art results for all three TinyMLperf industry-standard benchmark tasks: visual wake words, audio keyword spotting, and anomaly detection. Models and training scripts can be found at github.com/ARM-software/ML-zoo., Comment: 10 pages, 8 figures, 3 tables
Published: 2020

13. MANGO: A Python Library for Parallel Hyperparameter Tuning

Author: Sandha, Sandeep Singh, Aggarwal, Mohit, Fedorov, Igor, and Srivastava, Mani
Subjects: Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: Tuning hyperparameters for machine learning algorithms is a tedious task, one that is typically done manually. To enable automated hyperparameter tuning, recent works have started to use techniques based on Bayesian optimization. However, to practically enable automated tuning for large scale machine learning training pipelines, significant gaps remain in existing libraries, including lack of abstractions, fault tolerance, and flexibility to support scheduling on any distributed computing framework. To address these challenges, we present Mango, a Python library for parallel hyperparameter tuning. Mango enables the use of any distributed scheduling framework, implements intelligent parallel search strategies, and provides rich abstractions for defining complex hyperparameter search spaces that are compatible with scikit-learn. Mango is comparable in performance to Hyperopt, another widely used library. Mango is available open-source and is currently used in production at Arm Research to provide state-of-art hyperparameter tuning capabilities., Comment: 5 pages, 3 figures, ICASSP Conference
Published: 2020

14. TinyLSTMs: Efficient Neural Speech Enhancement for Hearing Aids

Author: Fedorov, Igor, Stamenovic, Marko, Jensen, Carl, Yang, Li-Chia, Mandell, Ari, Gan, Yiming, Mattina, Matthew, and Whatmough, Paul N.
Subjects: Electrical Engineering and Systems Science - Audio and Speech Processing, Computer Science - Machine Learning, Computer Science - Sound, Statistics - Machine Learning
Abstract: Modern speech enhancement algorithms achieve remarkable noise suppression by means of large recurrent neural networks (RNNs). However, large RNNs limit practical deployment in hearing aid hardware (HW) form-factors, which are battery powered and run on resource-constrained microcontroller units (MCUs) with limited memory capacity and compute capability. In this work, we use model compression techniques to bridge this gap. We define the constraints imposed on the RNN by the HW and describe a method to satisfy them. Although model compression techniques are an active area of research, we are the first to demonstrate their efficacy for RNN speech enhancement, using pruning and integer quantization of weights/activations. We also demonstrate state update skipping, which reduces the computational load. Finally, we conduct a perceptual evaluation of the compressed models to verify audio quality on human raters. Results show a reduction in model size and operations of 11.9$\times$ and 2.9$\times$, respectively, over the baseline for compressed models, without a statistical difference in listening preference and only exhibiting a loss of 0.55dB SDR. Our model achieves a computational latency of 2.39ms, well within the 10ms target and 351$\times$ better than previous work., Comment: First four authors contributed equally. For audio samples, see https://github.com/BoseCorp/efficient-neural-speech-enhancement
Published: 2020
Full Text: View/download PDF

15. Pushing the limits of RNN Compression

Author: Thakker, Urmish, Fedorov, Igor, Beu, Jesse, Gope, Dibakar, Zhou, Chu, Dasika, Ganesh, and Mattina, Matthew
Subjects: Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: Recurrent Neural Networks (RNN) can be difficult to deploy on resource constrained devices due to their size. As a result, there is a need for compression techniques that can significantly compress RNNs without negatively impacting task accuracy. This paper introduces a method to compress RNNs for resource constrained environments using Kronecker product (KP). KPs can compress RNN layers by 16-38x with minimal accuracy loss. We show that KP can beat the task accuracy achieved by other state-of-the-art compression techniques (pruning and low-rank matrix factorization) across 4 benchmarks spanning 3 different applications, while simultaneously improving inference run-time., Comment: 6 pages. arXiv admin note: substantial text overlap with arXiv:1906.02876
Published: 2019

16. Compressing RNNs for IoT devices by 15-38x using Kronecker Products

Author: Thakker, Urmish, Beu, Jesse, Gope, Dibakar, Zhou, Chu, Fedorov, Igor, Dasika, Ganesh, and Mattina, Matthew
Subjects: Computer Science - Machine Learning, Computer Science - Neural and Evolutionary Computing, Statistics - Machine Learning
Abstract: Recurrent Neural Networks (RNN) can be difficult to deploy on resource constrained devices due to their size.As a result, there is a need for compression techniques that can significantly compress RNNs without negatively impacting task accuracy. This paper introduces a method to compress RNNs for resource constrained environments using Kronecker product (KP). KPs can compress RNN layers by 15-38x with minimal accuracy loss. By quantizing the resulting models to 8-bits, we further push the compression factor to 50x. We show that KP can beat the task accuracy achieved by other state-of-the-art compression techniques across 5 benchmarks spanning 3 different applications, while simultaneously improving inference run-time. We show that the KP compression mechanism does introduce an accuracy loss, which can be mitigated by a proposed hybrid KP (HKP) approach. Our HKP algorithm provides fine-grained control over the compression ratio, enabling us to regain accuracy lost during compression by adding a small number of model parameters.
Published: 2019

17. SpArSe: Sparse Architecture Search for CNNs on Resource-Constrained Microcontrollers

Author: Fedorov, Igor, Adams, Ryan P., Mattina, Matthew, and Whatmough, Paul N.
Subjects: Computer Science - Machine Learning, Computer Science - Computer Vision and Pattern Recognition
Abstract: The vast majority of processors in the world are actually microcontroller units (MCUs), which find widespread use performing simple control tasks in applications ranging from automobiles to medical devices and office equipment. The Internet of Things (IoT) promises to inject machine learning into many of these every-day objects via tiny, cheap MCUs. However, these resource-impoverished hardware platforms severely limit the complexity of machine learning models that can be deployed. For example, although convolutional neural networks (CNNs) achieve state-of-the-art results on many visual recognition tasks, CNN inference on MCUs is challenging due to severe finite memory limitations. To circumvent the memory challenge associated with CNNs, various alternatives have been proposed that do fit within the memory budget of an MCU, albeit at the cost of prediction accuracy. This paper challenges the idea that CNNs are not suitable for deployment on MCUs. We demonstrate that it is possible to automatically design CNNs which generalize well, while also being small enough to fit onto memory-limited MCUs. Our Sparse Architecture Search method combines neural architecture search with pruning in a single, unified approach, which learns superior models on four popular IoT datasets. The CNNs we find are more accurate and up to $4.35\times$ smaller than previous approaches, while meeting the strict MCU working memory constraint.
Published: 2019

18. Improving Taxation of Small and Medium Enterprises in Russia

Author: Alexandrova, Nadezhda V., Blinov, Vladimir G., Ivanov, Nikolai V., Karpov, Alexey V., Fedorov, Igor Z., Pisello, Anna Laura, Editorial Board Member, Hawkes, Dean, Editorial Board Member, Bougdah, Hocine, Editorial Board Member, Rosso, Federica, Editorial Board Member, Abdalla, Hassan, Editorial Board Member, Boemi, Sofia-Natalia, Editorial Board Member, Mohareb, Nabil, Editorial Board Member, Mesbah Elkaffas, Saleh, Editorial Board Member, Bozonnet, Emmanuel, Editorial Board Member, Pignatta, Gloria, Editorial Board Member, Mahgoub, Yasser, Editorial Board Member, De Bonis, Luciano, Editorial Board Member, Kostopoulou, Stella, Editorial Board Member, Pradhan, Biswajeet, Editorial Board Member, Abdul Mannan, Md., Editorial Board Member, Alalouch, Chaham, Editorial Board Member, O. Gawad, Iman, Editorial Board Member, Nayyar, Anand, Editorial Board Member, Amer, Mourad, Series Editor, Bogoviz, Aleksei V., editor, and Popkova, Elena G., editor
Published: 2022
Full Text: View/download PDF

19. Modern Problems of Ensuring the Economic Security of the State

Author: Ivanov, Mikhail G., Ivanova, Oles ya M., Gorodnova, Olga N., Fedorov, Igor Z., Markelov, Alexander G., Kacprzyk, Janusz, Series Editor, Gomide, Fernando, Advisory Editor, Kaynak, Okyay, Advisory Editor, Liu, Derong, Advisory Editor, Pedrycz, Witold, Advisory Editor, Polycarpou, Marios M., Advisory Editor, Rudas, Imre J., Advisory Editor, Wang, Jun, Advisory Editor, Bogoviz, Aleksei V., editor, Suglobov, Alexander E., editor, Maloletko, Alexander N., editor, and Kaurova, Olga V., editor
Published: 2022
Full Text: View/download PDF

20. Risk Assessment of the Operation of Aviation Maintenance Personnel Trained on Virtual Reality Simulators

Author: Ainakulov Zharas, Pirmanov Ildar, Koshekov Kayrat, Astapenko Natalya, Fedorov Igor, Zuev Dmitriy, and Kurmankulova Gulzhan
Subjects: digital model, virtual reality, algorithm, 3d models, aviation technology, Transportation and communication, K4011-4343
Abstract: Conducting a safe briefing is essential to educate aircraft maintenance personnel, who very often encounter various unexpected and dangerous incidents. Their reaction to situations should be quick and adequate. To train aircraft maintenance professionals who cannot be practiced in real life due to high cost, danger, time or effort, virtual training seems like an obvious choice. This paper is devoted to the development of a calculation algorithm for assessing the risk of actions taken at the aircraft repair site, which was implemented in the training version of the virtual reality (VR) simulation. It includes a number of factors and elements that form the simulation scenario, influencing the degree of its complexity and the assessment of the performance of each exercise. Various components of the algorithm are presented, which allow assessing the skills of students of aviation specialist courses. The criterion for the acceptability of the developed algorithm is the correct assessment of the student’s skills in the course of training.
Published: 2022
Full Text: View/download PDF

21. Multimodal Sparse Bayesian Dictionary Learning

Author: Fedorov, Igor and Rao, Bhaskar D.
Subjects: Statistics - Machine Learning, Computer Science - Machine Learning
Abstract: This paper addresses the problem of learning dictionaries for multimodal datasets, i.e. datasets collected from multiple data sources. We present an algorithm called multimodal sparse Bayesian dictionary learning (MSBDL). MSBDL leverages information from all available data modalities through a joint sparsity constraint. The underlying framework offers a considerable amount of flexibility to practitioners and addresses many of the shortcomings of existing multimodal dictionary learning approaches. In particular, the procedure includes the automatic tuning of hyperparameters and is unique in that it allows the dictionaries for each data modality to have different cardinality, a significant feature in cases when the dimensionality of data differs across modalities. MSBDL is scalable and can be used in supervised learning settings. Theoretical results relating to the convergence of MSBDL are presented and the numerical results provide evidence of the superior performance of MSBDL on synthetic and real datasets compared to existing methods.
Published: 2018

22. Re-Weighted Learning for Sparsifying Deep Neural Networks

Author: Fedorov, Igor and Rao, Bhaskar D.
Subjects: Computer Science - Learning
Abstract: This paper addresses the topic of sparsifying deep neural networks (DNN's). While DNN's are powerful models that achieve state-of-the-art performance on a large number of tasks, the large number of model parameters poses serious storage and computational challenges. To combat these difficulties, a growing line of work focuses on pruning network weights without sacrificing performance. We propose a general affine scaling transformation (AST) algorithm to sparsify DNN's. Our approach follows in the footsteps of popular sparse recovery techniques, which have yet to be explored in the context of DNN's. We describe a principled framework for transforming densely connected DNN's into sparsely connected ones without sacrificing network performance. Unlike existing methods, our approach is able to learn sparse connections at each layer simultaneously, and achieves comparable pruning results on the architecture tested.
Published: 2018

23. Theoretical study of the interaction of lactose with collagen, the electronic and mechanical properties of crystalline lactose

Author: Fedorov, Igor A., Voroshilin, Roman A., and Prosekov, Alexander Y.
Published: 2022
Full Text: View/download PDF

24. Relevance Subject Machine: A Novel Person Re-identification Framework

Author: Fedorov, Igor, Giri, Ritwik, Rao, Bhaskar D., and Nguyen, Truong Q.
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: We propose a novel method called the Relevance Subject Machine (RSM) to solve the person re-identification (re-id) problem. RSM falls under the category of Bayesian sparse recovery algorithms and uses the sparse representation of the input video under a pre-defined dictionary to identify the subject in the video. Our approach focuses on the multi-shot re-id problem, which is the prevalent problem in many video analytics applications. RSM captures the essence of the multi-shot re-id problem by constraining the support of the sparse codes for each input video frame to be the same. Our proposed approach is also robust enough to deal with time varying outliers and occlusions by introducing a sparse, non-stationary noise term in the model error. We provide a novel Variational Bayesian based inference procedure along with an intuitive interpretation of the proposed update rules. We evaluate our approach over several commonly used re-id datasets and show superior performance over current state-of-the-art algorithms. Specifically, for ILIDS-VID, a recent large scale re-id dataset, RSM shows significant improvement over all published approaches, achieving an 11.5% (absolute) improvement in rank 1 accuracy over the closest competing algorithm considered., Comment: Submitted to IEEE Transactions on Pattern Analysis and Machine Intelligence
Published: 2017

25. Rectified Gaussian Scale Mixtures and the Sparse Non-Negative Least Squares Problem

Author: Nalci, Alican, Fedorov, Igor, Al-Shoukairi, Maher, Liu, Thomas T, and Rao, Bhaskar D
Subjects: Non-negative least squares, sparse Bayesian learning, sparse signal recovery, rectified Gaussian scale mixtures, Non-negative Least Squares, Rectified Gaussian Scale Mixtures, Sparse Bayesian learning, Sparse Signal Recovery, Networking & Telecommunications
Abstract: In this paper, we develop a Bayesian evidence maximization framework to solve the sparse non-negative least squares problem (S-NNLS). We introduce a family of probability densities referred to as the Rectified Gaussian Scale Mixture (R-GSM), to model the sparsity enforcing prior distribution for the signal of interest. The R-GSM prior encompasses a variety of heavy-tailed distributions such as the rectified Laplacian and rectified Student-t distributions with a proper choice of the mixing density. We utilize the hierarchical representation induced by the R-GSM prior and develop an evidence maximization framework based on the Expectation-Maximization (EM) algorithm. Using the EM-based method, we estimate the hyper-parameters and obtain a point estimate for the solution of interest. We refer to this proposed method as rectified Sparse Bayesian Learning (R-SBL). We provide four EM-based R-SBL variants that offer a range of options to trade-off computational complexity to the quality of the E-step computation. These methods include the Markov Chain Monte Carlo EM, linear minimum mean square estimation, approximate message passing and a diagonal approximation. Using numerical experiments, we show that the proposed R-SBL method outperforms existing S-NNLS solvers in terms of both signal and support recovery, and is very robust against the structure of the design matrix.
Published: 2018

26. A unified framework for sparse non-negative least squares using multiplicative updates and the non-negative matrix factorization problem

Author: Fedorov, Igor, Nalci, Alican, Giri, Ritwik, Rao, Bhaskar D, Nguyen, Truong Q, and Garudadri, Harinath
Subjects: Information and Computing Sciences, Communications Engineering, Engineering, Computer Vision and Multimedia Computation, Sparsity, Non-negativity, Dictionary learning, Technology, Networking & Telecommunications, Information and computing sciences
Abstract: We study the sparse non-negative least squares (S-NNLS) problem. S-NNLS occurs naturally in a wide variety of applications where an unknown, non-negative quantity must be recovered from linear measurements. We present a unified framework for S-NNLS based on a rectified power exponential scale mixture prior on the sparse codes. We show that the proposed framework encompasses a large class of S-NNLS algorithms and provide a computationally efficient inference procedure based on multiplicative update rules. Such update rules are convenient for solving large sets of S-NNLS problems simultaneously, which is required in contexts like sparse non-negative matrix factorization (S-NMF). We provide theoretical justification for the proposed approach by showing that the local minima of the objective function being optimized are sparse and the S-NNLS algorithms presented are guaranteed to converge to a set of stationary points of the objective function. We then extend our framework to S-NMF, showing that our framework leads to many well known S-NMF algorithms under specific choices of prior and providing a guarantee that a popular subclass of the proposed algorithms converges to a set of stationary points of the objective function. Finally, we study the performance of the proposed approaches on synthetic and real-world data.
Published: 2018

27. Rankitect: Ranking Architecture Search Battling World-class Engineers at Meta Scale

Author: Wen, Wei, primary, Liu, Kuang-Hung, additional, Fedorov, Igor, additional, Zhang, Xin, additional, Yin, Hang, additional, Chu, Weiwei, additional, Hassani, Kaveh, additional, Sun, Mengying, additional, Liu, Jiang, additional, Wang, Xu, additional, Jiang, Lin, additional, Chen, Yuxin, additional, Zhang, Buyun, additional, Liu, Xi, additional, Cheng, Dehua, additional, Chen, Zhengxing, additional, Zhao, Guang, additional, Han, Fangqiu, additional, Yang, Jiyan, additional, Hao, Yuchen, additional, Xiong, Liang, additional, and Chen, Wen-Yen, additional
Published: 2024
Full Text: View/download PDF

28. Computer Simulation of the Structural Properties of Energetic Materials Using High Performance Computing

Author: Fedorov, Igor A., Reyn, Tatyana S., Karabtsev, Sergei N., Kacprzyk, Janusz, Series Editor, Pal, Nikhil R., Advisory Editor, Bello Perez, Rafael, Advisory Editor, Corchado, Emilio S., Advisory Editor, Hagras, Hani, Advisory Editor, Kóczy, László T., Advisory Editor, Kreinovich, Vladik, Advisory Editor, Lin, Chin-Teng, Advisory Editor, Lu, Jie, Advisory Editor, Melin, Patricia, Advisory Editor, Nedjah, Nadia, Advisory Editor, Nguyen, Ngoc Thanh, Advisory Editor, Wang, Jun, Advisory Editor, Silhavy, Radek, editor, Silhavy, Petr, editor, and Prokopova, Zdenka, editor
Published: 2020
Full Text: View/download PDF

29. Robust Bayesian Method for Simultaneous Block Sparse Signal Recovery with Applications to Face Recognition

Author: Fedorov, Igor, Giri, Ritwik, Rao, Bhaskar D., and Nguyen, Truong Q.
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: In this paper, we present a novel Bayesian approach to recover simultaneously block sparse signals in the presence of outliers. The key advantage of our proposed method is the ability to handle non-stationary outliers, i.e. outliers which have time varying support. We validate our approach with empirical results showing the superiority of the proposed method over competing approaches in synthetic data experiments as well as the multiple measurement face recognition problem., Comment: To appear in ICIP 2016
Published: 2016

30. A Unified Framework for Sparse Non-Negative Least Squares using Multiplicative Updates and the Non-Negative Matrix Factorization Problem

Author: Fedorov, Igor, Nalci, Alican, Giri, Ritwik, Rao, Bhaskar D., Nguyen, Truong Q., and Garudadri, Harinath
Subjects: Statistics - Machine Learning
Abstract: We study the sparse non-negative least squares (S-NNLS) problem. S-NNLS occurs naturally in a wide variety of applications where an unknown, non-negative quantity must be recovered from linear measurements. We present a unified framework for S-NNLS based on a rectified power exponential scale mixture prior on the sparse codes. We show that the proposed framework encompasses a large class of S-NNLS algorithms and provide a computationally efficient inference procedure based on multiplicative update rules. Such update rules are convenient for solving large sets of S-NNLS problems simultaneously, which is required in contexts like sparse non-negative matrix factorization (S-NMF). We provide theoretical justification for the proposed approach by showing that the local minima of the objective function being optimized are sparse and the S-NNLS algorithms presented are guaranteed to converge to a set of stationary points of the objective function. We then extend our framework to S-NMF, showing that our framework leads to many well known S-NMF algorithms under specific choices of prior and providing a guarantee that a popular subclass of the proposed algorithms converges to a set of stationary points of the objective function. Finally, we study the performance of the proposed approaches on synthetic and real-world data., Comment: To appear in Signal Processing
Published: 2016

31. Rectified Gaussian Scale Mixtures and the Sparse Non-Negative Least Squares Problem

Author: Nalci, Alican, Fedorov, Igor, Al-Shoukairi, Maher, Liu, Thomas T., and Rao, Bhaskar D.
Subjects: Computer Science - Learning, Statistics - Machine Learning
Abstract: In this paper, we develop a Bayesian evidence maximization framework to solve the sparse non-negative least squares (S-NNLS) problem. We introduce a family of probability densities referred to as the Rectified Gaussian Scale Mixture (R- GSM) to model the sparsity enforcing prior distribution for the solution. The R-GSM prior encompasses a variety of heavy-tailed densities such as the rectified Laplacian and rectified Student- t distributions with a proper choice of the mixing density. We utilize the hierarchical representation induced by the R-GSM prior and develop an evidence maximization framework based on the Expectation-Maximization (EM) algorithm. Using the EM based method, we estimate the hyper-parameters and obtain a point estimate for the solution. We refer to the proposed method as rectified sparse Bayesian learning (R-SBL). We provide four R- SBL variants that offer a range of options for computational complexity and the quality of the E-step computation. These methods include the Markov chain Monte Carlo EM, linear minimum mean-square-error estimation, approximate message passing and a diagonal approximation. Using numerical experiments, we show that the proposed R-SBL method outperforms existing S-NNLS solvers in terms of both signal and support recovery performance, and is also very robust against the structure of the design matrix., Comment: Under Review by IEEE Transactions on Signal Processing
Published: 2016

32. Modern Problems of Ensuring the Economic Security of the State

Author: Ivanov, Mikhail G., primary, Ivanova, Oles ya M., additional, Gorodnova, Olga N., additional, Fedorov, Igor Z., additional, and Markelov, Alexander G., additional
Published: 2021
Full Text: View/download PDF

33. Deep Integration of INS and DP: from Theory to Experiments

Author: Ambrosovskaya, Elena, Romaev, Dmitry, Proskurnikov, Anton, Loginov, Andrey, Mordvintsev, Alexander, Miroshnikov, Alexander, and Fedorov, Igor
Published: 2021
Full Text: View/download PDF

34. Photosynthetic activity of plankton in saline lakes under climatic fluctuations conditions

Author: Tsybekmitova Gazhit and Fedorov Igor
Subjects: Environmental sciences, GE1-350
Abstract: Climate change causes the alternation of high-water and low-water years, which is reflected in morphometric characteristics of lakes. Changes in the depth and volume of water bodies affect the structural and functional activity of the ecosystem, including the growth and development of primary producers. The photosynthetic activity of water body plankton was shown on the example of Dabasa-nor lake and Kudzhertai lake located in the steppe territory of the South-Eastern Transbaikalia. The studied lakes differ in their hydrochemical composition. The content of chlorophyll a in Dabasa-nor lake is lower than in Kudzhertai lake. Degraded form of chlorophyll a - pheophytin is present only in Dabasa-nor lake, which currently has the least depth of its filling. Carotenoids, as more stable forms of pigments, dominate in the pigment diversity of lake plankton. Chlorophyll c was not detected in any of the plankton of both lakes. Factor analysis of the main components showed the interdependence of a number of abiotic agents with the pigment characteristic of phytoplankton. Thus, under the current conditions of the period of lake water availability recovery after a long dry period, the photosynthetic activity of the water body ecosystem is experiencing certain difficulties.
Published: 2023
Full Text: View/download PDF

35. Physicochemical properties of l- and dl-valine: first-principles calculations

Author: Fedorov, Igor, Korabel’nikov, Dmitry, Nguyen, Chuong, and Prosekov, Alexander
Published: 2020
Full Text: View/download PDF

36. Determination of nanoparticle size using Rayleigh approximation and Mie theory

Author: Niskanen, Ilpo, Forsberg, Viviane, Zakrisson, Daniel, Reza, Salim, Hummelgård, Magnus, Andres, Britta, Fedorov, Igor, Suopajärvi, Terhi, Liimatainen, Henrikki, and Thungström, Göran
Published: 2019
Full Text: View/download PDF

37. Pressure effect on the band structure and topological properties of the electron density of pyrene: First-principles calculations

Author: Fedorov, Igor A.
Published: 2019
Full Text: View/download PDF

38. Procedure for determination of minimum permissible rim thickness of solid-rolled The Method of Determining the Minimum Permissible Rim Thickness of Solid-Rolled Wheels of Freight Cars in Operation

Author: Vorob'ev, Aleksandr, primary, Fedorov, Igor', additional, Konogray, Ol'ga, additional, Metlyakova, Sof'ya, additional, and Migulin, Nikita, additional
Published: 2023
Full Text: View/download PDF

39. Computer Simulation of the Structural Properties of Energetic Materials Using High Performance Computing

Author: Fedorov, Igor A., primary, Reyn, Tatyana S., additional, and Karabtsev, Sergei N., additional
Published: 2020
Full Text: View/download PDF

40. Towards a Volunteered Geographic Information-Facilitated Visual Analytics Pipeline to Improve Impact-Based Weather Warning Systems

Author: Vrotsou, Katerina, primary, Navarra, Carlo, additional, Kucher, Kostiantyn, additional, Fedorov, Igor, additional, Schück, Fredrik, additional, Unger, Jonas, additional, and Neset, Tina-Simone, additional
Published: 2023
Full Text: View/download PDF

41. First-principles study of band structures of anthracene and tetracene under pressure

Author: Fedorov, Igor A.
Published: 2017
Full Text: View/download PDF

42. Chapter 17 - Dictionaries in machine learning

Author: Kreutz-Delgado, Kenneth, Rao, Bhaskar, Fedorov, Igor, and Das, Srinjoy
Published: 2024
Full Text: View/download PDF

43. Tuning the Electronic and Optical Properties of Two-Dimensional Graphene-like C2N Nanosheet by Strain Engineering

Author: Phuc, Huynh V., Tuan, Vu V., Hieu, Nguyen N., Ilyasov, Victor V., Fedorov, Igor A., Hoi, Bui D., Phuong, Le T. T., Hieu, Nguyen V., Feddi, Elmustapha, and Nguyen, Chuong V.
Published: 2018
Full Text: View/download PDF

44. Towards a Volunteered Geographic Information-Facilitated Visual Analytics Pipeline to Improve Impact-Based Weather Warning Systems

Author: Vrotsou, Katerina, Navarra, Carlo, Kucher, Kostiantyn, Fedorov, Igor, Schück, Fredrik, Unger, Jonas, Neset, Tina-Simone, Vrotsou, Katerina, Navarra, Carlo, Kucher, Kostiantyn, Fedorov, Igor, Schück, Fredrik, Unger, Jonas, and Neset, Tina-Simone
Published: 2023
Full Text: View/download PDF

45. THE CATEGORY «INTEGRATION» IN INTERNATIONAL EDUCATIONAL LAW

Author: Fedorov Igor
Subjects: integration, law of integration, international educational law, Law
Abstract: The article presents a brief overview of the development of the category«integration» in doctrine. The existing identification of law of integration with integration is noted. The application of the category in certain areas of international interaction, in particular, in international educational law is evaluated. The author shares the definition of international educational integration proposed in the scientific literature and considers the peculiarities of integrative interaction of states and other subjects in the educational sphere described in researches.
Published: 2018

46. Methods of calculating landslide volume using remote sensing data

Author: Razakova Maira, Kuzmin Alexandr, Fedorov Igor, Yergaliev Rustam, and Ainakulov Zharas
Subjects: Environmental sciences, GE1-350
Abstract: The paper considers the issues of calculating the volume of the landslide from remote sensing data. The main methods of obtaining information during research are field observations. The most important results of field studies are quantitative estimates, such as the volume of the embankment resulting from a landslide, morphometric indicators, etc. The study of a remote and remote object was carried out by remote methods using aerial photographs in the Ile Alatau foothills at 1,600 meters above sea level. The obtained materials from the mudflow survey will be useful in developing solutions to mitigate the effects of disasters and in the design of measures for engineering protection from landslides.
Published: 2020
Full Text: View/download PDF

47. Structured Learning with Scale Mixture Priors

Author: Fedorov, Igor
Subjects: Electrical engineering
Abstract: Sparsity plays an essential role in a number of modern algorithms. This thesis examines how we can incorporate additional structural information within the sparsity profile and formulate a richer class of learning approaches. The focus is on Bayesian techniques for promoting sparsity and developing novel priors and inference schemes.The thesis begins by showing how structured sparsity can be used to recover simultaneously block sparse signals in the presence of outliers. The approach is validated with empirical results on synthetic data experiments as well as the multiple measurement face recognition problem.In the next portion of the thesis, the focus is on how structured sparsity can be used to extend approaches for dictionary learning. Dictionary learning refers to decomposing a data matrix into the product of a dictionary and coefficient matrix, subject to a sparsity constraint on the coefficient matrix.Chapter 3 studies structure in the form of non-negativity constraints on the unknowns, which is referred to as the sparse non-negative least squares (S-NNLS) problem. It presents a unified framework for S-NNLS based on a novel prior on the sparse codes and provides anefficient multiplicative inference procedure. It then extends the framework to sparse non-negative matrix factorization (S-NMF) and proves that the proposed approach is guaranteed to converge to a set of stationary points for both the S-NNLS and a subclass of the S-NMF problems.Finally, Chapter 4 addresses the problem of learning dictionaries for multimodal datasets. It presents the multimodal sparse Bayesian dictionary learning (MSBDL) algorithm. The MSBDL algorithm is able to leverage information from all available data modalities through a joint sparsity constraint on each modality’s sparse codes without restricting the coefficients themselves to be equal. The proposed framework offers a considerable amount of flexibility to practitioners and addresses many of the shortcomings of existing multimodal dictionary learning approaches. Unlike existing approaches, MSBDL allows the dictionaries for each data modality to havedifferent cardinality. In addition, MSBDL can be used in numerous scenarios, from small datasets to extensive datasets with large dimensionality. MSBDL can also be used in supervised settings.
Published: 2018

48. The effect of hydrostatic compression on the crystal structure of glycinium phosphite

Author: Bogdanov, Nikita E., primary, Korabel'nikov, Dmitry V., additional, Fedorov, Igor A., additional, Zakharov, Boris A., additional, and Boldyreva, Elena V., additional
Published: 2022
Full Text: View/download PDF

49. Hydrostatic pressure effects on structural and electronic properties of TATB from first principles calculations

Author: Fedorov, Igor A. and Zhuravlev, Yuriy N.
Published: 2014
Full Text: View/download PDF

50. Chemical composition and formation conditions of NaHCO3 type of waters in the eastern Transbaikalia

Author: Borzenko Svetlana, Drebot Valeriia, and Fedorov Igor
Subjects: Environmental sciences, GE1-350
Abstract: The chemical composition of the supergene zone groundwater in the Eastern Transbaikalia is considered. The main geochemical processes for steppe landscapes are identified and determined the causes of some component’s concentrations in groundwater. To understand the formation conditions of NaHCO3 water, were calculated equilibria of this water with the main minerals of the region’s rocks.
Published: 2019
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

336 results on '"Fedorov, Igor"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources