Author: "Ghaffari, Alireza" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Ghaffari, Alireza"' showing total 177 results

Start Over Author "Ghaffari, Alireza"

177 results on '"Ghaffari, Alireza"'

1. OAC: Output-adaptive Calibration for Accurate Post-training Quantization

Author: Edalati, Ali, Ghaffari, Alireza, Asgharian, Masoud, Hou, Lu, Chen, Boxing, and Nia, Vahid Partovi
Subjects: Computer Science - Machine Learning, Computer Science - Computation and Language
Abstract: Deployment of Large Language Models (LLMs) has major computational costs, due to their rapidly expanding size. Compression of LLMs reduces the memory footprint, latency, and energy required for their inference. Post-training Quantization (PTQ) techniques have been developed to compress LLMs while avoiding expensive re-training. Most PTQ approaches formulate the quantization error based on a layer-wise $\ell_2$ loss, ignoring the model output. Then, each layer is calibrated using its layer-wise Hessian to update the weights towards minimizing the $\ell_2$ quantization error. The Hessian is also used for detecting the most salient weights to quantization. Such PTQ approaches are prone to accuracy drop in low-precision quantization. We propose Output-adaptive Calibration (OAC) to incorporate the model output in the calibration process. We formulate the quantization error based on the distortion of the output cross-entropy loss. OAC approximates the output-adaptive Hessian for each layer under reasonable assumptions to reduce the computational complexity. The output-adaptive Hessians are used to update the weight matrices and detect the salient weights towards maintaining the model output. Our proposed method outperforms the state-of-the-art baselines such as SpQR and BiLLM, especially, at extreme low-precision (2-bit and binary) quantization., Comment: 20 pages, 4 figures
Published: 2024

2. AdpQ: A Zero-shot Calibration Free Adaptive Post Training Quantization Method for LLMs

Author: Ghaffari, Alireza, Younesian, Sharareh, Nia, Vahid Partovi, Chen, Boxing, and Asgharian, Masoud
Subjects: Computer Science - Computation and Language
Abstract: The ever-growing computational complexity of Large Language Models (LLMs) necessitates efficient deployment strategies. The current state-of-the-art approaches for Post-training Quantization (PTQ) often require calibration to achieve the desired accuracy. This paper presents AdpQ, a novel zero-shot adaptive PTQ method for LLMs that achieves the state-of-the-art performance in low-precision quantization (e.g. 3-bit) without requiring any calibration data. Inspired by Adaptive LASSO regression model, our proposed approach tackles the challenge of outlier activations by separating salient weights using an adaptive soft-thresholding method. Guided by Adaptive LASSO, this method ensures that the quantized weights distribution closely follows the originally trained weights and eliminates the need for calibration data entirely, setting our method apart from popular approaches such as SpQR and AWQ. Furthermore, our method offers an additional benefit in terms of privacy preservation by eliminating any calibration or training data. We also delve deeper into the information-theoretic underpinnings of the proposed method. We demonstrate that it leverages the Adaptive LASSO to minimize the Kullback-Leibler divergence between the quantized weights and the originally trained weights. This minimization ensures the quantized model retains the Shannon information content of the original model to a great extent, guaranteeing efficient deployment without sacrificing accuracy or information. Our results achieve the same accuracy as the existing methods on various LLM benchmarks while the quantization time is reduced by at least 10x, solidifying our contribution to efficient and privacy-preserving LLM deployment.
Published: 2024

3. Mitigating Outlier Activations in Low-Precision Fine-Tuning of Language Models

Author: Ghaffari, Alireza, Yu, Justin, Nejad, Mahsa Ghazvini, Asgharian, Masoud, Chen, Boxing, and Nia, Vahid Partovi
Subjects: Computer Science - Computation and Language
Abstract: Low-precision fine-tuning of language models has gained prominence as a cost-effective and energy-efficient approach to deploying large-scale models in various applications. However, this approach is susceptible to the existence of outlier values in activation. The outlier values in the activation can negatively affect the performance of fine-tuning language models in the low-precision regime since they affect the scaling factor and thus make representing smaller values harder. This paper investigates techniques for mitigating outlier activation in low-precision integer fine-tuning of the language models. Our proposed novel approach enables us to represent the outlier activation values in 8-bit integers instead of floating-point (FP16) values. The benefit of using integers for outlier values is that it enables us to use operator tiling to avoid performing 16-bit integer matrix multiplication to address this problem effectively. We provide theoretical analysis and supporting experiments to demonstrate the effectiveness of our approach in improving the robustness and performance of low-precision fine-tuned language models.
Published: 2023

4. Simultaneous rotary and linear displacement sensor based on soft pneumatic sensing chambers

Author: Ghaffari, Alireza and Hojjat, Yousef
Published: 2024
Full Text: View/download PDF

5. Statistical Hardware Design With Multi-model Active Learning

Author: Ghaffari, Alireza, Asgharian, Masoud, and Savaria, Yvon
Subjects: Computer Science - Hardware Architecture, Computer Science - Machine Learning
Abstract: With the rising complexity of numerous novel applications that serve our modern society comes the strong need to design efficient computing platforms. Designing efficient hardware is, however, a complex multi-objective problem that deals with multiple parameters and their interactions. Given that there are a large number of parameters and objectives involved in hardware design, synthesizing all possible combinations is not a feasible method to find the optimal solution. One promising approach to tackle this problem is statistical modeling of a desired hardware performance. Here, we propose a model-based active learning approach to solve this problem. Our proposed method uses Bayesian models to characterize various aspects of hardware performance. We also use transfer learning and Gaussian regression bootstrapping techniques in conjunction with active learning to create more accurate models. Our proposed statistical modeling method provides hardware models that are sufficiently accurate to perform design space exploration as well as performance prediction simultaneously. We use our proposed method to perform design space exploration and performance prediction for various hardware setups, such as micro-architecture design and OpenCL kernels for FPGA targets. Our experiments show that the number of samples required to create performance models significantly reduces while maintaining the predictive power of our proposed statistical models. For instance, in our performance prediction setting, the proposed method needs 65% fewer samples to create the model, and in the design space exploration setting, our proposed method can find the best parameter settings by exploring less than 50 samples., Comment: added a reference for GRP subsampling and corrected typos
Published: 2023

6. On the Convergence of Stochastic Gradient Descent in Low-precision Number Formats

Author: Cacciola, Matteo, Frangioni, Antonio, Asgharian, Masoud, Ghaffari, Alireza, and Nia, Vahid Partovi
Subjects: Computer Science - Machine Learning, Mathematics - Numerical Analysis
Abstract: Deep learning models are dominating almost all artificial intelligence tasks such as vision, text, and speech processing. Stochastic Gradient Descent (SGD) is the main tool for training such models, where the computations are usually performed in single-precision floating-point number format. The convergence of single-precision SGD is normally aligned with the theoretical results of real numbers since they exhibit negligible error. However, the numerical error increases when the computations are performed in low-precision number formats. This provides compelling reasons to study the SGD convergence adapted for low-precision computations. We present both deterministic and stochastic analysis of the SGD algorithm, obtaining bounds that show the effect of number format. Such bounds can provide guidelines as to how SGD convergence is affected when constraints render the possibility of performing high-precision computations remote.
Published: 2023

7. EuclidNets: An Alternative Operation for Efficient Inference of Deep Learning Models

Author: Li, Xinlin, Parazeres, Mariana, Oberman, Adam, Ghaffari, Alireza, Asgharian, Masoud, and Nia, Vahid Partovi
Subjects: Computer Science - Machine Learning, I.2.6
Abstract: With the advent of deep learning application on edge devices, researchers actively try to optimize their deployments on low-power and restricted memory devices. There are established compression method such as quantization, pruning, and architecture search that leverage commodity hardware. Apart from conventional compression algorithms, one may redesign the operations of deep learning models that lead to more efficient implementation. To this end, we propose EuclidNet, a compression method, designed to be implemented on hardware which replaces multiplication, $xw$, with Euclidean distance $(x-w)^2$. We show that EuclidNet is aligned with matrix multiplication and it can be used as a measure of similarity in case of convolutional layers. Furthermore, we show that under various transformations and noise scenarios, EuclidNet exhibits the same performance compared to the deep learning models designed with multiplication operations.
Published: 2022

8. Towards Fine-tuning Pre-trained Language Models with Integer Forward and Backward Propagation

Author: Tayaranian, Mohammadreza, Ghaffari, Alireza, Tahaei, Marzieh S., Rezagholizadeh, Mehdi, Asgharian, Masoud, and Nia, Vahid Partovi
Subjects: Computer Science - Machine Learning
Abstract: The large number of parameters of some prominent language models, such as BERT, makes their fine-tuning on downstream tasks computationally intensive and energy hungry. Previously researchers were focused on lower bit-width integer data types for the forward propagation of language models to save memory and computation. As for the backward propagation, however, only 16-bit floating-point data type has been used for the fine-tuning of BERT. In this work, we use integer arithmetic for both forward and back propagation in the fine-tuning of BERT. We study the effects of varying the integer bit-width on the model's metric performance. Our integer fine-tuning uses integer arithmetic to perform forward propagation and gradient computation of linear, layer-norm, and embedding layers of BERT. We fine-tune BERT using our integer training method on SQuAD v1.1 and SQuAD v2., and GLUE benchmark. We demonstrate that metric performance of fine-tuning 16-bit integer BERT matches both 16-bit and 32-bit floating-point baselines. Furthermore, using the faster and more memory efficient 8-bit integer data type, integer fine-tuning of BERT loses an average of 3.1 points compared to the FP32 baseline., Comment: Accepted in EACL2023
Published: 2022

9. Is Integer Arithmetic Enough for Deep Learning Training?

Author: Ghaffari, Alireza, Tahaei, Marzieh S., Tayaranian, Mohammadreza, Asgharian, Masoud, and Nia, Vahid Partovi
Subjects: Computer Science - Machine Learning, Computer Science - Computational Complexity
Abstract: The ever-increasing computational complexity of deep learning models makes their training and deployment difficult on various cloud and edge platforms. Replacing floating-point arithmetic with low-bit integer arithmetic is a promising approach to save energy, memory footprint, and latency of deep learning models. As such, quantization has attracted the attention of researchers in recent years. However, using integer numbers to form a fully functional integer training pipeline including forward pass, back-propagation, and stochastic gradient descent is not studied in detail. Our empirical and mathematical results reveal that integer arithmetic seems to be enough to train deep learning models. Unlike recent proposals, instead of quantization, we directly switch the number representation of computations. Our novel training method forms a fully integer training pipeline that does not change the trajectory of the loss and accuracy compared to floating-point, nor does it need any special hyper-parameter tuning, distribution adjustment, or gradient clipping. Our experimental results show that our proposed method is effective in a wide variety of tasks such as classification (including vision transformers), object detection, and semantic segmentation., Comment: final camera ready submitted to NeurIPS
Published: 2022

10. Rethinking Pareto Frontier for Performance Evaluation of Deep Neural Networks

Author: Nia, Vahid Partovi, Ghaffari, Alireza, Zolnouri, Mahdi, and Savaria, Yvon
Subjects: Computer Science - Machine Learning, Computer Science - Neural and Evolutionary Computing, 68T07 (primery), 90C08 (secondary)
Abstract: Performance optimization of deep learning models is conducted either manually or through automatic architecture search, or a combination of both. On the other hand, their performance strongly depends on the target hardware and how successfully the models were trained. We propose to use a multi-dimensional Pareto frontier to re-define the efficiency measure of candidate deep learning models, where several variables such as training cost, inference latency, and accuracy play a relative role in defining a dominant model. Furthermore, a random version of the multi-dimensional Pareto frontier is introduced to mitigate the uncertainty of accuracy, latency, and throughput of deep learning models in different experimental setups. These two complementary methods can be combined to perform objective benchmarking of deep learning models. Our proposed method is applied to a wide range of deep image classification models trained on ImageNet data. Our method combines competing variables with stochastic nature in a single relative efficiency measure. This allows ranking deep learning models that run efficiently on different hardware, and combining inference efficiency with training efficiency objectively.
Published: 2022

11. CNN2Gate: Toward Designing a General Framework for Implementation of Convolutional Neural Networks on FPGA

Author: Ghaffari, Alireza and Savaria, Yvon
Subjects: Computer Science - Machine Learning, Computer Science - Computer Vision and Pattern Recognition, Electrical Engineering and Systems Science - Signal Processing
Abstract: Convolutional Neural Networks (CNNs) have a major impact on our society because of the numerous services they provide. On the other hand, they require considerable computing power. To satisfy these requirements, it is possible to use graphic processing units (GPUs). However, high power consumption and limited external IOs constrain their usability and suitability in industrial and mission-critical scenarios. Recently, the number of researches that utilize FPGAs to implement CNNs are increasing rapidly. This is due to the lower power consumption and easy reconfigurability offered by these platforms. Because of the research efforts put into topics such as architecture, synthesis and optimization, some new challenges are arising to integrate such hardware solutions to high-level machine learning software libraries. This paper introduces an integrated framework (CNN2Gate) that supports compilation of a CNN model for an FPGA target. CNN2Gate exploits the OpenCL synthesis workflow for FPGAs offered by commercial vendors. CNN2Gate is capable of parsing CNN models from several popular high-level machine learning libraries such as Keras, Pytorch, Caffe2 etc. CNN2Gate extracts computation flow of layers, in addition to weights and biases and applies a "given" fixed-point quantization. Furthermore, it writes this information in the proper format for OpenCL synthesis tools that are then used to build and run the project on FPGA. CNN2Gate performs design-space exploration using a reinforcement learning agent and fits the design on different FPGAs with limited logic resources automatically. This paper reports results of automatic synthesis and design-space exploration of AlexNet and VGG-16 on various Intel FPGA platforms. CNN2Gate achieves a latency of 205 ms for VGG-16 and 18 ms for AlexNet on the FPGA.
Published: 2020

12. EuclidNets: An Alternative Operation for Efficient Inference of Deep Learning Models

Author: Li, Xinlin, Parazeres, Mariana, Oberman, Adam, Ghaffari, Alireza, Asgharian, Masoud, and Nia, Vahid Partovi
Published: 2023
Full Text: View/download PDF

13. Hot-Melt Extrusion-Based Dexamethasone–PLGA Implants: Physicochemical, Physicomechanical, and Surface Morphological Properties and In Vitro Release Corrected for Drug Degradation.

Author: Ghaffari, Alireza, Matter, Brock A., Hartman, Rachel R., Bourne, David W. A., Wang, Yan, Choi, Stephanie, and Kompella, Uday B.
Subjects: *DRUG delivery systems, *GENERIC products, *SURFACE roughness, *SURFACE properties, *PHARMACODYNAMICS
Abstract: Developing bioequivalent (BE) generic products of complex dosage forms like intravitreal implants (IVIs) of corticosteroids such as dexamethasone prepared using hot-melt extrusion (HME), based on biodegradable poly (lactide-co-glycolide) (PLGA) polymers, can be challenging. A better understanding of the relationship between the physicochemical and physicomechanical properties of IVIs and their effect on drug release and ocular bioavailability is crucial to develop novel BE approaches. It is possible that the key physicochemical and physicomechanical properties of IVIs such as drug properties, implant surface roughness, mechanical strength and toughness, and implant erosion could vary for different compositions, resulting in changes in drug release. Therefore, this study investigated the hypothesis that biodegradable ophthalmic dexamethasone-loaded implants with 20% drug and 80% PLGA polymer(s) prepared using single-pass hot-melt extrusion (HME) differ in physicochemical and/or physicomechanical properties and drug release depending on their PLGA polymer composition. Acid end-capped PLGA was mixed with an ester end-capped PLGA to make three formulations: HME-1, HME-2, and HME-3, containing 100%, 80%, and 60% w/w of the acid end-capped PLGA. Further, this study compared the drug release between independent batches of each composition. In vitro release tests (IVRTs) indicated that HME-1 implants can be readily distinguished by their release profiles from HME-2 and HME-3, with the release being similar for HME-2 and HME-3. In the early stages, drug release generally correlated well with polymer composition and implant properties, with the release increasing with PLGA acid content (for day-1 release, R2 = 0.80) and/or elevated surface roughness (for day-1 and day-14 release, R2 ≥ 0.82). Further, implant mechanical strength and toughness correlated inversely with PLGA acid content and day-1 drug release. Drug release from independent batches was similar for each composition. The findings of this project could be helpful for developing generic PLGA polymer-based ocular implant products. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

14. Mitigating Outlier Activations in Low-Precision Fine-Tuning of Language Models

Author: Ghaffari, Alireza, primary, Yu, Justin, additional, Nejad, Mahsa, additional, Asgharian, Masoud, additional, Chen, Boxing, additional, and Nia, Vahid, additional
Published: 2024
Full Text: View/download PDF

15. The Association Between QTc-Interval and Blood Glucose with Plasma Acetylcholinesterase in Organophosphate (OP)-Poisoned Patients.

Author: ZaareNahandi, Maryam, Ostadi, Ali, khodayari, Mahdi, Azimi, Amin, Mohammadi, Ahad Banagozar, Ghaffari, Alireza, and Mohammadi, Ali Banagozar
Subjects: CHOLINESTERASES, PEARSON correlation (Statistics), CROSS-sectional method, STATISTICAL correlation, LONG QT syndrome, T-test (Statistics), HOSPITAL care, DESCRIPTIVE statistics, CHI-squared test, BLOOD sugar, ORGANOPHOSPHORUS compounds, DATA analysis software, CONFIDENCE intervals, REGRESSION analysis
Published: 2024
Full Text: View/download PDF

16. The use of Aster remotely sensed information to quantify gully networks in southern Italy

Author: Ghaffari, Alireza
Subjects: 551.302
Published: 2006

17. Artificial intelligence in cardiovascular medicine: An updated review of the literature.

Author: Zargarzadeh, Arian, Javanshir, Elnaz, Ghaffari, Alireza, Mosharkesh, Erfan, and Anari, Babak
Subjects: CARDIOVASCULAR disease treatment, CARDIOVASCULAR disease prevention, ATRIAL fibrillation diagnosis, CARDIOVASCULAR disease diagnosis, PRIVACY, CARDIAC hypertrophy, PULMONARY hypertension, ARTIFICIAL intelligence, MEDICAL screening, MEDICAL ethics, MEDICAL records, EARLY diagnosis, HEART failure
Abstract: Screening and early detection of cardiovascular disease (CVD) are crucial for managing progress and preventing related morbidity. In recent years, several studies have reported the important role of Artificial intelligence (AI) technology and its integration into various medical sectors. AI applications are able to deal with the massive amounts of data (medical records, ultrasounds, medications, and experimental results) generated in medicine and identify novel details that would otherwise be forgotten in the mass of healthcare data sets. Nowadays, AI algorithms are currently used to improve diagnosis of some CVDs including heart failure, atrial fibrillation, hypertrophic cardiomyopathy and pulmonary hypertension. This review summarized some AI concepts, critical execution requirements, obstacles, and new applications for CVDs. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

18. Statistical Hardware Design With Multimodel Active Learning

Author: Ghaffari, Alireza, Asgharian, Masoud, and Savaria, Yvon
Abstract: With the rising complexity of numerous novel applications that serve our modern society comes the strong need to design efficient computing platforms. Designing efficient hardware is, however, a complex multiobjective problem that deals with multiple parameters and their interactions. Given that there is a large number of parameters and objectives involved in hardware design, synthesizing all possible combinations is not a feasible method to find the optimal solution. One promising approach to tackle this problem is statistical modeling of a desired hardware performance. Here, we propose a model-based active learning approach to solve this problem. Our proposed method uses Bayesian models to characterize various aspects of hardware performance. We also use acrlong TL and Gaussian regression bootstrapping techniques in conjunction with active learning to create more accurate models. Our proposed statistical modeling method provides hardware models that are sufficiently accurate to perform design space exploration (DSE) as well as performance prediction simultaneously. We use our proposed method to perform DSE and performance prediction for various hardware setups, such as micro-architecture design and OpenCL kernels for FPGA targets. Our experiments show that the number of samples required to create performance models significantly reduces while maintaining the predictive power of our proposed statistical models. For instance, in our performance prediction setting, the proposed method needs 65% fewer samples to create the model, and in the DSE setting, our proposed method can find the best parameter settings by exploring fewer than 50 samples.
Published: 2024
Full Text: View/download PDF

19. Dexamethasone Degradation in Aqueous Medium and Implications for Correction of In Vitro Release from Sustained Release Delivery Systems

Author: Matter, Brock, Ghaffari, Alireza, Bourne, David, Wang, Yan, Choi, Stephanie, and Kompella, Uday B.
Published: 2019
Full Text: View/download PDF

20. A Novel Absolute Rotary Encoder Based on Soft Pneumatic Sensing Chambers

Author: Ghaffari, Alireza, primary and Hojjat, Yousef, additional
Published: 2023
Full Text: View/download PDF

21. DR-PIFO: A Dynamic Ranking Packet Scheduler Using a Push-In-First-Out Queue

Author: Elbediwy, Mostafa, primary, Pontikakis, Bill, additional, Ghaffari, Alireza, additional, David, Jean-Pierre, additional, and Savaria, Yvon, additional
Published: 2023
Full Text: View/download PDF

22. On the Convergence of Stochastic Gradient Descent in Low-Precision Number Formats

Author: Cacciola, Matteo, primary, Frangioni, Antonio, additional, Asgharian, Masoud, additional, Ghaffari, Alireza, additional, and Nia, Vahid, additional
Published: 2023
Full Text: View/download PDF

23. Statistical Hardware Design With Multi-Model Active Learning

Author: Ghaffari, Alireza, primary, Asgharian, Masoud, additional, and Savaria, Yvon, additional
Published: 2023
Full Text: View/download PDF

24. Development of a stability-indicating high performance liquid chromatography method for assay of erythromycin ethylsuccinate in powder for oral suspension dosage form

Author: Kamarei, Fahimeh, Movaghari, Farshid, Ghaffari, Alireza, Bozchalooi, Iman Soltani, Zamani, Ali, and Jabbari, Ali
Published: 2014
Full Text: View/download PDF

25. 2‐D analytical magnetic model for optimal design of an outer rotor permanent magnet synchronous machine

Author: Vahaj, Amir Abbas, primary, Rahideh, Akbar, additional, Zamani Faradonbeh, Vahid, additional, Salehi, Ali Reza, additional, Ghaffari, Alireza, additional, Shahnazari, Mostafa, additional, and Lubin, Thierry, additional
Published: 2022
Full Text: View/download PDF

26. A Brief History of Elephantiasis

Author: Golzari, Samad E. J., Kazemi, Abolhassan, Ghaffari, Alireza, and Ghabili, Kamyar
Published: 2012
Full Text: View/download PDF

27. 2‐D analytical magnetic model for optimal design of an outer rotor permanent magnet synchronous machine.

Author: Vahaj, Amir Abbas, Rahideh, Akbar, Zamani Faradonbeh, Vahid, Salehi, Ali Reza, Ghaffari, Alireza, Shahnazari, Mostafa, and Lubin, Thierry
Subjects: PERMANENT magnets, FINITE element method, MAGNETISM, MAGNETIC cores, MATHEMATICAL optimization, ELECTRIC power consumption
Abstract: The objective of this paper is to investigate the optimal design of a 120 W outer‐rotor permanent magnet synchronous machine for air‐conditioning applications to reduce the electricity consumption and machine costs. To this end, a 2‐D analytical magnetic model, combined with a teaching–learning based optimisation algorithm is employed. The objectives are to simultaneously maximise the machine efficiency and minimise the material cost. Constraints such as the required output torque, predefined machine volume, maximum allowable torque ripple and unbalanced magnetic force and limitations to avoid iron core magnetic saturation are imposed simultaneously. To find the flux density in iron parts, a new method based on the subdomain technique has been presented. To evaluate the analytical model in terms of accuracy and speed, the results of the 2‐D analytical approach are compared with those of the 2‐D and 3‐D linear/non‐linear finite element method. Finally, a prototype of the optimised machine is fabricated and the superiority of the presented approach is demonstrated. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

28. Tailoring Release Profiles of BCS Class II Drugs Using Controlled Release Amorphous Solid Dispersion Beads with Membrane-Reservoir Design: Effect of Pore Former and Coating Levels

Author: Lugtu-Pe, Jamie Anne, primary, Lin, Bing Ying, additional, Chen, Kuan, additional, Ghaffari, Alireza, additional, Kane, Anil, additional, and Wu, Xiao Yu, additional
Published: 2021
Full Text: View/download PDF

29. Lack of Bioequivalence between Two Aciclovir Tablets in Healthy Subjects

Author: Amini, Hossein, Javan, Mohammad, Gazerani, Parisa, Ghaffari, Alireza, and Ahmadiani, Abolhassan
Published: 2008
Full Text: View/download PDF

30. CNN2Gate: an implementation of convolutional neural networks inference on FPGAs with automated design space exploration

Author: Ghaffari, Alireza, Savaria, Yvon, Ghaffari, Alireza, and Savaria, Yvon
Abstract: Convolutional Neural Networks (CNNs) have a major impact on our society, because of the numerous services they provide. These services include, but are not limited to image classification, video analysis, and speech recognition. Recently, the number of researches that utilize FPGAs to implement CNNs are increasing rapidly. This is due to the lower power consumption and easy reconfigurability that are offered by these platforms. Because of the research efforts put into topics, such as architecture, synthesis, and optimization, some new challenges are arising for integrating suitable hardware solutions to high-level machine learning software libraries. This paper introduces an integrated framework (CNN2Gate), which supports compilation of a CNN model for an FPGA target. CNN2Gate is capable of parsing CNN models from several popular high-level machine learning libraries, such as Keras, Pytorch, Caffe2, etc. CNN2Gate extracts computation flow of layers, in addition to weights and biases, and applies a “given” fixed-point quantization. Furthermore, it writes this information in the proper format for the FPGA vendor’s OpenCL synthesis tools that are then used to build and run the project on FPGA. CNN2Gate performs design-space exploration and fits the design on different FPGAs with limited logic resources automatically. This paper reports results of automatic synthesis and design-space exploration of AlexNet and VGG-16 on various Intel FPGA platforms.
Published: 2020

31. Preparation and characterization of free mixed-film of pectin/chitosan/Eudragit ® RS intended for sigmoidal drug delivery

Author: Ghaffari, Alireza, Navaee, Kian, Oskoui, Mahvash, Bayati, Khosrow, and Rafiee-Tehrani, Morteza
Published: 2007
Full Text: View/download PDF

32. Efficient Design Space Exploration of OpenCL Kernels for FPGA Targets Using Black Box Optimization

Author: Ghaffari, Alireza, primary and Savaria, Yvon, additional
Published: 2021
Full Text: View/download PDF

33. CNN2Gate: An Implementation of Convolutional Neural Networks Inference on FPGAs with Automated Design Space Exploration

Author: Ghaffari, Alireza, primary and Savaria, Yvon, additional
Published: 2020
Full Text: View/download PDF

34. Research trend in partnering in construction journals

Author: Ghaffari, Alireza, primary and Jane, R, additional
Published: 2012
Full Text: View/download PDF

35. Concept of partnering in construction projects

Author: Ghaffari, Alireza, primary and Jane, R, additional
Published: 2012
Full Text: View/download PDF

36. Developing a model for profit sharing amongst partners in consulting engineering companies of construction projects

Author: Ghaffari, Alireza, primary and Jain, R, additional
Published: 2012
Full Text: View/download PDF

37. Comparison between 2‐D and 0‐D analytical models for slotless double‐sided inner armature linear permanent magnet synchronous machines

Author: Ghaffari, Alireza, primary, Rahideh, Akbar, additional, Ghaffari, Hamidreza, additional, Vahaj, Amirabbas, additional, and Mahmoudi, Amin, additional
Published: 2020
Full Text: View/download PDF

38. Manufacturing of Dexamethasone–Poly(d,l-Lactide-co-Glycolide) Implants Using Hot-Melt Extrusion: Within- and Between-Batch Product Performance Comparisons

Author: Kelley, Ryan A., primary, Ghaffari, Alireza, additional, Wang, Yan, additional, Choi, Stephanie, additional, Taylor, Jonathan R., additional, Hartman, Rachel R., additional, and Kompella, Uday B., additional
Published: 2020
Full Text: View/download PDF

39. 2-D ANALYTICAL MODEL FOR SLOTLESS DOUBLE-SIDED OUTER ARMATURE PERMANENT-MAGNET LINEAR MOTOR

Author: Ghaffari, Alireza, primary, Khalili, Farzaneh, additional, Vahaj, Amir Abbas, additional, Ghaffari, Hamidreza, additional, and Mahmoudi, Amin, additional
Published: 2020
Full Text: View/download PDF

40. 2-D Analytical Model for Outer-Rotor Consequent-Pole Brushless PM Machines

Author: Ghaffari, Alireza, primary, Rahideh, Akbar, additional, Moayed-Jahromi, Hossein, additional, Vahaj, AmirAbbas, additional, Mahmoudi, Amin, additional, and Soong, Wen Liang, additional
Published: 2019
Full Text: View/download PDF

41. Single Layer Extended Release Two-in-One Guaifenesin Matrix Tablet: Formulation Method, Optimization, Release Kinetics Evaluation and Its Comparison with Mucinex® Using Box-Behnken Design

Author: Morovati, Amirhosein, Ghaffari, Alireza, Erfani jabarian, Lale, and Mehramizi, Ali
Subjects: High dose modified release tablets, Binary mixture, Original Article, Highly water-soluble drug, Poor compressibility, Bilayer tablets
Abstract: Guaifenesin, a highly water-soluble active (50 mg/mL), classified as a BCS class I drug. Owing to its poor flowability and compressibility, formulating tablets especially high-dose one, may be a challenge. Direct compression may not be feasible. Bilayer tablet technology applied to Mucinex®, endures challenges to deliver a robust formulation. To overcome challenges involved in bilayer-tablet manufacturing and powder compressibility, an optimized single layer tablet prepared by a binary mixture (Two-in-one), mimicking the dual drug release character of Mucinex® was purposed. A 3-factor, 3-level Box-Behnken design was applied to optimize seven considered dependent variables (Release “%” in 1, 2, 4, 6, 8, 10 and 12 h) regarding different levels of independent one (X1: Cetyl alcohol, X2: Starch 1500®, X3: HPMC K100M amounts). Two granule portions were prepared using melt and wet granulations, blended together prior to compression. An optimum formulation was obtained (X1: 37.10, X2: 2, X3: 42.49 mg). Desirability function was 0.616. F2 and f1 between release profiles of Mucinex® and the optimum formulation were 74 and 3, respectively. An n-value of about 0.5 for both optimum and Mucinex® formulations showed diffusion (Fickian) control mechanism. However, HPMC K100M rise in 70 mg accompanied cetyl alcohol rise in 60 mg led to first order kinetic (n = 0.6962). The K values of 1.56 represented an identical burst drug releases. Cetyl alcohol and starch 1500® modulated guaifenesin release from HPMC K100M matrices, while due to their binding properties, improved its poor flowability and compressibility, too.
Published: 2017

42. Preparation and characterization of free mixed-film of pectin/chitosan/Eudragit® RS intended for sigmoidal drug delivery

Author: Ghaffari, Alireza, Navaee, Kian, Oskoui, Mahvash, Bayati, Khosrow, and Rafiee-Tehrani, Morteza
Published: 2007
Full Text: View/download PDF

43. Exact Two-Dimensional Analytical Calculations for Magnetic Field, Electromagnetic Torque, UMF, Back-EMF, and Inductance of Outer Rotor Surface Inset Permanent Magnet Machines

Author: Vahaj, AmirAbbas, primary, Rahideh, Akbar, additional, Moayed-Jahromi, Hossein, additional, and Ghaffari, AliReza, additional
Published: 2019
Full Text: View/download PDF

44. Toward High-Performance Implementation of 5G SCMA Algorithms

Author: Ghaffari, Alireza, primary, Leonardon, Mathieu, additional, Cassagne, Adrien, additional, Leroux, Camille, additional, and Savaria, Yvon, additional
Published: 2019
Full Text: View/download PDF

45. Analytical Model of Slotless Brushless PM Linear Motors Considering Different Magnetization Patterns

Author: Rahideh, A., primary, Ghaffari, Alireza, additional, Barzegar, Arman, additional, and Mahmoudi, Amin, additional
Published: 2018
Full Text: View/download PDF

46. Development of controlled release amorphous solid dispersions (CRASD) using polyvinyl acetate-based release retarding materials: Effect of dosage form design

Author: Lugtu-Pe, Jamie Anne, primary, Ghaffari, Alireza, additional, Chen, Kuan, additional, Kane, Anil, additional, and Wu, Xiao Yu, additional
Published: 2018
Full Text: View/download PDF

47. Microextraction of Furosemide from Human Serum and Its Fluorimetric Determination

Author: Rezazadeh, Akbar, primary, Amjadi, Mohammad, additional, L Manzoori, Jamshid, additional, Ghaffari, Alireza, additional, and Jouyban, Abolghasem, additional
Published: 2018
Full Text: View/download PDF

48. Improving performance of SCMA MPA decoders using estimation of conditional probabilities

Author: Ghaffari, Alireza, primary, Leonardon, Mathieu, additional, Savaria, Yvon, additional, Jego, Christophe, additional, and Leroux, Camille, additional
Published: 2017
Full Text: View/download PDF

49. Electrowetting based actuation of droplets with PDMS dielectric

Author: Ghaffari, Alireza, primary, Hojjat, Yousef, additional, Sadeghian, Hesam, additional, and Mokhtari, Mohammad, additional
Published: 2016
Full Text: View/download PDF

50. Complications and Carcinogenic Effects of Mustard Gas - a Systematic Review and Meta-Analysis in Iran

Author: Panahi, Yunes, primary, Gholami, Nasrin, additional, Ghojazadeh, Morteza, additional, Moslemi, Farnaz, additional, Naghavi-Behzad, Mohammad, additional, Azami-Aghdash, Saber, additional, Ghaffari, Alireza, additional, and Piri, Reza, additional
Published: 2015
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

177 results on '"Ghaffari, Alireza"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources