Author: "Hwang, Wen-Liang" / Publication Year Range: Last 3 years - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Hwang, Wen-Liang"' showing total 16 results

Start Over Author "Hwang, Wen-Liang" Publication Year Range Last 3 years

16 results on '"Hwang, Wen-Liang"'

1. Convergence rates of stochastic gradient method with independent sequences of step-size and momentum weight

Author: Hwang, Wen-Liang
Subjects: Computer Science - Machine Learning, Mathematics - Optimization and Control
Abstract: In large-scale learning algorithms, the momentum term is usually included in the stochastic sub-gradient method to improve the learning speed because it can navigate ravines efficiently to reach a local minimum. However, step-size and momentum weight hyper-parameters must be appropriately tuned to optimize convergence. We thus analyze the convergence rate using stochastic programming with Polyak's acceleration of two commonly used step-size learning rates: ``diminishing-to-zero" and ``constant-and-drop" (where the sequence is divided into stages and a constant step-size is applied at each stage) under strongly convex functions over a compact convex set with bounded sub-gradients. For the former, we show that the convergence rate can be written as a product of exponential in step-size and polynomial in momentum weight. Our analysis justifies the convergence of using the default momentum weight setting and the diminishing-to-zero step-size sequence in large-scale machine learning software. For the latter, we present the condition for the momentum weight sequence to converge at each stage.
Published: 2024

2. RIP sensing matrices construction for sparsifying dictionaries with application to MRI imaging

Author: Ho, Jinn, Hwang, Wen-Liang, and Heinecke, Andreas
Subjects: Electrical Engineering and Systems Science - Signal Processing
Abstract: Practical applications of compressed sensing often restrict the choice of its two main ingredients. They may (i) prescribe using particular redundant dictionaries for certain classes of signals to become sparsely represented, or (ii) dictate specific measurement mechanisms which exploit certain physical principles. On the problem of RIP measurement matrix design in compressed sensing with redundant dictionaries, we give a simple construction to derive sensing matrices whose compositions with a prescribed dictionary have a high probability of the RIP in the $k \log(n/k)$ regime. Our construction thus provides recovery guarantees usually only attainable for sensing matrices from random ensembles with sparsifying orthonormal bases. Moreover, we use the dictionary factorization idea that our construction rests on in the application of magnetic resonance imaging, in which also the sensing matrix is prescribed by quantum mechanical principles. We propose a recovery algorithm based on transforming the acquired measurements such that the compressed sensing theory for RIP embeddings can be utilized to recover wavelet coefficients of the target image, and show its performance on examples from the fastMRI dataset.
Published: 2024

3. Generalization bounds for regression and classification on adaptive covering input domains

Author: Hwang, Wen-Liang
Subjects: Statistics - Machine Learning, Computer Science - Machine Learning
Abstract: Our main focus is on the generalization bound, which serves as an upper limit for the generalization error. Our analysis delves into regression and classification tasks separately to ensure a thorough examination. We assume the target function is real-valued and Lipschitz continuous for regression tasks. We use the 2-norm and a root-mean-square-error (RMSE) variant to measure the disparities between predictions and actual values. In the case of classification tasks, we treat the target function as a one-hot classifier, representing a piece-wise constant function, and employ 0/1 loss for error measurement. Our analysis underscores the differing sample complexity required to achieve a concentration inequality of generalization bounds, highlighting the variation in learning efficiency for regression and classification tasks. Furthermore, we demonstrate that the generalization bounds for regression and classification functions are inversely proportional to a polynomial of the number of parameters in a network, with the degree depending on the hypothesis class and the network architecture. These findings emphasize the advantages of over-parameterized networks and elucidate the conditions for benign overfitting in such systems.
Published: 2024

4. Directional proximal point method for convex optimization

Author: Hwang, Wen-Liang and Yueh, Chang-Wei
Subjects: Mathematics - Optimization and Control
Abstract: The use of proximal point operators for optimization can be computationally expensive when the dimensionality of a function (i.e., the number of variables) is high. In this study, we sought to reduce the cost of calculating proximal point operators by developing a directional operator in which the proximal regularization of a function along a specific direction is penalized. We used this operator in a novel approach to optimization, referred to as the directional proximal point method (Direction PPM). When using Direction PPM, the key to achieving convergence is the selection of direction sequences for directional proximal point operators. In this paper, we present the conditions/assumptions by which to derive directions capable of achieving global convergence for convex functions. Considered a light version of PPM, Direction PPM uses scalar optimization to derive a stable step-size via a direction envelope function and an auxiliary method to derive a direction sequence that satisfies the assumptions. This makes Direction PPM adaptable to a larger class of functions. Through applications to differentiable convex functions, we demonstrate that negative gradient directions at the current iterates could conceivably be used to achieve this end. We provide experimental results to illustrate the efficacy of Direction PPM in practice.
Published: 2023

5. Representation and decomposition of functions in DAG-DNNs and structural network pruning

Author: Hwang, Wen-Liang
Subjects: Computer Science - Machine Learning
Abstract: The conclusions provided by deep neural networks (DNNs) must be carefully scrutinized to determine whether they are universal or architecture dependent. The term DAG-DNN refers to a graphical representation of a DNN in which the architecture is expressed as a direct-acyclic graph (DAG), on which arcs are associated with functions. The level of a node denotes the maximum number of hops between the input node and the node of interest. In the current study, we demonstrate that DAG-DNNs can be used to derive all functions defined on various sub-architectures of the DNN. We also demonstrate that the functions defined in a DAG-DNN can be derived via a sequence of lower-triangular matrices, each of which provides the transition of functions defined in sub-graphs up to nodes at a specified level. The lifting structure associated with lower-triangular matrices makes it possible to perform the structural pruning of a network in a systematic manner. The fact that decomposition is universally applicable to all DNNs means that network pruning could theoretically be applied to any DNN, regardless of the underlying architecture. We demonstrate that it is possible to obtain the winning ticket (sub-network and initialization) for a weak version of the lottery ticket hypothesis, based on the fact that the sub-network with initialization can achieve training performance on par with that of the original network using the same number of iterations or fewer.
Published: 2023

6. Global Convergence in Learning Fully-Connected ReLU Networks Via Un-rectifying Based on the Augmented Lagrangian Approach

Author: Tung, Shih-Shuo, Chung, Ming-Yu, Ho, Jinn, and Hwang, Wen-Liang
Published: 2024
Full Text: View/download PDF

7. Deriving RIP sensing matrices for sparsifying dictionaries

Author: Ho, Jinn and Hwang, Wen-Liang
Subjects: Computer Science - Information Theory
Abstract: Compressive sensing involves the inversion of a mapping $SD \in \mathbb{R}^{m \times n}$, where $m < n$, $S$ is a sensing matrix, and $D$ is a sparisfying dictionary. The restricted isometry property is a powerful sufficient condition for the inversion that guarantees the recovery of high-dimensional sparse vectors from their low-dimensional embedding into a Euclidean space via convex optimization. However, determining whether $SD$ has the restricted isometry property for a given sparisfying dictionary is an NP-hard problem, hampering the application of compressive sensing. This paper provides a novel approach to resolving this problem. We demonstrate that it is possible to derive a sensing matrix for any sparsifying dictionary with a high probability of retaining the restricted isometry property. In numerical experiments with sensing matrices for K-SVD, Parseval K-SVD, and wavelets, our recovery performance was comparable to that of benchmarks obtained using Gaussian and Bernoulli random sensing matrices for sparse vectors.
Published: 2022

8. Analysis of function approximation and stability of general DNNs in directed acyclic graphs using un-rectifying analysis

Author: Hwang, Wen-Liang and Tung, Shih-Shuo
Subjects: Computer Science - Machine Learning
Abstract: A general lack of understanding pertaining to deep feedforward neural networks (DNNs) can be attributed partly to a lack of tools with which to analyze the composition of non-linear functions, and partly to a lack of mathematical models applicable to the diversity of DNN architectures. In this paper, we made a number of basic assumptions pertaining to activation functions, non-linear transformations, and DNN architectures in order to use the un-rectifying method to analyze DNNs via directed acyclic graphs (DAGs). DNNs that satisfy these assumptions are referred to as general DNNs. Our construction of an analytic graph was based on an axiomatic method in which DAGs are built from the bottom-up through the application of atomic operations to basic elements in accordance with regulatory rules. This approach allows us to derive the properties of general DNNs via mathematical induction. We show that using the proposed approach, some properties hold true for general DNNs can be derived. This analysis advances our understanding of network functions and could promote further theoretical insights if the host of analytical tools for graphs can be leveraged., Comment: 26 pages, 14 figures
Published: 2022

9. Unconstrained optimization using the directional proximal point method

Author: Chung, Ming-Yu, Ho, Jinn, and Hwang, Wen-Liang
Subjects: Mathematics - Optimization and Control, 90C25, 90C26, G.1.6
Abstract: This paper presents a directional proximal point method (DPPM) to derive the minimum of any C1-smooth function f. The proposed method requires a function persistent a local convex segment along the descent direction at any non-critical point (referred to a DLC direction at the point). The proposed DPPM can determine a DLC direction by solving a two-dimensional quadratic optimization problem, regardless of the dimensionally of the function variables. Along that direction, the DPPM then updates by solving a one-dimensional optimization problem. This gives the DPPM advantage over competitive methods when dealing with large-scale problems, involving a large number of variables. We show that the DPPM converges to critical points of f. We also provide conditions under which the entire DPPM sequence converges to a single critical point. For strongly convex quadratic functions, we demonstrate that the rate at which the error sequence converges to zero can be R-superlinear, regardless of the dimension of variables., Comment: 29 pages, 12 figures
Published: 2022

10. Increasing the Speed of Banking Cybersecurity Systems Based on Intelligent Data Analysis and Artificial Intelligence Algorithms for Predicting Cyberattacks. I

Author: Khilenko, V., Akhmetov, B., Berdibayev, R., Lakhno, V., Harchenko, Yu., Hwang, Wen-Liang, and Khylenko, Jr., V.
Published: 2023
Full Text: View/download PDF

11. Image Restoration and Noise Reduction with Context-Dependent Wavelet Graph and ADMM Optimization

Author: Ho, Jinn, primary, Tung, Shih-Shuo, additional, and Hwang, Wen-Liang, additional
Published: 2022
Full Text: View/download PDF

12. Rip Sensing Matrices Construction for Sparsifying Dictionaries with Application to MRI Imaging

Author: Ho, J., primary, Hwang, Wen-Liang, additional, and Heinecke, A., additional
Published: 2024
Full Text: View/download PDF

13. Keynote Speech I: Representation and decomposition of functions in DAGDNNs and structural network pruning

Author: Hwang, Wen-Liang, primary
Published: 2023
Full Text: View/download PDF

14. Analysis of Function Approximation and Stability of General DNNs in Directed Acyclic Graphs Using Un-Rectifying Analysis

Author: Hwang, Wen-Liang, primary and Tung, Shih-Shuo, additional
Published: 2023
Full Text: View/download PDF

15. Depth Extraction from a Single Image and Its Application

Author: Hwang, Wen-Liang and Tung, Shih-Shuo
Subjects: Computers / Information Technology
Abstract: In this chapter, a method for the generation of depth map was presented. To generate the depth map from an image, the proposed approach involves application of a sequence of blurring and deblurring operations on a point to determine the depth of the point. The proposed method makes no assumptions with regard to the properties of the scene in resolving depth ambiguity in complex images. Since applications involving depth map manipulation can be achieved by obtaining all-in-focus images through a deblurring operation and then blurring the obtained images, we have presented methods to derive all-in-focus images from our depth maps. Furthermore, 2D to 3D conversion can also be achieved from the estimated depth map. Some demonstrations show the performance and applications of the estimated depth map in this chapter.
Published: 2022

16. Null Space Component Analysis of One-Shot Single-Channel Source Separation Problem.

Author: Hwang, Wen-Liang and Ho, Jinn
Subjects: *BLIND source separation, *TIME-frequency analysis, *NOISE measurement, *ELECTROCARDIOGRAPHY, *TASK analysis
Abstract: Extracting multiple unknown sources from a single observation of a single-channel is an ill-posed problem encountered in a variety of applications. This paper characterizes the ambiguity of solutions to the source separation problem, and then proposes a novel adaptive-operator-based approach to deriving solutions based on a combination of separation operators and domain-specific knowledge related to sources. The proposed scheme involves transforming the original problem into a new problem, in which data-dependent operators and the unknown sources are variables to be optimized. We demonstrate that a solution to the proposed optimization problem must reside in the null spaces of the operators, and any such solution also provides an optimal value to the original problem. We then demonstrate the applicability of the proposed method to the separation of sparse sources as well as AM-FM sources. Note that the proposed scheme outperformed corresponding state-of-the-art methods in noiseless as well as noisy environments. Finally, we demonstrate the efficacy of the proposed scheme in separation tasks based on real-world ECG data (i.e., extracting fetal ECG signals from noisy observations in which maternal and fetal ECGs recordings are superimposed) and electrical data (i.e.,separating singularities from harmonic components in an observation of noisy data related to surges in electrical current). [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

16 results on '"Hwang, Wen-Liang"'

1. Convergence rates of stochastic gradient method with independent sequences of step-size and momentum weight

2. RIP sensing matrices construction for sparsifying dictionaries with application to MRI imaging

3. Generalization bounds for regression and classification on adaptive covering input domains

4. Directional proximal point method for convex optimization

5. Representation and decomposition of functions in DAG-DNNs and structural network pruning

6. Global Convergence in Learning Fully-Connected ReLU Networks Via Un-rectifying Based on the Augmented Lagrangian Approach

7. Deriving RIP sensing matrices for sparsifying dictionaries

8. Analysis of function approximation and stability of general DNNs in directed acyclic graphs using un-rectifying analysis

9. Unconstrained optimization using the directional proximal point method

10. Increasing the Speed of Banking Cybersecurity Systems Based on Intelligent Data Analysis and Artificial Intelligence Algorithms for Predicting Cyberattacks. I

11. Image Restoration and Noise Reduction with Context-Dependent Wavelet Graph and ADMM Optimization

12. Rip Sensing Matrices Construction for Sparsifying Dictionaries with Application to MRI Imaging

13. Keynote Speech I: Representation and decomposition of functions in DAGDNNs and structural network pruning

14. Analysis of Function Approximation and Stability of General DNNs in Directed Acyclic Graphs Using Un-Rectifying Analysis

15. Depth Extraction from a Single Image and Its Application

16. Null Space Component Analysis of One-Shot Single-Channel Source Separation Problem.

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

16 results on '"Hwang, Wen-Liang"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources