Author: "Yang, Jerry Zhijian" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Yang, Jerry Zhijian"' showing total 117 results

Start Over Author "Yang, Jerry Zhijian"

117 results on '"Yang, Jerry Zhijian"'

1. Deep Transfer Learning: Model Framework and Error Analysis

Author: Jiao, Yuling, Lin, Huazhen, Luo, Yuchen, and Yang, Jerry Zhijian
Subjects: Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: This paper presents a framework for deep transfer learning, which aims to leverage information from multi-domain upstream data with a large number of samples $n$ to a single-domain downstream task with a considerably smaller number of samples $m$, where $m \ll n$, in order to enhance performance on downstream task. Our framework has several intriguing features. First, it allows the existence of both shared and specific features among multi-domain data and provides a framework for automatic identification, achieving precise transfer and utilization of information. Second, our model framework explicitly indicates the upstream features that contribute to downstream tasks, establishing a relationship between upstream domains and downstream tasks, thereby enhancing interpretability. Error analysis demonstrates that the transfer under our framework can significantly improve the convergence rate for learning Lipschitz functions in downstream supervised tasks, reducing it from $\tilde{O}(m^{-\frac{1}{2(d+2)}}+n^{-\frac{1}{2(d+2)}})$ ("no transfer") to $\tilde{O}(m^{-\frac{1}{2(d^*+3)}} + n^{-\frac{1}{2(d+2)}})$ ("partial transfer"), and even to $\tilde{O}(m^{-1/2}+n^{-\frac{1}{2(d+2)}})$ ("complete transfer"), where $d^* \ll d$ and $d$ is the dimension of the observed data. Our theoretical findings are substantiated by empirical experiments conducted on image classification datasets, along with a regression dataset.
Published: 2024

2. Unsupervised Transfer Learning via Adversarial Contrastive Training

Author: Duan, Chenguang, Jiao, Yuling, Lin, Huazhen, Ma, Wensen, and Yang, Jerry Zhijian
Subjects: Statistics - Machine Learning, Computer Science - Machine Learning
Abstract: Learning a data representation for downstream supervised learning tasks under unlabeled scenario is both critical and challenging. In this paper, we propose a novel unsupervised transfer learning approach using adversarial contrastive training (ACT). Our experimental results demonstrate outstanding classification accuracy with both fine-tuned linear probe and K-NN protocol across various datasets, showing competitiveness with existing state-of-the-art self-supervised learning methods. Moreover, we provide an end-to-end theoretical guarantee for downstream classification tasks in a misspecified, over-parameterized setting, highlighting how a large amount of unlabeled data contributes to prediction accuracy. Our theoretical findings suggest that the testing error of downstream tasks depends solely on the efficiency of data augmentation used in ACT when the unlabeled sample size is sufficiently large. This offers a theoretical understanding of learning downstream tasks with a small sample size.
Published: 2024

3. DRM Revisited: A Complete Error Analysis

Author: Jiao, Yuling, Li, Ruoxuan, Wu, Peiying, Yang, Jerry Zhijian, and Zhang, Pingwen
Subjects: Mathematics - Numerical Analysis, Computer Science - Machine Learning
Abstract: In this work, we address a foundational question in the theoretical analysis of the Deep Ritz Method (DRM) under the over-parameteriztion regime: Given a target precision level, how can one determine the appropriate number of training samples, the key architectural parameters of the neural networks, the step size for the projected gradient descent optimization procedure, and the requisite number of iterations, such that the output of the gradient descent process closely approximates the true solution of the underlying partial differential equation to the specified precision?
Published: 2024

4. Solving the inverse source problem of the fractional Poisson equation by MC-fPINNs

Author: Sheng, Rui, Wu, Peiying, Yang, Jerry Zhijian, and Yuan, Cheng
Subjects: Mathematics - Numerical Analysis, 68T07, 65M12, 62G05
Abstract: In this paper, we effectively solve the inverse source problem of the fractional Poisson equation using MC-fPINNs. We construct two neural networks $ u_{NN}(x;\theta )$ and $f_{NN}(x;\psi)$ to approximate the solution $u^{*}(x)$ and the forcing term $f^{*}(x)$ of the fractional Poisson equation. To optimize these two neural networks, we use the Monte Carlo sampling method mentioned in MC-fPINNs and define a new loss function combining measurement data and the underlying physical model. Meanwhile, we present a comprehensive error analysis for this method, along with a prior rule to select the appropriate parameters of neural networks. Several numerical examples are given to demonstrate the great precision and robustness of this method in solving high-dimensional problems up to 10D, with various fractional order $\alpha$ and different noise levels of the measurement data ranging from 1$\%$ to 10$\%$.
Published: 2024

5. Characteristic Learning for Provable One Step Generation

Author: Ding, Zhao, Duan, Chenguang, Jiao, Yuling, Li, Ruoxuan, Yang, Jerry Zhijian, and Zhang, Pingwen
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Mathematics - Numerical Analysis, Mathematics - Statistics Theory
Abstract: We propose the characteristic generator, a novel one-step generative model that combines the efficiency of sampling in Generative Adversarial Networks (GANs) with the stable performance of flow-based models. Our model is driven by characteristics, along which the probability density transport can be described by ordinary differential equations (ODEs). Specifically, We estimate the velocity field through nonparametric regression and utilize Euler method to solve the probability flow ODE, generating a series of discrete approximations to the characteristics. We then use a deep neural network to fit these characteristics, ensuring a one-step mapping that effectively pushes the prior distribution towards the target distribution. In the theoretical aspect, we analyze the errors in velocity matching, Euler discretization, and characteristic fitting to establish a non-asymptotic convergence rate for the characteristic generator in 2-Wasserstein distance. To the best of our knowledge, this is the first thorough analysis for simulation-free one step generative models. Additionally, our analysis refines the error analysis of flow-based generative models in prior works. We apply our method on both synthetic and real datasets, and the results demonstrate that the characteristic generator achieves high generation quality with just a single evaluation of neural network.
Published: 2024

6. A Stabilized Physics Informed Neural Networks Method for Wave Equations

Author: Jiao, Yuling, Liu, Yuhui, Yang, Jerry Zhijian, and Yuan, Cheng
Subjects: Mathematics - Numerical Analysis, Mathematical Physics, 68T07, 65M12, 62G05
Abstract: In this article, we propose a novel Stabilized Physics Informed Neural Networks method (SPINNs) for solving wave equations. In general, this method not only demonstrates theoretical convergence but also exhibits higher efficiency compared to the original PINNs. By replacing the $L^2$ norm with $H^1$ norm in the learning of initial condition and boundary condition, we theoretically proved that the error of solution can be upper bounded by the risk in SPINNs. Based on this, we decompose the error of SPINNs into approximation error, statistical error and optimization error. Furthermore, by applying the approximating theory of $ReLU^3$ networks and the learning theory on Rademacher complexity, covering number and pseudo-dimension of neural networks, we present a systematical non-asymptotic convergence analysis on our method, which shows that the error of SPINNs can be well controlled if the number of training samples, depth and width of the deep neural networks have been appropriately chosen. Two illustrative numerical examples on 1-dimensional and 2-dimensional wave equations demonstrate that SPINNs can achieve a faster and better convergence than classical PINNs method.
Published: 2024

7. Deep conditional distribution learning via conditional F\'ollmer flow

Author: Chang, Jinyuan, Ding, Zhao, Jiao, Yuling, Li, Ruoxuan, and Yang, Jerry Zhijian
Subjects: Statistics - Machine Learning, Computer Science - Machine Learning
Abstract: We introduce an ordinary differential equation (ODE) based deep generative method for learning conditional distributions, named Conditional F\"ollmer Flow. Starting from a standard Gaussian distribution, the proposed flow could approximate the target conditional distribution very well when the time is close to 1. For effective implementation, we discretize the flow with Euler's method where we estimate the velocity field nonparametrically using a deep neural network. Furthermore, we also establish the convergence result for the Wasserstein-2 distance between the distribution of the learned samples and the target conditional distribution, providing the first comprehensive end-to-end error analysis for conditional distribution learning via ODE flow. Our numerical experiments showcase its effectiveness across a range of scenarios, from standard nonparametric conditional density estimation problems to more intricate challenges involving image data, illustrating its superiority over various existing conditional density estimation methods., Comment: The original title of this paper is "Deep Conditional Generative Learning: Model and Error Analysis"
Published: 2024

8. Semi-Supervised Deep Sobolev Regression: Estimation, Variable Selection and Beyond

Author: Ding, Zhao, Duan, Chenguang, Jiao, Yuling, and Yang, Jerry Zhijian
Subjects: Statistics - Machine Learning, Computer Science - Machine Learning, 62G05, 62G08, 65N21
Abstract: We propose SDORE, a semi-supervised deep Sobolev regressor, for the nonparametric estimation of the underlying regression function and its gradient. SDORE employs deep neural networks to minimize empirical risk with gradient norm regularization, allowing computation of the gradient norm on unlabeled data. We conduct a comprehensive analysis of the convergence rates of SDORE and establish a minimax optimal rate for the regression function. Crucially, we also derive a convergence rate for the associated plug-in gradient estimator, even in the presence of significant domain shift. These theoretical findings offer valuable prior guidance for selecting regularization parameters and determining the size of the neural network, while showcasing the provable advantage of leveraging unlabeled data in semi-supervised learning. To the best of our knowledge, SDORE is the first provable neural network-based approach that simultaneously estimates the regression function and its gradient, with diverse applications including nonparametric variable selection and inverse problems. The effectiveness of SDORE is validated through an extensive range of numerical simulations and real data analysis.
Published: 2024

9. Non-asymptotic Approximation Error Bounds of Parameterized Quantum Circuits

Author: Yu, Zhan, Chen, Qiuhao, Jiao, Yuling, Li, Yinan, Lu, Xiliang, Wang, Xin, and Yang, Jerry Zhijian
Subjects: Quantum Physics, Computer Science - Machine Learning
Abstract: Parameterized quantum circuits (PQCs) have emerged as a promising approach for quantum neural networks. However, understanding their expressive power in accomplishing machine learning tasks remains a crucial question. This paper investigates the expressivity of PQCs for approximating general multivariate function classes. Unlike previous Universal Approximation Theorems for PQCs, which are either nonconstructive or rely on parameterized classical data processing, we explicitly construct data re-uploading PQCs for approximating multivariate polynomials and smooth functions. We establish the first non-asymptotic approximation error bounds for these functions in terms of the number of qubits, quantum circuit depth, and number of trainable parameters. Notably, we demonstrate that for approximating functions that satisfy specific smoothness criteria, the quantum circuit size and number of trainable parameters of our proposed PQCs can be smaller than those of deep ReLU neural networks. We further validate the approximation capability of PQCs through numerical experiments. Our results provide a theoretical foundation for designing practical PQCs and quantum neural networks for machine learning tasks that can be implemented on near-term quantum devices, paving the way for the advancement of quantum machine learning., Comment: 32 pages including appendix. To appear at NeurIPS 2024
Published: 2023

10. Current density impedance imaging with PINNs

Author: Duan, Chenguang, Jiao, Yuling, Lu, Xiliang, and Yang, Jerry Zhijian
Subjects: Mathematics - Numerical Analysis, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: In this paper, we introduce CDII-PINNs, a computationally efficient method for solving CDII using PINNs in the framework of Tikhonov regularization. This method constructs a physics-informed loss function by merging the regularized least-squares output functional with an underlying differential equation, which describes the relationship between the conductivity and voltage. A pair of neural networks representing the conductivity and voltage, respectively, are coupled by this loss function. Then, minimizing the loss function provides a reconstruction. A rigorous theoretical guarantee is provided. We give an error analysis for CDII-PINNs and establish a convergence rate, based on prior selected neural network parameters in terms of the number of samples. The numerical simulations demonstrate that CDII-PINNs are efficient, accurate and robust to noise levels ranging from $1\%$ to $20\%$.
Published: 2023

11. Deep Neural Network Approximation of Composition Functions: with application to PINNs

Author: Duan, Chenguang, Jiao, Yuling, Lu, Xiliang, Yang, Jerry Zhijian, and Yuan, Cheng
Subjects: Mathematics - Numerical Analysis, Physics - Computational Physics, 68T07, 65N99
Abstract: In this paper, we focus on approximating a natural class of functions that are compositions of smooth functions. Unlike the low-dimensional support assumption on the covariate, we demonstrate that composition functions have an intrinsic sparse structure if we assume each layer in the composition has a small degree of freedom. This fact can alleviate the curse of dimensionality in approximation errors by neural networks. Specifically, by using mathematical induction and the multivariate Faa di Bruno formula, we extend the approximation theory of deep neural networks to the composition functions case. Furthermore, combining recent results on the statistical error of deep learning, we provide a general convergence rate analysis for the PINNs method in solving elliptic equations with compositional solutions. We also present two simple illustrative numerical examples to demonstrate the effect of the intrinsic sparse structure in regression and solving PDEs., Comment: There are errors in the crucial Lemma 3.1, which is a result from our previous work that has not undergone peer review. During the refinement of this manuscript, one of our colleagues pointed out a potential mistake in the proof of this result, indicating that certain corrections are needed to ensure its correctness. To uphold academic rigor, we decide to withdraw the paper at this time
Published: 2023

12. GAS: A Gaussian Mixture Distribution-Based Adaptive Sampling Method for PINNs

Author: Jiao, Yuling, Li, Di, Lu, Xiliang, Yang, Jerry Zhijian, and Yuan, Cheng
Subjects: Computer Science - Machine Learning, Physics - Computational Physics, 68T07, 65N99
Abstract: With the recent study of deep learning in scientific computation, the Physics-Informed Neural Networks (PINNs) method has drawn widespread attention for solving Partial Differential Equations (PDEs). Compared to traditional methods, PINNs can efficiently handle high-dimensional problems, but the accuracy is relatively low, especially for highly irregular problems. Inspired by the idea of adaptive finite element methods and incremental learning, we propose GAS, a Gaussian mixture distribution-based adaptive sampling method for PINNs. During the training procedure, GAS uses the current residual information to generate a Gaussian mixture distribution for the sampling of additional points, which are then trained together with historical data to speed up the convergence of the loss and achieve higher accuracy. Several numerical simulations on 2D and 10D problems show that GAS is a promising method that achieves state-of-the-art accuracy among deep solvers, while being comparable with traditional numerical solvers.
Published: 2023
Full Text: View/download PDF

13. Stability and convergence analysis of high-order numerical schemes with DtN-type absorbing boundary conditions for nonlocal wave equations

Author: Wang, Jihong, Yang, Jerry Zhijian, and Zhang, Jiwei
Subjects: Mathematics - Numerical Analysis
Abstract: The stability and convergence analysis of high-order numerical approximations for the one- and two-dimensional nonlocal wave equations on unbounded spatial domains are considered. We first use the quadrature-based finite difference schemes to discretize the spatially nonlocal operator, and apply the explicit difference scheme to approximate the temporal derivative to achieve a fully discrete infinity system. After that, we construct the Dirichlet-to-Neumann (DtN)-type absorbing boundary conditions (ABCs) to reduce the infinite discrete system into a finite discrete system. To do so, we first adopt the idea in [Du, Zhang and Zheng, \emph{Commun. Comput. Phys.}, 24(4):1049--1072, 2018 and Du, Han, Zhang and Zheng, \emph{SIAM J. Sci. Comp.}, 40(3):A1430--A1445, 2018] to derive the Dirichlet-to-Dirichlet (DtD)-type mappings for one- and two-dimensional cases, respectively. We then use the discrete nonlocal Green's first identity to achieve the discrete DtN-type mappings from the DtD-type mappings. The resulting DtN-type mappings make it possible to perform the stability and convergence analysis of the reduced problem. Numerical experiments are provided to demonstrate the accuracy and effectiveness of the proposed approach., Comment: 26 pages, 4 figures
Published: 2022

14. A sturcture-preserving, upwind-SAV scheme for the degenerate Cahn--Hilliard equation with applications to simulating surface diffusion

Author: Huang, Qiong-Ao, Jiang, Wei, Yang, Jerry Zhijian, and Yuan, Cheng
Subjects: Mathematics - Numerical Analysis, Physics - Computational Physics
Abstract: This paper establishes a structure-preserving numerical scheme for the Cahn--Hilliard equation with degenerate mobility. First, by applying a finite volume method with upwind numerical fluxes to the degenerate Cahn--Hilliard equation rewritten by the scalar auxiliary variable (SAV) approach, we creatively obtain an unconditionally bound-preserving, energy-stable and fully-discrete scheme, which, for the first time, addresses the boundedness of the classical SAV approach under $H^{-1}$-gradient flow. Then, a dimensional-splitting technique is introduced in high-dimensional cases, which greatly reduces the computational complexity while preserves original structural properties. Numerical experiments are presented to verify the bound-preserving and energy-stable properties of the proposed scheme. Finally, by applying the proposed structure-preserving scheme, we numerically demonstrate that surface diffusion can be approximated by the Cahn--Hilliard equation with degenerate mobility and Flory--Huggins potential when the absolute temperature is sufficiently low, which agrees well with the theoretical result by using formal asymptotic analysis.wn theoretically by formal matched asymptotics.
Published: 2022

15. An error analysis for deep binary classification with sigmoid loss

Author: Li, Changshi, Jiao, Yuling, and Yang, Jerry Zhijian
Published: 2024
Full Text: View/download PDF

16. A Gaussian mixture distribution-based adaptive sampling method for physics-informed neural networks

Author: Jiao, Yuling, Li, Di, Lu, Xiliang, Yang, Jerry Zhijian, and Yuan, Cheng
Published: 2024
Full Text: View/download PDF

17. Current density impedance imaging with PINNs

Author: Duan, Chenguang, Huang, Junjun, Jiao, Yuling, Lu, Xiliang, and Yang, Jerry Zhijian
Published: 2024
Full Text: View/download PDF

18. Analysis of Deep Ritz Methods for Laplace Equations with Dirichlet Boundary Conditions

Author: Duan, Chenguang, Jiao, Yuling, Lai, Yanming, Lu, Xiliang, Quan, Qimeng, and Yang, Jerry Zhijian
Subjects: Mathematics - Numerical Analysis
Abstract: Deep Ritz methods (DRM) have been proven numerically to be efficient in solving partial differential equations. In this paper, we present a convergence rate in $H^{1}$ norm for deep Ritz methods for Laplace equations with Dirichlet boundary condition, where the error depends on the depth and width in the deep neural networks and the number of samples explicitly. Further we can properly choose the depth and width in the deep neural networks in terms of the number of training samples. The main idea of the proof is to decompose the total error of DRM into three parts, that is approximation error, statistical error and the error caused by the boundary penalty. We bound the approximation error in $H^{1}$ norm with $\mathrm{ReLU}^{2}$ networks and control the statistical error via Rademacher complexity. In particular, we derive the bound on the Rademacher complexity of the non-Lipschitz composition of gradient norm with $\mathrm{ReLU}^{2}$ network, which is of immense independent interest. We also analysis the error inducing by the boundary penalty method and give a prior rule for tuning the penalty parameter., Comment: arXiv admin note: substantial text overlap with arXiv:2103.13330; text overlap with arXiv:2109.01780
Published: 2021

19. Global Optimization via Schr{\'o}dinger-F{\'o}llmer Diffusion

Author: Dai, Yin, Jiao, Yuling, Kang, Lican, Lu, Xiliang, and Yang, Jerry Zhijian
Subjects: Mathematics - Optimization and Control
Abstract: We study the problem of finding global minimizers of $V(x):\mathbb{R}^d\rightarrow\mathbb{R}$ approximately via sampling from a probability distribution $\mu_{\sigma}$ with density $p_{\sigma}(x)=\dfrac{\exp(-V(x)/\sigma)}{\int_{\mathbb R^d} \exp(-V(y)/\sigma) dy }$ with respect to the Lebesgue measure for $\sigma \in (0,1]$ small enough. We analyze a sampler based on the Euler-Maruyama discretization of the Schr{\"o}dinger-F{\"o}llmer diffusion processes with stochastic approximation under appropriate assumptions on the step size $s$ and the potential $V$. We prove that the output of the proposed sampler is an approximate global minimizer of $V(x)$ with high probability at cost of sampling $\mathcal{O}(d^{3})$ standard normal random variables. Numerical studies illustrate the effectiveness of the proposed method and its superiority to the Langevin method., Comment: arXiv admin note: text overlap with arXiv:2107.04766
Published: 2021

20. A rate of convergence of Physics Informed Neural Networks for the linear second order elliptic PDEs

Author: Jiao, Yuling, Lai, Yanming, Li, Dingwei, Lu, Xiliang, Wang, Fengru, Wang, Yang, and Yang, Jerry Zhijian
Subjects: Mathematics - Numerical Analysis
Abstract: In recent years, physical informed neural networks (PINNs) have been shown to be a powerful tool for solving PDEs empirically. However, numerical analysis of PINNs is still missing. In this paper, we prove the convergence rate to PINNs for the second order elliptic equations with Dirichlet boundary condition, by establishing the upper bounds on the number of training samples, depth and width of the deep neural networks to achieve desired accuracy. The error of PINNs is decomposed into approximation error and statistical error, where the approximation error is given in $C^2$ norm with $\mathrm{ReLU}^{3}$ networks (deep network with activations function $\max\{0,x^3\}$) and the statistical error is estimated by Rademacher complexity. We derive the bound on the Rademacher complexity of the non-Lipschitz composition of gradient norm with $\mathrm{ReLU}^{3}$ network, which is of immense independent interest., Comment: arXiv admin note: text overlap with arXiv:2103.13330
Published: 2021
Full Text: View/download PDF

21. A Structure-Preserving, Upwind-SAV Scheme for the Degenerate Cahn–Hilliard Equation with Applications to Simulating Surface Diffusion

Author: Huang, Qiong-Ao, Jiang, Wei, Yang, Jerry Zhijian, and Yuan, Cheng
Published: 2023
Full Text: View/download PDF

22. A stable and high-order numerical scheme with discrete DtN-type artificial boundary conditions for a 2D peridynamic diffusion model

Author: Wang, Jihong, Di, Yana, Yang, Jerry Zhijian, and Zhang, Jiwei
Published: 2024
Full Text: View/download PDF

23. Deep Neural Networks with ReLU-Sine-Exponential Activations Break Curse of Dimensionality in Approximation on H\'older Class

Author: Jiao, Yuling, Lai, Yanming, Lu, Xiliang, Wang, Fengru, Yang, Jerry Zhijian, and Yang, Yuanyuan
Subjects: Computer Science - Machine Learning, Computer Science - Neural and Evolutionary Computing, Mathematics - Numerical Analysis
Abstract: In this paper, we construct neural networks with ReLU, sine and $2^x$ as activation functions. For general continuous $f$ defined on $[0,1]^d$ with continuity modulus $\omega_f(\cdot)$, we construct ReLU-sine-$2^x$ networks that enjoy an approximation rate $\mathcal{O}(\omega_f(\sqrt{d})\cdot2^{-M}+\omega_{f}\left(\frac{\sqrt{d}}{N}\right))$, where $M,N\in \mathbb{N}^{+}$ denote the hyperparameters related to widths of the networks. As a consequence, we can construct ReLU-sine-$2^x$ network with the depth $5$ and width $\max\left\{\left\lceil2d^{3/2}\left(\frac{3\mu}{\epsilon}\right)^{1/{\alpha}}\right\rceil,2\left\lceil\log_2\frac{3\mu d^{\alpha/2}}{2\epsilon}\right\rceil+2\right\}$ that approximates $f\in \mathcal{H}_{\mu}^{\alpha}([0,1]^d)$ within a given tolerance $\epsilon >0$ measured in $L^p$ norm $p\in[1,\infty)$, where $\mathcal{H}_{\mu}^{\alpha}([0,1]^d)$ denotes the H\"older continuous function class defined on $[0,1]^d$ with order $\alpha \in (0,1]$ and constant $\mu > 0$. Therefore, the ReLU-sine-$2^x$ networks overcome the curse of dimensionality on $\mathcal{H}_{\mu}^{\alpha}([0,1]^d)$. In addition to its supper expressive power, functions implemented by ReLU-sine-$2^x$ networks are (generalized) differentiable, enabling us to apply SGD to train.
Published: 2021

24. Linear multi-step methods and their numerical stability for solving gradient flow equations

Author: Huang, Qiong-Ao, Jiang, Wei, Yang, Jerry Zhijian, and Zhang, Gengen
Published: 2023
Full Text: View/download PDF

25. Deep Conditional Generative Learning: Model and Error Analysis

Author: Chang, Jinyuan, Ding, Zhao, Jiao, Yuling, Li, Ruoxuan, Yang, Jerry Zhijian, Chang, Jinyuan, Ding, Zhao, Jiao, Yuling, Li, Ruoxuan, and Yang, Jerry Zhijian
Abstract: We introduce an Ordinary Differential Equation (ODE) based deep generative method for learning a conditional distribution, named the Conditional Follmer Flow. Starting from a standard Gaussian distribution, the proposed flow could efficiently transform it into the target conditional distribution at time 1. For effective implementation, we discretize the flow with Euler's method where we estimate the velocity field nonparametrically using a deep neural network. Furthermore, we derive a non-asymptotic convergence rate in the Wasserstein distance between the distribution of the learned samples and the target distribution, providing the first comprehensive end-to-end error analysis for conditional distribution learning via ODE flow. Our numerical experiments showcase its effectiveness across a range of scenarios, from standard nonparametric conditional density estimation problems to more intricate challenges involving image data, illustrating its superiority over various existing conditional density estimation methods.
Published: 2024

26. Approximate Policy Iteration With Deep Minimax Average Bellman Error Minimization

Author: Kang, Lican, primary, Liu, Yuhui, additional, Luo, Yuan, additional, Yang, Jerry Zhijian, additional, Yuan, Han, additional, and Zhu, Chang, additional
Published: 2024
Full Text: View/download PDF

27. Calculation of Cauchy stress tensor in molecular dynamics system with a generalized Irving-Kirkwood formulism

Author: Yang, Jerry Zhijian and Du, Shukai
Subjects: Physics - Computational Physics
Abstract: Irving and Kirkwood formulism (IK formulism) provides a way to compute continuum mechanics quantities at certain location in terms of molecular variables. To make the approach more practical in computer simulation, Hardy proposed to use a spacial kernel function that couples continuum quantities with atomistic information. To reduce irrational fluctuations, Murdoch proposed to use a temporal kernel function to smooth the physical quantities obtained in Hardy's approach. In this paper, we generalize the original IK formulism to systematically incorporate both spacial and temporal average. The Cauchy stress tensor is derived in this generalized IK formulism (g-IK formulism). Analysis is given to illuminate the connection and difference between g-IK formulism and traditional temporal post-process approach. The relationship between Cauchy stress and first Piola-Kirchhoff stress is restudied in the framework of g-IK formulism. Numerical experiments using molecular dynamics are conducted to examine the analysis results.
Published: 2014

28. Accurate Evaluations of Strain and Stress in Atomistic Simulations of Crystalline Solids

Author: Yang, Jerry Zhijian, Wu, Xiaojie, and Li, Xiantao
Subjects: Physics - Computational Physics
Abstract: In this paper, we study the accuracy of Irving-Kirkwood type of formulas for the approximation of continuum quantities from atomistic simulations. Such formulas are derived by expressing the displacement, deformation gradient and stress in terms of certain kernel functions. We propose two criteria for choosing the kernel functions to significantly improve the sampling accuracy. We present a simple procedure to construct kernel functions that meet these criteria. Further, numerical tests on homogeneous and non-homogeneous systems provide validations for our analysis., Comment: 16 pages, 8 figures
Published: 2013
Full Text: View/download PDF

29. An unconditionally energy stable scheme for simulating wrinkling phenomena of elastic thin films on a compliant substrate

Author: Huang, Qiong-Ao, Jiang, Wei, and Yang, Jerry Zhijian
Published: 2019
Full Text: View/download PDF

30. Global Optimization via Schrödinger–Föllmer Diffusion

Author: Dai, Yin, primary, Jiao, Yuling, additional, Kang, Lican, additional, Lu, Xiliang, additional, and Yang, Jerry Zhijian, additional
Published: 2023
Full Text: View/download PDF

31. Deep Neural Networks with ReLU-Sine-Exponential Activations Break Curse of Dimensionality in Approximation on Hölder Class

Author: Jiao, Yuling, primary, Lai, Yanming, additional, Lu, Xiliang, additional, Wang, Fengru, additional, Yang, Jerry Zhijian, additional, and Yang, Yuanyuan, additional
Published: 2023
Full Text: View/download PDF

32. Improved Analysis of PINNs: Alleviate the CoD for Compositional Solutions

Author: Jiao, Yuling, primary, Lu, Xiliang, additional, Yang, Jerry Zhijian, additional, null, Cheng Yuan, additional, and Zhang, Pingwen, additional
Published: 2023
Full Text: View/download PDF

33. The Discontinuous Galerkin Method by Patch Reconstruction for Helmholtz Problems

Author: Li, Di, primary, Liu, Min, additional, null, Xiliang Lu, additional, and Yang, Jerry Zhijian, additional
Published: 2023
Full Text: View/download PDF

34. Stability and convergence analysis of high-order numerical schemes with DtN-type absorbing boundary conditions for nonlocal wave equations.

Author: Wang, Jihong, Yang, Jerry Zhijian, and Zhang, Jiwei
Abstract: The stability and convergence analysis of high-order numerical approximations for the one- and two-dimensional nonlocal wave equations on unbounded spatial domains are considered. We first use the quadrature-based finite difference schemes to discretize the spatially nonlocal operator, and apply the explicit difference scheme to approximate the temporal derivative to achieve a fully discrete infinity system. After that, we construct the Dirichlet-to-Neumann (DtN)-type absorbing boundary conditions (ABCs), to reduce the infinite discrete system into a finite discrete system. To do so, we first adopt the idea in Du et al. (2018, Commun. Comput. Phys. , 24 , 1049–1072) and Du et al. (2018, SIAM J. Sci. Comp. , 40 , A1430–A1445) to derive the Dirichlet-to-Dirichlet (DtD)-type mappings for one- and two-dimensional cases, respectively. We then use the discrete nonlocal Green's first identity to achieve the discrete DtN-type mappings from the DtD-type mappings. The resulting DtN-type mappings make it possible to perform the stability and convergence analysis of the reduced problem. Numerical experiments are provided to demonstrate the accuracy and effectiveness of the proposed approach. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

35. Stability and convergence analysis of high-order numerical schemes with DtN-type absorbing boundary conditions for nonlocal wave equations

Author: Wang, Jihong, primary, Yang, Jerry Zhijian, additional, and Zhang, Jiwei, additional
Published: 2023
Full Text: View/download PDF

36. Provable Advantage of Parameterized Quantum Circuit in Function Approximation

Author: Yu, Zhan, Chen, Qiuhao, Jiao, Yuling, Li, Yinan, Lu, Xiliang, Wang, Xin, Yang, Jerry Zhijian, Yu, Zhan, Chen, Qiuhao, Jiao, Yuling, Li, Yinan, Lu, Xiliang, Wang, Xin, and Yang, Jerry Zhijian
Abstract: Understanding the power of parameterized quantum circuits (PQCs) in accomplishing machine learning tasks is one of the most important questions in quantum machine learning. In this paper, we analyze the expressivity of PQCs through the lens of function approximation. Previously established universal approximation theorems for PQCs are mainly nonconstructive, leading us to the following question: How large do the PQCs need to be to approximate the target function up to a given error? We exhibit explicit constructions of data re-uploading PQCs for approximating continuous and smooth functions and establish quantitative approximation error bounds in terms of the width, the depth and the number of trainable parameters of the PQCs. To achieve this, we utilize techniques from quantum signal processing and linear combinations of unitaries to construct PQCs that implement multivariate polynomials. We implement global and local approximation techniques using Bernstein polynomials and local Taylor expansion and analyze their performances in the quantum setting. We also compare our proposed PQCs to nearly optimal deep neural networks in approximating high-dimensional smooth functions, showing that the ratio between model sizes of PQC and deep neural networks is exponentially small with respect to the input dimension. This suggests a potentially novel avenue for showcasing quantum advantages in quantum machine learning., Comment: 31pages, 3 figures
Published: 2023

37. Two-grid variational multiscale method with bubble stabilization for convection diffusion equation

Author: Weng, Zhifeng, Yang, Jerry Zhijian, and Lu, Xiliang
Published: 2016
Full Text: View/download PDF

38. Sparse Membership Affinity Lasso for Fuzzy Clustering

Author: Huang, Junjun, Lu, Xiliang, and Yang, Jerry Zhijian
Abstract: The membership matrix is a key element in fuzzy clustering, enabling novel data representation in multiple clusters. The row vectors of the membership matrix represent each sample's degree of membership to different clusters. Notably, researchers have confirmed the presence of the local affinity among these row vectors, effectively preserving the local structure of the original data distribution. However, in this work, we consider that most sample points have insignificant fuzziness, with fuzziness found primarily in a few clusters, resulting in most membership vectors being sparse. To tackle this issue, we present the sparse, membership-affinity fuzzy clustering model, which leverages the sparsity of the row vectors and its affinity to establish a more appropriate representation, along with an optimization algorithm. Our experimental results on both simulated and real datasets demonstrate that the combination of sparsity and affinity can significantly enhance fuzzy clustering performance over other models.
Published: 2024
Full Text: View/download PDF

39. Sparse Membership Affinity Lasso for Fuzzy Clustering

Author: Huang, Junjun, primary, Lu, Xiliang, additional, and Yang, Jerry Zhijian, additional
Published: 2023
Full Text: View/download PDF

40. A Stable and High-Order Numerical Scheme with Dtn-Type Artificialboundary Conditions for a 2d Peridynamic Diffusion Model

Author: Wang, Jihong, primary, Di, Yana, additional, Yang, Jerry Zhijian, additional, and Zhang, Jiwei, additional
Published: 2023
Full Text: View/download PDF

41. The Discontinuous Galerkin Method by Divergence-Free Patch Reconstruction for Stokes Eigenvalue Problems

Author: Li, Di, primary, Sun, Zhiyuan, additional, Wang, Fengru, additional, and Yang, Jerry Zhijian, additional
Published: 2022
Full Text: View/download PDF

42. Hierarchical Absorbing Interface Conditions for Wave Propagation on Non-Uniform Meshes

Author: Dai, Shuyang, primary, Sun, Zhiyuan, additional, Wang, Fengru, additional, Yang, Jerry Zhijian, additional, and Yuan, Cheng, additional
Published: 2022
Full Text: View/download PDF

43. Deep Ritz Methods for Laplace Equations with Dirichlet Boundary Condition

Author: Duan, Chenguang, primary, Jiao, Yuling, additional, Lai, Yanming, additional, Lu, Xiliang, additional, null, Qimeng Quan, additional, and Yang, Jerry Zhijian, additional
Published: 2022
Full Text: View/download PDF

44. A Rate of Convergence of Physics Informed Neural Networks for the Linear Second Order Elliptic PDEs

Author: Jiao, Yuling, primary, Lai, Yanming, additional, Li, Dingwei, additional, Lu, Xiliang, additional, Wang, Fengru, additional, Wang, Yang, additional, and Yang, Jerry Zhijian, additional
Published: 2022
Full Text: View/download PDF

45. Convergence Rate Analysis for Deep Ritz Method

Author: Duan, Chenguang, primary, Jiao, Yuling, additional, Lai, Yanming, additional, Li, Dingwei, additional, Lu, Xiliang, additional, and Yang, Jerry Zhijian, additional
Published: 2022
Full Text: View/download PDF

46. A Rate of Convergence of Physics Informed Neural Networks for the Linear Second Order Elliptic PDEs

Author: Jiao, Yuling, Lai, Yanming, Li, Dingwei, Lu, Xiliang, Wang, Fengru, Wang, Yang, Yang, Jerry Zhijian, Jiao, Yuling, Lai, Yanming, Li, Dingwei, Lu, Xiliang, Wang, Fengru, Wang, Yang, and Yang, Jerry Zhijian
Abstract: In recent years, physical informed neural networks (PINNs) have been shown to be a powerful tool for solving PDEs empirically. However, numerical analysis of PINNs is still missing. In this paper, we prove the convergence rate to PINNs for the second order elliptic equations with Dirichlet boundary condition, by establishing the upper bounds on the number of training samples, depth and width of the deep neural networks to achieve desired accuracy. The error of PINNs is decomposed into approximation error and statistical error, where the approximation error is given in C2 norm with ReLU3 networks (deep network with activation function max{0,x3}) and the statistical error is estimated by Rademacher complexity. We derive the bound on the Rademacher complexity of the non-Lipschitz composition of gradient norm with ReLU3 network, which is of immense independent interest. ©2022 Global-Science Press
Published: 2022

47. Finite element approximation to the extremal eigenvalue problem for inhomogenous materials

Author: Liang, Kewei, Lu, Xiliang, and Yang, Jerry Zhijian
Published: 2015
Full Text: View/download PDF

48. Boundary condition for dislocation dynamic simulation in BCC crystal

Author: Dai, Shuyang, Wang, Fengru, Xiang, Yang, Yang, Jerry Zhijian, Yuan, Cheng, Dai, Shuyang, Wang, Fengru, Xiang, Yang, Yang, Jerry Zhijian, and Yuan, Cheng
Abstract: The movement of dislocations and the corresponding crystal plastic deformation are highly influenced by the interaction between dislocations and nearby free surfaces. The boundary condition for inclination angle θinc which indicates the relation between a dislocation line and the surface is one of the key ingredients in the dislocation dynamic simulations. In this paper, we first present a systematical study on θinc by molecular static simulations in BCC-irons samples. We also study the inclination angle by using molecular dynamic simulations. A continuum description of inclination angle in both static and dynamic cases is derived based on Onsager's variational principle. We show that the results obtained from continuum description are in good agreement with the molecular simulations. These results can serve as boundary conditions for dislocation dynamics simulations.
Published: 2021

49. Cauchy–Born Rule and Stability of Crystalline Solids at Finite Temperature

Author: Luo, Tao, Xiang, Yang, Yang, Jerry Zhijian, Yuan, Cheng, Luo, Tao, Xiang, Yang, Yang, Jerry Zhijian, and Yuan, Cheng
Abstract: We study the connection between atomistic and continuum models for the elastic deformation of crystalline solids at finite temperature. We prove, under certain sharp stability conditions at zero temperature, that the solid is stable when temperature is low. This gives a criterion for the onset of instabilities of crystalline solids as temperature increases. Based on the stability conditions at both zero and finite temperature, we show that the finite temperature version of Cauchy–Born rule gives a correct nonlinear elasticity model in the sense that elastically deformed states of the atomistic model are closely approximated by solutions of the continuum model with free energy functionals obtained from the Cauchy–Born rule at finite temperature. The convergence is proved for both simple and complex lattices.
Published: 2021

50. Finite temperature Cauchy-Born rule and stability of crystalline solids with point defects

Author: Luo, Tao, Xiang, Yang, Yang, Jerry Zhijian, Luo, Tao, Xiang, Yang, and Yang, Jerry Zhijian
Abstract: We study the convergence of the elastic deformation from an atomistic model to a continuum model based on the Cauchy--Born rule for crystalline solids, where point defects are allowed to exist. We prove, under certain sharp stability conditions at zero temperature of the perfect lattice, that the solids are stable when the temperature and defect concentration are both low. Based on the stability conditions at zero/finite temperatures and with/without defects, we show that the defected version of the Cauchy--Born rule gives a correct nonlinear elasticity model in the sense that elastically deformed states of the atomistic model are closely approximated by solutions of the continuum model with free energy functionals obtained from the Cauchy--Born rule. Both static and dynamic problems are considered. The results are focused on the simple crystals and can be easily extended to complex ones.
Published: 2021

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

117 results on '"Yang, Jerry Zhijian"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources