399 results on '"Wing Hung Wong"'
Search Results
352. Computational Molecular Biology.
- Author
-
Wing Hung Wong
- Subjects
- *
MOLECULAR biology , *LIFE sciences , *GENES , *STATISTICS , *DNA , *BIOMOLECULES - Abstract
The article presents information on a study which reviewed computational molecular biology and the opportunities it presents to statistical researchers. Molecular biology is one of the most important scientific frontiers in the second half of the twentieth century. During this period, the basic principles of how genetic information is encoded in the DNA and how this information is used to direct the function of a cell were worked out at the molecular level, and methods were developed to clone, to sequence, and to amplify DNA. As a result, a large amount of biological sequence information has been generated and deposited into publicly accessible data bases. The phenomenal growth of DNA sequence data underpinned by a fundamental shift in the way such data are produced. Many components of basic cellular processes are highly conserved, although their uses and regulation have diverged greatly among extant organisms. Sequence alignment is the basic tool that allows to detect these conserved components.
- Published
- 2000
- Full Text
- View/download PDF
353. The Multiple-Try Method and Local Optimization in Metropolis Sampling.
- Author
-
Liu, Jun S., Liang, Faming, and Wing Hung Wong
- Subjects
MARKOV processes ,TRANSITION rules ,TEMPORARY tax regulations ,MONTE Carlo method ,STATE-space methods ,SYSTEM analysis - Abstract
This article describes a new Metropolis-like transition rule, the multiple-try Metropolis, for Markov chain Monte Carlo (MCMC) simulations. By using this transition rule together with adaptive direction sampling, we propose a novel method for incorporating local optimization steps into a MCMC sampler in continuous state-space. Numerical studies show that the new method performs significantly better than the traditional Metropolis-Hastings (M-H) sampler. With minor tailoring in using the rule, the multiple-try method can also be exploited to achieve the effect of a griddy Gibbs sampler without having to bear with griddy approximations, and the effect of a hit-and-run algorithm without having to figure out the required conditional distribution in a random direction. [ABSTRACT FROM AUTHOR]
- Published
- 2000
- Full Text
- View/download PDF
354. STOCHASTIC RECONSTRUCTION OF INCOMPLETE DATA SETS USING GIBBS PRIORS IN POSITRON EMISSION TOMOGRAPHY
- Author
-
Chin-Tu Chen, Wing Hung Wong, and Oscar H. Kapp
- Subjects
Spatial correlation ,Computer science ,Image quality ,business.industry ,General Medicine ,Iterative reconstruction ,Bayesian inference ,Transformation (function) ,Prior probability ,Radiology, Nuclear Medicine and imaging ,Projection (set theory) ,Nuclear medicine ,business ,Image resolution ,Algorithm - Abstract
Statistical method for image reconstruction in positron emission tomography (PET) have been utilized with increasing frequency in recent years because of their potential for yielding improved image quality. Stochastic techniques such as the inexact reconstruction technique (IRT) have provided a fruitful approach to the problem of image reconstruction with only a limited number of projection views by applying an iterative approach, with certain constraints, to the treatment of backprojected probability matrices. We have combined the use of the IRT with a new Bayesian model developed recently in our laboratories which employs a Gibbs prior that incorporate some prior information to describe the spatial correlation of neighboring regions and takes into account the effect of the limited spatial resolution as well. This model incorporates continuous values for `line sites' in order to avoid computational difficulties in the determination of point estimate of the image. In addition, we use a square-root transformation for Poisson intensity allowing ready incorporation into the Gibbs formulation. The method of iterative conditional averages was used for computing the point estimates. A preliminary study showed promising results with the use of data from only 8 projection angles.© (1992) COPYRIGHT SPIE--The International Society for Optical Engineering. Downloading of the abstract is permitted for personal use only.
- Published
- 1993
355. ATTENUATION CORRECTION IN SPECT
- Author
-
Wing Hung Wong, J. Liu, Chin-Tu Chen, and X. Pan
- Subjects
Computer science ,Radiology, Nuclear Medicine and imaging ,General Medicine ,Hybrid approach ,Correction for attenuation ,Algorithm - Published
- 1992
356. Convergence Rates of a Class of Multivariate Density Estimation Methods Based on Adaptive Partitioning.
- Author
-
Linxi Liu, Dangna Li, and Wing Hung Wong
- Subjects
- *
FUNCTIONS of bounded variation , *MAXIMUM likelihood statistics , *HOLDER spaces , *DATA compression , *DENSITY , *NONPARAMETRIC estimation - Abstract
Density estimation is a building block for many other statistical methods, such as classification, nonparametric testing, and data compression. In this paper, we focus on a nonparametric approach to multivariate density estimation, and study its asymptotic properties under both frequentist and Bayesian settings. The estimated density function is obtained by considering a sequence of approximating spaces to the space of densities. These spaces consist of piecewise constant density functions supported by binary partitions with increasing complexity. To obtain an estimate, the partition is learned by maximizing either the likelihood of the corresponding histogram on that partition, or the marginal posterior probability of the partition under a suitable prior. We analyze the convergence rate of the maximum likelihood estimator and the posterior concentration rate of the Bayesian estimator, and conclude that for a relatively rich class of density functions the rate does not directly depend on the dimension. We also show that the Bayesian method can adapt to the unknown smoothness of the density function. The method is applied to several specific function classes and explicit rates are obtained. These include spatially sparse functions, functions of bounded variation, and Hölder continuous functions. We also introduce an ensemble approach, obtained by aggregating multiple density estimates fit under carefully designed perturbations, and show that for density functions lying in a Hölder space (H1,β, β ≤ 1), the ensemble method can achieve minimax convergence rate up to a logarithmic term, while the corresponding rate of the density estimator based on a single partition is suboptimal for this function class. [ABSTRACT FROM AUTHOR]
- Published
- 2023
357. An Application of Imputation to an Estimation Problem in Grouped Lifetime Analysis.
- Author
-
Tanner, Martin A. and Wing Hung Wong
- Subjects
- *
ESTIMATION theory , *MULTIPLE imputation (Statistics) - Abstract
Tanner and Wong (in press) have introduced the data augmentation algorithm for the analysis of parametric missing data problems. In this article, this paradigm is used to develop an algorithm for the nonparametric estimation of the hazard function from grouped and censored lifetime data. This algorithm makes use of the notions of cross-validation and multiple imputation to prescribe the appropriate degree of smoothing for the nonparametric hazard estimate. A procedure for estimating the variance of the estimator is also proposed. The nonparametric hazard estimate and corresponding variance formula are shown to perform well in a simulation study. The algorithm is illustrated with a numerical example. [ABSTRACT FROM AUTHOR]
- Published
- 1987
- Full Text
- View/download PDF
358. The Calculation of Posterior Distributions by Data Augmentation.
- Author
-
Tanner, Martin A. and Wing Hung Wong
- Subjects
- *
ALGORITHMS , *THEORY of distributions (Functional analysis) , *APPROXIMATION theory , *FUNCTIONAL analysis , *ESTIMATION theory , *BAYESIAN analysis , *DISTRIBUTION (Probability theory) , *INFERENCE (Logic) , *MULTIPLE imputation (Statistics) - Abstract
The idea of data augmentation arises naturally in missing value problems, as exemplified by the standard ways of filling in missing cells in balanced two-way tables. Thus data augmentation refers to a scheme of augmenting the observed data so as to make it more easy to analyze. This device is used to great advantage by the EM algorithm (Dempster, Laird, and Rubin 1977) in solving maximum likelihood problems. In situations when the likelihood cannot be approximated closely by the normal likelihood, maximum likelihood estimates and the associated standard errors cannot be relied upon to make valid inferential statements. From the Bayesian point of view, one must now calculate the posterior distribution of parameters of interest. If data augmentation can be used in the calculation of the maximum likelihood estimate, then in the same cases one ought to be able to use it in the computation of the posterior distribution. It is the purpose of this article to explain how this can be done. The basic idea is quite simple. The observed data y is augmented by the quantity z, which is referred to as the latent data. It is assumed that if y and z are both known, then the problem is straightforward to analyze, that is, the augmented data posterior p(O I Y, z) can be calculated. But the posterior density that we want is p(O I Y), which may be difficult to calculate directly. If, however, one can generate multiple values of z from the predictive distribution p(z l Y) (i.e., multiple imputations of z), then p(O I Y) can be approximately obtained as the average of p(theta | Y, z) over the imputed z's. However, p(z [ y) depends, in turn, on p(O [ y). Hence if p(theta | y) was known, it could be used to calculate p(z | Y). This mutual dependency between p(O I Y) and p(z I Y) leads to an iterative algorithm to calculate p(O [ y).... [ABSTRACT FROM AUTHOR]
- Published
- 1987
- Full Text
- View/download PDF
359. Comment.
- Author
-
Dyn, Nira, Wahba, Grace, and Wing-Hung Wong
- Subjects
INTERPOLATION ,STATISTICS ,CONTOURS (Cartography) ,EQUATIONS ,APPROXIMATION theory ,ALGORITHMS ,CANCER ,MATHEMATICAL optimization - Abstract
The article comments on the article "Smooth Pycnophylactic Interpolation for Geographical Regions," by Waldo R. Tobler that appeared in the 1979 issue of the "Journal of the American Statistical Association." Authors would like to begin by thanking Tobler for a very interesting contribution to the important problem of obtaining smooth surfaces with the volume-matching property. Although Tobler's main interest is in obtaining smooth surfaces and not in solving optimization problems, authors think it is useful and important to state precisely the optimization problem being solved, to establish the existence and uniqueness of the solution, and to establish that the numerical algorithms involved do converge to a unique solution. The authors' position is that two experimenters employing the same definition of smoothness should obtain comparable contour maps. This requirement could be very important, for example, in analyzing the geographic distribution of various types of cancer incidence and matching the resultant contour maps with contour maps of, say, air-pollution density. If the disease data and air-pollution data were sculpted by different methods, then the two maps are not necessarily comparable. If an algorithm does not converge to a unique solution, then different experimenters can obtain different maps for the same data.
- Published
- 1979
- Full Text
- View/download PDF
360. The Calculation of Posterior Distributions by Data Augmentation
- Author
-
Wing Hung Wong and Martin A. Tanner
- Subjects
Statistics and Probability ,Computation ,Bayesian probability ,Monte Carlo method ,Posterior probability ,Bayesian inference ,Statistics::Computation ,Standard error ,Expectation–maximization algorithm ,Statistics ,Statistics::Methodology ,Imputation (statistics) ,Statistics, Probability and Uncertainty ,Algorithm ,Mathematics - Abstract
The idea of data augmentation arises naturally in missing value problems, as exemplified by the standard ways of filling in missing cells in balanced two-way tables. Thus data augmentation refers to a scheme of augmenting the observed data so as to make it more easy to analyze. This device is used to great advantage by the EM algorithm (Dempster, Laird, and Rubin 1977) in solving maximum likelihood problems. In situations when the likelihood cannot be approximated closely by the normal likelihood, maximum likelihood estimates and the associated standard errors cannot be relied upon to make valid inferential statements. From the Bayesian point of view, one must now calculate the posterior distribution of parameters of interest. If data augmentation can be used in the calculation of the maximum likelihood estimate, then in the same cases one ought to be able to use it in the computation of the posterior distribution. It is the purpose of this article to explain how this can be done. The basic idea ...
- Published
- 1987
361. On constrained multivariate splines and their approximations
- Author
-
Wing Hung Wong
- Subjects
Polyharmonic spline ,Computational Mathematics ,Spline (mathematics) ,Hermite spline ,Smoothing spline ,Applied Mathematics ,Mathematical analysis ,Spline interpolation ,Thin plate spline ,Convexity ,Smoothing ,Mathematics - Abstract
The variational formulation of multivariate spline functions is generalized to include cases where the function has to satisfy inequality constraints such as positivity and convexity. Condition for existence and uniqueness of a solution is given. Approximation to the solution can be obtained by solving the variational problem in a finite dimensional subspace. Conditions for convergence and error estimates of the approximations are presented, both for interpolation problems and smoothing problems. The general theory is illustrated by specific examples including the "volume-matching" problem and the "one-sided thin plate spline".
- Published
- 1984
362. Comment
- Author
-
Nira Dyn and Wing-Hung Wong
- Subjects
Statistics and Probability ,Statistics, Probability and Uncertainty - Published
- 1979
363. An Application of Imputation to an Estimation Problem in Grouped Lifetime Analysis
- Author
-
Martin A. Tanner and Wing Hung Wong
- Subjects
Statistics and Probability ,Applied Mathematics ,Modeling and Simulation - Published
- 1987
364. On the characterization of non-negative volume-matching surface splines
- Author
-
Nira Dyn and Wing Hung Wong
- Subjects
Hermite spline ,Mathematics(all) ,Numerical Analysis ,General Mathematics ,Applied Mathematics ,Perfect spline ,Geometry ,Dirichlet integral ,Smoothing spline ,symbols.namesake ,Spline (mathematics) ,M-spline ,symbols ,Applied mathematics ,Spline interpolation ,Thin plate spline ,Analysis ,Mathematics - Abstract
In this paper we study the surface spline which minimizes the Dirichlet Integral over a two-dimensional bounded domain, among all non-negative functions satisfying a finite number of volume-matching constraints. Existence and uniqueness of this surface spline are proved. A characterization by a variational inequality is given, revealing local and boundary behaviour of the surface spline. This characterization is of importance in the construction of numerical algorithms for the production of non-negative smooth surfaces from aggregated data.
- Published
- 1987
- Full Text
- View/download PDF
365. Assessing the Conservation of Mammalian Gene Expression Using High-Density Exon Arrays.
- Author
-
Yi Xing, Zhengqing Ouyang, Kapur, Karen, Scott, Matthew P., and Wing Hung Wong
- Published
- 2007
- Full Text
- View/download PDF
366. A Note on the Modified Likelihood for Density Estimation
- Author
-
Wing Hung Wong
- Subjects
Statistics and Probability ,Information distance ,Combinatorics ,Window Width ,Statistics ,Density estimation ,Limiting ,Statistics, Probability and Uncertainty ,Jackknife resampling ,Equivalence (measure theory) ,Cross-validation ,Mathematics - Abstract
Let f λ be a kernel estimate (with window width λ) of the density f. Its performance is assessed by the Kullback-Leibler information distance I(f, f λ) = ∫ f log f − ∫ f log f λ. This article establishes conditions for the asymptotic equivalence of the cross-validation estimate and the jackknife estimate of the term ∫ f log f λ, and provides the common limiting value. This gives insight into the “modified likelihood” criterion for choosing λ, introduced by Habbema, Hermans, and Van den Broek (1974) and Duin (1976).
- Published
- 1983
367. Rejoinder
- Author
-
Martin A. Tanner and Wing Hung Wong
- Subjects
Statistics and Probability ,Statistics, Probability and Uncertainty - Published
- 1987
368. On the Consistency of Cross-Validation in Kernel Nonparametric Regression
- Author
-
Wing Hung Wong
- Subjects
Statistics and Probability ,Polynomial regression ,consistency ,Kernel (set theory) ,Mean squared error ,Strong consistency ,Cross-validation ,kernel estimate ,Lambda ,Nonparametric regression ,Combinatorics ,nonparametric regression ,Statistics ,Kernel regression ,Principal component regression ,62G05 ,Statistics, Probability and Uncertainty ,Mathematics - Abstract
For the nonparametric regression model $Y(t_i) = \theta(t_i) + \varepsilon(t_i)$ where $\theta$ is a smooth function to be estimated, $t_i$'s are nonrandom, $\varepsilon(t_i)$'s are i.i.d. errors, this paper studies the behavior of the kernel regression estimate $\hat{\theta}(t) = \big\lbrack \sum^n_{j=1}K \big(\frac{t_j - t}{\lambda}\big) Y(t_j) \big\rbrack / \big\lbrack\ sum^n_{j=1} K \big(\frac{t_j - t}{\lambda}\big) \big\rbrack$ when $\lambda$ is chosen by cross-validation on the average squared error. Strong consistency in terms of the average squared error is established for uniform spacing, compact kernel and finite fourth error moment.
- Published
- 1983
369. Theory of Partial Likelihood
- Author
-
Wing Hung Wong
- Subjects
Statistics and Probability ,62A10 ,generalized autoregression ,Asymptotic distribution ,Consistent estimator ,Statistics ,Nuisance parameter ,Statistics::Methodology ,proportional hazard model ,Mathematics ,nonstationary ,martingale limit theorem ,Estimator ,conditional score ,Missing data ,Minimal Fisher Information ,missing values ,Efficient estimator ,Autoregressive model ,nuisance parameter ,62P10 ,Maximum likelihood estimator ,62M10 ,Statistics, Probability and Uncertainty ,Martingale (probability theory) ,62F12 - Abstract
A general asymptotic theory is developed for the maximum likelihood estimator based on a partial likelihood. Conditions are given for consistency and asymptotic normality, and a method is provided for the calculation of the asymptotic efficiency of the estimator. The implications of the general theory are examined in special cases such as inference in stochastic processes, Cox regression models, and AR processes with missing segments.
- Published
- 1986
370. Improved Bayesian image reconstruction in positron emission tomography
- Author
-
C.-T. Chen, Chien-Min Kao, Wing Hung Wong, and Xiaochuan Pan
- Subjects
Hyperparameter ,medicine.diagnostic_test ,Positron emission tomography ,Computer science ,Bayesian probability ,Diagonal ,medicine ,Statistical analysis ,Iterative reconstruction ,Bayesian inference ,Algorithm ,Image restoration - Abstract
We extend the Bayesian model (Johnson, Chen, et al., 1990, 1991), previously developed for image reconstruction and restoration, by the introduction of diagonal line sites and the use of symmetrical neighborhood configurations. A computer simulation study was performed to examine the effect of the hyperparameters, the diagonal line sites, and the size of the neighbourhood configuration on the performance of the Bayesian method. We show that for optimal performance, distinct hyperparameters should be used for the intensity sites and line sites. The results also suggested the use of a larger configuration. By comparing the near-optimal restored images, we also demonstrate that the use of diagonal line sites, along with the symmetrical configurations thus made possible, can effectively remove the blocky edge artifacts and produce images of better quality. For image reconstruction applications, in addition to the use of diagonal line sites and symmetrical neighborhood configurations, we suggested using the area-weighted projection/backprojection technique to improve the accuracy of the system response function. Quality of reconstructed images were shown to be improved for both computer simulated and real patient PET data.
371. Improved Bayesian approach
- Author
-
Chin-Tu Chen, Chien-Min Kao, Wing Hung Wong, and Xiaochuan Pan
- Subjects
Scheme (programming language) ,Computer science ,business.industry ,Optical engineering ,Bayesian probability ,Boundary (topology) ,Machine learning ,computer.software_genre ,Line (geometry) ,Artificial intelligence ,business ,Algorithm ,computer ,computer.programming_language - Abstract
The Bayesian approach that employs the concepts of cliques and line sites in its Gibbs prior provides the potential of realistic and objective characterization of boundaries between different regions exhibiting intensity variations in the reconstructed images. In this work, we develop an improved Bayesian approach for accurate detection of boundaries by introducing symmetric cliques and new types of line sites as well as a new calculation scheme. This improved Bayesian approach has been applied to positron emission tomography data from both computer simulations and patient studies. The results demonstrate that the new cliques, line sites, and calculation scheme can enhance the boundary detectability of the Bayesian approach and yield realistic boundaries and hence improved reconstructed images.© (1995) COPYRIGHT SPIE--The International Society for Optical Engineering. Downloading of the abstract is permitted for personal use only.
372. Comment
- Author
-
Martin A. Tanner and Wing Hung Wong
- Subjects
Statistics and Probability ,Statistics, Probability and Uncertainty - Published
- 1983
373. A 7.11mJ/Gb/query data-driven machine learning processor (D2MLP) for big data analysis and applications.
- Author
-
Chang-Hung Tsai, Tung-Yu Wu, Shu-Yu Hsu, Chia-Ching Chu, Fang-Ju Ku, Ying-Siou Laio, Chih-Lung Chen, Wing-Hung Wong, Hsie-Chia Chang, and Chen-Yi Lee
- Published
- 2014
- Full Text
- View/download PDF
374. The Calculation of Posterior Distributions by Data Augmentation: Comment
- Author
-
Donald B. Rubin, Arthur P. Dempster, Wing Hung Wong, S. J. Haberman, Martin A. Tanner, and C. N. Morris
- Subjects
Statistics and Probability ,Iterative method ,Statistics ,Applied mathematics ,Statistics, Probability and Uncertainty ,Missing data ,Mathematics - Abstract
On introduit une methode iterative pour le calcul des distributions a posteriori qui s'applique meme lorsque les donnees peuvent etre augmentees de telle sorte qu'il devienne facile d'analyser les donnees augmentees et qu'il soit simple de generer les donnees augmentees etant donne le parametre
- Published
- 1987
375. Smooth Pycnophylactic Interpolation for Geographical Regions: Comment
- Author
-
Grace Wahba, Wing-Hung Wong, and Nira Dyn
- Subjects
Statistics and Probability ,education.field_of_study ,Geography ,Population ,Statistics, Probability and Uncertainty ,education ,Cartography ,Interpolation ,Demography - Published
- 1979
376. Data-Based Nonparametric Estimation of the Hazard Function with Applications to Model Diagnostis and Exploratory Analysis
- Author
-
Wing Hung Wong and Martin A. Tanner
- Subjects
Statistics and Probability ,Data processing ,Mathematical model ,Computer science ,Kernel (statistics) ,Statistics ,Nonparametric statistics ,Estimator ,Statistics, Probability and Uncertainty ,Censoring (statistics) ,Smoothing ,Parametric statistics - Abstract
Two general classes of nonparametric kernel estimators of the hazard function are introduced, which include both a 1-parameter estimator and a more complex 3-parameter estimator. In addition, employing the idea of cross-validation, the authors present a data-based algorithm for smoothing parameter selection. The article compares the data-based 1and 3-parameter estimators in a simulation experiment to the maximum likelihood estimator assuming the correct failure distribution and censoring mechanism. The 3-parameter estimator is found to perform well over a wide range of settings. On the average, the estimator recovers the shape of the underlying failure hazard and is competitive with the parametric estimator over a subset of the positive half line. Two examples illustrate possible uses of the nonparametric estimators.
- Published
- 1984
377. A Reanalysis of the Standford Heart Transplant Data: Comment
- Author
-
Martin A. Tanner and Wing Hung Wong
- Subjects
Statistics and Probability ,Statistics, Probability and Uncertainty ,Mathematics - Published
- 1983
378. Modeling non-uniformity in short-read rates in RNA-Seq data
- Author
-
Wing Hung Wong, Jun-Jun Li, and Hui-Hui Jiang
- Subjects
Sequence analysis ,genetic processes ,Method ,RNA-Seq ,Biology ,Poisson distribution ,Statistics, Nonparametric ,03 medical and health sciences ,symbols.namesake ,Mice ,0302 clinical medicine ,Apolipoproteins E ,Animals ,Humans ,Protein Isoforms ,natural sciences ,Poisson Distribution ,030304 developmental biology ,Genetics ,0303 health sciences ,Sequence ,Base Sequence ,Models, Genetic ,Sequence Analysis, RNA ,Gene Expression Profiling ,Linear model ,Nonparametric statistics ,food and beverages ,Exons ,Embryo, Mammalian ,Variable (computer science) ,Gene Expression Regulation ,symbols ,Linear Models ,RNA ,Constant (mathematics) ,Databases, Nucleic Acid ,Algorithm ,030217 neurology & neurosurgery - Abstract
Methods for modeling read counts from short read RNA-seq data., After mapping, RNA-Seq data can be summarized by a sequence of read counts commonly modeled as Poisson variables with constant rates along each transcript, which actually fit data poorly. We suggest using variable rates for different positions, and propose two models to predict these rates based on local sequences. These models explain more than 50% of the variations and can lead to improved estimates of gene and isoform expressions for both Illumina and Applied Biosystems data.
- Full Text
- View/download PDF
379. A tale of two morphogen gradients: Identifying Gli targets of Hedgehog Signaling
- Author
-
Hongkai Ji, Wing Hung Wong, Andrew P. McMahon, and Steven A. Vokes
- Subjects
Cell Biology ,Biology ,Molecular Biology ,Hedgehog signaling pathway ,Developmental Biology ,Cell biology ,Morphogen - Full Text
- View/download PDF
380. Genomic analysis of endoderm transcription in Xenopus
- Author
-
Julie C. Baker, Wing Hung Wong, Andrew L. Hufton, and Si Wan Kim
- Subjects
medicine.anatomical_structure ,animal structures ,biology ,Transcription (biology) ,embryonic structures ,Xenopus ,medicine ,Cell Biology ,Endoderm ,biology.organism_classification ,Molecular Biology ,Cell biology ,Developmental Biology - Full Text
- View/download PDF
381. CisGenome Browser: a flexible tool for genomic data visualization.
- Author
-
Hui Jiang, Fan Wang, Dyer, Nigel P., and Wing Hung Wong
- Subjects
OPEN source software ,GENOMICS ,COMPUTERS in biology ,WEB browsers ,DATA mining ,DATA visualization ,DATA modeling - Abstract
Summary: We present an open source, platform independent tool, called CisGenome Browser, which can work together with any other data analysis program to serve as a flexible component for genomic data visualization. It can also work by itself as a standalone genome browser. By working as a light-weight web server, CisGenome Browser is a convenient tool for data sharing between labs. It has features that are specifically designed for ultra high-throughput sequencing data visualization. [ABSTRACT FROM PUBLISHER]
- Published
- 2010
- Full Text
- View/download PDF
382. Comment.
- Author
-
Tanner, Martin A. and Wing Hung Wong
- Subjects
- *
HEART transplantation , *TRANSPLANTATION of organs, tissues, etc. , *STATISTICS , *EXPERIMENTAL design - Abstract
The article comments on the paper by Murray Aitkin, Nan Laird and Brian Francis which presents an analyses of survival of patients in the Stanford Heart Transplantation Program, published in the June 1983 issue of the "Journal of the American Statistical Association." Aitkin, Laird, and Francis make the important observation that one should not expect the effect of the transplant to fit the proportional hazards model. Their approach is to model pre-transplant and post-transplant survival separately. The authors, as well as several of the discussants, point out the need for further diagnostics to accompany their parametric and semiparametric analyses. For the post-transplant data the authors consider a number of models, most of which share the proportional hazards assumption. The survival times are then partitioned into several groups such that the scores within each group are approximately the same. If the proportional hazards assumption holds, then, at least approximately, the observations in each group should be realizations of the same failure process from which a nonparametric estimate of the hazard function can be obtained. Under the proportional hazards model we expect the estimates to be proportional. Furthermore, the estimates should conform to the parametric form if the assumed parametric model is valid.
- Published
- 1983
- Full Text
- View/download PDF
383. Comprehensive tissue deconvolution of cell-free DNA by deep learning for disease diagnosis and monitoring.
- Author
-
Shuo Li, Weihua Zeng, Xiaohui Ni, Qiao Liu, Wenyuan Li, Stackpole, Mary L., Yonggang Zhou, Gower, Arjan, Krysan, Kostyantyn, Ahuja, Preeti, Lu, David S., Raman, Steven S., Hsu, William, Aberle, Denise R., Magyar, Clara E., French, Samuel W., Hane, Steven-Huy B., Garon, Edward B., Agopian, Vatche G., and Wing Hung Wong
- Subjects
- *
CELL-free DNA , *DEEP learning , *DIAGNOSIS , *TISSUES , *CELL death , *CANCER education - Abstract
Plasma cell-free DNA (cfDNA) is a noninvasive biomarker for cell death of all organs. Deciphering the tissue origin of cfDNA can reveal abnormal cell death because of dis- eases, which has great clinical potential in disease detection and monitoring. Despite the great promise, the sensitive and accurate quantification of tissue-derived cfDNA remains challenging to existing methods due to the limited characterization of tissue methylation and the reliance on unsupervised methods. To fully exploit the clinical potential of tissue-derived cfDNA, here we present one of the largest comprehensive and high- resolution methylation atlas based on 521 noncancer tissue samples spanning 29 major types of human tissues. We systematically identified fragment-level tissue-specific methylation patterns and extensively validated them in orthogonal datasets. Based on the rich tissue methylation atlas, we develop the first supervised tissue deconvolution approach, a deep-learning-powered model, cfSort, for sensitive and accurate tissue decon- volution in cfDNA. On the benchmarking data, cfSort showed superior sensitivity and accuracy compared to the existing methods. We further demonstrated the clinical util- ities of cfSort with two potential applications: aiding disease diagnosis and monitoring treatment side effects. The tissue-derived cfDNA fraction estimated from cfSort reflected the clinical outcomes of the patients. In summary, the tissue methylation atlas and cfSort enhanced the performance of tissue deconvolution in cfDNA, thus facilitating cfDNA-based disease detection and longitudinal treatment monitoring. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
384. Relaxed simulated tempering for VLSI floorplan designs.
- Author
-
Cong, J., Tianming Kong, Dongmin Xu, Faming Liang, Liu, J.S., and Wing Hung Wong
- Published
- 1999
- Full Text
- View/download PDF
385. Statistical inferences for isoform expression in RNA-Seq.
- Author
-
Hui Jiang and Wing Hung Wong
- Subjects
- *
NUCLEOTIDE sequence , *RNA , *GENE expression , *BIOINFORMATICS , *GENETIC transcription , *QUANTITATIVE research - Abstract
Summary: The development of RNA sequencing (RNA-Seq) makes it possible for us to measure transcription at an unprecedented precision and throughput. However, challenges remain in understanding the source and distribution of the reads, modeling the transcript abundance and developing efficient computational methods. In this article, we develop a method to deal with the isoform expression estimation problem. The count of reads falling into a locus on the genome annotated with multiple isoforms is modeled as a Poisson variable. The expression of each individual isoform is estimated by solving a convex optimization problem and statistical inferences about the parameters are obtained from the posterior distribution by importance sampling. Our results show that isoform expression inference in RNA-Seq is possible by employing appropriate statistical methods. Contact: whwong@stanford.edu Supplementary information: Supplementary data are available at Bioinformatics online. [ABSTRACT FROM AUTHOR]
- Published
- 2009
- Full Text
- View/download PDF
386. Cross-hybridization modeling on Affymetrix exon arrays.
- Author
-
Karen Kapur, Hui Jiang, Yi Xing, and Wing Hung Wong
- Subjects
DNA microarrays ,EXONS (Genetics) ,GENETIC polymorphisms ,GENETIC transcription ,MESSENGER RNA ,BIOINFORMATICS - Abstract
Motivation: Microarray designs have become increasingly probe-rich, enabling targeting of specific features, such as individual exons or single nucleotide polymorphisms. These arrays have the potential to achieve quantitative high-throughput estimates of transcript abundances, but currently these estimates are affected by biases due to cross-hybridization, in which probes hybridize to off-target transcripts. Results: To study cross-hybridization, we map Affymetrix exon array probes to a set of annotated mRNA transcripts, allowing a small number of mismatches or insertion/deletions between the two sequences. Based on a systematic study of the degree to which probes with a given match type to a transcript are affected by cross-hybridization, we developed a strategy to correct for cross-hybridization biases of gene-level expression estimates. Comparison with Solexa ultra high-throughput sequencing data demonstrates that correction for cross-hybridization leads to a significant improve-ment of gene expression estimates. Availability: We provide mappings between human and mouse exon array probes and off-target transcripts and provide software extending the GeneBASE program for generating gene-level expression estimates including the cross-hybridization correction http://biogibbs.stanford.edu/~kkapur/GeneBase/. Contact: whwong@stanford.edu Supplementary information: Supplementary data are available at Bioinformatics online. [ABSTRACT FROM AUTHOR]
- Published
- 2008
- Full Text
- View/download PDF
387. SeqMap: mapping massive amount of oligonucleotides to the genome.
- Author
-
Hui Jiang and Wing Hung Wong
- Subjects
- *
MATHEMATICAL mappings , *OLIGONUCLEOTIDES , *GENOMES , *ALGORITHMS - Abstract
Summary: SeqMap is a tool for mapping large amount of short sequences to the genome. It is designed for finding all the places in a reference genome where each sequence may come from. This task is essential to the analysis of data from ultra high-throughput sequencing machines. With a carefully designed index-filtering algorithm and an efficient implementation, SeqMap can map tens of millions of short sequences to a genome of several billions of nucleotides. Multiple substitutions and insertions/deletions of the nucleotide bases in the sequences can be tolerated and therefore detected. SeqMap supports FASTA input format and various output formats, and provides command line options for tuning almost every aspect of the mapping process. A typical mapping can be done in a few hours on a desktop PC. Parallel use of SeqMap on a cluster is also very straightforward. Contact: whwong@stanford.edu [ABSTRACT FROM AUTHOR]
- Published
- 2008
- Full Text
- View/download PDF
388. Heritability enrichment in context-specific regulatory networks improves phenotype-relevant tissue identification.
- Author
-
Zhanying Feng, Zhana Duren, Jingxue Xin, Qiuyue Yuan, Yaoxi He, Bing Su, Wing Hung Wong, and Yong Wang
- Subjects
- *
HERITABILITY , *GENETIC correlations , *GENE regulatory networks , *GENOME-wide association studies , *GENE expression , *GENETIC regulation , *PHENOTYPES , *TISSUES - Abstract
Systems genetics holds the promise to decipher complex traits by interpreting their associated SNPs through gene regulatory networks derived from comprehensive multi-omics data of cell types, tissues, and organs. Here, we propose SpecVar to integrate paired chromatin accessibility and gene expression data into context-specific regulatory network atlas and regulatory categories, conduct heritability enrichment analysis with genome-wide association studies (GWAS) summary statistics, identify relevant tissues, and estimate relevance correlation to depict common genetic factors acting in the shared regulatory networks between traits. Our method improves power upon existing approaches by associating SNPs with context-specific regulatory elements to assess heritability enrichments and by explicitly prioritizing gene regulations underlying relevant tissues. Ablation studies, independent data validation, and comparison experiments with existing methods on GWAS of six phenotypes show that SpecVar can improve heritability enrichment, accurately detect relevant tissues, and reveal causal regulations. Furthermore, SpecVar correlates the relevance patterns for pairs of phenotypes and better reveals shared SNP-associated regulations of phenotypes than existing methods. Studying GWAS of 206 phenotypes in UK Biobank demonstrates that SpecVar leverages the context-specific regulatory network atlas to prioritize phenotypes' relevant tissues and shared heritability for biological and therapeutic insights. SpecVar provides a powerful way to interpret SNPs via context-specific regulatory networks and is available at https://github.com/AMSSwanglab/SpecVar, copy archived at swh:1:rev:cf27438d3f8245c34c357ec5f077528e6befe829. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
389. Heritability enrichment in context- specific regulatory networks improves phenotype-relevant tissue identification.
- Author
-
Zhanying Feng, Zhana Duren, Jingxue Xin, Qiuyue Yuan, Yaoxi He, Bing Su, Wing Hung Wong, and Yong Wang
- Abstract
Systems genetics holds the promise to decipher complex traits by interpreting their associated SNPs through gene regulatory networks derived from comprehensive multi-omics data of cell types, tissues, and organs. Here, we propose SpecVar to integrate paired chromatin accessibility and gene expression data into context-specific regulatory network atlas and regulatory categories, conduct heritability enrichment analysis with genome-wide association studies (GWAS) summary statistics, identify relevant tissues, and estimate relevance correlation to depict common genetic factors acting in the shared regulatory networks between traits. Our method improves power upon existing approaches by associating SNPs with context-specific regulatory elements to assess herita- bility enrichments and by explicitly prioritizing gene regulations underlying relevant tissues. Ablation studies, independent data validation, and comparison experiments with existing methods on GWAS of six phenotypes show that SpecVar can improve heritability enrichment, accurately detect rele- vant tissues, and reveal causal regulations. Furthermore, SpecVar correlates the relevance patterns for pairs of phenotypes and better reveals shared SNP-associated regulations of phenotypes than existing methods. Studying GWAS of 206 phenotypes in UK Biobank demonstrates that SpecVar leverages the context-specific regulatory network atlas to prioritize phenotypes’ relevant tissues and shared heritability for biological and therapeutic insights. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
390. Nested epistasis enhancer networks for robust genome regulation.
- Author
-
Xueqiu Lin, Yanxia Liu, Shuai Liu, Xiang Zhu, Lingling Wu, Yanyu Zhu, Dehua Zhao, Xiaoshu Xu, Chemparathy, Augustine, Haifeng Wang, Yaqiang Cao, Muneaki Nakamura, Noordermeer, Jasprina N., Russa, Marie La, Wing Hung Wong, Keji Zhao, and Qi, Lei S.
- Subjects
- *
EPISTASIS (Genetics) , *GENOMES , *GENE expression , *GENE enhancers , *NUCLEIC acid regulatory sequences - Abstract
Mammalian genomes have multiple enhancers spanning an ultralong distance (>megabases) to modulate important genes, but it is unclear how these enhancers coordinate to achieve this task. We combine multiplexed CRISPRi screening with machine learning to define quantitative enhancer-enhancer interactions. We find that the ultralong distance enhancer network has a nested multilayer architecture that confers functional robustness of gene expression. Experimental characterization reveals that enhancer epistasis is maintained by three-dimensional chromosomal interactions and BRD4 condensation. Machine learning prediction of synergistic enhancers provides an effective strategy to identify noncoding variant pairs associated with pathogenic genes in diseases beyond genome-wide association studies analysis. Our work unveils nested epistasis enhancer networks, which can better explain enhancer functions within cells and in diseases. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
391. Leveraging cell-type-specific regulatory networks to interpret genetic variants in abdominal aortic aneurysm.
- Author
-
Shining Ma, Xi Chen, Xiang Zhu, Tsao, Philip S., and Wing Hung Wong
- Subjects
- *
ABDOMINAL aortic aneurysms , *GENETIC variation , *GENE regulatory networks , *GENETIC regulation , *NUCLEOTIDE sequencing - Abstract
Abdominal aortic aneurysm (AAA) is a common degenerative cardiovascular disease whose pathobiology is not clearly understood. The cellular heterogeneity and cell-type-specific gene regulation of vascular cells in human AAA have not been well-characterized. Here, we performed analysis of whole-genome sequencing data in AAA patients versus controls with the aim of detecting diseaseassociated variants that may affect gene regulation in human aortic smooth muscle cells (AoSMC) and human aortic endothelial cells (HAEC), two cell types of high relevance to AAA disease. To support this analysis, we generated H3K27ac HiChIP data for these cell types and inferred cell-type-specific gene regulatory networks. We observed that AAA-associated variants were most enriched in regulatory regions in AoSMC, compared with HAEC and CD4+ cells. The cell-type-specific regulation defined by this HiChIP data supported the importance of ERG and the KLF family of transcription factors in AAA disease. The analysis of regulatory elements that contain noncoding variants and also are differentially open between AAA patients and controls revealed the significance of the interleukin-6-mediated signaling pathway. This finding was further validated by including information from the deleteriousness effect of nonsynonymous single-nucleotide variants in AAA patients and additional control data from the Medical Genome Reference Bank dataset. These results shed important insights into AAA pathogenesis and provide a model for cell-type-specific analysis of disease-associated variants. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
392. Density estimation using deep generative neural networks.
- Author
-
Qiao Liu, Jiaze Xu, Rui Jiang, and Wing Hung Wong
- Subjects
- *
GENERATIVE adversarial networks , *DENSITY , *DEEP learning , *GAUSSIAN distribution , *MACHINE learning - Abstract
Density estimation is one of the fundamental problems in both statistics and machine learning. In this study, we propose Roundtrip, a computational framework for general-purpose density estimation based on deep generative neural networks. Roundtrip retains the generative power of deep generative models, such as generative adversarial networks (GANs) while it also provides estimates of density values, thus supporting both data generation and density estimation. Unlike previous neural density estimators that put stringent conditions on the transformation from the latent space to the data space, Roundtrip enables the use of much more general mappings where target density is modeled by learning a manifold induced from a base density (e.g., Gaussian distribution). Roundtrip provides a statistical framework for GAN models where an explicit evaluation of density values is feasible. In numerical experiments, Roundtrip exceeds state-of-the-art performance in a diverse range of density estimation tasks. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
393. A method for scoring the cell type-specific impacts of noncoding variants in personal genomes.
- Author
-
Wenran Li, Zhana Duren, Rui Jiang, and Wing Hung Wong
- Subjects
- *
HUMAN genetics , *HUMAN genome , *GENOMES , *PHENOTYPES - Abstract
A person's genome typically contains millions of variants which represent the differences between this personal genome and the reference human genome. The interpretation of these variants, i.e., the assessment of their potential impact on a person's phenotype, is currently of great interest in human genetics and medicine. We have developed a prioritization tool called OpenCausal which takes as inputs 1) a personal genome and 2) a reference contextspecific TF expression profile and returns a list of noncoding variants prioritized according to their impact on chromatin accessibility for any given genomic region of interest. We applied OpenCausal to 6,430 samples across 18 tissues derived from the GTEx project and found that the variants prioritized by OpenCausal are highly enriched for eQTLs and caQTLs. We further propose a strategy to integrate the predicted open scores with genome-wide association studies (GWAS) data to prioritize putative causal variants and regulatory elements for a given risk locus (i.e., fine-mapping analysis). As an initial example, we applied this method to a GWAS dataset of human height and found that the prioritized putative variants and elements are correlated with the phenotype (i.e., heights of individuals) better than others. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
394. Integrated functional genomic analyses of Klinefelter and Turner syndromes reveal global network effects of altered X chromosome dosage.
- Author
-
Xianglong Zhang, Hong, David, Shining Ma, Ward, Thomas, Ho, Marcus, Pattni, Reenal, Zhana Duren, Stankov, Atanas, Shrestha, Sharon Bade, Hallmayer, Joachim, Wing Hung Wong, Reiss, Allan L., and Urban, Alexander E.
- Subjects
- *
X chromosome , *TURNER'S syndrome , *KLINEFELTER'S syndrome , *FUNCTIONAL analysis , *NETWORK effect - Abstract
In both Turner syndrome (TS) and Klinefelter syndrome (KS) copy number aberrations of the X chromosome lead to various developmental symptoms. We report a comparative analysis of TS vs. KS regarding differences at the genomic network level measured in primary samples by analyzing gene expression, DNA methylation, and chromatin conformation. X-chromosome inactivation (XCI) silences transcription from one X chromosome in female mammals, on which most genes are inactive, and some genes escape from XCI. In TS, almost all differentially expressed escape genes are downregulated but most differentially expressed inactive genes are upregulated. In KS, differentially expressed escape genes are upregulated while the majority of inactive genes appear unchanged. Interestingly, 94 differentially expressed genes (DEGs) overlapped between TS and female and KS and male comparisons; and these almost uniformly display expression changes into opposite directions. DEGs on the X chromosome and the autosomes are coexpressed in both syndromes, indicating that there are molecular ripple effects of the changes in X chromosome dosage. Six potential candidate genes (RPS4X, SEPT6, NKRF, CX0rf57, NAA10, and FLNA) for KS are identified on Xq, as well as candidate central genes on Xp for TS. Only promoters of inactive genes are differentially methylated in both syndromes while escape gene promoters remain unchanged. The intrachromosomal contact map of the X chromosome in TS exhibits the structure of an active X chromosome. The discovery of shared DEGs indicates the existence of common molecular mechanisms for gene regulation in TS and KS that transmit the gene dosage changes to the transcriptome. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
395. Integrative analysis of single-cell genomics data by coupled nonnegative matrix factorizations.
- Author
-
Zhana Duren, Xi Chen, Zamanighomi, Mahdi, Wanwen Zeng, Satpathy, Ansuman T., Chang, Howard Y., Yong Wang, and Wing Hung Wong
- Subjects
- *
GENOMICS , *CLUSTERING of particles , *SINGLE cell proteins , *RNA sequencing , *NONNEGATIVE matrices - Abstract
When different types of functional genomics data are generated on single cells from different samples of cells from the same heterogeneous population, the clustering of cells in the different samples should be coupled. We formulate this "coupled clustering" problem as an optimization problem and propose the method of coupled nonnegative matrix factorizations (coupled NMF) for its solution. The method is illustrated by the integrative analysis of single-cell RNA-sequencing (RNA-seq) and single-cell ATAC-sequencing (ATAC-seq) data. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
396. General Education as a Gateway for Establishing Self-Directedness.
- Author
-
Chow, Ian C., Hodgson, Paula, Sze-Wing Tang, Wing-Hung Wong, and Yang Yeung
- Subjects
- *
GENERAL education , *LEARNING , *STUDENTS , *UNIVERSITIES & colleges - Abstract
The article discusses principles and guidelines associated with General Education Maps and Markers (GEMs), a project of the Association of American Colleges and Universities. It mentions that Chinese University of Hong Kong (CUHK) adopted a strategic plan that emphasized student-centered learning and learning outcomes. It states that general education program is offered university-wide and in individual colleges, complementing the formal curricula by delivering wholeperson education and care.
- Published
- 2018
397. Applied Linear Regression (Book).
- Author
-
Wing Hung Wong
- Subjects
- *
REGRESSION analysis , *NONFICTION - Abstract
Reviews the book `Applied Linear Regression,' 2nd edition, by Sanford Weisberg.
- Published
- 1987
398. Human tRNA synthetase catalytic nulls with diverse functions.
- Author
-
Wing-Sze Lo, Gardiner, Elisabeth, Zhiwen Xu, Ching-Fun Lau, Feng Wang, Jie J. Zhou, Mendlein, John D., Nangle, Leslie A., Chiang, Kyle P., Xiang-Lei Yang, Kin-Fai Au, Wing Hung Wong, Min Guo, Mingjie Zhang, and Schimmel, Paul
- Subjects
- *
AMINOACYL-tRNA synthetases , *ENZYMES , *PROTEOLYSIS , *PROTEIN splicing , *CELLULAR control mechanisms , *GENETIC regulation - Abstract
Genetic efficiency in higher organisms depends on mechanisms to create multiple functions from single genes. To investigate this question for an enzyme family, we chose aminoacyl tRNA synthetases (AARSs).They are exceptional in their progressive and accretive proliferation of noncatalytic domains as the Tree of Life is ascended. Here we report discovery of a large number of natural catalytic nulls (CNs) for each human AARS. Splicing events retain noncatalytic domains while ablating the catalytic domain to create CNs with diverse functions. Each synthetase is converted into several new signaling proteins with biological activities "orthogonal" to that of the catalytic parent. We suggest that splice variants with nonenzymatic functions may be more general, as evidenced by recent findings of other catalytically inactive splice-variant enzymes. [ABSTRACT FROM AUTHOR]
- Published
- 2014
- Full Text
- View/download PDF
399. RNA sequencing reveals a diverse and dynamic repertoire of the Xenopus tropicalis transcriptome over development.
- Author
-
Meng How Tan, Kin Fai Au, Yablonovitch, Arielle L., Wills, Andrea E., Chuang, Jason, Baker, Julie C., Wing Hung Wong, and Jin Billy Li
- Subjects
- *
XENOPUS , *HUMAN embryos , *RNA , *GENES , *GENOMES , *EMBRYOLOGY - Abstract
The Xenopus embryo has provided key insights into fate specification, the cell cycle, and other fundamental developmental and cellular processes, yet a comprehensive understanding of its transcriptome is lacking. Here, we used paired end RNA sequencing (RNA-seq) to explore the transcriptome of Xenopus tropicalis in 23 distinct developmental stages. We determined expression levels of all genes annotated in RefSeq and Ensembl and showed for the first time on a genome- wide scale that, despite a general state of transcriptional silence in the earliest stages of development, approximately 150 genes are transcribed prior to the midblastula transition. In addition, our splicing analysis uncovered more than 10,000 novel splice junctions at each stage and revealed that many known genes have additional unannotated isoforms. Furthermore, we used Cufflinks to reconstruct transcripts from our RNA-seq data and found that ~13.5% of the final contigs are derived from novel transcribed regions, both within introns and in intergenic regions. We then developed a filtering pipeline to separate protein-coding transcripts from noncoding RNAs and identified a confident set of 6686 noncoding transcripts in 3859 genomic loci. Since the current reference genome, XenTro3, consists of hundreds of scaffolds instead of full chromosomes, we also performed de novo reconstruction of the transcriptome using Trinity and uncovered hundreds of transcripts that are missing from the genome. Collectively, our data will not only aid in completing the assembly of the Xenopus tropicalis genome but will also serve as a valuable resource for gene discovery and for unraveling the fundamental mechanisms of vertebrate embryogenesis. [ABSTRACT FROM AUTHOR]
- Published
- 2013
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.