22 results on '"Clifford H. Spiegelman"'
Search Results
2. A nonparametric approach based on a Markov like property for classification
- Author
-
Jeongyoun Ahn, Clifford H. Spiegelman, and Eun Sug Park
- Subjects
education.field_of_study ,Markov chain ,business.industry ,Process Chemistry and Technology ,Population ,Nonparametric statistics ,Pattern recognition ,Density estimation ,Linear discriminant analysis ,Computer Science Applications ,Analytical Chemistry ,ComputingMethodologies_PATTERNRECOGNITION ,Statistics::Methodology ,Markov property ,Artificial intelligence ,Marginal distribution ,education ,business ,Spectroscopy ,Software ,Mathematics ,Curse of dimensionality - Abstract
We suggest a new approach for classification based on nonparametricly estimated likelihoods. Due to the scarcity of data in high dimensions, full nonparametric estimation of the likelihood functions for each population is impractical. Instead, we propose to build a class of estimated nonparametric candidate likelihood models based on a Markov property and to provide multiple likelihood estimates that are useful for guiding a classification algorithm. Our density estimates require only estimates of one and two-dimensional marginal distributions, which can effectively get around the curse of dimensionality problem. A classification algorithm based on those estimated likelihoods is presented. A modification to it utilizing variable selection of differences in log of estimated marginal densities is also suggested to specifically handle high dimensional data.
- Published
- 2011
- Full Text
- View/download PDF
3. A computation saving Jackknife approach to receptor model uncertainty statements for serially correlated data
- Author
-
Clifford H. Spiegelman and Eun Sug Park
- Subjects
Process Chemistry and Technology ,Computation ,Bilinear interpolation ,Confidence interval ,Computer Science Applications ,Analytical Chemistry ,Standard error ,Statistics ,Econometrics ,Receptor model ,Jackknife resampling ,Spectroscopy ,Software ,Independence (probability theory) ,Mathematics - Abstract
The use of receptor modeling is now a widely accepted approach to model air pollution data. The resulting estimates of pollution source profiles have error and frequently the uncertainties are obtained under an assumption of independence. In addition traditional Bootstrap approaches are very computationally intensive. We present an intuitive Jackknife alternative that is much less computationally intensive and in simulation examples and actual data seems to demonstrate that it provides wider confidence intervals and larger standard errors for receptor model profile estimates than does the Bootstrap done under the assumption of independence.
- Published
- 2007
- Full Text
- View/download PDF
4. Some aspects of multivariate calibration with incomplete designs
- Author
-
Frits H. Ruymgaart, Clifford H. Spiegelman, Sang Joon Lee, and Joseph M. Conny
- Subjects
Multivariate statistics ,Calibration (statistics) ,Process Chemistry and Technology ,Carry (arithmetic) ,Univariate model ,Inverse ,Multivariate calibration ,Computer Science Applications ,Analytical Chemistry ,Statistics ,Econometrics ,Spectroscopy ,Software ,Mathematics - Abstract
There has been some debate whether inverse or classical calibration methods are superior when there are multivariate predictors and some of them are missing. In this paper, we compare these two methods in the case where the design is not completely known. We develop some general results in the multivariate case and carry out extensive simulations in a univariate model with partly known regressors and several error distributions. These simulations reveal that the methods perform differently, depending on the specifics of the model. Neither method, however, turns out to be consistently superior to the other.
- Published
- 2005
- Full Text
- View/download PDF
5. Locating nearby sources of air pollution by nonparametric regression of atmospheric concentrations on wind direction
- Author
-
Yu-Shuo Chang, Clifford H. Spiegelman, and Ronald C. Henry
- Subjects
Atmospheric Science ,Meteorology ,Air pollution ,Triangulation (social science) ,Regression analysis ,Wind direction ,medicine.disease_cause ,Confidence interval ,Nonparametric regression ,symbols.namesake ,Gaussian function ,symbols ,medicine ,Environmental science ,Air quality index ,General Environmental Science - Abstract
The relationship of the concentration of air pollutants to wind direction has been determined by nonparametric regression using a Gaussian kernel. The results are smooth curves with error bars that allow for the accurate determination of the wind direction where the concentration peaks, and thus, the location of nearby sources. Equations for this method and associated confidence intervals are given. A nonsubjective method is given to estimate the only adjustable parameter. A test of the method was carried out using cyclohexane data from 1997 at two sites near a heavy industrial region in Houston, Texas, USA. According to published emissions inventories, 70% of the cyclohexane emissions are from one source. Nonparametric regression correctly identified the direction of this source from each site. The location of the source determined by triangulation of these directions was
- Published
- 2002
- Full Text
- View/download PDF
6. Comparing a new algorithm with the classic methods for estimating the number of factors
- Author
-
Eun Sug Park, Ronald C. Henry, and Clifford H. Spiegelman
- Subjects
Trace (linear algebra) ,Covariance matrix ,Process Chemistry and Technology ,Analytic model ,Bartlett's test ,Cross-validation ,Computer Science Applications ,Analytical Chemistry ,Set (abstract data type) ,Indicator function ,Algorithm ,Spectroscopy ,Software ,Eigenvalues and eigenvectors ,Mathematics - Abstract
This paper presents and compares a new algorithm for finding the number of factors in a data analytic model. After we describe the new method, called NUMFACT, we compare it with standard methods for finding the number of factors to use in a model. The standard methods that we compare NUMFACT with are Malinowski's indicator function, Wold's cross-validation approach, Bartlett's test, scree plots, the rule-of-one, and using the number of factors (eigenvectors) needed to explain 90% of the trace of a correlation matrix. Using a diverse set of real applications, NUMFACT is shown to be the clear method of choice.
- Published
- 1999
- Full Text
- View/download PDF
7. A novel peak-hopping stepwise feature selection method with application to Raman spectroscopy1This paper is dedicated to the memory of Jean Thomas Clerc: scientist, editor, luminary, and dog breeder.1
- Author
-
Michael J. McShane, Clifford H. Spiegelman, Brent D. Cameron, Massoud Motamedi, and Gerard L. Coté
- Subjects
Chemistry ,Sample (statistics) ,Feature selection ,Biochemistry ,Analytical Chemistry ,Ranking (information retrieval) ,Chemometrics ,Partial least squares regression ,Statistics ,Genetic algorithm ,Environmental Chemistry ,Point (geometry) ,Algorithm ,Spectroscopy ,Selection (genetic algorithm) - Abstract
A new stepwise approach to variable selection for spectroscopy that includes chemical information and attempts to test several spectral regions producing high ranking coefficients has been developed to improve on currently available methods. Existing selection techniques can, in general, be placed into two groups: the first, time-consuming optimization approaches that ignore available information about sample chemistry and require considerable expertise to arrive at appropriate solutions (e.g. genetic algorithms), and the second, stepwise procedures that tend to select many variables in the same area containing redundant information. The algorithm described here is a fast stepwise procedure that uses multiple ranking chains to identify several spectral regions correlated with known sample properties. The multiple-chain approach allows the generation of a final ranking vector that moves quickly away from the initial selection point, testing several areas exhibiting correlation between spectra and composition early in the stepping procedure. Quantitative evidence of the success of this approach as applied to Raman spectroscopy is given in terms of processing speed, number of selected variables, and prediction error in comparison with other selection methods. In this respect, the procedure described here may be considered as a significant evolutionary step in variable selection algorithms.
- Published
- 1999
- Full Text
- View/download PDF
8. Asymptotic minimax calibration estimates
- Author
-
Suojin Wang, Clifford H. Spiegelman, and Michael C. Denham
- Subjects
Mathematical optimization ,Calibration (statistics) ,Process Chemistry and Technology ,Multivariate calibration ,Feature selection ,Minimax ,Computer Science Applications ,Analytical Chemistry ,Nonlinear system ,Simple (abstract algebra) ,Partial least squares regression ,Applied mathematics ,Spectroscopy ,Software ,Mathematics - Abstract
This paper gives methods that use measurements from calibrated instruments in an effective and understandable manner. While some chemometric methods such as partial least squares might be considered, the procedures that we use are more transparent. In this paper two simple methods are proposed that use standard and saddlepoint approximations to combine nonlinear estimates from different regions of the instrument response. The asymptotic accuracy of the approximations is discussed. A worked example is given. A simulation study is also reported that supports our recommendations.
- Published
- 1996
- Full Text
- View/download PDF
9. Detecting interactions using low dimensional searches in high dimensional data
- Author
-
Clifford H. Spiegelman and C.Y. Wang
- Subjects
Clustering high-dimensional data ,Chemometrics ,Computer science ,Process Chemistry and Technology ,Monte Carlo method ,Data mining ,computer.software_genre ,computer ,Spectroscopy ,Software ,Computer Science Applications ,Analytical Chemistry - Abstract
One important issue in chemometrics is to detect interactions among several factors. In this paper, we propose methods that detect interactions using low dimensional smoothers. Two methods are investigated and compared with usual least squared methods via Monte Carlo simulations. In addition, we show, using real data, how the methods affect our decisions.
- Published
- 1994
- Full Text
- View/download PDF
10. Applying and developing receptor models to the 1990 El Paso air data: a look at receptor modeling with uncharacterized sources and graphical diagnostics
- Author
-
Clifford H. Spiegelman and Stuart Dattner
- Subjects
Chemistry ,Completeness (order theory) ,Environmental Chemistry ,Sampling (statistics) ,Data mining ,computer.software_genre ,Biochemistry ,computer ,Spectroscopy ,Analytical Chemistry - Abstract
This paper represents an ongoing receptor modeling research of airborne species in El Paso, Texas. It represents a six month collaboration between the authors. It extends the case study reported by Spiegelman and Dattner in 1992. For completeness the background material is reviewed.
- Published
- 1993
- Full Text
- View/download PDF
11. Plotting aids for multivariate calibration and chemostatistics
- Author
-
Clifford H. Spiegelman
- Subjects
Chemometrics ,Multivariate analysis ,Calibration (statistics) ,Process Chemistry and Technology ,Statistics ,Multivariate calibration ,Spectroscopy ,Software ,Computer Science Applications ,Analytical Chemistry ,Mathematics - Abstract
Spiegelman, C.H., 1992. Plotting aids for multivariate calibration and chemostatistics. Chemometrics and Intelligent Laboratory Systems , 15: 29–38. There are few published procedures for plotting multivariate calibration data. In this paper I give some new plotting techniques that have been useful in my research and consulting. There are plots for detecting matrix effects, and other violations of Beer's law, plots for checking the selectivity of channels and frequencies, and plots that help diagnose the importance of frequencies within a peak.
- Published
- 1992
- Full Text
- View/download PDF
12. Bias correcting confidence intervals for a nearly common property
- Author
-
Daren B. H. Cline and Clifford H. Spiegelman
- Subjects
Systematic error ,Process Chemistry and Technology ,Confidence interval ,Robust confidence intervals ,Computer Science Applications ,Analytical Chemistry ,Random error ,Statistics ,Credible interval ,Confidence distribution ,Common property ,Algorithm ,Spectroscopy ,Software ,CDF-based nonparametric confidence interval ,Mathematics - Abstract
Cline, D.B.H. and Spiegelman, C.H., 1991. Bias correcting confidence intervals for a nearly common property. Chemometrics and Intelligent Laboratory System, 11: 131–136. Confidence intervals are an important tool. Realistic confidence intervals account for both random errors and systematic errors (bias). We improve the usual method for combining random and systematic errors. The new methods are simple and often result in increased accuracy for confidence interval levels.
- Published
- 1991
- Full Text
- View/download PDF
13. A statistical method for calibrating flame emission spectrometry which takes account of errors in the calibration standards
- Author
-
Linda Hungwu, Clifford H. Spiegelman, and Robert L. Watters
- Subjects
Observational error ,Calibration curve ,Process Chemistry and Technology ,Data transformation (statistics) ,Studentized residual ,Computer Science Applications ,Analytical Chemistry ,Linear regression ,Outlier ,Statistics ,Calibration ,Influential observation ,Spectroscopy ,Software ,Mathematics - Abstract
Spiegelman, C.H., Watters, R.L. and Hungwu, L., 1991. A statistical method for calibrating flame emission spectrometry which takes account of errors in the calibration standards. Chemometrics and Intelligent Laboratory Systems , 11: 121–130. The determination of potassium in sample solutions using flame emission spectrometry (FES) requires that the calibration function be estimated. Calibration standard solutions of potassium are made in the laboratory and nebulized into the FES instrument. A total of 240 data points were collected from chemical analyses. However the data come from only eight different values of the standards and are highly correlated within each standard. In the final analysis a sample size of eight was chosen to estimate the calibration curve. The main goal of this paper is to estimate the calibration function based on these data and then measure the amount of potassium in samples using this calibration function. The secondary goal is to show some of the important exploratory data analysis that should be done in any calibration. Since both the underlying theory of emission spectrometry and the scatter plot of data points suggest a linear relationship between the emission intensity and potassium concentration, a linear regression model is applied to fit these data and the residuals are examined based on regression assumptions. Data transformation is then attempted to stabilize the nonconstant variance of the residuals due to the fact that residuals fail to meet the assumptions. However, because the suggested transformation of taking the logarithm or 1/4 power of both x and y is hard to interpret, and because the log transformation would require an addition of an arbitrary constant to the standard values, we proceeded with the untransformed data. Outlier detection was used to find possible outliers. Ten consecutive observations (obs. 201–210) in the data set are potential outliers for they have absolute studentized residuals bigger than 2.7. However, influential observation techniques indicate that their effect on the estimation of the calibration curve is not great. In order to help compensate for the error in the calibration standards, we expand the calibration interval estimates. This compensation is important and helps to avoid the rather ad-hoc deletion of unusually influential data from the analysis. We think that a plausible explanation for the outliers is error in the calibration standards. In recognition of the heterogeneity of variance indicated by our data, we perform a weighted least squares type of confidence interval estimation for our calibration curve. The coefficient and standard error estimates are quite close for all weighted cases; in contrast, the unweighted case yields different values for the standard errors. If heteroscedasticity (nonconstant variance of the observations) is ignored, confidence intervals will be too wide at the low end and too narrow at the high end of the calibration curve. At both ends of the calibration curve, each resulting multiple-use calibration confidence interval is somewhat wider than the corresponding single-use calibration confidence interval. Finally, since the measurement errors that have a known finite bound in working standards have been taken into account, the increase in confidence intervals relative to presumed exact standards is about 0.1%.
- Published
- 1991
- Full Text
- View/download PDF
14. Celebrating 30 years of publishing chemometrics for chemometricians and others
- Author
-
Philip K. Hopke and Clifford H. Spiegelman
- Subjects
Chemometrics ,Engineering ,Management science ,Publishing ,business.industry ,Process Chemistry and Technology ,business ,Combinatorial chemistry ,Spectroscopy ,Software ,Computer Science Applications ,Analytical Chemistry - Published
- 2015
- Full Text
- View/download PDF
15. Statistical software packages for the macintosh
- Author
-
Clifford H. Spiegelman
- Subjects
business.industry ,Computer science ,Process Chemistry and Technology ,Software engineering ,business ,Spectroscopy ,Software ,Statistical software ,Computer Science Applications ,Analytical Chemistry - Published
- 1990
- Full Text
- View/download PDF
16. BBN/Catalyst version 1.4
- Author
-
Clifford H. Spiegelman
- Subjects
Chemical engineering ,Process Chemistry and Technology ,Spectroscopy ,Software ,Computer Science Applications ,Analytical Chemistry ,Catalysis - Published
- 1992
- Full Text
- View/download PDF
17. StatView II and superANOVA
- Author
-
Clifford H. Spiegelman
- Subjects
Process Chemistry and Technology ,Statistics ,StatView ,Spectroscopy ,Software ,Computer Science Applications ,Analytical Chemistry ,Mathematics - Published
- 1991
- Full Text
- View/download PDF
18. SYSTAT 5.0 and MINITAB release 8
- Author
-
Clifford H. Spiegelman
- Subjects
Process Chemistry and Technology ,Spectroscopy ,Software ,Computer Science Applications ,Analytical Chemistry - Published
- 1991
- Full Text
- View/download PDF
19. JMP®, JMP IN® , and JMP Serve ™
- Author
-
Clifford H. Spiegelman
- Subjects
Process Chemistry and Technology ,Spectroscopy ,Software ,Computer Science Applications ,Analytical Chemistry - Published
- 1991
- Full Text
- View/download PDF
20. A computational examination of orthogonal distance regression
- Author
-
Clifford H. Spiegelman, Janet R. Donaldson, Paul T. Boggs, and Robert B. Schnabel
- Subjects
Economics and Econometrics ,Variables ,Applied Mathematics ,media_common.quotation_subject ,Regression analysis ,Least squares ,Regression ,Orthogonality ,Ordinary least squares ,Statistics ,Curve fitting ,Total least squares ,media_common ,Mathematics - Abstract
Ordinary least squares (OLS) is one of the most commonly used criteria for fitting data to models and for estimating parameters. Orthogonal distance regression (ODR) extends least squares data fitting to problems with independent variables that are not known exactly. In this paper, we present the results of an empirical study designed to compare OLS to ODR for fitting both linear and non-linear models when there are errors in the independent variables. The results indicate that, for the data and performance criteria considered, ODR never performs appreciably worse than OLS and often performs considerably better.
- Published
- 1988
- Full Text
- View/download PDF
21. Sensitivity of trends in geometric mean blood levels to random measurement errors
- Author
-
I.H. Billick, D.R. Shier, and Clifford H. Spiegelman
- Subjects
Environmental Engineering ,Observational error ,Urban Population ,Series (mathematics) ,Computers ,Statistics as Topic ,Statistical model ,Pollution ,Lead ,Child, Preschool ,Additive function ,Statistics ,Humans ,Environmental Chemistry ,New York City ,Sensitivity (control systems) ,Geometric mean ,Waste Management and Disposal ,Mathematics ,Non-sampling error ,Probability - Abstract
A statistical model is investigated that expresses observations, such as blood lead levels, as an additive function of true levels and random measurement errors. Both empirical results (obtained from a series of computer simulation experiments) and theoretical results indicate how certain summary statistics for the observations vary in response to random measurement errors. Such results are applied to a very large data base of pediatric blood lead levels collected in New York City during 1970–1976, and they indicate that the observed trends in geometric mean blood lead levels are not significantly altered by the possible presence of measurement errors.
- Published
- 1982
- Full Text
- View/download PDF
22. A minimax approach to combining means, with practical examples
- Author
-
Clifford H. Spiegelman, Charles P. Reeve, and Keith R. Eberhardt
- Subjects
Mathematical optimization ,Mean squared error ,Computer science ,Process Chemistry and Technology ,Sample (statistics) ,Minimax ,Confidence interval ,Computer Science Applications ,Analytical Chemistry ,Statistics ,Applied mathematics ,Statistical theory ,Minimax estimator ,Weighted arithmetic mean ,Spectroscopy ,Software ,Reciprocal ,Mathematics - Abstract
Eberhardt, K.R., Reeve, C.P. and Spiegelman, C.H., 1989. A minimax approach to combining means, with practical examples. Chemometrics and Intelligent Laboratory Systems , 5: 129–148. This paper describes a method for combining sample means that accounts for bias in those means. It compares the unweighted mean, the weighted mean using reciprocal estimated variances for weights, and a minimax weighted mean. When the individual means are subject to nontrivial biases we show that the minimax estimator can lead to important decreases in mean squared error and confidence interval width. Our recommendations are based on statistical theory and on simulations based on three Standard Reference Material data sets.
- Published
- 1989
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.