334 results
Search Results
2. CONTINUATION OF DR JONES'S PAPER
- Author
-
Maurice G. Kendall
- Subjects
Statistics and Probability ,Continuation ,Psychoanalysis ,Applied Mathematics ,General Mathematics ,Statistics, Probability and Uncertainty ,General Agricultural and Biological Sciences ,Agricultural and Biological Sciences (miscellaneous) ,Mathematics - Published
- 1948
3. NOTE ON SOME POINTS IN 'STUDENT'S' PAPER ON 'COMPARISON BETWEEN BALANCED AND RANDOM ARRANGEMENTS OF FIELD PLOTS'
- Author
-
J. Neyman and E. S. Pearson
- Subjects
Statistics and Probability ,Field plot ,Applied Mathematics ,General Mathematics ,Statistics ,Statistics, Probability and Uncertainty ,General Agricultural and Biological Sciences ,Agricultural and Biological Sciences (miscellaneous) ,Mathematics - Published
- 1938
4. (ii) The Game of Heads and Tails--Some Notes on M. Paul Levy's Paper: 'Nuove Formule Relative al Giuoco di Testa e Croce.'
- Author
-
E. C. Fieller
- Subjects
Statistics and Probability ,Applied Mathematics ,General Mathematics ,Statistics, Probability and Uncertainty ,General Agricultural and Biological Sciences ,Agricultural and Biological Sciences (miscellaneous) ,Humanities ,Mathematics - Published
- 1931
5. THE OUTCOME OF A STOCHASTIC EPIDEMIC—A NOTE ON BAILEY'S PAPER
- Author
-
P. Whittle
- Subjects
Statistics and Probability ,Applied Mathematics ,General Mathematics ,Statistics, Probability and Uncertainty ,General Agricultural and Biological Sciences ,Agricultural and Biological Sciences (miscellaneous) ,Outcome (game theory) ,Mathematical economics ,Mathematics - Published
- 1955
6. ON THE DISTRIBUTION OF THE CORRELATION COEFFICIENT IN SMALL SAMPLES. APPENDIX II TO THE PAPERS OF 'STUDENT' AND R. A. FISHER. A COOPERATIVE STUDY
- Author
-
H. E. Soper, A. W. Young, A. Lee, Karl Pearson, and B. M. Cave
- Subjects
Statistics and Probability ,Distribution (number theory) ,Correlation coefficient ,Intraclass correlation ,Applied Mathematics ,General Mathematics ,Fisher transformation ,Correlation ratio ,Agricultural and Biological Sciences (miscellaneous) ,Spearman's rank correlation coefficient ,Pearson product-moment correlation coefficient ,symbols.namesake ,Statistics ,symbols ,Statistics, Probability and Uncertainty ,General Agricultural and Biological Sciences ,Mathematics - Published
- 1917
7. THOUGHTS SUGGESTED BY THE PAPERS OF MESSRS WELCH AND KOŁODZIEJCZYK
- Author
-
Karl Pearson
- Subjects
Statistics and Probability ,Applied Mathematics ,General Mathematics ,Statistics, Probability and Uncertainty ,General Agricultural and Biological Sciences ,Agricultural and Biological Sciences (miscellaneous) ,Classics ,Mathematics - Published
- 1935
8. (i) Note on Dr Burnside's Paper on Errors of Observation
- Author
-
Ethel M. Newbold
- Subjects
Statistics and Probability ,Applied Mathematics ,General Mathematics ,Calculus ,Statistics, Probability and Uncertainty ,General Agricultural and Biological Sciences ,Agricultural and Biological Sciences (miscellaneous) ,Mathematics - Published
- 1923
9. FIRST RESULTS FROM THE OXFORD ANTHROPOMETRIC LABORATORY: (A paper read before the British Association (Section D) at Sheffield, 1910.)
- Author
-
E. Schuster
- Subjects
Statistics and Probability ,Anthropology ,Applied Mathematics ,General Mathematics ,Association (object-oriented programming) ,Section (typography) ,Statistics, Probability and Uncertainty ,Anthropometry ,General Agricultural and Biological Sciences ,Agricultural and Biological Sciences (miscellaneous) ,Mathematics - Abstract
n/a
- Published
- 1911
10. NOTE ON PROFESSOR HALDANE'S PAPER REGARDING THE TREATMENT OF RARE EVENTS
- Author
-
E. S. Pearson
- Subjects
Statistics and Probability ,Operations research ,Applied Mathematics ,General Mathematics ,Rare events ,Statistics, Probability and Uncertainty ,General Agricultural and Biological Sciences ,Agricultural and Biological Sciences (miscellaneous) ,Epistemology ,Mathematics - Published
- 1948
11. (v) Note on Professor Narumi's Paper. (See Vol. xv. p. 253.)
- Author
-
B. H. Camp
- Subjects
Statistics and Probability ,Applied Mathematics ,General Mathematics ,Statistics, Probability and Uncertainty ,General Agricultural and Biological Sciences ,Agricultural and Biological Sciences (miscellaneous) ,Humanities ,Mathematics - Published
- 1923
12. APPENDIX TO A PAPER BY PROFESSOR TOKISHIGE HOJO
- Author
-
Karl Pearson
- Subjects
Statistics and Probability ,medicine.anatomical_structure ,Applied Mathematics ,General Mathematics ,medicine ,Statistics, Probability and Uncertainty ,General Agricultural and Biological Sciences ,Agricultural and Biological Sciences (miscellaneous) ,Appendix ,Classics ,Mathematics - Published
- 1931
13. Note on Paper published in Biometrika, Vol. XIX
- Author
-
J. O. Irwin
- Subjects
Statistics and Probability ,Applied Mathematics ,General Mathematics ,Statistics, Probability and Uncertainty ,General Agricultural and Biological Sciences ,Agricultural and Biological Sciences (miscellaneous) ,Classics ,Mathematics - Published
- 1929
14. (ii) Note on Karl Pearson's Paper: 'On a method of ascertaining limits to the actual number of marked members in a population of given size from a sample.'
- Author
-
K. Raghavan Nair
- Subjects
Statistics and Probability ,education.field_of_study ,Applied Mathematics ,General Mathematics ,Statistics ,Population ,Sample (statistics) ,Statistics, Probability and Uncertainty ,General Agricultural and Biological Sciences ,education ,Agricultural and Biological Sciences (miscellaneous) ,Karl pearson ,Mathematics - Published
- 1936
15. 2×2 TABLES. A NOTE ON E. S. PEARSON'S PAPER
- Author
-
G. A. Barnard
- Subjects
Statistics and Probability ,Applied Mathematics ,General Mathematics ,Statistics, Probability and Uncertainty ,General Agricultural and Biological Sciences ,Agricultural and Biological Sciences (miscellaneous) ,Humanities ,Mathematics - Published
- 1947
16. A note on Craig's paper on the minimum of binomial variates
- Author
-
B. K. Shah
- Subjects
Statistics and Probability ,Binomial approximation ,Applied Mathematics ,General Mathematics ,Negative binomial distribution ,Continuity correction ,Agricultural and Biological Sciences (miscellaneous) ,Negative multinomial distribution ,Binomial distribution ,Beta-binomial distribution ,Statistics ,Multinomial distribution ,Statistics, Probability and Uncertainty ,Binomial proportion confidence interval ,General Agricultural and Biological Sciences ,Mathematics - Abstract
Craig (1962) described an application of order statistics from a binomial distribution and gave an approximate expression for the mean value of the minimum of two binomial variables having the same probability of success and the same number of trials. In this note we suggest another approximation for the mean and also for the standard deviation of the minimum, using normal order statistics tabulated by Teichroew (1956).
- Published
- 1966
17. NOTE ON MR SRIVASTAVA'S PAPER ON THE POWER FUNCTION OF STUDENT'S TEST
- Author
-
Egon S. Pearson
- Subjects
Statistics and Probability ,Applied Mathematics ,General Mathematics ,Calculus ,Statistics, Probability and Uncertainty ,General Agricultural and Biological Sciences ,Power function ,Agricultural and Biological Sciences (miscellaneous) ,Mathematics ,Test (assessment) - Published
- 1958
18. Two early papers on the relation between extreme values and tensile strength
- Author
-
Julius Lieblein
- Subjects
Statistics and Probability ,medicine.medical_specialty ,Injury control ,Applied Mathematics ,General Mathematics ,Poison control ,Human factors and ergonomics ,Agricultural and Biological Sciences (miscellaneous) ,Suicide prevention ,Occupational safety and health ,Ultimate tensile strength ,Injury prevention ,Physical therapy ,medicine ,Statistics, Probability and Uncertainty ,General Agricultural and Biological Sciences ,Extreme value theory ,Mathematics - Published
- 1954
19. Note on J. B. S. Haldane's Paper: 'The Exact Value of the Moments of the Distribution of χ 2 .'
- Author
-
W. G. Cochran
- Subjects
Statistics and Probability ,Contingency table ,Distribution (number theory) ,Applied Mathematics ,General Mathematics ,Degrees of freedom ,Variance (accounting) ,Term (logic) ,Agricultural and Biological Sciences (miscellaneous) ,Part iii ,Combinatorics ,Statistics, Probability and Uncertainty ,General Agricultural and Biological Sciences ,Value (mathematics) ,Independence (probability theory) ,Mathematics - Abstract
IN this paper Haldane points out (p. 142) a difference between his results for the mean and variance of x2 in a 2 x n-fold contingency table when the expectation p is fixed and the results obtained by me in my paper (Annals of Eugenics, Vol. VII, part III, p. 211). The difference is that I have (n 1) throughout where Haldane has n. Haldane writes, "my own results would appear to be slightly more accurate than Cochran's", which might, I think, give the impression that both Haldane's results and mine are only approximations. In fact, both results are mathematically exact, the difference between them being one of definition of x2. My paper is almost entirely concerned with the distribution of x2 when the expectation p is not known. In the results which I gave for the distribution of x2 when p is known, I retained the term S (x -x-)2 in the numerator of x2 instead of S (x np)2, to facilitate comparison between this and my other results. Thus my x2 has (n 1) degrees of freedom, whereas Haldane's x2 has n degrees of freedom. Unfortunately I did not emphasize this point in the passage concerned, and as it may have appeared misleading to others besides Haldane, I welcome this opportunity of drawing attention to it. Haldane's x2 is, of course, the one which is normally appropriate in testing the departure from independence when the expectation is known.
- Published
- 1938
20. Note on Dr Burnside's Paper on Errors of Observation
- Author
-
Ethel M. Newbold
- Subjects
Statistics and Probability ,Applied Mathematics ,General Mathematics ,Calculus ,Statistics, Probability and Uncertainty ,General Agricultural and Biological Sciences ,Agricultural and Biological Sciences (miscellaneous) ,Mathematics - Published
- 1923
21. Note on Professor Narumi's Paper
- Author
-
B. H. Camp
- Subjects
Statistics and Probability ,Applied Mathematics ,General Mathematics ,Art history ,Statistics, Probability and Uncertainty ,General Agricultural and Biological Sciences ,Agricultural and Biological Sciences (miscellaneous) ,Mathematics - Published
- 1923
22. Thoughts Suggested by the Papers of Messrs Welch and Kolodziejcyk ( Biometrika, Vol. XXVII. pp. 145-190)
- Author
-
Karl Pearson
- Subjects
Statistics and Probability ,Applied Mathematics ,General Mathematics ,Statistics, Probability and Uncertainty ,General Agricultural and Biological Sciences ,Agricultural and Biological Sciences (miscellaneous) ,Classics ,Mathematics - Published
- 1935
23. Note on Dr Usher's Paper on Epicanthus (pp. 5--25 of this volume)
- Author
-
Karl Pearson
- Subjects
Statistics and Probability ,Applied Mathematics ,General Mathematics ,Statistics, Probability and Uncertainty ,Epicanthus ,General Agricultural and Biological Sciences ,Agricultural and Biological Sciences (miscellaneous) ,Humanities ,Mathematics ,Volume (compression) - Published
- 1935
24. On the Distribution of the Correlation Coefficient in Small Samples. Appendix II to the Papers of 'Student' and R. A. Fisher
- Author
-
Karl Pearson, A. W. Young, A. Lee, H. E. Soper, and B. M. Cave
- Subjects
Statistics and Probability ,Correlation coefficient ,Distribution (number theory) ,Intraclass correlation ,Applied Mathematics ,General Mathematics ,Fisher transformation ,Correlation ratio ,Agricultural and Biological Sciences (miscellaneous) ,Pearson product-moment correlation coefficient ,symbols.namesake ,Statistics ,symbols ,Statistics, Probability and Uncertainty ,General Agricultural and Biological Sciences ,Mathematics - Published
- 1917
25. I. Note on the Paper by Dr J. Wishart in the present Volume (XXA)
- Author
-
J. Wishart
- Subjects
Statistics and Probability ,Wishart distribution ,Applied Mathematics ,General Mathematics ,Calculus ,Statistics, Probability and Uncertainty ,General Agricultural and Biological Sciences ,Agricultural and Biological Sciences (miscellaneous) ,Mathematics ,Volume (compression) - Published
- 1928
26. On the Bias of Various Estimators of the Logit and Its Variance with Application to Quantal Bioassay
- Author
-
John J. Gart and James R. Zweifel
- Subjects
Contingency table ,Statistics and Probability ,Applied Mathematics ,General Mathematics ,Logit ,Estimator ,Variance (accounting) ,Agricultural and Biological Sciences (miscellaneous) ,Unpublished paper ,Statistics ,Econometrics ,Bioassay ,Logistic function ,Statistics, Probability and Uncertainty ,General Agricultural and Biological Sciences ,Mathematics - Abstract
SUMMARY The bias of several logit estimators and their corresponding variance estimators is investigated in small samples. Their use in quantal bioassay is similarly explored. The logit transformation has been suggested in the analysis of higher dimensional contingency tables, by Woolf (1954) and many others more recently, and also in estimating the parameters of the logistic function in the quantal bioassay problem (Berkson, 1944, 1953). Various modifications of the logit have been suggested by Berkson (1953), Haldane (1955), Anscombe (1956), Tukey, mentioned by Anscombe (1956), and Hitchcock (1962). Modifications of its usual variance estimator have been proposed by Haldane, Goodman (1964) and Gart (1966). More recently Goodman, in an unpublished paper, has derived several further modifications of his estimator. In this paper we present a numerical comparison of the bias of these estimators and give conditions under which one or the other may be preferred. The use of logits in the quantal bioassay problem is briefly explored with particular reference to the asymptotic results of Hitchcock regarding the bias of the estimators.
- Published
- 1967
27. Exploratory latent structure analysis using both identifiable and unidentifiable models
- Author
-
Leo A. Goodman
- Subjects
Statistics and Probability ,Contingency table ,Applied Mathematics ,General Mathematics ,Polytomous Rasch model ,Latent variable ,Agricultural and Biological Sciences (miscellaneous) ,Latent class model ,Structural equation modeling ,Statistics ,Econometrics ,Statistics, Probability and Uncertainty ,Local independence ,General Agricultural and Biological Sciences ,Latent variable model ,Mathematics ,Variable (mathematics) - Abstract
SUMMARY This paper considers a wide class of latent structure models. These models can serve as possible explanations of the observed relationships among a set of m manifest polytomous variables. The class of models considered here includes both models in which the parameters are identifiable and also models in which the parameters are not. For each of the models considered here, a relatively simple method is presented for calculating the maximum likeli- hood estimate of the frequencies in the m-way contingency table expected under the model, and for determining whether the parameters in the estimated model are identifiable. In addition, methods are presented for testing whether the model fits the observed data, and for replacing unidentifiable models that fit by identifiable models that fit. Some illus- trative applications to data are also included. This paper deals with the relationships among m polytomous variables, i.e. with the analysis of an m-way contingency table. These m variables are manifest variables in that, for each observed individual in a sample, his class with respect to each of the m variables is observed. We also consider here polytomous variables that are latent in that an individ- ual's class with respect to these variables is not observed. The classes of a latent variable will be called latent classes. Consider first a 4-way contingency table which cross-classifies a sample of n individuals with respect to four manifest polytomous variables A, B, C and D. If there is, say, some latent dichotomous variable X, so that each of the n individuals is in one of the two latent classes with respect to this variable, and within the tth latent class the manifest variables (A, B, C, D) are mutually independent, then this two-class latent structure would serve as a simple explanation of the observed relationships among the variables in the 4-way con- tingency table for the n individuals. There is a direct generalization when the latent variable has T classes. We shall present some relatively simple methods for determining whether the observed relationships among the variables in the m-way contingency table can be explained by a T-class structure, or by various modifications and extensions of this latent structure. To illustrate the methods we analyze Table 1, a 24 contingency table presented earlier by Stouffer & Toby (1951, 1962, 1963), which cross-classifies 216 respondents with respect to whether they tend towards universalistic values ( + ) or particularistic values (-) when confronted by each of four different situations of role conflict. The letters A, B, C and D in
- Published
- 1974
28. Bayesian analysis of the two-sample problem under the Lehmann alternatives
- Author
-
R. J. Brooks
- Subjects
Statistics and Probability ,Applied Mathematics ,General Mathematics ,Bayesian probability ,Agricultural and Biological Sciences (miscellaneous) ,Bayesian statistics ,Combinatorics ,Distribution function ,Distribution (mathematics) ,Bayesian experimental design ,Rank (graph theory) ,Statistics, Probability and Uncertainty ,General Agricultural and Biological Sciences ,Statistic ,Mathematics ,Weibull distribution - Abstract
Papers that have been written on the use of the rank order statistic for the two-sample problem under the Lehmann alternatives have focused on the use of rank tests to determine whether there is a difference in the two populations. In particular, the test proposed by Savage (1956) is the rank test that is usually suggested to be used in this situation. In this paper a Bayesian analysis of the problem based on the rank order statistic is presented. Suppose that a random sample of m observations is obtained from distribution 1 and an independent random sample of n observations is obtained from distribution 2, and N = m+n. The rank order statistic will be denoted by z = (z1, ...,ZN), where zi is 0 if the ith smallest observation is from distribution 1 and zi is 1 otherwise (i -1, ..., N). Let F(x) and G(x) be the distribution functions of distributions 1 and 2, respectively, both assumed to be continuous. If G(x) is the Lehmann alternative 1{1 F(x)}k(k > 0), which holds, for example, if the population distributions are both exponential or both Weibull, then the rank order probability is
- Published
- 1974
29. ON A COMPREHENSIVE TEST FOR THE HOMOGENEITY OF VARIANCES AND COVARIANCES IN MULTIVARIATE PROBLEMS
- Author
-
D. J. Bishop
- Subjects
Statistics and Probability ,education.field_of_study ,Studentized range ,Applied Mathematics ,General Mathematics ,Population ,Univariate ,Bartlett's test ,Covariance ,Agricultural and Biological Sciences (miscellaneous) ,Sampling distribution ,Statistics ,F-test of equality of variances ,Multiple correlation ,Statistics, Probability and Uncertainty ,General Agricultural and Biological Sciences ,education ,Mathematics - Abstract
Now that satisfactory and probably final solutions have been obtained for a wide variety of statistical problems concerned with a single normally distributed variable, more and more attention has recently been given to the solution of multivariate problems. The multiple correlation methods of the old large sample theory have been replaced in many instances by others for which " studentized " test criteria are available, often having sampling distributions that are already familiar in univariate problems. In a recent paper on " The statistical utilization of multiple measurements", R. A. Fisher (1938a) has shown the connexion between certain of these methods: the D2-statistic work of Mahalanobis, the discriminant function methods of the Galton Laboratory and the generalized " Student's " ratio of Hotelling. A similar very general problem was dealt with some time ago by S. S. Wilks (1932), while mention may also be made of two papers by D. G. Lawley (1938 a, b) and a paper by P. L. Hsu (1938). The purpose of the methods put forward is to obtain information regarding the mean values of a number, say q, of correlated variables in one or more, say k, populations from which random samples have been drawn. If we denote by x8 a value of the sth variable (s = 1, 2, ..., q), then in all this work it has been assumed not only that x8 is normally distributed, but that it has the same variance o2 in every population sampled. Further, it is assumed that if x. is a second variable the correlation coefficients psu between x8 and x. is the same in all populations. The estimates of variance and covariance required in order to "studentize" the function of the sample means are therefore obtained by pooling together the sums of squares and sums of products from all samples. While it is true that even if oa, and psu are not the same in all populations the error involved may not be very large, it is however important to have available some means of testing the basic hypothesis which assumes homogeneity throughout the populations. Such a test has been derived by S. S. Wilks (1932) by an extension of Neyman & Pearson's likelihood ratio method of approach. Hitherto the somewhat lengthy computations required to obtain the moments of the sampling distribution of the test criterion have probably discouraged its use. The objects of the present paper are as follows: (a) In the simple but commonly met case, where the k samples are of the same
- Published
- 1939
30. An analysis of variance test for normality (complete samples)
- Author
-
S. S. Shapiro and M. B. Wilk
- Subjects
Statistics and Probability ,Applied Mathematics ,General Mathematics ,Omnibus test ,Lilliefors test ,Agricultural and Biological Sciences (miscellaneous) ,Normality test ,Goodness of fit ,Statistics ,Econometrics ,Test statistic ,Z-test ,Statistics, Probability and Uncertainty ,General Agricultural and Biological Sciences ,ANOVA on ranks ,Rankit ,Mathematics - Abstract
The main intent of this paper is to introduce a new statistical procedure for testing a complete sample for normality. The test statistic is obtained by dividing the square of an appropriate linear combination of the sample order statistics by the usual symmetric estimate of variance. This ratio is both scale and origin invariant and hence the statistic is appropriate for a test of the composite hypothesis of normality. Testing for distributional assumptions in general and for normality in particular has been a major area of continuing statistical research-both theoretically and practically. A possible cause of such sustained interest is that many statistical procedures have been derived based on particular distributional assumptions-especially that of normality. Although in many cases the techniques are more robust than the assumptions underlying them, still a knowledge that the underlying assumption is incorrect may temper the use and application of the methods. Moreover, the study of a body of data with the stimulus of a distributional test may encourage consideration of, for example, normalizing transformations and the use of alternate methods such as distribution-free techniques, as well as detection of gross peculiarities such as outliers or errors. The test procedure developed in this paper is defined and some of its analytical properties described in ? 2. Operational information and tables useful in employing the test are detailed in ? 3 (which may be read independently of the rest of the paper). Some examples are given in ? 4. Section 5 consists of an extract from an empirical sampling study of the comparison of the effectiveness of various alternative tests. Discussion and concluding remarks are given in ?6. 2. THE W TEST FOR NORMALITY (COMPLETE SAMPLES) 2 1. Motivation and early work This study was initiated, in part, in an attempt to summarize formally certain indications of probability plots. In particular, could one condense departures from statistical linearity of probability plots into one or a few 'degrees of freedom' in the manner of the application of analysis of variance in regression analysis? In a probability plot, one can consider the regression of the ordered observations on the expected values of the order statistics from a standardized version of the hypothesized distribution-the plot tending to be linear if the hypothesis is true. Hence a possible method of testing the distributional assumptionis by means of an analysis of variance type procedure. Using generalized least squares (the ordered variates are correlated) linear and higher-order
- Published
- 1965
31. Sequential estimation of the size of a population
- Author
-
P. R. Freeman
- Subjects
Statistics and Probability ,Sequential estimation ,Lever ,education.field_of_study ,business.product_category ,Applied Mathematics ,General Mathematics ,Maximum likelihood ,Population ,Agricultural and Biological Sciences (miscellaneous) ,Quadratic equation ,Sample size determination ,Statistics ,Ball (bearing) ,Bibliography ,Statistics, Probability and Uncertainty ,General Agricultural and Biological Sciences ,business ,education ,Mathematics - Abstract
When estimating the size of a finite population, it is possible to consider, as an alternative to the capture-recapture method, a sequential scheme. Suppose an urn contains an unknown number, N, balls, initially all white. A single ball is drawn at random and if it is white it is painted black and returned to the urn, while if it is black, indicating that it has already been sampled at least once before, it is returned unchanged. Thus initially nearly all drawings will be of white balls, but after a long time mostly black balls will appear. Somewhere in between we wish to stop sampling and produce an estimate of N. This scheme was first proposed by Goodman (1949, 1953). The most recent treatment of this problem is by Samuel (1968, 1969), who considered asymptotic distributions of sample sizes and maximum likelihood estimates for each of four stopping rules. The crucial question of the choice between these rules remained unanswered, however, since the balance between cost of sampling and risk due to inaccurate estimation was not considered. It is precisely this which is the aim of the present paper. A bibliography of previous, non-Bayesian, work is given by Samuel (1968). Since preparing the first draft of this paper, the author has seen an unpublished report by D. G. Hoel and W. E. Lever in which this problem is formulated in a Bayesian framework, and a small numerical solution given for a quadratic loss function, and with an upper limit on the value of N.
- Published
- 1972
32. ON THE UTILIZATION OF MARKED SPECIMENS IN ESTIMATING POPULATIONS OF FLYING INSECTS
- Author
-
C. C. Craig
- Subjects
Statistics and Probability ,Estimation ,education.field_of_study ,biology ,Applied Mathematics ,General Mathematics ,Population size ,Population ,Boundary (topology) ,biology.organism_classification ,Agricultural and Biological Sciences (miscellaneous) ,Moment (mathematics) ,Sample size determination ,Statistics ,Butterfly ,Statistics, Probability and Uncertainty ,General Agricultural and Biological Sciences ,Colias eurytheme ,education ,Mathematics - Abstract
Professor William Hovanitz called my attention to the following problem: An observer catches butterflies, marks them, and immediately releases them. It is assumed that a butterfly, no matter how many times it has been caught before, has the same susceptibility to capture as any other butterfly in the population which is supposed stable while the captures are being made. Records are kept of f, the frequency of cases in which the same butterfly is caught x times, x = 1, 2, ..., until a total of s captures of r different butterflies have been made. (Efx = r; Zxfx = s.) The number, fo, of butterflies which escape is not observed; the problem is to estimate from the values offx the total population n of butterflies on the area assumed well defined. The estimation of biological populations by means of capture-recapture data is by no means a new problem, though papers dealing with it from the mathematical-statistical point of view are largely quite recent. (In particular, see the papers by Leslie & Chitty (1951), Bailey (1951), Moran (1951, 1952), and the bibliographies quoted by them.) However, the experimental conditions and the mathematical models for the present study arpear to differ in essential ways from those previously considered. The important point of departure is that each butterfly on being netted is immediately marked (with a spot of nail polish) and released. The butterflies (Colias eurytheme) were caught in one of two isolated alfalfa fields, which they inhabit, in southern California. Each catch was made during the same day at times when the butterflies were freely flying. Thus, it seemed reasonable to assume that the population was stable during a catch. The experimenter, Prof. Hovanitz, endeavoured to give each butterfly an equal chance of capture, walking in straight lines across the field and deviating in direction before reaching a boundary only when he noticed that a butterfly just caught tended to fly down his path. One check of the suitability of a mathematical model is to test the agreement of the experimental results with respect to the number of butterflies caught once, twice, etc., with those predicted from the model. I will return to this point at the end of the paper. Two mathematical models seem appropriate to serve as a basis for discussion of this estimation problem. It is of some interest to see that both lead to approximately the same estimates with little difference in their precision for large samples. It may be of more interest that for both models in which the population size is regarded as a parameter, though maximum likelihood estimates exist and agree substantially with moment estimates in all sixteen of the actual field experiments for which I have data, nevertheless with increasing sample size meaningful solutions of the likelihood equation do not exist.
- Published
- 1953
33. Bayesian estimation of latent roots and vectors with special reference to the bivariate normal distribution
- Author
-
George C. Tiao and Stephen Fienberg
- Subjects
Statistics and Probability ,Wishart distribution ,Inverse-chi-squared distribution ,Applied Mathematics ,General Mathematics ,Inverse-Wishart distribution ,Asymptotic distribution ,Multivariate normal distribution ,Agricultural and Biological Sciences (miscellaneous) ,Ratio distribution ,Statistics ,Applied mathematics ,Dirichlet-multinomial distribution ,Statistics, Probability and Uncertainty ,General Agricultural and Biological Sciences ,Bayesian linear regression ,Mathematics - Abstract
SUMMARY This paper discusses from a Bayesian viewpoint some aspects of the estimation of latent roots and vectors of the covariance matrix of the bivariate normal distribution. The joint distribution of (i) the angle of the canonical transformation, and (ii) the ratio of the larger root to the total variance is considered in detail and illustrated by an example. Also discussed is the problem of making inferences about the larger roots, and several simple approximations to the distribution are considered. Finally, a generalization is given of one of the approximation methods to latent roots of higher dimensional covariance matrices. In this paper some aspects of the estimation of latent roots and vectors of the covariance matrix of the bivariate normal distribution are considered from a Bayesian viewpoint. In ? 2 the joint posterior distribution is obtained for (i) the angle 0 associated with the canonical transformation, and (ii) the ratio V of the larger root to the total variance, i.e. the sum of the two roots. Some properties of the joint distribution and the associated marginal distributions are discussed and a numerical example is given. Attention is then turned, in ? 3, to the problem of making inferences about the larger root and several simple approximations to its distribution are considered. In particular, it is demonstrated that the distribution can be closely approximated by a scaled inverted x2 distribution upon equating moments.
- Published
- 1969
34. TESTS OF SIGNIFICANCE FOR THE LATENT ROOTS OF COVARIANCE AND CORRELATION MATRICES
- Author
-
D. N. Lawley
- Subjects
Statistics and Probability ,Correlation ,Applied Mathematics ,General Mathematics ,Principal component analysis ,Statistics ,Covariance and correlation ,Statistics, Probability and Uncertainty ,General Agricultural and Biological Sciences ,Object (computer science) ,Agricultural and Biological Sciences (miscellaneous) ,Sample mean and sample covariance ,Mathematics - Abstract
In a recent note Bartlett (1954) has summarized various approximate x2 tests (see also other papers referred to therein). Our object in this paper is to extend certain of these, listed as III a, b, c, which may be regarded as tests involving the latent roots of sample covariance and correlation matrices. We shall be interested in those cases where the effects of the k largest latent roots have been removed and where a hypothesis of equality of the remaining roots is made. Such hypotheses are made in principal components and factor analyses.
- Published
- 1956
35. ON THE MOMENTS OF ORDER STATISTICS IN SAMPLES FROM NORMAL POPULATIONS
- Author
-
H. Ruben
- Subjects
Statistics and Probability ,Applied Mathematics ,General Mathematics ,Order statistic ,Bivariate analysis ,Function (mathematics) ,Agricultural and Biological Sciences (miscellaneous) ,Square (algebra) ,Quartile ,Sample size determination ,Statistics ,Statistics, Probability and Uncertainty ,General Agricultural and Biological Sciences ,Statistic ,Mathematics ,L-moment - Abstract
The purpose of this paper is to show the geometrical significance of the moments of order statistics derived from normal populations. It appears that these, as well as the moment-generating function of the square of any order statistic, are intimately related to the contents of the members of a class of hyperspherical simplices. Further geometrical interpretations, together with the extension of the present results to bivariate moments and other properties, will be provided in subsequent publications. The problem of order statistics in normal populations has been extensively considered in the literature. For example,t Tippett (1925) gives the second, third and fourth moments of the extreme order statistics for a few sample sizes. Hojo (1931) examines the sampling variation of the median, quartiles and interquartile distance in samples from normal populations and computes a large number of integrals for this purpose. Cole (1951) produces a very simple recurrence relationship between the 'normalized' moments which enables the normalized moments for all samples of size no greater than n to be obtained, by successive differencing, from the 'normalized' moments of the extreme order statistics in samples of size m < n.4 Hastings, Mosteller, Tukey & Winsor (1947) give, among other results, the means, variances, covariances and correlations of order statistics in samples of ten or less from a normal population, some of the results being given to only two decimal places because of the extreme labour of the computation. Jones (1948) and Godwin (1949a,b) have obtained exact values for some of the lower moments. In particular, Godwin (1949a, p. 283) proceeds in a more systematic manner, and approaches most closely the essential idea upon which this paper is based. In general, it may be said that many statisticians have attacked the problem in a rather disjointed and fragmentary manner, usually from the small sample end, but have failed to develop a systematic attack which shall at the same time throw light on the interconnexion between the moments and enable the computation of the moments to become an economic proposition. It is believed that these conditions are met by this paper.
- Published
- 1954
36. Two expansions for the quadrivariate normal integral
- Author
-
J. A. Mcfadden
- Subjects
Statistics and Probability ,Series (mathematics) ,Applied Mathematics ,General Mathematics ,Zero (complex analysis) ,Agricultural and Biological Sciences (miscellaneous) ,Main diagonal ,Numerical integration ,Combinatorics ,Distribution (mathematics) ,Statistics, Probability and Uncertainty ,General Agricultural and Biological Sciences ,Series expansion ,Random variable ,Mathematics ,Sign (mathematics) - Abstract
Let Xl, X2, X3 and X4 obey a quadrivariate niormal distribution with zero means, aind let P4 be the value of the quadrivariate normal integral, i.e. the probability that Xl, X2, X3 and X4 are simultaneously positive. The generalized tetrachoric series for P4, as given by Aitken (unpublished), Kendall (1941, 1945) (see also Kendall & Stuart, 1958, pp. 350-4), and Moran (1948), are not well suited for computation. For the case in which all six correlation coefficients are equal, the series has been sumlmed approximately by McFadden (1956). For special numerical values of the correlation matrix, exact results for P4 have been given by Schlafli (1858, 1860), Anis & Lloyd (1953), and Plackett (1954). Methods for numerical integration in more general cases have been given by Plackett (1954), Ihm (1959), and John (1959). Numerical methods for integration when all the correlation coefficients are equal have been provided by Ruben (1954) and Moran (1956). In this paper we present two series expansions for P4 which are well suited for computation. The first case occurs when X1, X2, X3 and X4 are successive measurements from a stationary Gaussian Markov process (with zero mean), or, equivalently, when the inverse of the correlation matrix has zero elements except on the main diagonal and immediately adjacent to it. The second case occurs when the correlation matrix itself has zero elements except on the main diagonal and adjacent to it. We shall then show that these two cases are related by a simple transformation. In a previous paper, McFadden (1955, eq. (39)) has expressed P4 in terms of the various product moments of the set of random variables Yi which assume the sign of the normal variables Xi. Let Y.=1 when X. O' ---1 when Xi< Oj (i= 1,2,3,4). (1)
- Published
- 1960
37. Inference about the change-point from cumulative sum tests
- Author
-
D. V. Hinkley
- Subjects
Statistics and Probability ,Sequence ,Applied Mathematics ,General Mathematics ,Inference ,Asymptotic distribution ,CUSUM ,Variance (accounting) ,Agricultural and Biological Sciences (miscellaneous) ,Statistics ,Point (geometry) ,Statistics, Probability and Uncertainty ,General Agricultural and Biological Sciences ,Random variable ,Statistical hypothesis testing ,Mathematics - Abstract
SUMMARY The point of change in mean in a sequence of normal random variables can be estimated from a cumulative sum test scheme. The asymptotic distribution of this estimate and associated test statistics are derived and numerical results given. The relation to likelihood inference is emphasized. Asymptotic results are compared with empirical sequential results, and some practical implications are discussed. The cumulative sum scheme for detecting distributional change in a sequence of random variables is a well-known technique in quality control, dating from the paper of Page (1954) to the recent expository account by van Dobben de Bruyn (1968). Throughout the literature on cumulative sum schemes the emphasis is placed on tests of departure from initial conditions. The purpose of this paper is to examine a secondary aspect: estimation of the index T in a sequence {xt}, where the departure from initial conditions has taken place. The work is closely related to an earlier paper by Hinkley (1970), in which maximum likelihood estimation and inference were discussed. We consider specifically sequences of normal random variables x1, ..., xT, say, where initially the mean 00 and the variance o2 are known. A cumulative sum, cusum, scheme is used to detect possible change in mean from 00, and for simplicity suppose that it is a one-sided scheme for detecting decrease in mean. Then the procedure is to compute the cumulative sums t
- Published
- 1971
38. Closed sequential tests of an exponential parameter
- Author
-
David G. Hoel
- Subjects
Statistics and Probability ,Class (set theory) ,Exponential distribution ,Applied Mathematics ,General Mathematics ,Sequential test ,Type (model theory) ,Random walk ,Agricultural and Biological Sciences (miscellaneous) ,Exponential function ,Test (assessment) ,Sequential probability ratio test ,Applied mathematics ,Statistics, Probability and Uncertainty ,General Agricultural and Biological Sciences ,Algorithm ,Mathematics - Abstract
SUMMARY This paper is concerned with the application of several closed sequential procedures to the testing of an exponential parameter. In particular, the procedures of Armitage (1957), Schneiderman & Armitage (1962a,b) and Anderson (1960) are considered. In certain of these cases either the properties or bounds on the properties of the sequential test can be obtained without approximations. There are certain situations in which the risk of a long sequential experiment is unacceptable. Some common examples are to be found in the area of medical trials and in various types of industrial experimentation. Such situations have brought about the study of alternatives to the sequential probability ratio test (SPRT) in which a procedure's boundaries are closed. One class of such tests is the 'restricted' procedures of Armitage (1957) and the 'wedge' plans of Schneiderman & Armitage (1962a, b). Another test which also belongs to this category is a modification of the SPRT as described by Anderson (1960) and by Donnelly in his unpublished Ph.D. thesis. These tests, which are concerned with normal means, require an approximation of the random walk by a diffusion process for an evaluation of their properties. In this paper we shall apply the above type regions to the testing of the parameter from an exponential distribution and show that in certain cases the properties of the test can be obtained without approximations.
- Published
- 1968
39. THE EXACT VALUE OF THE MOMENTS OF THE DISTRIBUTION OF x2 USED AS A TEST OF GOODNESS OF FIT, WHEN EXPECTATIONS ARE SMALL
- Author
-
J. B. S. Haldane
- Subjects
Statistics and Probability ,education.field_of_study ,Distribution (number theory) ,Applied Mathematics ,General Mathematics ,Sample (material) ,Population ,Degrees of freedom (statistics) ,Agricultural and Biological Sciences (miscellaneous) ,Combinatorics ,Minimum distance estimation ,Goodness of fit ,Cramér–von Mises criterion ,Statistics ,Multinomial theorem ,Statistics, Probability and Uncertainty ,General Agricultural and Biological Sciences ,education ,Mathematics - Abstract
IN genetical practice we are constantly presented with large numbers of small samples from populations consisting of several well-defined classes. For example in the mouse we can readily obtain hundreds of litters containing anything from one up to about twelve members. Their totals may agree satisfactorily with expectation on a Mendelian basis, for example J coloured, i white, or 9 grey, H black, i white. But we desire to know whether the individual litters can be regarded as random samples from such a population. In addition the problem of homogeneity may arise. That is to say the population as a whole may not conform to any particular expectation. But we may desire to know whether the litters can be regarded as random samples of the population given by the totals. It has long been known that when the numbers expected in any observation are small, the distribution of x2 departs from that given by Pearson (1900). The mean appears sometimes, but not always, to be equal to the number of degrees of freedom. But the variance is no longer exactly equal to twice that number. Exact expressions for it in certain cases have been given by Pearson (1932) and Cochran (1936). These are based on an ingenious application of the theory of multiple contingency by Pearson. It will be shown in this paper that the first few moments can often be calculated by entirely elementary methods involving nothing more advanced than the multinomial theorem. In an accompanying paper (Griineberg and Haldane, 1937) they will be applied to actual data on mice. We first study the distribution of x2 in a n-fold table with n 1 degrees of freedom, then in a (m x n)-fold table with m (n-i) degrees of freedom. For genetical work we are particularly interested in the (n x 2)-fold table with n degrees of freedom. As a limiting case of the 2-fold table with 1 degree of freedom we derive the moments of the variance of samples from a Poisson series, and thence the distribution of x2 in a n-fold table with n degrees of freedom. The important case of the (m x n)-fold table with (m 1) (n 1) degrees of freedom remains to be investigated. Consider a sample of 8 individuals falling into n classes. Let the expected and observed numbers in these classes be
- Published
- 1937
40. ON THE USE OF STUDENT'S t-TEST IN AN ASYMMETRICAL POPULATION
- Author
-
Ghurye Sg
- Subjects
Statistics and Probability ,education.field_of_study ,Applied Mathematics ,General Mathematics ,media_common.quotation_subject ,Population ,Agricultural and Biological Sciences (miscellaneous) ,Normal distribution ,Standard normal deviate ,Sample size determination ,Statistics ,Statistics, Probability and Uncertainty ,General Agricultural and Biological Sciences ,education ,Cumulant ,Normality ,Student's t-test ,media_common ,Mathematics ,Statistical hypothesis testing - Abstract
On account of the unique property of samples from a normal population that the ratio n+1 (x-Fb)/V(n + 1)/s (where jt is the population mean, x = x1/(n + 1) and ns2 =Z(Xi= 1 is the ratio of a normal deviate to a stochastically independent estimate of its variance, Student's t-test is a suitable test of significance for the mean of a normal population. However, in a variety of cases, it is necessary to test for the mean of a population which does not follow the Gaussian law. Efforts have, therefore, been made to see how far Student's distribution may be used for the purpose in non-normal populations. Due, mainly, to the analytical difficulties of the problem, no extensive theoretical discussion has yet been given. Thus, Pearson & Adyanthaya (1929), Rietz (1939) and Nair (1941) have given experimental treatments, while the theoretical discussions of some others (Rider, 1929; Perlo, 1933; Laderman, 1939) have dealt only with trivially small sample sizes. The papers by Bartlett (1935) and Geary (1936, 1947) give results true for any sample size, though they are based on certain assumptions and approximations. The present paper deals with the population considered by Geary in his 1936 paper, subject to the same approximations. The second contribution by Geary (in which is derived the t-distribution in samples from a population which departs more from normality than that considered in the 1936 paper) came to my notice too late to be made use of in the present work; but it is proposed to consider it later on. Geary (1936) has obtained the distribution of the ratio (x -It) (n + 1)/s in the case of an asymmetrical population, whose fourth and higher cumulants are zero, by neglecting squares and higher powers of the third cumulant. We know from this how far the probability of an error of the first kind (i.e. the probability of rejecting the null hypothesis when it is true) in such a population differs from that for a normal distribution, provided we may neglect the square of the standardized third cumulant yl. Here again, on account of analytical difficulties, it is not possible, except for very small sample sizes, to consider the effect of terms containing higher powers of yl. However, we can assume the result derived by Geary to be correct for very small values of yl, as also for large sample sizes-but in such cases the deviation from values of the normal theory is practically negligible. Even then, it is of interest to know whether, in using the usual tables of the t-test (based on the normal distribution), we are committing the greater error in the probability of an error of the first kind or in that of an error of the second kind. In the present paper are derived the values of the probability of an error of the second kind (and hence, of the power of the test) when the usual t-tables are used to define the critical region. It may be mentioned here that this problem is only a special case of a general investigation, on which the writer is engaged, into the effect, on statistical tests, of differences between the actual and the assumed distribution laws of the universe sampled. The solution of these problems is hampered by analytical difficulties in the derivation of the probability laws (and particularly of power functions), and the present case is one of the few in which a mathematical, though only approximate, solution has been found possible.
- Published
- 1949
41. On the distribution of range of samples from nonnormal populations
- Author
-
C. Singh
- Subjects
Statistics and Probability ,education.field_of_study ,Applied Mathematics ,General Mathematics ,Population ,Pearson distribution ,Sample (statistics) ,Edgeworth series ,Nonparametric skew ,Agricultural and Biological Sciences (miscellaneous) ,symbols.namesake ,Skewness ,Statistics ,Econometrics ,symbols ,Range (statistics) ,Kurtosis ,Statistics, Probability and Uncertainty ,General Agricultural and Biological Sciences ,education ,Mathematics - Abstract
SUMMARY The probability integral for the distribution of range of a sample from a population whose distribution can be represented by the first few terms of an Edgeworth series has been obtained in this paper. The numerical values of the corrective functions arising due to nonnormality are tabulated. The new theoretical results are compared with the earlier results, where available. The distribution of range of samples from nonnormal populations was first studied empirically by Pearson & Adyanthdya (1928) and later, among others, by Pearson (1950), Cox (1954) and David (1954). These studies have been limited mainly to the mean range and to the probability integral in some simple nonnormal cases, and from these likely effects of nonnormality on the distribution of range have been conjectured. Singh (1967) obtained some theoretical results regarding the expectation and the variance of range of samples from a population whose distribution can be represented by the first few terms of an Edgeworth series. These results provided some additional information regarding the effects of parental excess and skewness on the mean and variance of the range. In the present paper the probability integral for the distribution of range of samples from the same type of population has been obtained and evaluated for small samples to examine the effects of parental excess and skewness.
- Published
- 1970
42. ON CORRECTIONS FOR THE MOMENT-COEFFICIENTS OF FREQUENCY DISTRIBUTIONS WHEN THERE ARE INFINITE ORDINATES AT ONE OR BOTH OF THE TERMINALS OF THE RANGE
- Author
-
Gertrude E. Pearse
- Subjects
Statistics and Probability ,Applied Mathematics ,General Mathematics ,Mathematical analysis ,Agricultural and Biological Sciences (miscellaneous) ,Moment (mathematics) ,Ordinate ,Statistics ,Range (statistics) ,Statistics, Probability and Uncertainty ,Frequency distribution ,General Agricultural and Biological Sciences ,Karl pearson ,Mathematics - Abstract
(1) A PAPER* by Eleanor Pairmian anld Karl Pearson, F.R.S., was published in 1918 on "Corrections for the Moment-Coefficients of Limited Range Frequency Distributions when there are Finite or Iinfinite Ordinates and any Slopes at the Terminals of the Range." In the second part of that paper, dealing with cases of asymptotic frequency, the auxiliarv curve selected to give the first five frequencies was
- Published
- 1928
43. PAIRED COMPARISON DESIGNS FOR TESTING CONCORDANCE BETWEEN JUDGES
- Author
-
Raj Chandra Bose
- Subjects
Statistics and Probability ,Applied Mathematics ,General Mathematics ,Concordance ,Paired comparison ,Method of analysis ,Agricultural and Biological Sciences (miscellaneous) ,Degree (music) ,Object (philosophy) ,Technical report ,Statistics, Probability and Uncertainty ,Arithmetic ,General Agricultural and Biological Sciences ,Mathematics - Abstract
In a recent paper, 'Further contributions to the theory of paired comparisons', M. G. Kendall (1955) considers paired comparison designs, in which each pair of judges have certain comparisons in common. Such designs should prove useful for testing concordance between judges. He notes that designs of an optimum kind which balance by numbers of comparisons, objects compared, numbers of observers on given comparisons and so forth are rather rare. It is the object of this paper to obtain some paired comparison designs which have a high degree of symmetry. These designs have been defined in ?2, and certain inequalities between the parameters are obtained in ? 3. In ?? 4 and 5, two special classes of these designs have been investigated, and explicit designs for small values of n (the number of objects to be compared) have been given in Tables 1 and 2. The method of analysis would, to a certain extent, depend on what use the experimenter wants to make of the design. This question will be considered in a future communication. My thanks are due to Professor Kendall for suggesting the problem, and for helpful discussion during the preparation of the paper.
- Published
- 1956
44. EFFECT OF NON-NORMALITY ON THE POWER FUNCTION OF t-TEST
- Author
-
A. B. L. Srivastava
- Subjects
Statistics and Probability ,education.field_of_study ,Applied Mathematics ,General Mathematics ,Population ,Edgeworth series ,Agricultural and Biological Sciences (miscellaneous) ,Sample size determination ,Skewness ,Joint probability distribution ,Statistics ,Kurtosis ,Statistics, Probability and Uncertainty ,General Agricultural and Biological Sciences ,education ,Cumulant ,Mathematics ,Type I and type II errors - Abstract
Student's t-statistic provides a suitable test of significance for the mean when the sample comes from a normal population. The power of the normal theory test has been studied by Neyman (1935); Neyman & Tokarska (1936) and Johnson & Welch (1939). As in many cases the samples appear to belong to populations other than the Gaussian, it is necessary to see how far the normal theory test can be assumed to be valid in controlling the Type I and Type II errors of inference on non-normal samples. The effect of non-normality on the Type I error of Student's t-test was studied experimentally by Pearson & Adyanthaya (1929) and theoretically by Bartlett (1935), Geary (1936, 1947) and Gayen (1949). Assuming the parent population to be specified by the first two terms of the Edgeworth series, Geary (1936) obtained the approximate distribution of t for any sample size, and later this work was extended by Gayen (1949) by including in the distribution the effects of parental kurtosis A4 = fl2-3 and A2 = IA. Apart from the pioneer empirical study by Pearson & Adyanthaya (1929, pp. 276-80), the effect of non-normality on the Type II error (and hence on the power) of the t-test was first studied by Ghurye (1949). He has, however, considered only the effect of skewness of the population, for he started with the joint distribution of the mean and the variance for populations specified by the first two terms of the Edgeworth series (Geary, 1936). In this paper, it has been possible to study the effects of kurtosis and skewness of the parent population which may be assumed to cover a larger range of nonnormality. Gayen's (1949) formulae for the joint distribution of the sample mean and variance for the first four terms of the Edgeworth population have been utilized for the derivation of the corrective terms of the power function. The mnethod followed by Ghurye for the evaluation of the corrective term of the power function due to A3 appears to be satisfactory for derivation of the effects due to higher odd-order cumulants. But for those of the even-order cumulants his method does not appear to be useful, as Ghurye himself encountered some 'analytic difficulties'. In this paper, by a different approach it has been possible to evaluate integrals involved in the power function due to A4 and A3. The non-normal population considered here is supposed to be characterized by non-zero values of the standardized third and fourth cumulants. Since the effects of the higher-order terms depending on A5,6, A3A4, A2, ... are assumed to be negligible, the population covered is only moderately non-normal. Too high values of A3 and A4 can also not be permitted as they will make f(x) negative at one or both tails, and will give rise to subsidiary modes. To ensure a positive definite, unimodal frequency function, A4 should lie roughly between 0 and 2-4 and A3 02 (Barton & Dennis, 1952).t Also it is found possible in this paper to calculate in the non-normal case the critical region
- Published
- 1958
45. TESTING FOR SERIAL CORRELATION IN LEAST SQUARES REGRESSION. II
- Author
-
G. S. Watson and James Durbin
- Subjects
Statistics and Probability ,Polynomial regression ,Durbin–Watson statistic ,Applied Mathematics ,General Mathematics ,Autocorrelation ,Regression analysis ,Agricultural and Biological Sciences (miscellaneous) ,Breusch–Godfrey test ,Statistics ,Econometrics ,Statistics, Probability and Uncertainty ,Time series ,General Agricultural and Biological Sciences ,Independence (probability theory) ,Statistical hypothesis testing ,Mathematics - Abstract
A great deal of use has undoubtedly been made of least squares regression methods in circumstances in which they are known to be inapplicable. In particular, they have often been employed for the analysis of time series and similar data in which successive observations are serially correlated. The resulting complications are well known and have recently been studied from the standpoint of the econometrician by Cochrane & Orcutt (1949). A basic assumption underlying the application of the least squares method is that the error terms in the regression model are independent. When this assumption—among others—is satisfied the procedure is valid whether or not the observations themselves are serially correlated. The problem of testing the errors for independence forms the subject of this paper and its successor. The present paper deals mainly with the theory on which the test is based, while the second paper describes the test procedures in detail and gives tables of bounds to the significance points of the test criterion adopted. We shall not be concerned in either paper with the question of what should be done if the test gives an unfavourable result.
- Published
- 1951
46. Exact first- and second-order moments of estimates of components of covariance
- Author
-
G. M. Tallis and Charles A. Rohde
- Subjects
Statistics and Probability ,Analysis of covariance ,Covariance function ,Covariance matrix ,Applied Mathematics ,General Mathematics ,Estimator ,Covariance ,Random effects model ,Agricultural and Biological Sciences (miscellaneous) ,Matrix multiplication ,Estimation of covariance matrices ,Statistics ,Statistics, Probability and Uncertainty ,General Agricultural and Biological Sciences ,Mathematics - Abstract
SUMMARY This paper develops formulae for the first two moments of estimates of covariance for the general multivariate 'one-way' and 'two-way' models. The results are used to obtain the large sample dispersion matrix for estimated coefficients of two types of genetic selection indexes. These dispersion matrices provide the necessary extension of known results in balanced models to the unbalanced case. The problems of estimation associated with variance and covariance component analysis with unbalanced data have been of major concern to the statistical geneticist. This is primarily because most of the selection procedures applied to livestock require a knowledge of genetic variances and covariances which usually have to be estimated from the analyses of hierarchical models. Invariably, there is a marked lack of balance in the data thus rendering standard formulae for the variances of the estimates inapplicable. Serious consideration to these problems has been given by Henderson (1953), Searle (1956) and Hartley & Rao (1967). Searle gave particular attention to the one-way analysis of variance and covariance and used matrix methods to calculate the moments of the various estimators. Other work in this area concerns analysis of variance models of varying complexity; see Searle (1958, 1961), Mahamunulu (1963) and Blischke (1966). It is the purpose of the present paper to extend and complement existing results. With the advent of high speed computers, matrix operations can be handled with great speed and hence formulae for expectations and covariances of sums of squares and products can be left in a general computable form. Thus, explicit algebraic evaluation of each case is, in most cases, not only time-consuming but unnecessary. We consider here the general one-way and two-way analysis of covariance model with fixed and random effects. The number of variables included in the analysis is assumed to be arbitrary and this seems to lead to somewhat involved notation and algebra. However, general results are required in order to solve a number of practical problems. We give two examples from statistical genetics. In the theory of animal breeding interest centres around certain phenotypic and genetic parameters. Suppose that k characters of a particular breed of animal are relevant from the point of view of a selection programme. Then we let P and G be the phenotypic and additive genetic covariance matrices for the k characters and we consider two types of selection index which are based on these matrices.
- Published
- 1969
47. AN ANALYSIS OF PAIRED COMPARISON DESIGNS WITH INCOMPLETE REPETITIONS
- Author
-
John W. Wilkinson
- Subjects
Statistics and Probability ,Test procedures ,Applied Mathematics ,General Mathematics ,Paired comparison ,Observer (special relativity) ,Agricultural and Biological Sciences (miscellaneous) ,Handwriting ,Technical report ,Pairwise comparison ,Statistics, Probability and Uncertainty ,General Agricultural and Biological Sciences ,Algorithm ,Mathematics - Abstract
Experiments involving paired comparisons have mainly concerned the situation where each judge compares all possible pairs of treatments or objects. In certain types of experiments, this may require an excessive number of comparisons to be made by any observer. To overcome this handicap, Bose (1956) and Kendall (1955) constructed certain designs, symmetrical with respect to objects and judges, which do not require each judge to compare all possible pairs of objects. However, neither Bose nor Kendall proposed any procedure of analysis in their respective papers. The purpose of this paper is to consider the problem of analysis of the Bose-Kendall paired comparison designs. This analysis is carried out in the tradition of the fundamental Bradley-Terry (1952) paper concerning the situation where all possible pairs are compared by each judge. Using the Bradley-Terry mathematical model, likelihood ratio tests are constructed in detail for certain classes of hypotheses, and are stated for some additional situations of interest. To exemplify the test procedures, the proposed analysis is applied to an experiment involving pairwise comparison of handwriting specimens.
- Published
- 1957
48. A multivariate analogue of the one-sided test
- Author
-
Akio Kudo
- Subjects
Statistics and Probability ,Multivariate statistics ,education.field_of_study ,Covariance matrix ,Applied Mathematics ,General Mathematics ,Population ,Univariate ,Multivariate normal distribution ,Agricultural and Biological Sciences (miscellaneous) ,Test (assessment) ,Multivariate analysis of variance ,One sided ,Statistics ,Econometrics ,Statistics, Probability and Uncertainty ,General Agricultural and Biological Sciences ,education ,Mathematics - Abstract
In this paper we consider the following problem. Given a multivariate normal population with known variance matrix, what test is appropriate to determine whether the means are slipped to the right? In the case when the population is univariate normal, this problem can be solved by the ordinary one-sided test using either the normal or the t-distribution functions. It is the purpose of this paper to develop what may be termed a multivariate analogue of the one-sided test of significance.
- Published
- 1963
49. Studies in the History of Probability and Statistics. XXVII
- Author
-
Carl Erik Särndal
- Subjects
Statistics and Probability ,Observational error ,Laplace transform ,Applied Mathematics ,General Mathematics ,Gauss ,Probability and statistics ,Agricultural and Biological Sciences (miscellaneous) ,Least squares ,Statistical inference ,Calculus ,Statistics, Probability and Uncertainty ,Statistical theory ,General Agricultural and Biological Sciences ,Mathematics ,Central limit theorem - Abstract
SUMMARY This paper follows the development of an aspect of the Theory of Errors known as the Hypothesis of Elementary Errors. This development produced a scientific tradition in statistical inference theory concerned with the causes and effects of nonnormality of statistical data, a tradition sometimes called the Scandinavian school. Its founder was the Swedish astronomer C. V. L. Charlier. Being concerned mainly with the period of Charlier and his followers, this paper first outlines briefly some of the nineteenth-century thinking that formed the essential background for the Scandinavian school. The principal contributions of Charlier and his followers are discussed. Their ideas are compared with simultaneous developments in England, starting from the time of Karl Pearson. 1. SOME NINETEENTH-CENTURY DEVELOPMENTS ON THE CONTINENT The contribution by Gauss that attached his name to the normal law of error was his famous 'first proof' (1809) of the principle of least squares. Thus the normal law of error was called the Gaussian law of error. Being concerned with astronomical measurements, Gauss's problem was that of the combination and adjustment of observations. The essence of his 'first proof' was the derivation of the normal law of errors through the Principle of the Arithmetic Mean, from which the Principle of Least Squares immediately followed. On the other hand, Laplace's (1810, 1812) slightly later derivation of the normal law of errors, quite different from Gauss's, essentially consisted in the first demonstration of the Central Limit Theorem. The analytical technique of derivation of the normal law of error used by Laplace, which included extensive use of characteristic functions, became an important tool in Charlier's work on nonnormality. Hence Laplace's work on the normal law, and its subsequent clarification by Poisson (1837), constitutes the point of departure for the trend of thought to be followed in this paper. While the analytical techniques stemmed from Laplace, the leading theoretical concept in Charlier's work consisted in proliferations of Hagen's (1837) original formulation of the Hypothesis of Elementary Error. Hagen's simple form of the hypothesis stated that the observational error is the algebraic sum of an infinite number of elementary errors, all having the same absolute value and the same probability of being positive as negative. The normal law of error resulted from considering the general term in the expansion of
- Published
- 1971
50. THE MEAN AND COEFFICIENT OF VARIATION OF RANGE IN SMALL SAMPLES FROM NON-NORMAL POPULATIONS
- Author
-
David Cox
- Subjects
Statistics and Probability ,education.field_of_study ,Applied Mathematics ,General Mathematics ,Coefficient of variation ,Population ,Sampling (statistics) ,Agricultural and Biological Sciences (miscellaneous) ,Upper and lower bounds ,Standard deviation ,Statistics ,Kurtosis ,Range (statistics) ,Statistical dispersion ,Statistics, Probability and Uncertainty ,General Agricultural and Biological Sciences ,education ,Mathematics - Abstract
Since Tippett (1925) tabulated the mean range of random samples from a normal population, the range has been extensively used for the rapid estimation of dispersion. Moreover, much work has been done recently on quick significance tests in which the root-mean-square estimate of dispersion in the t-test, analysis of variance, etc., is replaced by an estimate derived from the range. All these uses of the range rest on the assumption of normality,, and so it is of interest to examine the distribution of range from non-normal populations. This was first done by E. S. Pearson & Adyanthaya (1929), and their work, and later work by Shone (1949), has been summarized, discussed and extended by Pearson (1950). The conclusion was that in small samples the ratio of mean range to populIation standard deviation is not much affected by the form of the population, but that the coefficient of variation of range depends fairly critically on the population. In large samnples the distribution of range is, of course, determined bv the tails of the population and so is very sensitive to nonnorlnality. The main object of the present paper is to predict the mean and coefficient of variation of range in small random samples of n (n < 5) fromn a population of given skewness and kurtosis and then to show how these results can be uised to assess the effect of non-normality on the comiimon applications of the range. Such a prediction can only be approximate because there is no functional relation between the distribution of range and population skewness and kurtosis. There are four ways of proceeding: (i) by evaltuating numerically the single and double integrals for the mean and mean square of range for a representative selection of non-normal populations; (ii) by sampling experiments; (iii) by a derivation of upper and lower limnits for the miiean andl coefficient of variation of range of populations with given properties. This was first done by Plackett (1947), who obtained an upper bound to the ratio of mean range to population standard deviation in samples of given size from an arbitrary population. An imiportant extension of Plackett's work (Hartley & David, 1954) appeared as the present paper was being completed
- Published
- 1954
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.