378 results on '"Network sampling"'
Search Results
202. Network Sampling Algorithms and Applications
- Author
-
Michael Drew LaMar and Rex K. Kincaid
- Subjects
Network sampling ,Computer science ,Data mining ,computer.software_genre ,computer - Published
- 2014
- Full Text
- View/download PDF
203. Probability sampling methods for hard-to-sample populations
- Author
-
Graham Kalton
- Subjects
Panel survey ,Nonprobability sampling ,Snowball sampling ,Network sampling ,Elderly persons ,Statistics ,Sampling error ,Psychology ,Probability sampling ,Horvitz–Thompson estimator ,Demography - Published
- 2014
- Full Text
- View/download PDF
204. Representing the populations: what general social surveys can learn from surveys among specific groups
- Author
-
Ineke Stoop
- Subjects
Gerontology ,Network sampling ,business.industry ,media_common.quotation_subject ,Immigration ,Ingroups and outgroups ,European Social Survey ,General Social Survey ,Elderly persons ,Medicine ,Non-response bias ,Population Register ,business ,Demography ,media_common - Published
- 2014
- Full Text
- View/download PDF
205. Link-tracing and respondent-driven sampling
- Author
-
Steve Thompson
- Subjects
Nonprobability sampling ,Snowball sampling ,Network sampling ,Respondent ,Statistics ,Sampling (statistics) ,Tracing ,Link (knot theory) ,Horvitz–Thompson estimator ,Mathematics - Published
- 2014
- Full Text
- View/download PDF
206. Indirect sampling for hard-to-reach populations
- Author
-
Pierre Lavallée
- Subjects
Nonprobability sampling ,Snowball sampling ,Network sampling ,Statistics ,Sampling (statistics) ,Target population ,Mathematics - Published
- 2014
- Full Text
- View/download PDF
207. Plea for Network Sampling
- Author
-
Arijit Chaudhuri
- Subjects
Plea ,Network sampling ,Operations research ,Computer science - Published
- 2014
- Full Text
- View/download PDF
208. Life stories of young women who experience rejection from their mothers
- Author
-
Marie Poggenpoel, Chris Myburgh, and Selina C. Mosman
- Subjects
lcsh:RT1-120 ,Adult ,Daughter ,Network sampling ,lcsh:Nursing ,Adolescent ,media_common.quotation_subject ,Mental Disorders ,General Medicine ,Child, Abandoned ,Mother-Child Relations ,Developmental psychology ,Interviews as Topic ,South Africa ,Women's Health Services ,Young Adult ,Humans ,Female ,Young adult ,Psychology ,Everyday life ,Social psychology ,media_common ,Original Research - Abstract
Background: When a daughter perceives rejection from her mother, she is bound to be sensitive to rejection for most if not all of her life. Such an experience influences almost all future relationships. Objectives: The purpose of this research was to explore and describe the life stories of young women who perceived rejection from their mothers and to formulate guidelines to assist them. Method: A phenomenological interpretive method that is explorative, descriptive, and contextual was used to explore everyday life experiences. Network sampling was used. In-depth phenomenological interviews were conducted with the young women so that they could define the most important dimensions of their life stories and elaborate on what is relevant to them. They were asked: ‘Tell me your life story.’ One of the authors also had a life story of perceived maternal rejection; hence an auto-ethnography was critical and was included in the study. Thematic data analysis was applied. Results: Themes that emerged from the data were that the young women: (1) perceive ongoing challenges in forming and sustaining relationships in their lives; (2) experience their lives as conflicted because their relationship with the central core of their existence, their mother, is perceived as tumultuous; and (3) experience fundamental links to be missing in their ‘motherdaughter relationship’. Conclusion: Only a few women were interviewed regarding perceived rejection from their mothers. Further research in this regard is imperative.
- Published
- 2014
209. Analyse en identification partielle de la décision d'émigrer des étudiants africains
- Author
-
Méango, Natoua Romuald and Henry, Marc
- Subjects
Modèles structurels incomplets ,Student Mobility ,Incomplete structural models ,Technique d’échantillonnage de réseau ,Network sampling ,Mobilité étudiante - Abstract
La migration internationale d’étudiants est un investissement couteux pour les familles dans beaucoup de pays en voie de développement. Cependant, cet investissement est susceptible de générer des bénéfices financiers et sociaux relativement importants aux investisseurs, tout autant que des externalités pour d’autres membres de la famille. Cette thèse s’intéresse à deux aspects importants de la migration des étudiants internationaux : (i) Qui part? Quels sont les déterminants de la probabilité de migration? (ii) Qui paie? Comment la famille s’organise-t-elle pour couvrir les frais de la migration? (iii) Qui y gagne? Ce flux migratoire est-il au bénéfice du pays d’origine? Entreprendre une telle étude met le chercheur en face de défis importants, notamment, l’absence de données complètes et fiables; la dispersion géographique des étudiants migrants en étant la cause première. La première contribution importante de ce travail est le développement d’une méthode de sondage en « boule de neige » pour des populations difficiles à atteindre, ainsi que d’estimateurs corrigeant les possibles biais de sélection. A partir de cette méthodologie, j’ai collecté des données incluant simultanément des étudiants migrants et non-migrants du Cameroun en utilisant une plateforme internet. Un second défi relativement bien documenté est la présence d’endogénéité du choix d’éducation. Nous tirons avantage des récents développements théoriques dans le traitement des problèmes d’identification dans les modèles de choix discrets pour résoudre cette difficulté, tout en conservant la simplicité des hypothèses nécessaires. Ce travail constitue l’une des premières applications de cette méthodologie à des questions de développement. Le premier chapitre de la thèse étudie la décision prise par la famille d’investir dans la migration étudiante. Il propose un modèle structurel empirique de choix discret qui reflète à la fois le rendement brut de la migration et la contrainte budgétaire liée au problème de choix des agents. Nos résultats démontrent que le choix du niveau final d’éducation, les résultats académiques et l’aide de la famille sont des déterminants importants de la probabilité d’émigrer, au contraire du genre qui ne semble pas affecter très significativement la décision familiale. Le second chapitre s’efforce de comprendre comment les agents décident de leur participation à la décision de migration et comment la famille partage les profits et décourage le phénomène de « passagers clandestins ». D’autres résultats dans la littérature sur l’identification partielle nous permettent de considérer des comportements stratégiques au sein de l’unité familiale. Les premières estimations suggèrent que le modèle « unitaire », où un agent représentatif maximise l’utilité familiale ne convient qu’aux familles composées des parents et de l’enfant. Les aidants extérieurs subissent un cout strictement positif pour leur participation, ce qui décourage leur implication. Les obligations familiales et sociales semblent expliquer les cas de participation d’un aidant, mieux qu’un possible altruisme de ces derniers. Finalement, le troisième chapitre présente le cadre théorique plus général dans lequel s’imbriquent les modèles développés dans les précédents chapitres. Les méthodes d’identification et d’inférence présentées sont spécialisées aux jeux finis avec information complète. Avec mes co-auteurs, nous proposons notamment une procédure combinatoire pour une implémentation efficace du bootstrap aux fins d’inférences dans les modèles cités ci-dessus. Nous en faisons une application sur les déterminants du choix familial de soins à long terme pour des parents âgés., International migration of students is a costly investment for family units in many developing countries. However, it might yield substantial financial and social return for the investors, as well as externalities for other family members. Furthermore, when these family decisions aggregate at the country-level, they affect the stock of human capital available to the origin country. This thesis addresses primarily two aspects of international student migration: (i) Who goes? What are the determinants of the probability of migration? (ii) Who pays? How does the family organize to bear the cost of the migration? Engaging in this study, one faces the challenge of data limitation, a direct consequence of the geographical dispersion of the population of interest. The first important contribution of this work is to provide a new snowball sampling methodology for hard-to-reach population, along with estimators to correct selection-biases. I collected data which include both migrant and non-migrant students from Cameroon, using an online-platform. A second challenge is the well-documented problem of endogeneity of the educational attainment. I take advantage of recent advances in the treatment of identification problems in discrete choice models to solve this issue while keeping assumptions at a low level. In particular, validity of the partial identification methodology does not rest on the existence of an instrument. To the best of my knowledge, this is the first empirical application of this methodology to development related issues. The first chapter studies the decision made by a family to invest in student. I propose an empirical structural decision model which reflects the importance of both the return of the investment and the budgetary constraint in agent choices. Our results show that the choice of level of education, the help of the family and academic results in secondary school are significant determinant of the probability to migrate, unlike the gender which does not seem to play any role in the family decision. The objective of the second chapter is to understand how agents decide to be part of the migration project and how the family organizes itself to share profits and discourage free riding-behavior. Further results on partial identification for games of incomplete information allow us to consider strategic behavior of family. My estimation suggests that models with a representative individual suit only families which consist of parent and child, but are rejected when a significant extended family member is introduced. Helpers incur a non-zero cost of participation that discourages involvement in the migration process. Kinship obligations and not altruism appears as the main reason of participation. Finally, the third chapter presents the more general theoretical framework in which my models are imbedded. The method presented is specialized to infinite games of complete information, but is of interest for application to the empirical analysis of instrumental variable models of discrete choice (Chapter 1), cooperative and non-cooperative games (Chapter 2), as well as revealed preference analysis. With my co-authors, we propose an efficient combinatorial bootstrap procedure for inference in games of complete information that runs in linear computing time and an application to the determinants of long term elderly care choices.
- Published
- 2014
210. Empirical investigation about statistical properties of abundance estimates based on line-intercept and network sampling of tracks
- Author
-
Fattorini, Lorenzo and Marchesselli, Marzia
- Published
- 2002
- Full Text
- View/download PDF
211. Population Size Estimation Using Multiple Respondent-Driven Sampling Surveys.
- Author
-
Kim BJ and Handcock MS
- Abstract
Respondent-driven sampling (RDS) is commonly used to study hard-to-reach populations since traditional methods are unable to efficiently survey members due to the typically highly stigmatized nature of the population. The number of people in these populations is of primary global health and demographic interest and is usually hard to estimate. However, due to the nature of RDS, current methods of population size estimation are insufficient. We introduce a new method of estimating population size that uses concepts from capture-recapture methods while modeling RDS as a successive sampling process. We assess its statistical validity using information from the CDC's National HIV Behavioral Surveillance system in 2009 and 2012., (© The Author(s) 2019. Published by Oxford University Press on behalf of the American Association for Public Opinion Research. All rights reserved. For permissions, please email: journals.permissions@oup.com.)
- Published
- 2019
- Full Text
- View/download PDF
212. Triad count estimation in digraphs
- Author
-
Martin Karlberg
- Subjects
Algebra and Number Theory ,Sociology and Political Science ,Network sampling ,Statistics ,Estimator ,Sampling (statistics) ,Observable ,Statistical analysis ,Social Sciences (miscellaneous) ,Graph ,Stratified sampling ,Mathematics ,Vertex (geometry) - Abstract
One may use information about a random sample of network members to estimate quantities related to the triad census of a network. Various kinds of information about the graph may be observable from the sample; four observation schemes involving the local networks of the sampled vertices are considered here. Unbiased triad count estimators are defined, and their variances (and unbiased estimators of these variances) are derived. A main result is that under one of the observation schemes, the estimator can be written as a sum of vertex attributes; standard estimation formulas for various sampling designs, such as stratified sampling, are therefore effortlessly applied. The estimator properties are compared in a simulation study.
- Published
- 1998
- Full Text
- View/download PDF
213. Sampling in the twenty-first century
- Author
-
Sudman, Seymour and Blair, Edward
- Published
- 1999
- Full Text
- View/download PDF
214. Social relations in a chronic care hospital: a whole network study of patients, family and employees
- Author
-
John P. Hirdes and Kathy A Scott
- Subjects
Chronic care ,medicine.medical_specialty ,Sociology and Political Science ,Network sampling ,Isolation (health care) ,Institutionalisation ,Hospital setting ,General Social Sciences ,Social relation ,Interpersonal ties ,Anthropology ,Family medicine ,medicine ,Hospital patients ,Psychiatry ,Psychology ,General Psychology - Abstract
Institutionalization is associated with the loss of informal social ties because isolation in the community increases the risk of admission and/or admission to a facility increases the risk that ties will be severed. The literature on social relationships with institutions has been limited by the use of biased samples, simplistic measures and exclusion of patients unable to respond to verbal or written questionnaires. The methodological aspects of a whole network study of chronic care hospital patients are discussed with a particular emphasis on network sampling and non-response bias. The substantive aspects of the present analyses focus in the different patterns of relationships of patients, family and employees in a hospital setting.
- Published
- 1998
- Full Text
- View/download PDF
215. International Student Migration: A Partial Identification Analysis
- Author
-
Méango, Romuald
- Subjects
incomplete structural models ,ddc:330 ,I25 ,C13 ,J61 ,C25 ,network sampling ,D85 ,student migration ,partial identification - Abstract
This paper studies the decision made by a family to invest in student migration. We propose an empirical structural decision model which reflects the importance of both the return to the investment and the budgetary constraint in the choice of the family. We circumvent the problem of endogeneity of the educational attainment by deriving sharp bounds and conduct inference for the parameters of interest. The data are collected on students from Cameroon, using a new snowball sampling procedure, which allow the inclusion of both migrants and non-migrants in the sample. We propose bias corrected estimators for this procedure. We study the characteristics of potential candidates to migration that increase or decrease their probability to migrate, accounting for a potential helper in the diaspora. Among the interesting results we find that a choice to complete a Master's degree doubles the odds of migration, there is little evidence of gender preference, students migrants are positively selected on their previous academic results.
- Published
- 2014
216. Impact of sampling design in estimation of graph characteristics
- Author
-
Emrah Cem, Mehmet Engin Tozal, and Kamil Sarac
- Subjects
Network sampling ,Computer science ,business.industry ,Sampling (statistics) ,Graph theory ,Machine learning ,computer.software_genre ,Graph ,Application domain ,Graph sampling ,Sampling design ,Sampling process ,Artificial intelligence ,business ,computer - Abstract
Studying structural and functional characteristics of large scale graphs (or networks) has been a challenging task due to the related computational overhead. Hence, most studies consult to sampling to gather necessary information to estimate various features of these big networks. On the other hand, using a best effort approach to graph sampling within the constraints of an application domain may not always produce accurate estimates. In fact, the mismatch between the characteristics of interest and the utilized network sampling methodology may result in incorrect inferences about the studied characteristics of the underlying system. In this study we empirically investigate the sources of information loss in a sampling process; identify the fundamental factors that need to be carefully considered in a sampling design; and use several synthetic and real world graphs to elaborately demonstrate the mismatch between the sampling design and graph characteristics of interest.
- Published
- 2013
- Full Text
- View/download PDF
217. Estimating network degree distributions under sampling: An inverse problem, with applications to monitoring social media networks
- Author
-
Eric D. Kolaczyk, Yaonan Zhang, and Bruce D. Spencer
- Subjects
FOS: Computer and information sciences ,Statistics and Probability ,constrained penalized weighted least squares ,Degree (graph theory) ,Computer science ,Monte Carlo method ,Sampling (statistics) ,Estimator ,Network ,Inverse problem ,degree distribution ,Degree distribution ,Methodology (stat.ME) ,Matrix (mathematics) ,Modeling and Simulation ,inverse problem ,network sampling ,Statistics, Probability and Uncertainty ,Algorithm ,Statistics - Methodology ,Network model - Abstract
Networks are a popular tool for representing elements in a system and their interconnectedness. Many observed networks can be viewed as only samples of some true underlying network. Such is frequently the case, for example, in the monitoring and study of massive, online social networks. We study the problem of how to estimate the degree distribution - an object of fundamental interest - of a true underlying network from its sampled network. In particular, we show that this problem can be formulated as an inverse problem. Playing a key role in this formulation is a matrix relating the expectation of our sampled degree distribution to the true underlying degree distribution. Under many network sampling designs, this matrix can be defined entirely in terms of the design and is found to be ill-conditioned. As a result, our inverse problem frequently is ill-posed. Accordingly, we offer a constrained, penalized weighted least-squares approach to solving this problem. A Monte Carlo variant of Stein's unbiased risk estimation (SURE) is used to select the penalization parameter. We explore the behavior of our resulting estimator of network degree distribution in simulation, using a variety of combinations of network models and sampling regimes. In addition, we demonstrate the ability of our method to accurately reconstruct the degree distributions of various sub-communities within online social networks corresponding to Friendster, Orkut and LiveJournal. Overall, our results show that the true degree distributions from both homogeneous and inhomogeneous networks can be recovered with substantially greater accuracy than reflected in the empirical degree distribution resulting from the original sampling., Published at http://dx.doi.org/10.1214/14-AOAS800 in the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)
- Published
- 2013
218. Field tests for a regional mercury deposition network—sampling design and preliminary test results
- Author
-
Nicolas S. Bloom, Stephen Vermette, and Steven E. Lindberg
- Subjects
MERCURE ,Atmospheric Science ,Network sampling ,Air pollution ,chemistry.chemical_element ,Mineralogy ,Field tests ,Atmospheric sciences ,medicine.disease_cause ,Mercury (element) ,Deposition (aerosol physics) ,chemistry ,medicine ,Environmental science ,Cold vapour atomic fluorescence spectroscopy ,General Environmental Science ,Mercury deposition - Abstract
A number of actions have been undertaken within the National Atmospheric Deposition Program (NADP) to implement a regional mercury deposition network. This paper describes a field test designed to evaluate a collector design and protocol for implementation within a new Hg network. The collector chosen for evaluation is a “dual-orifice” collector, designed to sample precipitation for mercury and other metals simultaneously. The method chosen for Hg analysis was cold vapour atomic fluorescence spectroscopy (CVAFS). The weekly precipitation Hg concentrations range between 4.29 and 17.88 ng l−1, with a volume-weighted mean of 10 ng l−1 comparable to those reported in other ongoing studies in North America and Europe. Calculated deposition flux ranges from 43 to 358 ng m−2 week−1, with a mean of 186 ng m−2 week−1.
- Published
- 1995
- Full Text
- View/download PDF
219. Network Sampling and Link-Tracing Designs
- Author
-
Steven K. Thompson
- Subjects
Network sampling ,Computer science ,Real-time computing ,Tracing ,Link (knot theory) - Published
- 2012
- Full Text
- View/download PDF
220. Estimating network structure via random sampling: Cognitive social structures and the adaptive threshold method
- Author
-
Michael D. Siciliano, C. Deniz Yenigün, and Gunes Ertan
- Subjects
Structure (mathematical logic) ,Dynamic network analysis ,Data collection ,Sociology and Political Science ,Social network ,business.industry ,Computer science ,Cross-network research ,General Social Sciences ,Sample (statistics) ,Cognition ,Organizational network analysis ,computer.software_genre ,Machine learning ,Social networks ,Anthropology ,Cognitive social structures ,Data mining ,Artificial intelligence ,business ,computer ,Network sampling ,General Psychology ,Social structure - Abstract
This paper introduces and tests a novel methodology for measuring networks. Rather than collecting data to observe a network or several networks in full, which is typically costly or impossible, we randomly sample a portion of individuals in the network and estimate the network based on the sampled individuals' perceptions on all possible ties. We find the methodology produces accurate estimates of social structure and network level indices in five different datasets. In order to illustrate the performance of our approach we compare its results with the traditional roster and ego network methods of data collection. Across all five datasets, our methodology outperforms these standard social network data collection methods. We offer ideas on applications of our methodology, and find it especially promising in cross-network settings. © 2012 Elsevier B.V.
- Published
- 2012
221. Assessing network scale-up estimates for groups most at risk of HIV/AIDS: evidence from a multiple-method study of heavy drug users in Curitiba, Brazil
- Author
-
Matthew J. Salganik, Alexandre Hannud Abdo, Neilane Bertoni, Maeve Brito de Mello, Francisco Inácio Bastos, and Dimitri Fazito
- Subjects
Gerontology ,medicine.medical_specialty ,social networks ,Epidemiology ,Practice of Epidemiology ,epidemiologic methods ,MEDLINE ,Developing country ,HIV Infections ,Disease ,Risk Assessment ,03 medical and health sciences ,0302 clinical medicine ,Acquired immunodeficiency syndrome (AIDS) ,population size estimation ,Statistics ,medicine ,Prevalence ,Humans ,030212 general & internal medicine ,Substance Abuse, Intravenous ,Estimation ,Acquired Immunodeficiency Syndrome ,030505 public health ,biology ,business.industry ,Curitiba ,HIV ,medicine.disease ,biology.organism_classification ,3. Good health ,Epidemiologic Research Design ,network sampling ,0305 other medical science ,Risk assessment ,business ,Brazil - Abstract
One of the many challenges hindering the global response to the human immunodeficiency virus (HIV)/acquired immunodeficiency syndrome (AIDS) epidemic is the difficulty of collecting reliable information about the populations most at risk for the disease. Thus, the authors empirically assessed a promising new method for estimating the sizes of most at-risk populations: the network scale-up method. Using 4 different data sources, 2 of which were from other researchers, the authors produced 5 estimates of the number of heavy drug users in Curitiba, Brazil. The authors found that the network scale-up and generalized network scale-up estimators produced estimates 5-10 times higher than estimates made using standard methods (the multiplier method and the direct estimation method using data from 2004 and 2010). Given that equally plausible methods produced such a wide range of results, the authors recommend that additional studies be undertaken to compare estimates based on the scale-up method with those made using other methods. If scale-up-based methods routinely produce higher estimates, this would suggest that scale-up-based methods are inappropriate for populations most at risk of HIV/AIDS or that standard methods may tend to underestimate the sizes of these populations.
- Published
- 2011
222. Business Surveys as a Network Sample
- Author
-
Ayah E. Johnson
- Subjects
Network sampling ,Computer science ,Sampling design ,Econometrics ,Sampling (statistics) ,Sampling error ,Cluster sampling ,Sample (statistics) ,Lot quality assurance sampling ,Marketing ,Stratified sampling - Published
- 2011
- Full Text
- View/download PDF
223. Benefits of Bias: Towards Better Characterization of Network Sampling
- Author
-
Arun S. Maiya and Tanya Y. Berger-Wolf
- Subjects
Social and Information Networks (cs.SI) ,FOS: Computer and information sciences ,Physics - Physics and Society ,Network sampling ,Computer science ,business.industry ,FOS: Physical sciences ,Sampling (statistics) ,Sample (statistics) ,Computer Science - Social and Information Networks ,Physics and Society (physics.soc-ph) ,Complex network ,Machine learning ,computer.software_genre ,Representativeness heuristic ,Characterization (materials science) ,Market research ,H.2.8 ,Artificial intelligence ,business ,Social network analysis ,computer - Abstract
From social networks to P2P systems, network sampling arises in many settings. We present a detailed study on the nature of biases in network sampling strategies to shed light on how best to sample from networks. We investigate connections between specific biases and various measures of structural representativeness. We show that certain biases are, in fact, beneficial for many applications, as they "push" the sampling process towards inclusion of desired properties. Finally, we describe how these sampling biases can be exploited in several, real-world applications including disease outbreak detection and market research., 9 pages; KDD 2011: 17th ACM SIGKDD Conference on Knowledge Discovery and Data Mining
- Published
- 2011
224. Consistency under sampling of exponential random graph models
- Author
-
Alessandro Rinaldo and Cosma Rohilla Shalizi
- Subjects
Statistics and Probability ,Class (set theory) ,Theoretical computer science ,network models ,projective family ,91D30 ,Mathematics - Statistics Theory ,Exponential family ,Statistics Theory (math.ST) ,01 natural sciences ,Constructive ,Article ,62M09 ,010104 statistics & probability ,Development (topology) ,0504 sociology ,Consistency (statistics) ,sufficient statistics ,Exponential random graph models ,FOS: Mathematics ,0101 mathematics ,exponential random graph model ,62B05 ,Mathematics ,Probability ,Stochastic process ,05 social sciences ,Statistics ,050401 social sciences methods ,Statistical model ,independent increments ,62M99 ,network sampling ,Statistics, Probability and Uncertainty ,60G51 - Abstract
The growing availability of network data and of scientific interest in distributed systems has led to the rapid development of statistical models of network structure. Typically, however, these are models for the entire network, while the data consists only of a sampled sub-network. Parameters for the whole network, which is what is of interest, are estimated by applying the model to the sub-network. This assumes that the model is consistent under sampling, or, in terms of the theory of stochastic processes, that it defines a projective family. Focusing on the popular class of exponential random graph models (ERGMs), we show that this apparently trivial condition is in fact violated by many popular and scientifically appealing models, and that satisfying it drastically limits ERGM's expressive power. These results are actually special cases of more general results about exponential families of dependent random variables, which we also prove. Using such results, we offer easily checked conditions for the consistency of maximum likelihood estimation in ERGMs, and discuss some possible constructive responses., Comment: Published in at http://dx.doi.org/10.1214/12-AOS1044 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)
- Published
- 2011
- Full Text
- View/download PDF
225. Estimation with Inadequate Frames
- Author
-
Arijit Chaudhuri
- Subjects
Estimation ,Adaptive sampling ,Network sampling ,Computer science ,Data mining ,computer.software_genre ,computer - Published
- 2010
- Full Text
- View/download PDF
226. Estimating point centrality using different network sampling techniques
- Author
-
Joseph Galaskiewicz
- Subjects
Estimation ,Sociology and Political Science ,Point (typography) ,Network sampling ,Computer science ,General Social Sciences ,Sampling (statistics) ,Network size ,Nonprobability sampling ,Anthropology ,Statistics ,Econometrics ,Centrality ,General Psychology - Abstract
The paper outlines the methodological choices that analysts must make when sampling social networks and assesses the impact of different sampling techniques on the estimation of network parameters. Using data from Galaskiewicz's (1919) study of Towertown and River City, the results show the extent to which sampling percentage, the number of trials/estimates, sampling procedure, and network size and density affect the ability of researchers to estimate the point centrality of organizations in networks of information and money transactions.
- Published
- 1991
- Full Text
- View/download PDF
227. Assessing Respondent Driven Sampling for Network Studies in Ethnographic Contexts
- Author
-
Dombrowski, Kirk, Khan, Bilal, Moses, Joshua, Channell, Emily, Misshula, Evan, Dombrowski, Kirk, Khan, Bilal, Moses, Joshua, Channell, Emily, and Misshula, Evan
- Abstract
Respondent Driven Sampling (RDS) is generally considered a methodology for recruiting “hard-to-reach” populations for social science research. More recently, Wejnert has argued that RDS analysis can be used for general social network analysis as well (where he labels it, RDS-SN). In this article, we assess the value of Wejnert’s RDS-SN for use in more traditional ethnographic contexts. We employed RDS as part of a larger social network research project to recruit n = 330 community residents (over 17 years of age) in Nain, a predominantly (92%) aboriginal community in northern Labrador, Canada, for social network interviews about food sharing, housing, public health, and community traditions. The peer referral chains resulted in a sample that was then analyzed for its representativeness by two means—a comparison with the Statistics Canada 2006 Census of the same community, and with house-by-house demographic sur- veys carried out in the community as part of our research. The results show a close fit with available community statistics and our own survey. As such, we argue that the RDS sampling used in Nain was able to provide a useful and near-representative sample of the community. To demonstrate the usefulness of the results, the referral chains are also analyzed here for patterns in intragroup and intergroup relation- ships that were apparent only in the aggregate.
- Published
- 2013
228. Structural Effects of Network Sampling Coverage I: Nodes Missing at Random
- Author
-
Smith, Jeffrey A, Moody, James, Smith, Jeffrey A, and Moody, James
- Abstract
Network measures assume a census of a well-bounded population. This level of coverage is rarely achieved in practice, however, and we have only limited information on the robustness of network measures to incomplete coverage. This paper examines the effect of node-level missingness on 4 classes of network measures: centrality, centralization, topology and homophily across a diverse sample of 12 empirical networks. We use a Monte Carlo simulation process to generate data with known levels of missingness and compare the resulting network scores to their known starting values. As with past studies (Borgatti et al., 2006; Kossinets, 2006), we find that measurement bias generally increases with more missing data. The exact rate and nature of this increase, however, varies systematically across network measures. For example, betweenness and Bonacich centralization are quite sensitive to missing data while closeness and in-degree are robust. Similarly, while the tau statistic and distance are difficult to capture with missing data, transitivity shows little bias even with very high levels of missingness. The results are also clearly dependent on the features of the network. Larger, more centralized networks are generally more robust to missing data, but this is especially true for centrality and centralization measures. More cohesive networks are robust to missing data when measuring topological features but not when measuring centralization. Overall, the results suggest that missing data may have quite large or quite small effects on network measurement, depending on the type of network and the question being posed.
- Published
- 2013
229. Macrostructure from Microstructure: Generating Whole Systems from Ego Networks
- Author
-
Smith, Jeffrey A and Smith, Jeffrey A
- Abstract
This paper presents a new simulation method to make global network inference from sampled data. The proposed simulation method takes sampled ego network data and uses Exponential Random Graph Models (ERGM) to reconstruct the features of the true, unknown network. After describing the method, the paper presents two validity checks of the approach: the first uses the 20 largest Add Health networks while the second uses the Sociology Coauthorship network in the 1990’s. For each test, I take random ego network samples from the known networks and use my method to make global network inference. I find that my method successfully reproduces the properties of the networks, such as distance and main component size. The results also suggest that simpler, baseline models provide considerably worse estimates for most network properties. I end the paper by discussing the bounds/limitations of ego network sampling. I also discuss possible extensions to the proposed approach.
- Published
- 2012
230. Using network sampling in crime victimization surveys
- Author
-
Ronald Czaja and Johnny Blair
- Subjects
education.field_of_study ,Network sampling ,Victimology ,Population ,Sampling (statistics) ,Pathology and Forensic Medicine ,Test (assessment) ,Telephone survey ,Statistics ,Rare events ,Statistical analysis ,education ,Psychology ,Law - Abstract
Since crime victimizations are statistically rare events, surveys to estimate rates of victimization are difficult and expensive. In this paper, we examine the advantages of network sampling over traditional methods for conducting crime victimization surveys. Network sampling links population households in specified ways, for reporting purposes, in order to increase the probabilities of locating households with particular characteristics. We conducted a reverse record check field experiment to test whether a telephone survey using network sampling is feasible to collect crime victimization data. Three types of crimes-burglary, robbery, and assault-were tested along with two types of networks-relatives and co-workers/close friends. This paper examines the extent to which victims report their victimization experiences in a general crime and victimization interview and the extent to which a randomly selected relative or close friend will report the same victimization incident in an identical interview. A number of multiplicity counting rules are compared in terms of reporting errors and a mean square error analysis.
- Published
- 1990
- Full Text
- View/download PDF
231. Sample scale-free gene regulatory network using gene ontology
- Author
-
Peter E. Larsen, Eyad Almasri, Yang Dai, and Guanrao Chen
- Subjects
Network sampling ,Property (programming) ,Genes, Fungal ,Gene regulatory network ,Sample (statistics) ,Saccharomyces cerevisiae ,Biology ,computer.software_genre ,Set (abstract data type) ,Gene Expression Regulation, Fungal ,Humans ,Computer Simulation ,Gene Regulatory Networks ,Gene ,Oligonucleotide Array Sequence Analysis ,Models, Statistical ,Models, Genetic ,Gene ontology ,Scale (chemistry) ,Gene Expression Profiling ,Computational Biology ,Reproducibility of Results ,Data mining ,computer ,Algorithms ,Software - Abstract
� Abstract— Currently there are various approaches to the reconstruction of gene regulatory networks from different sources of data. However, none of these methods incorporates explicitly scale-free property, one of the most important features of the targeted network, into their algorithms. In this paper, several network sampling strategies are explored on a set assembled from previous published gene interactions in yeast, expecting to reconstruct regulatory networks that are scale-free.
- Published
- 2007
232. A new learning automata-based sampling algorithm for social networks
- Author
-
Alireza Rezvanian and Mohammad Reza Meybodi
- Subjects
Social network ,Learning automata ,Network sampling ,Computer Networks and Communications ,Computer science ,business.industry ,Sampling (statistics) ,020206 networking & telecommunications ,02 engineering and technology ,Degree distribution ,Machine learning ,computer.software_genre ,symbols.namesake ,Approximation error ,0202 electrical engineering, electronic engineering, information engineering ,symbols ,020201 artificial intelligence & image processing ,Artificial intelligence ,Electrical and Electronic Engineering ,business ,Social network analysis ,Algorithm ,computer ,Gibbs sampling - Abstract
Summary Recently, studying social networks plays a significant role in many applications of social network analysis, from the studying the characterization of network to that of financial applications. Due to the large data and privacy issues of social network services, there is only a limited local access to the whole network data in a reasonable amount of time. Therefore, network sampling arises to studying the characterization of real networks such as communication, technological, information, and social networks. In this paper, a sampling algorithm for complex social networks that is based on a new version of distributed learning automata (DLA) reported recently called extended DLA (eDLA) is proposed. For evaluation purpose, the eDLA-based sampling algorithm has been tested on several test networks and the obtained experimental results are compared with the results obtained for a number of well-known sampling algorithms in terms of relative error and Kolmogorov–Smirnov test. It is shown that eDLA-based sampling algorithm outperforms the existing sampling algorithms. Experimental results further show that the eDLA-based sampling algorithm in comparison with the DLA-based sampling algorithm has a 26.93% improvement for the average of Kolmogorov–Smirnov value for degree distribution taken over all test networks. Copyright © 2015 John Wiley & Sons, Ltd.
- Published
- 2015
- Full Text
- View/download PDF
233. Methodological Problems with Transformation and Size Reduction of Data Sets in Network Analysis
- Author
-
Marschall, Nicolas
- Subjects
Methodologie [gnd] ,Methodenkritik ,Soziales Netzwerk [gnd] ,Methodology ,ddc:300 ,Politisches Netzwerk [gnd] ,Network Sampling ,+[gnd]%22">Netzwerkanalyse ,[gnd] Network Analysis ,Social Network Analysis ,Validität [gnd] ,Validity - Abstract
This thesis is a methodological study in the field of social network analysis. It seeks to investigate how certain factors can interfere with the processes of data collection and data analysis, and therefore lead to invalid or unreliable results for network-analytical measures. The discussion is focused on one-mode whole-network designs with data collection by the way of questionnaires. It begins with a short introduction into the methods of network analysis and then discusses the literature on the field of the validity of network analysis in general. Afterwards the possible influencing factors investigated in this study are discussed and the analysis is described.In particular, nonresponse (biased and unbiased), forgetting, attempts of sampling, the omission of unimportant actors, symmetrization, dichotomization, and collapsing actors are investigated. These processes and methods are simulated by comparing the results of network-analytical measures calculated with an unchanged data set to the results of measures calculated with other variants of the same data set in which these processes have been simulated. The network-analytical measures tested are density, degree centralization, eigenvector centralization, the determination of cliques and k-plexes, degree centrality, closeness centrality, eigenvector centrality, and betweenness centrality. Density, centralization and the extent of size reduction are expected to be the main influencing factors for validity and reliability. All combinations of size reduction or transformation processes and network-analytical measures are simulated using a total of seven matrices representing different densities and centralizations. In some cases, different extents of size reduction and different strategies of dealing with the problem are investigated as well.The study comes to the conclusion that size reduction and transformation processes can significantly change the results of an analysis. In most cases, the error introduced into the network-analytical measures is biased in one direction, most often negative. The deviations of the estimates from the real values depend on the extent of size reduction. Density and centralization are also influencing factors in many cases; however, the direction of this influence can change. Certain network-analytical measures like closeness centrality and the determination of subgroups are especially vulnerable. Certain size-reduction and transformation processes are more dangerous than others. These results are presented in detail at the end of the thesis.
- Published
- 2006
234. Redes y conjuntos de acción : para aplicaciones estratégicas en los tiempos de la complejidad social
- Author
-
Tomás R. Villasante and Pedro Martín Gutiérrez
- Subjects
Conjunts d'acció ,media_common.quotation_subject ,Social Sciences ,Affinity relationships ,lcsh:Social Sciences ,Set (abstract data type) ,Sociograms – Action set – Affinity relationships – Sampling ,Reflexivity ,Sociology ,lcsh:Social sciences (General) ,Network Sampling ,Sociograms ,Sampling ,media_common ,H1-99 ,Sociogram ,Relacions d'afinitat ,Participative Planning ,Communication ,Sampling (statistics) ,Mostratge ,Muestreo ,Epistemology ,Social sciences (General) ,lcsh:H ,Conjuntos de acción ,Action (philosophy) ,Relaciones de afinidad ,Action set ,Participative Methodologies ,Sociogrames ,lcsh:H1-99 ,Ideology ,Sociogramas ,Social Sciences (miscellaneous) ,Network Analysis - Abstract
In this paper we propose a participative and reflexive approach to social maps. Several social and ideological categories may be applied in the design of sociograms. The concept of “action set” is presented, as well as its consequences in theoretical terms and for sampling purposes., En este trabajo se propone una metodología participativa y reflexiva de desarrollo de mapas sociales. Se apuesta por incorporar en el sociograma, de modo flexible, el grado de afinidad de las relaciones junto a otros ejes sociales e ideológicos. Se presenta el concepto de conjuntos de acción, y se discuten las implicaciones de este enfoque, tanto en lo que hace referencia al muestreo como a aspectos teóricos más generales.
- Published
- 2006
235. Inaccessible and Sensitive Data
- Author
-
Vic Barnett
- Subjects
Network sampling ,Ranked set sampling ,Computer science ,Data mining ,computer.software_genre ,computer ,Remote sensing - Published
- 2005
- Full Text
- View/download PDF
236. Establishment Surveys With Population Survey–Generated Sampling Frames
- Author
-
Iris M. Shimizu and Monroe G. Sirken
- Subjects
Network sampling ,Population sample ,Computer science ,Frame (networking) ,Statistics ,Volume (computing) ,Econometrics ,Estimator ,Sampling (statistics) ,Sampling frame ,Mathematics ,Population survey - Abstract
When stand-alone sampling frames list all establishments and size measures, the Hansen–Hurwitz (HH) pps estimator is generally used to estimate the volume of transactions between establishments and households. The network sampling (NS) version of the HH estimator depends on a population-survey-generated establishment frame, which lists establishments that have transactions with households in a population sample survey and the number of transactions that establishments have with survey households. The NS estimator is a competitor of the HH estimator whenever stand-alone frames of good quality are unavailable and, as this article indicates, the NS estimator may be competitive with the HH estimator when stand-alone and population-survey-generated frames are flawless. Keywords: establishment surveys; sampling frames; network sampling; Hansen–Hurwitz estimator
- Published
- 2005
- Full Text
- View/download PDF
237. Effectively Visualizing Large Networks Through Sampling
- Author
-
Curial, Stephen and Rafiei, Davood
- Subjects
business.industry ,Computer science ,Relational database ,Sampling (statistics) ,Graph theory ,Sample (statistics) ,Searching a network ,computer.software_genre ,Data visualization ,Proof of concept ,Scalability ,Visualizing the Web ,The Internet ,Large network visualization ,Data mining ,Communication Networks ,business ,Network sampling ,computer - Abstract
Technical report TR05-08. We study the problem of visualizing large networks and develop techniques for effectively abstracting a network and reducing the size to a level that can be clearly viewed. Our size reduction techniques are based on sampling, where only a sample instead of the full network is visualized. We propose a randomized notion of ``focus'' that specifies a part of the network and the degree to which it needs to be magnified. Visualizing a sample allows our method overcome the scalability issues inherent in traditional visualization methods. We report some characteristics that frequently occur in large networks and the conditions under which they are preserved when sampling from a network. This can be useful in selecting a proper sampling scheme that yields a sample with similar characteristics as the original network. Our method is built on top of a relational database, thus it can be easily and efficiently implemented using any off-the-shelf database software. As a proof of concept, we implement our methods within a system called ALVIN and report some of our experiments over the movie database and the connectivity graph of the Web with 178 million nodes and over 800 million edges. | TRID-ID TR05-08
- Published
- 2005
- Full Text
- View/download PDF
238. A SIMULATION-BASED FRAMEWORK FOR ASSESSING THE FEASIBILITY OF RESPONDENT-DRIVEN SAMPLING FOR ESTIMATING CHARACTERISTICS IN POPULATIONS OF LESBIAN, GAY AND BISEXUAL OLDER ADULTS.
- Author
-
Griffin M, Gile KJ, Fredricksen-Goldsen KI, Handcock MS, and Erosheva EA
- Abstract
Respondent-driven sampling (RDS) is a method for sampling from a target population by leveraging social connections. RDS is invaluable to the study of hard-to-reach populations. However, RDS is costly and can be infeasible. RDS is infeasible when RDS point estimators have small effective sample sizes (large design effects) or when RDS interval estimators have large confidence intervals relative to estimates obtained in previous studies or poor coverage. As a result, researchers need tools to assess whether or not estimation of certain characteristics of interest for specific populations is feasible in advance. In this paper, we develop a simulation-based framework for using pilot data-in the form of a convenience sample of aggregated, egocentric data and estimates of subpopulation sizes within the target population-to assess whether or not RDS is feasible for estimating characteristics of a target population. in doing so, we assume that more is known about egos than alters in the pilot data, which is often the case with aggregated, egocentric data in practice. We build on existing methods for estimating the structure of social networks from aggregated, egocentric sample data and estimates of subpopulation sizes within the target population. We apply this framework to assess the feasibility of estimating the proportion male, proportion bisexual, proportion depressed and proportion infected with HIV/AIDS within three spatially distinct target populations of older lesbian, gay and bisexual adults using pilot data from the caring and Aging with Pride Study and the Gallup Daily Tracking Survey. We conclude that using an RDS sample of 300 subjects is infeasible for estimating the proportion male, but feasible for estimating the proportion bisexual, proportion depressed and proportion infected with HIV/AIDS in all three target populations.
- Published
- 2018
- Full Text
- View/download PDF
239. Redes y conjuntos de acción: para aplicaciones estratégicas en los tiempos de la complejidad social
- Author
-
Villasante, Tomás R., Martín Gutiérrez, Pedro, Villasante, Tomás R., and Martín Gutiérrez, Pedro
- Abstract
In this article social network analysis (SNA) and some of its techniques are used within the context of a different theoretical proposal from those that are normally applied. We use SNA in participative methodologies, and we try to go beyond the limits of the “how the research is” –merely cognitive– in order to study in a transforming way, “for whom” and “for what” of knowledge. Therefore we do not have a debate about which technique is more accurate in terms of measure, but if it is strategically operative in the transformation of social networks in self-organised and synergic processes. Speech complexity, sociograms, or action set analysis, are less formalized than those in other studies, more precise and descriptive. However, its usefulness –within the processes mentioned above– has to do with the social agents involved, who take part and own the technique, and reflect about their reality, in terms of systems of relationships. Thus, they are able to have conscious strategies en order to transform their own networks. In sum, they show themselves as auto-organized systems.
- Published
- 2007
240. Ebb and flow when navigating adolescence: predictors of daily hassles among African-American adolescent girls
- Author
-
Amy Young, Eileen K Kintner, Barbara J. Guthrie, and Carol J. Boyd
- Subjects
Adult ,Network sampling ,Adolescent ,Psychology, Adolescent ,Ethnic group ,Pediatrics ,Developmental psychology ,Education ,Health problems ,Risk Factors ,Humans ,Child ,Socioeconomic status ,African american ,Analysis of Variance ,United States ,Pediatric Nursing ,Black or African American ,Search terms ,Cross-Sectional Studies ,Socioeconomic Factors ,Regression Analysis ,Female ,Family Relations ,Psychology ,Prejudice ,Stress, Psychological ,Clinical psychology - Abstract
ISSUES AND PURPOSE. To examine the nature of daily hassles as perceived by African-American adolescent females. DESIGN AND METHODS. As part of a larger, cross-sectional study, nonrandom network sampling technique was usedto survey 178 adolescent girls between the ages of 11 and 19. RESULTS. This study found that the most common hassles were school and academic, followed by family and economic hassles, peer and social hassles, and personal safety hassles. Socioeconomic factors were strongly associated with the level of hassles reported. PRACTICE IMPLICATIONS. Assess African-American girls' perception of daily hassles, specifically school- and family- related hassles, and also examine the interrelationship between the type of hassles and health problems. Search terms: Adolescence, blacks, daily hassles, ethnic groups, females.
- Published
- 2003
241. Career paths beyond nursing and the contribution of nursing experience and skills in attaining these positions
- Author
-
Helen Franks and Christine Duffield
- Subjects
Network sampling ,business.industry ,Nursing research ,MEDLINE ,Exploratory research ,Nursing ,Interviews as Topic ,Market research ,Career Mobility ,Team nursing ,Workforce ,Medicine ,Nurse education ,Clinical Competence ,New South Wales ,business ,General Nursing - Abstract
This paper reports on an exploratory study undertaken in New South Wales, Australia which sought to identify the positions nurses go on to when they leave nursing and the skills and experience they gained from nursing which they believe enabled them to obtain employment outside the profession. In addition, the reasons why they left nursing were also ascertained. A network sampling technique was used to recruit 17 participants. A tape-recorded semi-structured interview of approximately 1 h was conducted with each participant. Interviews were conducted until no new information emerged (14) and the remaining three interviews were used for validation. While many participants were employed in health-related fields, others were in diverse areas such as business, landscape coordination and market research, to name a few. All participants reported positively on the range of skills they had acquired as a nurse. Reasons provided for leaving the nursing workforce included reaching a ceiling in nursing or wishing to develop themselves in another direction. Respondents had also undertaken a wide range of additional qualifications. © 2002 Elsevier Science Ltd. All rights reserved.
- Published
- 2002
242. Muestreo de conglomerados con multiplicidad: estimación del total en poblaciones raras
- Author
-
Torres Saavedra, Pedro Alejo and Ospina Botero, David
- Subjects
Eficiencia ,Multiplicity ,Multiplicity estimator ,Multiplicidad ,Efficiency ,Household survey ,Muestreo de conglomerados ,Scarce population ,Network sampling ,Población escasa ,Cluster sampling ,Encuesta de hogares - Abstract
El muestreo de conglomerados con multiplicidad utiliza el concepto de regla de conteo de multiplicidad para incluir en la encuesta individuos no pertenecientes a la muestra. Se derivan las fórmulas para la ganancia de eficiencia y para los componentes de varianza de dos estimadores de multiplicidad del total poblacional. Mediante simulación se ilustra la magnitud de la ganancia de eficiencia con el muestreo con multiplicidad en función del tamaño de muestra y la amplitud de la regla de conteo. Cluster sampling with multiplicity uses the concept of counting rule of multiplicity in order to include in the survey those individuals not belonging to the sample. Additionally, some results to improve the efficiency and for the variance components from multiplicity estimators of the population total are derived. Finally, a simulation exercise to illustrate the magnitude of the gain in the efficiency with the sampling with multiplicity in function of the sample size and the extent of the counting rule is presented.
- Published
- 2001
243. Inference on network statistics by restricting to the network space: applications to sexual history data.
- Author
-
Goyal R and De Gruttola V
- Subjects
- Bayes Theorem, Bias, Botswana epidemiology, Chicago, Computer Simulation, Educational Status, Female, HIV Infections epidemiology, HIV Infections prevention & control, HIV Infections transmission, Humans, Likelihood Functions, Male, Models, Statistical, Surveys and Questionnaires, Biostatistics methods, Sexual Behavior, Sexual Partners
- Abstract
Analysis of sexual history data intended to describe sexual networks presents many challenges arising from the fact that most surveys collect information on only a very small fraction of the population of interest. In addition, partners are rarely identified and responses are subject to reporting biases. Typically, each network statistic of interest, such as mean number of sexual partners for men or women, is estimated independently of other network statistics. There is, however, a complex relationship among networks statistics; and knowledge of these relationships can aid in addressing concerns mentioned earlier. We develop a novel method that constrains a posterior predictive distribution of a collection of network statistics in order to leverage the relationships among network statistics in making inference about network properties of interest. The method ensures that inference on network properties is compatible with an actual network. Through extensive simulation studies, we also demonstrate that use of this method can improve estimates in settings where there is uncertainty that arises both from sampling and from systematic reporting bias compared with currently available approaches to estimation. To illustrate the method, we apply it to estimate network statistics using data from the Chicago Health and Social Life Survey. Copyright © 2017 John Wiley & Sons, Ltd., (Copyright © 2017 John Wiley & Sons, Ltd.)
- Published
- 2018
- Full Text
- View/download PDF
244. Identification of Homophily and Preferential Recruitment in Respondent-Driven Sampling.
- Author
-
Crawford FW, Aronow PM, Zeng L, and Li J
- Subjects
- Bias, Connecticut, Drug Users statistics & numerical data, Female, Humans, Male, Sampling Studies, Patient Selection, Social Networking, Surveys and Questionnaires standards
- Abstract
Respondent-driven sampling (RDS) is a link-tracing procedure used in epidemiologic research on hidden or hard-to-reach populations in which subjects recruit others via their social networks. Estimates from RDS studies may have poor statistical properties due to statistical dependence in sampled subjects' traits. Two distinct mechanisms account for dependence in an RDS study: homophily, the tendency for individuals to share social ties with others exhibiting similar characteristics, and preferential recruitment, in which recruiters do not recruit uniformly at random from their network alters. The different effects of network homophily and preferential recruitment in RDS studies have been a source of confusion and controversy in methodological and empirical research in epidemiology. In this work, we gave formal definitions of homophily and preferential recruitment and showed that neither is identified in typical RDS studies. We derived nonparametric identification regions for homophily and preferential recruitment and showed that these parameters were not identified unless the network took a degenerate form. The results indicated that claims of homophily or recruitment bias measured from empirical RDS studies may not be credible. We applied our identification results to a study involving both a network census and RDS on a population of injection drug users in Hartford, Connecticut (2012-2013)., (© The Author(s) 2017. Published by Oxford University Press on behalf of the Johns Hopkins Bloomberg School of Public Health. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.)
- Published
- 2018
- Full Text
- View/download PDF
245. A comparison of network sampling designs for a hidden population of drug users: Random walk vs. respondent-driven sampling.
- Author
-
Bell DC, Erbaugh EB, Serrano T, Dayton-Shotts CA, and Montoya ID
- Abstract
Both random walk and respondent-driven sampling (RDS) exploit social networks and may reduce biases introduced by earlier methods for sampling from hidden populations. Although RDS has become much more widely used by social researchers than random walk (RW), there has been little discussion of the tradeoffs in choosing RDS over RW. This paper compares experiences of implementing RW and RDS to recruit drug users to a network-based study in Houston, Texas. Both recruitment methods were implemented over comparable periods of time, with the same population, by the same research staff. RDS methods recruited more participants with less strain on staff. However, participants recruited through RW were more forthcoming than RDS participants in helping to recruit members of their social networks. Findings indicate that, dependent upon study goals, researchers' choice of design may influence participant recruitment, participant commitment, and impact on staff, factors that may in turn affect overall study success., (Copyright © 2016 Elsevier Inc. All rights reserved.)
- Published
- 2017
- Full Text
- View/download PDF
246. The Survey Population: a Profile
- Author
-
Sharon J. Daye
- Subjects
Geography ,Network sampling ,Position (finance) ,Survey sampling ,Demographic economics ,Educational qualification - Abstract
The men and women who comprise the survey population around which this study is based were obtained through network sampling. Using original material obtained from in-depth interviews this chapter aims to build up a factual profile of this London-based survey population, firstly by looking at the background history of their migration to Britain and secondly, provide an outline of their present position by looking at their occupational position and educational qualifications.
- Published
- 1994
- Full Text
- View/download PDF
247. Modeling Preferential Recruitment for Respondent-Driven Sampling
- Author
-
McLaughlin, Katherine Rumjahn
- Subjects
- Statistics, Markov chain Monte Carlo, matching model, network sampling, peer referral, respondent-driven sampling, social network analysis
- Abstract
Respondent-driven sampling (RDS) is a network sampling methodology used worldwide to sample key populations at high risk for HIV/AIDS who often practice stigmatized/illegal behaviors and are not typically reachable by conventional sampling techniques. In RDS, study participants recruit their peers to enroll, resulting in a sampling mechanism that is unknown to researchers. Current estimators for RDS data require many assumptions about the sampling process, including that recruiters choose people from their network uniformly at random to participate in the study. However, this is likely not true in practice. We believe that people recruit based on observable covariates, such as age, frequency of interaction, geography, socioeconomic status, or social capital.To model preferential recruitment, I develop a sequential two-sided rational-choice framework, referred to as the RCPR model. At each wave of recruitment, each recruiter has a utility for selecting each peer, and symmetrically each peer has a utility for being recruited by each recruiter. Each person also has utilities for selecting themself (not recruiting or not participating). People in the network behave in a way that maximizes their utility given the constraints of the network and the restrictions on recruitment. Although a person's utility is not observed, it can be modeled as a linear combination of observable nodal or dyadic covariates plus unobserved pair-specific heterogeneities. This framework allows generative probabilistic network models to be created for the RDS recruitment process. The models can incorporate observable characteristics of the population and have interpretable parameters. It greatly increases the sophistication of the modeling of the RDS sampling mechanism. Inference can be made about the preference coefficients by maximizing the likelihood of the observed recruitment chain given the observed covariates. As the likelihood is computationally intractable, I develop a Bayesian framework where inference is made feasible by approximating the posterior distribution of the preference coefficients via a Markov chain Monte Carlo algorithm. Each update step samples new values of the preference coefficients and utilities via Metropolis-Hastings, subject to constraints. New prevalence estimates can be calculated be generating many recruitment chains from the population using the RCPR coefficients, then directly obtaining the first-order and second-order inclusion probabilities. This framework allows the incorporation of covariates we think effect recruitment into the sample weights.
- Published
- 2016
248. Network sampling coverage II: The effect of non-random missing data on network measurement.
- Author
-
Smith JA, Moody J, and Morgan J
- Abstract
Missing data is an important, but often ignored, aspect of a network study. Measurement validity is affected by missing data, but the level of bias can be difficult to gauge. Here, we describe the effect of missing data on network measurement across widely different circumstances. In Part I of this study (Smith and Moody, 2013), we explored the effect of measurement bias due to randomly missing nodes. Here, we drop the assumption that data are missing at random: what happens to estimates of key network statistics when central nodes are more/less likely to be missing? We answer this question using a wide range of empirical networks and network measures. We find that bias is worse when more central nodes are missing. With respect to network measures, Bonacich centrality is highly sensitive to the loss of central nodes, while closeness centrality is not; distance and bicomponent size are more affected than triad summary measures and behavioral homophily is more robust than degree-homophily. With respect to types of networks, larger, directed networks tend to be more robust, but the relation is weak. We end the paper with a practical application, showing how researchers can use our results (translated into a publically available java application) to gauge the bias in their own data.
- Published
- 2017
- Full Text
- View/download PDF
249. Quantity Versus Quality: A Survey Experiment to Improve the Network Scale-up Method.
- Author
-
Feehan DM, Umubyeyi A, Mahy M, Hladik W, and Salganik MJ
- Subjects
- Epidemiologic Methods, Female, HIV Infections etiology, Humans, Male, Risk Assessment methods, Rwanda epidemiology, Substance Abuse, Intravenous complications, Surveys and Questionnaires, Drug Users statistics & numerical data, HIV Infections epidemiology, Homosexuality, Male statistics & numerical data, Sex Workers statistics & numerical data, Social Environment, Social Networking, Substance Abuse, Intravenous epidemiology
- Abstract
The network scale-up method is a promising technique that uses sampled social network data to estimate the sizes of epidemiologically important hidden populations, such as sex workers and people who inject illicit drugs. Although previous scale-up research has focused exclusively on networks of acquaintances, we show that the type of personal network about which survey respondents are asked to report is a potentially crucial parameter that researchers are free to vary. This generalization leads to a method that is more flexible and potentially more accurate. In 2011, we conducted a large, nationally representative survey experiment in Rwanda that randomized respondents to report about one of 2 different personal networks. Our results showed that asking respondents for less information can, somewhat surprisingly, produce more accurate size estimates. We also estimated the sizes of 4 key populations at risk for human immunodeficiency virus infection in Rwanda. Our estimates were higher than earlier estimates from Rwanda but lower than international benchmarks. Finally, in this article we develop a new sensitivity analysis framework and use it to assess the possible biases in our estimates. Our design can be customized and extended for other settings, enabling researchers to continue to improve the network scale-up method., (© The Author 2016. Published by Oxford University Press on behalf of the Johns Hopkins Bloomberg School of Public Health.)
- Published
- 2016
- Full Text
- View/download PDF
250. Statistical analysis of network data motivated by problems in online social media
- Author
-
Zhang, Yaonan
- Subjects
- Statistics, Degree distribution, Networks, Network sampling, Review ratings
- Abstract
Networks have been widely used to represent and analyze a system of connected elements. Online social media networks, as a result of the expansion of the Internet and increased need of communication, have become an increasingly important part of people's lives. This thesis focuses on the statistical analysis of network data motivated by problems in online social media. It discusses problems arising from both explicit network data and implicit network data. Explicit network data are data where network structures are observable, implicit network data are those that do not have a network structure but occur under the influence of an underlying network. For the explicit network data analysis, we develop a novel method of recovering a fundamental characteristic -- network degree distributions -- under sampling. We formulate the problem of estimating degree distribution as an inverse problem. We show that this problem is ill-conditioned for many sampling methods in practice, and accordingly propose a constrained, penalized weighted least-squares approach to solve this problem. We demonstrate the ability of our method to accurately reconstruct the degree distributions from simulated network data and real world social network data. We also propose practical usage of the estimates relevant to marketing and advertising. For the implicit network data analysis, we look at review data from the popular review websites. Motivated by articles from the popular press and the research community which publicized that the average rating for top review sites is above 4 out of 5 stars, we study the phenomena of review rating trends and convergence using restaurant review data from TripAdvisor. We analyze the trend on different levels -- a rough analysis of the characteristics of the ratings, and a subtler statistical modeling with ordinal logistic regressions. Taking into account the implicit network underlying the review data, we suggest the upward trend observed in restaurant review ratings may be explained by social influence on an individual's perception of qualities. We use the intensity of review postings as an indicator of how popular a restaurant is and to test to what extent the increase in review intensity explains increases in average rating. After that, we consider a more nuanced approach to the joint modeling of ratings and review intensity which would allow for interaction between the two, rather than intensity serving only as an explanatory variable to ratings. Specifically, a state-space model is used to test the interaction between review intensity and review ratings.
- Published
- 2015
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.