Author: "Oliveira, Natalia L." - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Oliveira, Natalia L."' showing total 16 results

Start Over Author "Oliveira, Natalia L."

16 results on '"Oliveira, Natalia L."'

1. Unbiased Test Error Estimation in the Poisson Means Problem via Coupled Bootstrap Techniques

Author: Oliveira, Natalia L., Lei, Jing, and Tibshirani, Ryan J.
Subjects: Statistics - Methodology, Mathematics - Statistics Theory
Abstract: We propose a coupled bootstrap (CB) method for the test error of an arbitrary algorithm that estimates the mean in a Poisson sequence, often called the Poisson means problem. The idea behind our method is to generate two carefully-designed data vectors from the original data vector, by using synthetic binomial noise. One such vector acts as the training sample and the second acts as the test sample. To stabilize the test error estimate, we average this over multiple bootstrap B of the synthetic noise. A key property of the CB estimator is that it is unbiased for the test error in a Poisson problem where the original mean has been shrunken by a small factor, driven by the success probability $p$ in the binomial noise. Further, in the limit as $B \to \infty$ and $p \to 0$, we show that the CB estimator recovers a known unbiased estimator for test error based on Hudson's lemma, under no assumptions on the given algorithm for estimating the mean (in particular, no smoothness assumptions). Our methodology applies to two central loss functions that can be used to define test error: Poisson deviance and squared loss. Via a bias-variance decomposition, for each loss function, we analyze the effects of the binomial success probability and the number of bootstrap samples and on the accuracy of the estimator. We also investigate our method empirically across a variety of settings, using simulated as well as real data., Comment: 26 pages, 9 figures
Published: 2022

2. Drone flight data reveal energy and greenhouse gas emissions savings for small package delivery

Author: Rodrigues, Thiago A., Patrikar, Jay, Oliveira, Natalia L., Matthews, H. Scott, Scherer, Sebastian, and Samaras, Constantine
Subjects: Electrical Engineering and Systems Science - Systems and Control
Abstract: The adoption of Uncrewed Aerial Vehicles (UAVs) for last-mile deliveries will affect the energy productivity of package delivery and require new methods to understand the associated energy consumption and greenhouse gas (GHG) emissions. Here we combine empirical testing of 187 quadcopter flights with first principles analysis to develop a usable energy model for drone package delivery. We develop a machine-learning algorithm to assess energy use across three different flight regimes: takeoff, cruise, and landing. Our model shows that, in the US, a small electric quadcopter drone with a payload of 1 kg would consume approximately 0.05 MJ/km and result in 41 g of CO$_{2}$e per package. The energy per package delivered by drones (0.19 MJ/package) can be up to 96\% lower than conventional transportation modes. Our open model and generalizable coefficients can assist stakeholders in understanding and improving the energy use of drone package delivery.
Published: 2021

3. Unbiased Risk Estimation in the Normal Means Problem via Coupled Bootstrap Techniques

Author: Oliveira, Natalia L., Lei, Jing, and Tibshirani, Ryan J.
Subjects: Mathematics - Statistics Theory, Statistics - Methodology
Abstract: We develop a new approach for estimating the risk of an arbitrary estimator of the mean vector in the classical normal means problem. The key idea is to generate two auxiliary data vectors, by adding carefully constructed normal noise vectors to the original data. We then train the estimator of interest on the first auxiliary vector and test it on the second. In order to stabilize the risk estimate, we average this procedure over multiple draws of the synthetic noise vector. A key aspect of this coupled bootstrap (CB) approach is that it delivers an unbiased estimate of risk under no assumptions on the estimator of the mean vector, albeit for a modified and slightly "harder" version of the original problem, where the noise variance is elevated. We prove that, under the assumptions required for the validity of Stein's unbiased risk estimator (SURE), a limiting version of the CB estimator recovers SURE exactly. We then analyze a bias-variance decomposition of the error of the CB estimator, which elucidates the effects of the variance of the auxiliary noise and the number of bootstrap samples on the accuracy of the estimator. Lastly, we demonstrate that the CB estimator performs favorably in various simulated experiments., Comment: 32 pages, 9 figures
Published: 2021

4. An open repository of real-time COVID-19 indicators

Author: Reinhart, Alex, Brooks, Logan, Jahja, Maria, Rumack, Aaron, Tang, Jingjing, Agrawal, Sumit, Saeed, Wael Al, Arnold, Taylor, Basu, Amartya, Bien, Jacob, Cabrera, Ángel A, Chin, Andrew, Chua, Eu Jing, Clark, Brian, Colquhoun, Sarah, DeFries, Nat, Farrow, David C, Forlizzi, Jodi, Grabman, Jed, Gratzl, Samuel, Green, Alden, Haff, George, Han, Robin, Harwood, Kate, Hu, Addison J, Hyde, Raphael, Hyun, Sangwon, Joshi, Ananya, Kim, Jimi, Kuznetsov, Andrew, La Motte-Kerr, Wichada, Lee, Yeon Jin, Lee, Kenneth, Lipton, Zachary C, Liu, Michael X, Mackey, Lester, Mazaitis, Kathryn, McDonald, Daniel J, McGuinness, Phillip, Narasimhan, Balasubramanian, O’Brien, Michael P, Oliveira, Natalia L, Patil, Pratik, Perer, Adam, Politsch, Collin A, Rajanala, Samyak, Rucker, Dawn, Scott, Chris, Shah, Nigam H, Shankar, Vishnu, Sharpnack, James, Shemetov, Dmitry, Simon, Noah, Smith, Benjamin Y, Srivastava, Vishakha, Tan, Shuyi, Tibshirani, Robert, Tuzhilina, Elena, Van Nortwick, Ana Karina, Ventura, Valérie, Wasserman, Larry, Weaver, Benjamin, Weiss, Jeremy C, Whitman, Spencer, Williams, Kristin, Rosenfeld, Roni, and Tibshirani, Ryan J
Subjects: Bioengineering, Ambulatory Care, COVID-19, Databases, Factual, Epidemiologic Methods, Health Status Indicators, Humans, Internet, Physical Distancing, Surveys and Questionnaires, Travel, United States, open data, digital surveillance, internet surveys, medical insurance claims
Abstract: The COVID-19 pandemic presented enormous data challenges in the United States. Policy makers, epidemiological modelers, and health researchers all require up-to-date data on the pandemic and relevant public behavior, ideally at fine spatial and temporal resolution. The COVIDcast API is our attempt to fill this need: Operational since April 2020, it provides open access to both traditional public health surveillance signals (cases, deaths, and hospitalizations) and many auxiliary indicators of COVID-19 activity, such as signals extracted from deidentified medical claims data, massive online surveys, cell phone mobility data, and internet search trends. These are available at a fine geographic resolution (mostly at the county level) and are updated daily. The COVIDcast API also tracks all revisions to historical data, allowing modelers to account for the frequent revisions and backfill that are common for many public health data sources. All of the data are available in a common format through the API and accompanying R and Python software packages. This paper describes the data sources and signals, and provides examples demonstrating that the auxiliary signals in the COVIDcast API present information relevant to tracking COVID activity, augmenting traditional public health reporting and empowering research and decision-making.
Published: 2021

5. Can auxiliary indicators improve COVID-19 forecasting and hotspot prediction?

Author: McDonald, Daniel J, Bien, Jacob, Green, Alden, Hu, Addison J, DeFries, Nat, Hyun, Sangwon, Oliveira, Natalia L, Sharpnack, James, Tang, Jingjing, Tibshirani, Robert, Ventura, Valérie, Wasserman, Larry, and Tibshirani, Ryan J
Subjects: Law and Legal Studies, Public Health, Health Sciences, Bioengineering, Good Health and Well Being, COVID-19, Epidemiologic Methods, Forecasting, Health Status Indicators, Humans, Internet, Models, Statistical, Surveys and Questionnaires, United States, forecasting, hotspot prediction, time series, digital surveillance
Abstract: Short-term forecasts of traditional streams from public health reporting (such as cases, hospitalizations, and deaths) are a key input to public health decision-making during a pandemic. Since early 2020, our research group has worked with data partners to collect, curate, and make publicly available numerous real-time COVID-19 indicators, providing multiple views of pandemic activity in the United States. This paper studies the utility of five such indicators-derived from deidentified medical insurance claims, self-reported symptoms from online surveys, and COVID-related Google search activity-from a forecasting perspective. For each indicator, we ask whether its inclusion in an autoregressive (AR) model leads to improved predictive accuracy relative to the same model excluding it. Such an AR model, without external features, is already competitive with many top COVID-19 forecasting models in use today. Our analysis reveals that 1) inclusion of each of these five indicators improves on the overall predictive accuracy of the AR model; 2) predictive gains are in general most pronounced during times in which COVID cases are trending in "flat" or "down" directions; and 3) one indicator, based on Google searches, seems to be particularly helpful during "up" trends.
Published: 2021

6. TRAP: A Predictive Framework for Trail Running Assessment of Performance

Author: Fogliato, Riccardo, Oliveira, Natalia L., and Yurko, Ronald
Subjects: Statistics - Applications
Abstract: Trail running is an endurance sport in which athletes face severe physical challenges. Due to the growing number of participants, the organization of limited staff, equipment, and medical support in these races now plays a key role. Monitoring runner's performance is a difficult task that requires knowledge of the terrain and of the runner's ability. In the past, choices were solely based on the organizers' experience without reliance on data. However, this approach is neither scalable nor transferable. Instead, we propose a firm statistical methodology to perform this task, both before and during the race. Our proposed framework, Trail Running Assessment of Performance (TRAP), studies (1) the the assessment of the runner's ability to reach the next checkpoint, (2) the prediction of the runner's expected passage time at the next checkpoint, and (3) corresponding prediction intervals for the passage time. To obtain data on the ability of runners, we introduce a Python package, ScrapITRA, to access the race history of runners from the International Trail Running Association (ITRA). We apply our methodology, using the ITRA data along with checkpoint and terrain-level information, to the "holy grail" of ultra-trail running, the Ultra-Trail du Mont-Blanc (UTMB) race, demonstrating the predictive power of our methodology.
Published: 2020

7. Longitudinal 5-year prediction of cognitive impairment among men with HIV disease

Author: Oliveira, Natalia L, Kennedy, Edward H, Tibshirani, Ryan, Levine, Andrew, Martin, Eileen, Munro, Cynthia, Ragin, Ann B, Rubin, Leah H, Sacktor, Ned, Seaberg, Eric C, Weinstein, Andrea, and Becker, James T
Subjects: Medical Microbiology, Biomedical and Clinical Sciences, Health Sciences, Brain Disorders, Mental Health, Liver Disease, Neurosciences, Digestive Diseases, Infectious Diseases, Acquired Cognitive Impairment, Clinical Research, Prevention, Behavioral and Social Science, HIV/AIDS, Aging, Dementia, Infection, Good Health and Well Being, Cognitive Dysfunction, Cohort Studies, HIV Infections, Homosexuality, Male, Humans, Longitudinal Studies, Male, Sexual and Gender Minorities, cognition, HIV, LASSO, risk prediction, Neuropsychology Working Group of the Multicenter AIDS Cohort Study, Biological Sciences, Medical and Health Sciences, Psychology and Cognitive Sciences, Virology, Biomedical and clinical sciences, Health sciences
Abstract: BackgroundAlthough combination antiretroviral therapy reduced the prevalence of HIV-associated dementia, milder syndromes persist. Our goals were to predict cognitive impairment of the Multicenter AIDS Cohort Study (MACS) participants 5 years ahead and from a large pool of factors, select the ones that mostly contributed to our predictions.DesignLongitudinal, natural and treated history of HIV infection among MSM.MethodsThe MACS is a longitudinal study of the natural and treated history of HIV disease in MSM; the neuropsychological substudy aims to characterize cognitive disorders in men with HIV disease.ResultsWe modeled on an annual basis the risk of cognitive impairment 5 years in the future. We were able to predict cognitive impairment at individual level with high precision and overperform default methods. We found that while a diagnosis of AIDS is a critical risk factor, HIV infection per se does not necessarily convey additional risk. Other infectious processes, most notably hepatitis B and C, are independently associated with increased risk of impairment. The relative importance of an AIDS diagnosis diminished across calendar time.ConclusionOur prediction models are a powerful tool to help clinicians address dementia in early stages for MACS paticipants. The strongest predictors of future cognitive impairment included the presence of clinical AIDS and hepatitis B or C infection. The fact that the pattern of predictive power differs by calendar year suggests a clinically critical change to the face of the epidemic.
Published: 2021

8. Drone flight data reveal energy and greenhouse gas emissions savings for very small package delivery

Author: Rodrigues, Thiago A., Patrikar, Jay, Oliveira, Natalia L., Matthews, H. Scott, Scherer, Sebastian, and Samaras, Constantine
Published: 2022
Full Text: View/download PDF

9. The Likelihood Ratio Test and Full Bayesian Significance Test under small sample sizes for contingency tables

Author: Oliveira, Natalia L., Pereira, Carlos A. de B., Diniz, Marcio A., and Polpo, Adriano
Subjects: Statistics - Methodology
Abstract: Hypothesis testing in contingency tables is usually based on asymptotic results, thereby restricting its proper use to large samples. To study these tests in small samples, we consider the likelihood ratio test and define an accurate index, the P-value, for the celebrated hypotheses of homogeneity, independence, and Hardy-Weinberg equilibrium. The aim is to understand the use of the asymptotic results of the frequentist Likelihood Ratio Test and the Bayesian FBST -- Full Bayesian Significance Test -- under small-sample scenarios. The proposed exact P-value is used as a benchmark to understand the other indices. We perform analysis in different scenarios, considering different sample sizes and different table dimensions. The exact Fisher test for $2 \times 2$ tables that drastically reduces the sample space is also discussed. The main message of this paper is that all indices have very similar behavior, so the tests based on asymptotic results are very good to be used in any circumstance, even with small sample sizes.
Published: 2016
Full Text: View/download PDF

10. Coupled Bootstrap Test Error Estimation for Poisson Variables

Author: Oliveira, Natalia L., Lei, Jing, and Tibshirani, Ryan J.
Subjects: Methodology (stat.ME), FOS: Computer and information sciences, FOS: Mathematics, Mathematics - Statistics Theory, Statistics Theory (math.ST), Statistics - Methodology
Abstract: Test error estimation is a fundamental problem in statistics and machine learning. Correctly assessing the future performance of an algorithm is an essential task, especially with the development of complex predictive algorithms that require data-driven parameter tuning. We propose a new coupled bootstrap estimator for the test error of Poisson-response algorithms, a fundamental model for count data and with applications such as signal processing, density estimation, and queue theory. The idea behind our estimator is to generate two carefully designed new random vectors from the original data, where one acts as a training sample and the other as a test set. It is unbiased for an intuitive parameter: the out-of-sample error of a Poisson random vector whose mean has been shrunken by a small factor. Moreover, in a limiting regime, the coupled bootstrap estimator recovers an exactly unbiased estimator for test error. Our framework is applicable to loss functions of the Bregman divergence family, and our analysis and examples focus on two important cases: Poisson likelihood deviance and squared loss. Through a bias-variance decomposition, we analyze the effect of the number of bootstrap samples and the added noise due to the two auxiliary variables. We then apply our method to different scenarios with both simulated and real data., 28 pages, 12 figures
Published: 2022

11. An Open Repository of Real-Time COVID-19 Indicators

Author: Reinhart, Alex, primary, Brooks, Logan, additional, Jahja, Maria, additional, Rumack, Aaron, additional, Tang, Jingjing, additional, Agrawal, Sumit, additional, Saeed, Wael Al, additional, Arnold, Taylor, additional, Basu, Amartya, additional, Bien, Jacob, additional, Cabrera, Ángel A., additional, Chin, Andrew, additional, Chua, Eu Jing, additional, Clark, Brian, additional, Colquhoun, Sarah, additional, DeFries, Nat, additional, Farrow, David C., additional, Forlizzi, Jodi, additional, Grabman, Jed, additional, Gratzl, Samuel, additional, Green, Alden, additional, Haff, George, additional, Han, Robin, additional, Harwood, Kate, additional, Hu, Addison J., additional, Hyde, Raphael, additional, Hyun, Sangwon, additional, Joshi, Ananya, additional, Kim, Jimi, additional, Kuznetsov, Andrew, additional, Motte-Kerr, Wichada La, additional, Lee, Yeon Jin, additional, Lee, Kenneth, additional, Lipton, Zachary C., additional, Liu, Michael X., additional, Mackey, Lester, additional, Mazaitis, Kathryn, additional, McDonald, Daniel J., additional, McGuinness, Phillip, additional, Narasimhan, Balasubramanian, additional, O’Brien, Michael P., additional, Oliveira, Natalia L., additional, Patil, Pratik, additional, Perer, Adam, additional, Politsch, Collin A., additional, Rajanala, Samyak, additional, Rucker, Dawn, additional, Scott, Chris, additional, Shah, Nigam H., additional, Shankar, Vishnu, additional, Sharpnack, James, additional, Shemetov, Dmitry, additional, Simon, Noah, additional, Smith, Benjamin Y., additional, Srivastava, Vishakha, additional, Tan, Shuyi, additional, Tibshirani, Robert, additional, Tuzhilina, Elena, additional, Van Nortwick, Ana Karina, additional, Ventura, Valérie, additional, Wasserman, Larry, additional, Weaver, Benjamin, additional, Weiss, Jeremy C., additional, Whitman, Spencer, additional, Williams, Kristin, additional, Rosenfeld, Roni, additional, and Tibshirani, Ryan J., additional
Published: 2021
Full Text: View/download PDF

12. Can Auxiliary Indicators Improve COVID-19 Forecasting and Hotspot Prediction?

Author: McDonald, Daniel J., primary, Bien, Jacob, additional, Green, Alden, additional, Hu, Addison J., additional, DeFries, Nat, additional, Hyun, Sangwon, additional, Oliveira, Natalia L., additional, Sharpnack, James, additional, Tang, Jingjing, additional, Tibshirani, Robert, additional, Ventura, Valérie, additional, Wasserman, Larry, additional, and Tibshirani, Ryan J., additional
Published: 2021
Full Text: View/download PDF

13. TRAP: a predictive framework for the Assessment of Performance in Trail Running

Author: Fogliato, Riccardo, primary, Oliveira, Natalia L., additional, and Yurko, Ronald, additional
Published: 2020
Full Text: View/download PDF

14. TRAP: a predictive framework for the Assessment of Performance in Trail Running.

Author: Fogliato, Riccardo, Oliveira, Natalia L., and Yurko, Ronald
Subjects: TRAIL running, ENDURANCE sports, LONG-distance running, RANDOM forest algorithms
Abstract: Trail running is an endurance sport in which athletes face severe physical challenges. Due to the growing number of participants, the organization of limited staff, equipment, and medical support in these races now plays a key role. Monitoring runner's performance is a difficult task that requires knowledge of the terrain and of the runner's ability. In the past, choices were solely based on the organizers' experience without reliance on data. However, this approach is neither scalable nor transferable. Instead, we propose a firm statistical methodology to perform this task, both before and during the race. Our proposed framework, Trail Running Assessment of Performance (TRAP), studies (1) the assessment of the runner's ability to reach the next checkpoint, (2) the prediction of the runner's expected passage time at the next checkpoint, and (3) corresponding prediction intervals for the passage time. We apply our methodology, using the race history of runners from the International Trail Running Association (ITRA) along with checkpoint and terrain-level information, to the "holy grail" of ultra-trail running, the Ultra-Trail du Mont-Blanc (UTMB) race, demonstrating the predictive power of our methodology. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

15. A discussion on significance indices for contingency tables under small sample sizes

Author: Oliveira, Natalia L., primary, Pereira, Carlos A. de B., additional, Diniz, Marcio A., additional, and Polpo, Adriano, additional
Published: 2018
Full Text: View/download PDF

16. Copper and cadmium complexation by Cylindrospermopsis raciborskii exudates

Author: Tonietto, Alessandra Emanuele, primary, Oliveira, Natalia L., additional, Lombardi, Ana Teresa, additional, and Polpo, Adriano, additional
Published: 2016
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

16 results on '"Oliveira, Natalia L."'

1. Unbiased Test Error Estimation in the Poisson Means Problem via Coupled Bootstrap Techniques

2. Drone flight data reveal energy and greenhouse gas emissions savings for small package delivery

3. Unbiased Risk Estimation in the Normal Means Problem via Coupled Bootstrap Techniques

4. An open repository of real-time COVID-19 indicators

5. Can auxiliary indicators improve COVID-19 forecasting and hotspot prediction?

6. TRAP: A Predictive Framework for Trail Running Assessment of Performance

7. Longitudinal 5-year prediction of cognitive impairment among men with HIV disease

8. Drone flight data reveal energy and greenhouse gas emissions savings for very small package delivery

9. The Likelihood Ratio Test and Full Bayesian Significance Test under small sample sizes for contingency tables

10. Coupled Bootstrap Test Error Estimation for Poisson Variables

11. An Open Repository of Real-Time COVID-19 Indicators

12. Can Auxiliary Indicators Improve COVID-19 Forecasting and Hotspot Prediction?

13. TRAP: a predictive framework for the Assessment of Performance in Trail Running

14. TRAP: a predictive framework for the Assessment of Performance in Trail Running.

15. A discussion on significance indices for contingency tables under small sample sizes

16. Copper and cadmium complexation by Cylindrospermopsis raciborskii exudates

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

16 results on '"Oliveira, Natalia L."'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources