2,595 results on '"Ferres A"'
Search Results
52. TorchGeo: Deep Learning With Geospatial Data
- Author
-
Stewart, Adam J., Robinson, Caleb, Corley, Isaac A., Ortiz, Anthony, Ferres, Juan M. Lavista, and Banerjee, Arindam
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Machine Learning - Abstract
Remotely sensed geospatial data are critical for applications including precision agriculture, urban planning, disaster monitoring and response, and climate change research, among others. Deep learning methods are particularly promising for modeling many remote sensing tasks given the success of deep neural networks in similar computer vision tasks and the sheer volume of remotely sensed imagery available. However, the variance in data collection methods and handling of geospatial metadata make the application of deep learning methodology to remotely sensed data nontrivial. For example, satellite imagery often includes additional spectral bands beyond red, green, and blue and must be joined to other geospatial data sources that can have differing coordinate systems, bounds, and resolutions. To help realize the potential of deep learning for remote sensing applications, we introduce TorchGeo, a Python library for integrating geospatial data into the PyTorch deep learning ecosystem. TorchGeo provides data loaders for a variety of benchmark datasets, composable datasets for generic geospatial data sources, samplers for geospatial data, and transforms that work with multispectral imagery. TorchGeo is also the first library to provide pre-trained models for multispectral satellite imagery (e.g., models that use all bands from the Sentinel-2 satellites), allowing for advances in transfer learning on downstream remote sensing tasks with limited labeled data. We use TorchGeo to create reproducible benchmark results on existing datasets and benchmark our proposed method for preprocessing geospatial imagery on the fly. TorchGeo is open source and available on GitHub: https://github.com/microsoft/torchgeo.
- Published
- 2021
53. Interpretable and Explainable Machine Learning for Materials Science and Chemistry
- Author
-
Oviedo, Felipe, Ferres, Juan Lavista, Buonassisi, Tonio, and Butler, Keith
- Subjects
Condensed Matter - Materials Science ,Computer Science - Machine Learning - Abstract
While the uptake of data-driven approaches for materials science and chemistry is at an exciting, early stage, to realise the true potential of machine learning models for successful scientific discovery, they must have qualities beyond purely predictive power. The predictions and inner workings of models should provide a certain degree of explainability by human experts, permitting the identification of potential model issues or limitations, building trust on model predictions and unveiling unexpected correlations that may lead to scientific insights. In this work, we summarize applications of interpretability and explainability techniques for materials science and chemistry and discuss how these techniques can improve the outcome of scientific studies. We discuss various challenges for interpretable machine learning in materials science and, more broadly, in scientific settings. In particular, we emphasize the risks of inferring causation or reaching generalization by purely interpreting machine learning models and the need of uncertainty estimates for model explanations. Finally, we showcase a number of exciting developments in other fields that could benefit interpretability in material science and chemistry problems., Comment: Under review Accounts of Material Research
- Published
- 2021
- Full Text
- View/download PDF
54. Effects of the Use of Neuronavigation in Patients with Supratentorial Brain Gliomas: A Cohort Study
- Author
-
Perera Valdivia, Doriam, Zapata Vega, Luis, Herrera Pérez, Edgar, Toledo Cisneros, Francisco, Gómez López, Lorena, Guzmán Reynoso, Lagree, Rumià Arboix, Jordi, Di Somma, Alberto, Enseñat Nora, Joaquim, Ferrés Pijoan, Abel, and Roldán Ramos, Pedro
- Published
- 2024
- Full Text
- View/download PDF
55. Viral shedding and viraemia of Andes virus during acute hantavirus infection: a prospective study
- Author
-
Ferrés, Marcela, Martínez-Valdebenito, Constanza, Henriquez, Carolina, Marco, Claudia, Angulo, Jenniffer, Barrera, Aldo, Palma, Carlos, Barriga Pinto, Gonzalo, Cuiza, Analia, Ferreira, Leonila, Rioseco, María Luisa, Calvo, Mario, Fritz, Ricardo, Bravo, Sebastián, Bruhn, Alejandro, Graf, Jerónimo, Llancaqueo, Alvaro, Rivera, Gonzalo, Cerda, Carolina, Tischler, Nicole, Valdivieso, Francisca, Vial, Pablo, Mertz, Gregory, Vial, Cecilia, and Le Corre, Nicole
- Published
- 2024
- Full Text
- View/download PDF
56. Endonasal versus supraorbital approach for anterior skull base meningiomas: Results and quality of life assessment from a single-surgeon cohort
- Author
-
Torales, Jorge, Di Somma, Alberto, Alobid, Isam, Lopez, Mauricio, Hoyos, Jhon, Ferres, Abel, Morillas, Ruben, Reyes, Luis, Roldan, Pedro, Valero, Ricard, and Enseñat, Joaquim
- Published
- 2024
- Full Text
- View/download PDF
57. Structuring concrete boundary objects for project-to-project learning: a state-of-practice review
- Author
-
Ferres, Geoffrey Mark and Moehler, Robert C.
- Published
- 2023
- Full Text
- View/download PDF
58. Linking physical violence to women’s mobility in Chile
- Author
-
Contreras, Hugo, Candia, Cristian, Troncoso, Rodrigo, Ferres, Leo, Bravo, Loreto, Lepri, Bruno, and Rodriguez-Sickert, Carlos
- Published
- 2023
- Full Text
- View/download PDF
59. An observational, sequential analysis of the relationship between local economic distress and inequities in health outcomes, clinical care, health behaviors, and social determinants of health
- Author
-
Weeks, William B, Chang, Ji E, Pagán, José A, Aerts, Ann, Weinstein, James N, and Ferres, Juan Lavista
- Published
- 2023
- Full Text
- View/download PDF
60. Modeling to explore and challenge inherent assumptions when cultural norms have changed: a case study on left-handedness and life expectancy
- Author
-
Ferres, Juan Lavista, Nasir, Md, Bijral, Avleen, Subramanian, S V, and Weeks, William B
- Published
- 2023
- Full Text
- View/download PDF
61. A dataset to assess mobility changes in Chile following local quarantines
- Author
-
Pappalardo, Luca, Cornacchia, Giuliano, Navarro, Victor, Bravo, Loreto, and Ferres, Leo
- Published
- 2023
- Full Text
- View/download PDF
62. Predicting locations of cryptic pockets from single protein structures using the PocketMiner graph neural network
- Author
-
Meller, Artur, Ward, Michael, Borowsky, Jonathan, Kshirsagar, Meghana, Lotthammer, Jeffrey M., Oviedo, Felipe, Ferres, Juan Lavista, and Bowman, Gregory R.
- Published
- 2023
- Full Text
- View/download PDF
63. Evaluating the generalizability of deep learning image classification algorithms to detect middle ear disease using otoscopy
- Author
-
Habib, Al-Rahim, Xu, Yixi, Bock, Kris, Mohanty, Shrestha, Sederholm, Tina, Weeks, William B., Dodhia, Rahul, Ferres, Juan Lavista, Perry, Chris, Sacks, Raymond, and Singh, Narinder
- Published
- 2023
- Full Text
- View/download PDF
64. Challenges of COVID-19 Case Forecasting in the US, 2020-2021.
- Author
-
Velma K Lopez, Estee Y Cramer, Robert Pagano, John M Drake, Eamon B O'Dea, Madeline Adee, Turgay Ayer, Jagpreet Chhatwal, Ozden O Dalgic, Mary A Ladd, Benjamin P Linas, Peter P Mueller, Jade Xiao, Johannes Bracher, Alvaro J Castro Rivadeneira, Aaron Gerding, Tilmann Gneiting, Yuxin Huang, Dasuni Jayawardena, Abdul H Kanji, Khoa Le, Anja Mühlemann, Jarad Niemi, Evan L Ray, Ariane Stark, Yijin Wang, Nutcha Wattanachit, Martha W Zorn, Sen Pei, Jeffrey Shaman, Teresa K Yamana, Samuel R Tarasewicz, Daniel J Wilson, Sid Baccam, Heidi Gurung, Steve Stage, Brad Suchoski, Lei Gao, Zhiling Gu, Myungjin Kim, Xinyi Li, Guannan Wang, Lily Wang, Yueying Wang, Shan Yu, Lauren Gardner, Sonia Jindal, Maximilian Marshall, Kristen Nixon, Juan Dent, Alison L Hill, Joshua Kaminsky, Elizabeth C Lee, Joseph C Lemaitre, Justin Lessler, Claire P Smith, Shaun Truelove, Matt Kinsey, Luke C Mullany, Kaitlin Rainwater-Lovett, Lauren Shin, Katharine Tallaksen, Shelby Wilson, Dean Karlen, Lauren Castro, Geoffrey Fairchild, Isaac Michaud, Dave Osthus, Jiang Bian, Wei Cao, Zhifeng Gao, Juan Lavista Ferres, Chaozhuo Li, Tie-Yan Liu, Xing Xie, Shun Zhang, Shun Zheng, Matteo Chinazzi, Jessica T Davis, Kunpeng Mu, Ana Pastore Y Piontti, Alessandro Vespignani, Xinyue Xiong, Robert Walraven, Jinghui Chen, Quanquan Gu, Lingxiao Wang, Pan Xu, Weitong Zhang, Difan Zou, Graham Casey Gibson, Daniel Sheldon, Ajitesh Srivastava, Aniruddha Adiga, Benjamin Hurt, Gursharn Kaur, Bryan Lewis, Madhav Marathe, Akhil Sai Peddireddy, Przemyslaw Porebski, Srinivasan Venkatramanan, Lijing Wang, Pragati V Prasad, Jo W Walker, Alexander E Webber, Rachel B Slayton, Matthew Biggerstaff, Nicholas G Reich, and Michael A Johansson
- Subjects
Biology (General) ,QH301-705.5 - Abstract
During the COVID-19 pandemic, forecasting COVID-19 trends to support planning and response was a priority for scientists and decision makers alike. In the United States, COVID-19 forecasting was coordinated by a large group of universities, companies, and government entities led by the Centers for Disease Control and Prevention and the US COVID-19 Forecast Hub (https://covid19forecasthub.org). We evaluated approximately 9.7 million forecasts of weekly state-level COVID-19 cases for predictions 1-4 weeks into the future submitted by 24 teams from August 2020 to December 2021. We assessed coverage of central prediction intervals and weighted interval scores (WIS), adjusting for missing forecasts relative to a baseline forecast, and used a Gaussian generalized estimating equation (GEE) model to evaluate differences in skill across epidemic phases that were defined by the effective reproduction number. Overall, we found high variation in skill across individual models, with ensemble-based forecasts outperforming other approaches. Forecast skill relative to the baseline was generally higher for larger jurisdictions (e.g., states compared to counties). Over time, forecasts generally performed worst in periods of rapid changes in reported cases (either in increasing or decreasing epidemic phases) with 95% prediction interval coverage dropping below 50% during the growth phases of the winter 2020, Delta, and Omicron waves. Ideally, case forecasts could serve as a leading indicator of changes in transmission dynamics. However, while most COVID-19 case forecasts outperformed a naïve baseline model, even the most accurate case forecasts were unreliable in key phases. Further research could improve forecasts of leading indicators, like COVID-19 cases, by leveraging additional real-time data, addressing performance across phases, improving the characterization of forecast confidence, and ensuring that forecasts were coherent across spatial scales. In the meantime, it is critical for forecast users to appreciate current limitations and use a broad set of indicators to inform pandemic-related decision making.
- Published
- 2024
- Full Text
- View/download PDF
65. Land use and Europe’s renewable energy transition: identifying low-conflict areas for wind and solar development
- Author
-
Joseph M. Kiesecker, Jeffrey S. Evans, James R. Oakleaf, Kasandra Zorica Dropuljić, Igor Vejnović, Chris Rosslowe, Elisabeth Cremona, Aishwarya L. Bhattacharjee, Shivaprakash K. Nagaraju, Anthony Ortiz, Caleb Robinson, Juan Lavista Ferres, Mate Zec, and Kei Sochi
- Subjects
climate mitigation ,development scenarios ,energy impacts ,energy policy ,energy sprawl ,energy transition ,Environmental sciences ,GE1-350 - Abstract
Continued dependence on imported fossil fuels is rapidly becoming unsustainable in the face of the twin challenges of global climate change and energy security demands in Europe. Here we present scenarios in line with REPowerEU package to identify Renewables Acceleration Areas that support rapid renewable expansion, while ensuring minimal harm to places important for biodiversity and rural communities. We calculated the area needed to meet renewable energy objectives under Business-as-Usual (BAU) and Low-conflict (LCON) development scenarios within each country, providing a broad overview of the potential for renewable energy generation to reduce impacts when development is steered toward lower conflict lands. Our analysis shows that meeting renewable energy objectives would require a network of land-based wind turbines and solar arrays encompassing upwards of 164,789 km2 by 2030 and 445,654 km2 by 2050, the latter roughly equivalent to the land area of Sweden. Our results highlight that BAU development patterns disproportionately target high-conflict land cover types. By 2030, depending on the development pathway, solar and wind development are projected to impact approximately 4,386–20,996 km2 and 65,735–138,454 km2 of natural and agricultural lands, respectively. As renewable energy objectives increase from 2030 to 2050 impacts to natural and agricultural lands also increase, with upwards of 33,911 km2 from future solar development and 399,879 km2 from wind development. Despite this large footprint, low-conflict lands can generate substantial renewable energy: 6.6 million GWh of solar and 3.5 million GWh of wind, 8–31 times 2030 solar objectives and 3–5 times 2030 wind objectives. Given these patterns, we emphasize the need for careful planning in areas with greater impact potential, either due to a larger demand for land area or limited land availability. Top-emitting countries with large renewable energy objectives (Germany, Italy, Poland, France, Spain) and those with limited flexibility in meeting objectives on low-conflict land (Albania, Slovenia, Montenegro, Hungary, Croatia, Serbia, Bosnia Herzegovina, Finland, Greece, Portugal, and Norway) should be priorities for country-level customizations to guide low-conflict siting and avoid disproportionate impacts on high-value areas.
- Published
- 2024
- Full Text
- View/download PDF
66. Health and Wealth in America
- Author
-
William B. Weeks, Juan M. Lavista Ferres, and James N. Weinstein
- Subjects
public health ,social determinansts of health ,economic activity ,wealth ,income ,Public aspects of medicine ,RA1-1270 - Published
- 2024
- Full Text
- View/download PDF
67. Retinal Microvasculature as Biomarker for Diabetes and Cardiovascular Diseases
- Author
-
Trivedi, Anusua, Desbiens, Jocelyn, Gross, Ron, Gupta, Sunil, Dodhia, Rahul, and Ferres, Juan Lavista
- Subjects
Electrical Engineering and Systems Science - Image and Video Processing ,Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Machine Learning - Abstract
Purpose: To demonstrate that retinal microvasculature per se is a reliable biomarker for Diabetic Retinopathy (DR) and, by extension, cardiovascular diseases. Methods: Deep Learning Convolutional Neural Networks (CNN) applied to color fundus images for semantic segmentation of the blood vessels and severity classification on both vascular and full images. Vessel reconstruction through harmonic descriptors is also used as a smoothing and de-noising tool. The mathematical background of the theory is also outlined. Results: For diabetic patients, at least 93.8% of DR No-Refer vs. Refer classification can be related to vasculature defects. As for the Non-Sight Threatening vs. Sight Threatening case, the ratio is as high as 96.7%. Conclusion: In the case of DR, most of the disease biomarkers are related topologically to the vasculature. Translational Relevance: Experiments conducted on eye blood vasculature reconstruction as a biomarker shows a strong correlation between vasculature shape and later stages of DR.
- Published
- 2021
68. Detecting Cattle and Elk in the Wild from Space
- Author
-
Robinson, Caleb, Ortiz, Anthony, Hughey, Lacey, Stabach, Jared A., and Ferres, Juan M. Lavista
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Machine Learning - Abstract
Localizing and counting large ungulates -- hoofed mammals like cows and elk -- in very high-resolution satellite imagery is an important task for supporting ecological studies. Prior work has shown that this is feasible with deep learning based methods and sub-meter multi-spectral satellite imagery. We extend this line of work by proposing a baseline method, CowNet, that simultaneously estimates the number of animals in an image (counts), as well as predicts their location at a pixel level (localizes). We also propose an methodology for evaluating such models on counting and localization tasks across large scenes that takes the uncertainty of noisy labels and the information needed by stakeholders in ecological monitoring tasks into account. Finally, we benchmark our baseline method with state of the art vision methods for counting objects in scenes. We specifically test the temporal generalization of the resulting models over a large landscape in Point Reyes Seashore, CA. We find that the LC-FCN model performs the best and achieves an average precision between 0.56 and 0.61 and an average recall between 0.78 and 0.92 over three held out test scenes., Comment: Presented at the KDD 2021 Fragile Earth Workshop
- Published
- 2021
69. An Analysis of the Deployment of Models Trained on Private Tabular Synthetic Data: Unexpected Surprises
- Author
-
Pereira, Mayana, Kshirsagar, Meghana, Mukherjee, Sumit, Dodhia, Rahul, and Ferres, Juan Lavista
- Subjects
Statistics - Machine Learning ,Computer Science - Computers and Society ,Computer Science - Machine Learning - Abstract
Diferentially private (DP) synthetic datasets are a powerful approach for training machine learning models while respecting the privacy of individual data providers. The effect of DP on the fairness of the resulting trained models is not yet well understood. In this contribution, we systematically study the effects of differentially private synthetic data generation on classification. We analyze disparities in model utility and bias caused by the synthetic dataset, measured through algorithmic fairness metrics. Our first set of results show that although there seems to be a clear negative correlation between privacy and utility (the more private, the less accurate) across all data synthesizers we evaluated, more privacy does not necessarily imply more bias. Additionally, we assess the effects of utilizing synthetic datasets for model training and model evaluation. We show that results obtained on synthetic data can misestimate the actual model performance when it is deployed on real data. We hence advocate on the need for defining proper testing protocols in scenarios where differentially private synthetic datasets are utilized for model training and evaluation.
- Published
- 2021
70. A machine learning pipeline for aiding school identification from child trafficking images
- Author
-
Mukherjee, Sumit, Sederholm, Tina, Roman, Anthony C., Sankar, Ria, Caltagirone, Sherrie, and Ferres, Juan Lavista
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Child trafficking in a serious problem around the world. Every year there are more than 4 million victims of child trafficking around the world, many of them for the purposes of child sexual exploitation. In collaboration with UK Police and a non-profit focused on child abuse prevention, Global Emancipation Network, we developed a proof-of-concept machine learning pipeline to aid the identification of children from intercepted images. In this work, we focus on images that contain children wearing school uniforms to identify the school of origin. In the absence of a machine learning pipeline, this hugely time consuming and labor intensive task is manually conducted by law enforcement personnel. Thus, by automating aspects of the school identification process, we hope to significantly impact the speed of this portion of child identification. Our proposed pipeline consists of two machine learning models: i) to identify whether an image of a child contains a school uniform in it, and ii) identification of attributes of different school uniform items (such as color/texture of shirts, sweaters, blazers etc.). We describe the data collection, labeling, model development and validation process, along with strategies for efficient searching of schools using the model predictions.
- Published
- 2021
71. The Spanish adaptation of the relationship power inventory
- Author
-
Alonso-Ferres, María, Serrano-Montilla, Celia, Valor-Segura, Inmaculada, and Expósito, Francisca
- Subjects
Couples -- Psychological aspects -- Social aspects ,Psychological tests -- Evaluation ,Psychology and mental health - Abstract
Power dynamics are fundamental when negotiating conflicts. However, no instrument for measuring power in romantic relationships has been adequately adapted to Spanish culture. The goal of this research was to adapt the Relationship Power Inventory (RPI; Farrell et al., 2015) to Spanish culture and language, filling this gap by providing a rigorous instrument for evaluating this construct. Study 1 was conducted to obtain evidence based on Spanish adaptation of RPI content. Once the Spanish adaptation of the RPI was built and we obtained validity evidence based on the test content, in Study 2, the scale was administered to two different samples of the adult population following a cross-validation approach. Specifically, in Sample 1 (N = 400), the training sample, a statistical analysis and an exploration of the dimensional structure and reliability of the measure were carried out. In Sample 2 (N = 755), the validation sample, the internal structure of the scale was confirmed, and evidence of external validity and generalization was obtained. The exploratory and confirmatory factor analysis showed a good fit for the four-factor structure. These dimensions were invariant to gender and had adequate validity based on their relationship with other variables (dependence on the partner, conflict-resolution strategies, and psychological well-being). In sum, the Spanish version of the RPI (SARPI) is a reliable instrument with sufficient valid evidence to provide accurate measurement of power differences in the context of romantic relationships., Author(s): María Alonso-Ferres [sup.1] , Celia Serrano-Montilla [sup.2] , Inmaculada Valor-Segura [sup.1] , Francisca Expósito [sup.1] Author Affiliations: (1) grid.4489.1, 0000000121678994, Department of Social Psychology, Mind, Brain and Behavioral Research [...]
- Published
- 2023
- Full Text
- View/download PDF
72. An observational, sequential analysis of the relationship between local economic distress and inequities in health outcomes, clinical care, health behaviors, and social determinants of health
- Author
-
William B Weeks, Ji E Chang, José A Pagán, Ann Aerts, James N Weinstein, and Juan Lavista Ferres
- Subjects
Population health ,Health equity ,Socio-economic status ,Social determinants of health ,Public aspects of medicine ,RA1-1270 - Abstract
Abstract Background Socioeconomic status has long been associated with population health and health outcomes. While ameliorating social determinants of health may improve health, identifying and targeting areas where feasible interventions are most needed would help improve health equity. We sought to identify inequities in health and social determinants of health (SDOH) associated with local economic distress at the county-level. Methods For 3,131 counties in the 50 US states and Washington, DC (wherein approximately 325,711,203 people lived in 2019), we conducted a retrospective analysis of county-level data collected from County Health Rankings in two periods (centering around 2015 and 2019). We used ANOVA to compare thirty-three measures across five health and SDOH domains (Health Outcomes, Clinical Care, Health Behaviors, Physical Environment, and Social and Economic Factors) that were available in both periods, changes in measures between periods, and ratios of measures for the least to most prosperous counties across county-level prosperity quintiles, based on the Economic Innovation Group’s 2015–2019 Distressed Community Index Scores. Results With seven exceptions, in both periods, we found a worsening of values with each progression from more to less prosperous counties, with least prosperous counties having the worst values (ANOVA p
- Published
- 2023
- Full Text
- View/download PDF
73. How severity of intimate partner violence is perceived and related to attitudinal variables? A systematic review and meta-analysis
- Author
-
Badenes-Sastre, Marta, Spencer, Chelsea M., Alonso-Ferres, María, Lorente, Miguel, and Expósito, Francisca
- Published
- 2024
- Full Text
- View/download PDF
74. Height Estimation of Children under Five Years using Depth Images
- Author
-
Trivedi, Anusua, Jain, Mohit, Gupta, Nikhil Kumar, Hinsche, Markus, Singh, Prashant, Matiaschek, Markus, Behrens, Tristan, Militeri, Mirco, Birge, Cameron, Kaushik, Shivangi, Mohapatra, Archisman, Chatterjee, Rita, Dodhia, Rahul, and Ferres, Juan Lavista
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Artificial Intelligence ,Computer Science - Machine Learning - Abstract
Malnutrition is a global health crisis and is the leading cause of death among children under five. Detecting malnutrition requires anthropometric measurements of weight, height, and middle-upper arm circumference. However, measuring them accurately is a challenge, especially in the global south, due to limited resources. In this work, we propose a CNN-based approach to estimate the height of standing children under five years from depth images collected using a smart-phone. According to the SMART Methodology Manual [5], the acceptable accuracy for height is less than 1.4 cm. On training our deep learning model on 87131 depth images, our model achieved an average mean absolute error of 1.64% on 57064 test images. For 70.3% test images, we estimated height accurately within the acceptable 1.4 cm range. Thus, our proposed solution can accurately detect stunting (low height-for-age) in standing children below five years of age.
- Published
- 2021
75. Becoming Good at AI for Good
- Author
-
Kshirsagar, Meghana, Robinson, Caleb, Yang, Siyu, Gholami, Shahrzad, Klyuzhin, Ivan, Mukherjee, Sumit, Nasir, Md, Ortiz, Anthony, Oviedo, Felipe, Tanner, Darren, Trivedi, Anusua, Xu, Yixi, Zhong, Ming, Dilkina, Bistra, Dodhia, Rahul, and Ferres, Juan M. Lavista
- Subjects
Computer Science - Computers and Society - Abstract
AI for good (AI4G) projects involve developing and applying artificial intelligence (AI) based solutions to further goals in areas such as sustainability, health, humanitarian aid, and social justice. Developing and deploying such solutions must be done in collaboration with partners who are experts in the domain in question and who already have experience in making progress towards such goals. Based on our experiences, we detail the different aspects of this type of collaboration broken down into four high-level categories: communication, data, modeling, and impact, and distill eleven takeaways to guide such projects in the future. We briefly describe two case studies to illustrate how some of these takeaways were applied in practice during our past collaborations., Comment: Accepted to AIES-2021
- Published
- 2021
76. U.S. Broadband Coverage Data Set: A Differentially Private Data Release
- Author
-
Pereira, Mayana, Kim, Allen, Allen, Joshua, White, Kevin, Ferres, Juan Lavista, and Dodhia, Rahul
- Subjects
Computer Science - Cryptography and Security ,Statistics - Applications - Abstract
Broadband connectivity is a key metric in today's economy. In an era of rapid expansion of the digital economy, it directly impacts GDP. Furthermore, with the COVID-19 guidelines of social distancing, internet connectivity became necessary to everyday activities such as work, learning, and staying in touch with family and friends. This paper introduces a publicly available U.S. Broadband Coverage data set that reports broadband coverage percentages at a zip code-level. We also explain how we used differential privacy to guarantee that the privacy of individual households is preserved. Our data set also contains error ranges estimates, providing information on the expected error introduced by differential privacy per zip code. We describe our error range calculation method and show that this additional data metric does not induce any privacy losses.
- Published
- 2021
77. Temporal Cluster Matching for Change Detection of Structures from Satellite Imagery
- Author
-
Robinson, Caleb, Ortiz, Anthony, Ferres, Juan M. Lavista, Anderson, Brandon, and Ho, Daniel E.
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Longitudinal studies are vital to understanding dynamic changes of the planet, but labels (e.g., buildings, facilities, roads) are often available only for a single point in time. We propose a general model, Temporal Cluster Matching (TCM), for detecting building changes in time series of remotely sensed imagery when footprint labels are observed only once. The intuition behind the model is that the relationship between spectral values inside and outside of building's footprint will change when a building is constructed (or demolished). For instance, in rural settings, the pre-construction area may look similar to the surrounding environment until the building is constructed. Similarly, in urban settings, the pre-construction areas will look different from the surrounding environment until construction. We further propose a heuristic method for selecting the parameters of our model which allows it to be applied in novel settings without requiring data labeling efforts (to fit the parameters). We apply our model over a dataset of poultry barns from 2016/2017 high-resolution aerial imagery in the Delmarva Peninsula and a dataset of solar farms from a 2020 mosaic of Sentinel 2 imagery in India. Our results show that our model performs as well when fit using the proposed heuristic as it does when fit with labeled data, and further, that supervised versions of our model perform the best among all the baselines we test against. Finally, we show that our proposed approach can act as an effective data augmentation strategy -- it enables researchers to augment existing structure footprint labels along the time dimension and thus use imagery from multiple points in time to train deep learning models. We show that this improves the spatial generalization of such models when evaluated on the same change detection task., Comment: Published in ACM COMPASS 2021
- Published
- 2021
78. Reducing bias and increasing utility by federated generative modeling of medical images using a centralized adversary
- Author
-
Rajotte, Jean-Francois, Mukherjee, Sumit, Robinson, Caleb, Ortiz, Anthony, West, Christopher, Ferres, Juan Lavista, and Ng, Raymond T
- Subjects
Statistics - Machine Learning ,Computer Science - Artificial Intelligence ,Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Distributed, Parallel, and Cluster Computing ,Computer Science - Machine Learning ,68W15 ,I.2.11 - Abstract
We introduce FELICIA (FEderated LearnIng with a CentralIzed Adversary) a generative mechanism enabling collaborative learning. In particular, we show how a data owner with limited and biased data could benefit from other data owners while keeping data from all the sources private. This is a common scenario in medical image analysis where privacy legislation prevents data from being shared outside local premises. FELICIA works for a large family of Generative Adversarial Networks (GAN) architectures including vanilla and conditional GANs as demonstrated in this work. We show that by using the FELICIA mechanism, a data owner with limited image samples can generate high-quality synthetic images with high utility while neither data owners has to provide access to its data. The sharing happens solely through a central discriminator that has access limited to synthetic data. Here, utility is defined as classification performance on a real test set. We demonstrate these benefits on several realistic healthcare scenarions using benchmark image datasets (MNIST, CIFAR-10) as well as on medical images for the task of skin lesion classification. With multiple experiments, we show that even in the worst cases, combining FELICIA with real data gracefully achieves performance on par with real data while most results significantly improves the utility., Comment: 10 pages, 10 figures
- Published
- 2021
79. Using Data Science and a Health Equity Lens to Identify Long-COVID Sequelae Among Medically Underserved Populations
- Author
-
Nasir, Md, Cook, Nicole, Parras, Daniel, Mukherjee, Sumit, Miller, Geralyn, Ferres, Juan Lavista, and Chung-Bridges, Katherine
- Published
- 2023
- Full Text
- View/download PDF
80. Machine Learning for Glacier Monitoring in the Hindu Kush Himalaya
- Author
-
Baraka, Shimaa, Akera, Benjamin, Aryal, Bibek, Sherpa, Tenzing, Shresta, Finu, Ortiz, Anthony, Sankaran, Kris, Ferres, Juan Lavista, Matin, Mir, and Bengio, Yoshua
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Glacier mapping is key to ecological monitoring in the hkh region. Climate change poses a risk to individuals whose livelihoods depend on the health of glacier ecosystems. In this work, we present a machine learning based approach to support ecological monitoring, with a focus on glaciers. Our approach is based on semi-automated mapping from satellite images. We utilize readily available remote sensing data to create a model to identify and outline both clean ice and debris-covered glaciers from satellite imagery. We also release data and develop a web tool that allows experts to visualize and correct model predictions, with the ultimate aim of accelerating the glacier mapping process., Comment: Accepted for a spotlight talk and a poster at the Tackling Climate Change with Machine Learning workshop at NeurIPS 2020
- Published
- 2020
81. Rapid building damage assessment workflow: An implementation for the 2023 Rolling Fork, Mississippi tornado event.
- Author
-
Caleb Robinson, Simone Fobi Nsutezo, Anthony Ortiz, Tina Sederholm, Rahul Dodhia, Cameron Birge, Kasie Richards, Kris Pitcher, Paulo Duarte, and Juan M. Lavista Ferres
- Published
- 2023
- Full Text
- View/download PDF
82. Poverty rate prediction using multi-modal survey and earth observation data.
- Author
-
Simone Fobi, Manuel Cardona, Elliott Collins, Caleb Robinson, Anthony Ortiz, Tina Sederholm, Rahul Dodhia, and Juan Lavista Ferres
- Published
- 2023
- Full Text
- View/download PDF
83. Smartphone Telemedicine Networks for Retinopathy of Prematurity (ROP) in Latin America : SP-ROP (Panamerican Society of ROP)
- Author
-
de Kartzow, Alejandro Vazquez, Acevedo, Pedro J., Saidman, Gabriela, Schbib, Vanina, Zuluaga, Claudia, Monteoliva, Guillermo, Carrascal, Marcelo, Salvatelli, Adrian, Patiño, Susana, Marmol, Juan, Ferres, Juan Lavista, Castellanos, Maria Ana Martinez, Yogesan, Kanagasingam, editor, Goldschmidt, Leonard, editor, Cuadros, Jorge, editor, and Ricur, Giselle, editor
- Published
- 2023
- Full Text
- View/download PDF
84. Writing to Understand and Being Understood: Basic Design Principles for Writing Instruction
- Author
-
Flores-Ferres, Magdalena, Van Weijen, Daphne, Van Ockenburg, Liselore, Ten Peze, Anouk, Alkema, Edith, Holdinga, Lieke, Rijlaarsdam, Gert, Spinillo, Alina Galvão, editor, and Sotomayor, Carmen, editor
- Published
- 2023
- Full Text
- View/download PDF
85. Modeling to explore and challenge inherent assumptions when cultural norms have changed: a case study on left-handedness and life expectancy
- Author
-
Juan Lavista Ferres, Md Nasir, Avleen Bijral, S V Subramanian, and William B Weeks
- Subjects
Epidemiology ,Modeling ,Handedness ,Life expectancy ,Public aspects of medicine ,RA1-1270 - Abstract
Abstract Background In 1991, Halpern and Coren claimed that left-handed people die nine years younger than right-handed people. Most subsequent studies did not find support for the difference in age of death or its magnitude, primarily because of the realization that there have been historical changes in reported rates of left-handedness. Methods We created a model that allowed us to determine whether the historical change in left-handedness explains the original finding of a nine-year difference in life expectancy. We calculated all deaths in the United States by birth year, gender, and handedness for 1989 (the Halpern and Coren study was based on data from that year) and contrasted those findings with the modeled age of death by reported and counterfactual estimated handedness for each birth year, 1900–1989. Results In 1989, 2,019,512 individuals died, of which 6.4% were reportedly left-handed based on concurrent annual handedness reporting. However, it is widely believed that cultural pressures may have caused an underestimation of the true rate of left-handedness. Using a simulation that assumed no age of death difference between left-handed and right-handed individuals in this cohort and adjusting for the reported rates of left-handedness, we found that left-handed individuals were expected to die 9.3 years earlier than their right-handed counterparts due to changes in the rate of left-handedness over time. This difference of 9.3 years was not found to be statistically significant compared to the 8.97 years reported by Halpern and Coren. When we assumed no change in the rate of left-handedness over time, the survival advantage for right-handed individuals was reduced to 0.02 years, solely driven by not controlling for gender. When we considered the estimated age of death for each birth cohort, we found a mean difference of 0.43 years between left-handed and right-handed individuals, also driven by handedness difference by gender. Conclusion We found that the changing rate of left-handedness reporting over the years entirely explains the originally reported observation of nine-year difference in life expectancy. In epidemiology, new information on past reporting biases could warrant re-exploration of initial findings. The simulation modeling approach that we use here might facilitate such analyses.
- Published
- 2023
- Full Text
- View/download PDF
86. Consensus document on the management of febrile neutropenia in paediatric haematology and oncology patients of the Spanish Society of Pediatric Infectious Diseases (SEIP) and the Spanish Society of Pediatric Hematology and Oncology (SEHOP)
- Author
-
Leticia Martínez Campos, Paula Pérez-Albert, Laia Ferres Ramis, Elena María Rincón-López, Natalia Mendoza-Palomar, Pere Soler-Palacin, and David Aguilera-Alonso
- Subjects
Neutropenia febril inducida por quimioterapia ,Neoplasias ,Pediatría ,Infección fúngica invasiva ,Antibióticos ,Antifúngicos ,Pediatrics ,RJ1-570 - Abstract
Febrile neutropenia is one of the main infectious complications experienced by paediatric patients with blood or solid tumours, which, despite the advances in diagnosis and treatment, are still associated with a significant morbidity and mortality. These patients have several risk factors for infection, chief of which are chemotherapy-induced neutropenia, the disruption of cutaneous and mucosal barriers and the use of intravascular devices. Early diagnosis and treatment of febrile neutropenia episodes based on the patient’s characteristics is essential in patients with blood and solid tumours to improve their outcomes. Therefore, it is important to develop protocols in order to optimise and standardise its management. In addition, the rational use of antibiotics, with careful adjustment of the duration of treatment and antimicrobial spectrum, is crucial to address the increase in antimicrobial drug resistance.The aim of this document, developed jointly by the Spanish Society of Pediatric Infectious Diseases and the Spanish Society of Pediatric Hematology and Oncology, is to provide consensus recommendations for the management of febrile neutropenia in paediatric oncology and haematology patients, including the initial evaluation, the stepwise approach to its treatment, supportive care and invasive fungal infection, which each facility then needs to adapt to the characteristics of its patients and local epidemiological trends. Resumen: La neutropenia febril es una de las principales complicaciones infecciosas que sufren los pacientes pediátricos oncohematológicos, y a pesar los avances en diagnóstico y tratamiento, siguen condicionando una mortalidad y morbilidad significativa. Estos pacientes agrupan una serie de factores de riesgo de infección, donde destaca la neutropenia asociada a quimioterapia, la disrupción de barreras cutáneo-mucosas y el uso de dispositivos intravasculares. El abordaje diagnóstico y terapéutico precoz de los episodios de neutropenia febril en los pacientes oncohematológicos, ajustado a las características individuales de cada paciente, es fundamental para mejorar su pronóstico. Por ello, diseñar protocolos de abordaje, que sistematicen su atención, permite optimizar y homogeneizar su abordaje. Además, racionalizar el uso de los antimicrobianos, ajustando la duración y el espectro de los mismos, es crucial para hacer frente al incremento de resistencias a antimicrobianos.El objetivo de este documento, elaborado entre la Sociedad Española de Infectología Pediátrica y la Sociedad Española de Hematología y Oncología Pediátrica, es dar recomendaciones de consenso sobre el manejo de la neutropenia febril en el paciente oncohematológico, respecto al abordaje inicial, terapia secuencial y de soporte e infección fúngica invasiva, que cada centro debe adaptar a las características de sus pacientes y epidemiología local.
- Published
- 2023
- Full Text
- View/download PDF
87. Assessment of differentially private synthetic data for utility and fairness in end-to-end machine learning pipelines for tabular data.
- Author
-
Mayana Pereira, Meghana Kshirsagar, Sumit Mukherjee, Rahul Dodhia, Juan Lavista Ferres, and Rafael de Sousa
- Subjects
Medicine ,Science - Abstract
Differentially private (DP) synthetic datasets are a solution for sharing data while preserving the privacy of individual data providers. Understanding the effects of utilizing DP synthetic data in end-to-end machine learning pipelines impacts areas such as health care and humanitarian action, where data is scarce and regulated by restrictive privacy laws. In this work, we investigate the extent to which synthetic data can replace real, tabular data in machine learning pipelines and identify the most effective synthetic data generation techniques for training and evaluating machine learning models. We systematically investigate the impacts of differentially private synthetic data on downstream classification tasks from the point of view of utility as well as fairness. Our analysis is comprehensive and includes representatives of the two main types of synthetic data generation algorithms: marginal-based and GAN-based. To the best of our knowledge, our work is the first that: (i) proposes a training and evaluation framework that does not assume that real data is available for testing the utility and fairness of machine learning models trained on synthetic data; (ii) presents the most extensive analysis of synthetic dataset generation algorithms in terms of utility and fairness when used for training machine learning models; and (iii) encompasses several different definitions of fairness. Our findings demonstrate that marginal-based synthetic data generators surpass GAN-based ones regarding model training utility for tabular data. Indeed, we show that models trained using data generated by marginal-based algorithms can exhibit similar utility to models trained using real data. Our analysis also reveals that the marginal-based synthetic data generated using AIM and MWEM PGM algorithms can train models that simultaneously achieve utility and fairness characteristics close to those obtained by models trained with real data.
- Published
- 2024
- Full Text
- View/download PDF
88. An Open One-Step RT-qPCR for SARS-CoV-2 detection.
- Author
-
Ariel Cerda, Maira Rivera, Grace Armijo, Catalina Ibarra-Henriquez, Javiera Reyes, Paula Blázquez-Sánchez, Javiera Avilés, Aníbal Arce, Aldo Seguel, Alexander J Brown, Yesseny Vásquez, Marcelo Cortez-San Martín, Francisco A Cubillos, Patricia García, Marcela Ferres, César A Ramírez-Sarmiento, Fernán Federici, and Rodrigo A Gutiérrez
- Subjects
Medicine ,Science - Abstract
The COVID-19 pandemic has resulted in millions of deaths globally, and while several diagnostic systems were proposed, real-time reverse transcription polymerase chain reaction (RT-PCR) remains the gold standard. However, diagnostic reagents, including enzymes used in RT-PCR, are subject to centralized production models and intellectual property restrictions, which present a challenge for less developed countries. With the aim of generating a standardized One-Step open RT-qPCR protocol to detect SARS-CoV-2 RNA in clinical samples, we purified and tested recombinant enzymes and a non-proprietary buffer. The protocol utilized M-MLV RT and Taq DNA pol enzymes to perform a Taqman probe-based assay. Synthetic RNA samples were used to validate the One-Step RT-qPCR components, demonstrating sensitivity comparable to a commercial kit routinely employed in clinical settings for patient diagnosis. Further evaluation on 40 clinical samples (20 positive and 20 negative) confirmed its comparable diagnostic accuracy. This study represents a proof of concept for an open approach to developing diagnostic kits for viral infections and diseases, which could provide a cost-effective and accessible solution for less developed countries.
- Published
- 2024
- Full Text
- View/download PDF
89. A dataset to assess mobility changes in Chile following local quarantines
- Author
-
Pappalardo, Luca, Cornacchia, Giuliano, Navarro, Victor, Bravo, Loreto, and Ferres, Leo
- Subjects
Physics - Physics and Society - Abstract
Fighting the COVID-19 pandemic, most countries have implemented non-pharmaceutical interventions like wearing masks, physical distancing, lockdown, and travel restrictions. Because of their economic and logistical effects, tracking mobility changes during quarantines is crucial in assessing their efficacy and predicting the virus spread. Chile, one of the worst-hit countries in the world, unlike many other countries, implemented quarantines at a more localized level, shutting down small administrative zones, rather than the whole country or large regions. Given the non-obvious effects of these localized quarantines, tracking mobility becomes even more critical in Chile. To assess the impact on human mobility of the localized quarantines in Chile, we analyze a mobile phone dataset made available by Telef\'onica Chile, which comprises 31 billion eXtended Detail Records and 5.4 million users covering the period February 26th to September 20th, 2020. From these records, we derive three epidemiologically relevant metrics describing the mobility within and between comunas. The datasets made available can be used to fight the COVID-19 epidemics, particularly for localized quarantines' less understood effect.
- Published
- 2020
90. An individual-level ground truth dataset for home location detection
- Author
-
Pappalardo, Luca, Ferres, Leo, Sacasa, Manuel, Cattuto, Ciro, and Bravo, Loreto
- Subjects
Computer Science - Computers and Society ,Physics - Physics and Society - Abstract
Home detection, assigning a phone device to its home antenna, is a ubiquitous part of most studies in the literature on mobile phone data. Despite its widespread use, home detection relies on a few assumptions that are difficult to check without ground truth, i.e., where the individual that owns the device resides. In this paper, we provide an unprecedented evaluation of the accuracy of home detection algorithms on a group of sixty-five participants for whom we know their exact home address and the antennas that might serve them. Besides, we analyze not only Call Detail Records (CDRs) but also two other mobile phone streams: eXtended Detail Records (XDRs, the ``data'' channel) and Control Plane Records (CPRs, the network stream). These data streams vary not only in their temporal granularity but also they differ in the data generation mechanism', e.g., CDRs are purely human-triggered while CPR is purely machine-triggered events. Finally, we quantify the amount of data that is needed for each stream to carry out successful home detection for each stream. We find that the choice of stream and the algorithm heavily influences home detection, with an hour-of-day algorithm for the XDRs performing the best, and with CPRs performing best for the amount of data needed to perform home detection. Our work is useful for researchers and practitioners in order to minimize data requests and to maximize the accuracy of home antenna location.
- Published
- 2020
91. Improving Lesion Detection by exploring bias on Skin Lesion dataset
- Author
-
Trivedi, Anusua, Muppalla, Sreya, Pathak, Shreyaan, Mobasher, Azadeh, Janowski, Pawel, Dodhia, Rahul, and Ferres, Juan M. Lavista
- Subjects
Electrical Engineering and Systems Science - Image and Video Processing ,Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Machine Learning - Abstract
All datasets contain some biases, often unintentional, due to how they were acquired and annotated. These biases distort machine-learning models' performance, creating spurious correlations that the models can unfairly exploit, or, contrarily destroying clear correlations that the models could learn. With the popularity of deep learning models, automated skin lesion analysis is starting to play an essential role in the early detection of Melanoma. The ISIC Archive is one of the most used skin lesion sources to benchmark deep learning-based tools. Bissoto et al. experimented with different bounding-box based masks and showed that deep learning models could classify skin lesion images without clinically meaningful information in the input data. Their findings seem confounding since the ablated regions (random rectangular boxes) are not significant. The shape of the lesion is a crucial factor in the clinical characterization of a skin lesion. In that context, we performed a set of experiments that generate shape-preserving masks instead of rectangular bounding-box based masks. A deep learning model trained on these shape-preserving masked images does not outperform models trained on images without clinically meaningful information. That strongly suggests spurious correlations guiding the models. We propose use of general adversarial network (GAN) to mitigate the underlying bias.
- Published
- 2020
92. MACE: A Flexible Framework for Membership Privacy Estimation in Generative Models
- Author
-
Xu, Yixi, Mukherjee, Sumit, Liu, Xiyang, Tople, Shruti, Dodhia, Rahul, and Ferres, Juan Lavista
- Subjects
Computer Science - Cryptography and Security ,Computer Science - Machine Learning - Abstract
Generative machine learning models are being increasingly viewed as a way to share sensitive data between institutions. While there has been work on developing differentially private generative modeling approaches, these approaches generally lead to sub-par sample quality, limiting their use in real world applications. Another line of work has focused on developing generative models which lead to higher quality samples but currently lack any formal privacy guarantees. In this work, we propose the first formal framework for membership privacy estimation in generative models. We formulate the membership privacy risk as a statistical divergence between training samples and hold-out samples, and propose sample-based methods to estimate this divergence. Compared to previous works, our framework makes more realistic and flexible assumptions. First, we offer a generalizable metric as an alternative to the accuracy metric especially for imbalanced datasets. Second, we loosen the assumption of having full access to the underlying distribution from previous studies , and propose sample-based estimations with theoretical guarantees. Third, along with the population-level membership privacy risk estimation via the optimal membership advantage, we offer the individual-level estimation via the individual privacy risk. Fourth, our framework allows adversaries to access the trained model via a customized query, while prior works require specific attributes.
- Published
- 2020
93. privGAN: Protecting GANs from membership inference attacks at low cost
- Author
-
Mukherjee, Sumit, Xu, Yixi, Trivedi, Anusua, and Ferres, Juan Lavista
- Subjects
Computer Science - Machine Learning ,Computer Science - Cryptography and Security ,Computer Science - Computer Vision and Pattern Recognition ,Statistics - Machine Learning - Abstract
Generative Adversarial Networks (GANs) have made releasing of synthetic images a viable approach to share data without releasing the original dataset. It has been shown that such synthetic data can be used for a variety of downstream tasks such as training classifiers that would otherwise require the original dataset to be shared. However, recent work has shown that the GAN models and their synthetically generated data can be used to infer the training set membership by an adversary who has access to the entire dataset and some auxiliary information. Current approaches to mitigate this problem (such as DPGAN) lead to dramatically poorer generated sample quality than the original non--private GANs. Here we develop a new GAN architecture (privGAN), where the generator is trained not only to cheat the discriminator but also to defend membership inference attacks. The new mechanism provides protection against this mode of attack while leading to negligible loss in downstream performances. In addition, our algorithm has been shown to explicitly prevent overfitting to the training set, which explains why our protection is so effective. The main contributions of this paper are: i) we propose a novel GAN architecture that can generate synthetic data in a privacy preserving manner without additional hyperparameter tuning and architecture selection, ii) we provide a theoretical understanding of the optimal solution of the privGAN loss function, iii) we demonstrate the effectiveness of our model against several white and black--box attacks on several benchmark datasets, iv) we demonstrate on three common benchmark datasets that synthetic images generated by privGAN lead to negligible loss in downstream performance when compared against non--private GANs.
- Published
- 2019
94. Prevalencia y características clínicas del dolor en pacientes con enfermedad crónica avanzada
- Author
-
Ballarín Castany, Angels, Serrà Rigol, Thaïs, Cereceda Ferrés, M., Serrarols Soldevila, M., Oller Piqué, Ramon, and Gómez-Batiste, Xavier
- Published
- 2023
- Full Text
- View/download PDF
95. Evaluating the generalizability of deep learning image classification algorithms to detect middle ear disease using otoscopy
- Author
-
Al-Rahim Habib, Yixi Xu, Kris Bock, Shrestha Mohanty, Tina Sederholm, William B. Weeks, Rahul Dodhia, Juan Lavista Ferres, Chris Perry, Raymond Sacks, and Narinder Singh
- Subjects
Medicine ,Science - Abstract
Abstract To evaluate the generalizability of artificial intelligence (AI) algorithms that use deep learning methods to identify middle ear disease from otoscopic images, between internal to external performance. 1842 otoscopic images were collected from three independent sources: (a) Van, Turkey, (b) Santiago, Chile, and (c) Ohio, USA. Diagnostic categories consisted of (i) normal or (ii) abnormal. Deep learning methods were used to develop models to evaluate internal and external performance, using area under the curve (AUC) estimates. A pooled assessment was performed by combining all cohorts together with fivefold cross validation. AI-otoscopy algorithms achieved high internal performance (mean AUC: 0.95, 95%CI: 0.80–1.00). However, performance was reduced when tested on external otoscopic images not used for training (mean AUC: 0.76, 95%CI: 0.61–0.91). Overall, external performance was significantly lower than internal performance (mean difference in AUC: −0.19, p ≤ 0.04). Combining cohorts achieved a substantial pooled performance (AUC: 0.96, standard error: 0.01). Internally applied algorithms for otoscopy performed well to identify middle ear disease from otoscopy images. However, external performance was reduced when applied to new test cohorts. Further efforts are required to explore data augmentation and pre-processing techniques that might improve external performance and develop a robust, generalizable algorithm for real-world clinical applications.
- Published
- 2023
- Full Text
- View/download PDF
96. Predicting locations of cryptic pockets from single protein structures using the PocketMiner graph neural network
- Author
-
Artur Meller, Michael Ward, Jonathan Borowsky, Meghana Kshirsagar, Jeffrey M. Lotthammer, Felipe Oviedo, Juan Lavista Ferres, and Gregory R. Bowman
- Subjects
Science - Abstract
Cryptic pockets enable targeting of proteins currently considered undruggable because they lack pockets in their ground state structures. Here, the authors develop a graph neural network that accurately predicts cryptic pockets in static structures by training using molecular simulation data alone.
- Published
- 2023
- Full Text
- View/download PDF
97. Hantavirus in humans: a review of clinical aspects and management
- Author
-
Vial, Pablo A, Ferrés, Marcela, Vial, Cecilia, Klingström, Jonas, Ahlm, Clas, López, René, Le Corre, Nicole, and Mertz, Gregory J
- Published
- 2023
- Full Text
- View/download PDF
98. Risks of Using Non-verified Open Data: A case study on using Machine Learning techniques for predicting Pregnancy Outcomes in India
- Author
-
Trivedi, Anusua, Mukherjee, Sumit, Tse, Edmund, Ewing, Anne, and Ferres, Juan Lavista
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence ,Statistics - Machine Learning - Abstract
Artificial intelligence (AI) has evolved considerably in the last few years. While applications of AI is now becoming more common in fields like retail and marketing, application of AI in solving problems related to developing countries is still an emerging topic. Specially, AI applications in resource-poor settings remains relatively nascent. There is a huge scope of AI being used in such settings. For example, researchers have started exploring AI applications to reduce poverty and deliver a broad range of critical public services. However, despite many promising use cases, there are many dataset related challenges that one has to overcome in such projects. These challenges often take the form of missing data, incorrectly collected data and improperly labeled variables, among other factors. As a result, we can often end up using data that is not representative of the problem we are trying to solve. In this case study, we explore the challenges of using such an open dataset from India, to predict an important health outcome. We highlight how the use of AI without proper understanding of reporting metrics can lead to erroneous conclusions., Comment: Presented at NeurIPS 2019 Workshop on Machine Learning for the Developing World
- Published
- 2019
99. Endoscopic Transorbital Approach for the Management of Spheno-Orbital Meningiomas: Literature Review and Preliminary Experience
- Author
-
Di Somma, Alberto, De Rosa, Andrea, Ferrés, Abel, Mosteiro, Alejandra, Guizzardi, Giulia, Fassi, Jessica Matas, Topczewski, Thomaz E., Reyes, Luis, Roldán, Pedro, Torné, Ramon, Alobid, Isam, and Enseñat, Joaquim
- Published
- 2023
- Full Text
- View/download PDF
100. News and the city: understanding online press consumption patterns through mobile data
- Author
-
Vilella, Salvatore, Paolotti, Daniela, Ruffo, Giancarlo, and Ferres, Leo
- Subjects
Computer Science - Computers and Society ,Physics - Physics and Society - Abstract
The always increasing mobile connectivity affects every aspect of our daily lives, including how and when we keep ourselves informed and consult news media. By studying a DPI (deep packet inspection) dataset, provided by one of the major Chilean telecommunication companies, we investigate how different cohorts of the population of Santiago De Chile consume news media content through their smartphones. We find that some socio-demographic attributes are highly associated to specific news media consumption patterns. In particular, education and age play a significant role in shaping the consumers behaviour even in the digital context, in agreement with a large body of literature on off-line media distribution channels.
- Published
- 2019
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.