Author: "Rudin A" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Rudin A"' showing total 21,766 results

Start Over Author "Rudin A"

21,766 results on '"Rudin A"'

1. Interpretable Image Classification with Adaptive Prototype-based Vision Transformers

Author: Ma, Chiyu, Donnelly, Jon, Liu, Wenjun, Vosoughi, Soroush, Rudin, Cynthia, and Chen, Chaofan
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: We present ProtoViT, a method for interpretable image classification combining deep learning and case-based reasoning. This method classifies an image by comparing it to a set of learned prototypes, providing explanations of the form ``this looks like that.'' In our model, a prototype consists of \textit{parts}, which can deform over irregular geometries to create a better comparison between images. Unlike existing models that rely on Convolutional Neural Network (CNN) backbones and spatially rigid prototypes, our model integrates Vision Transformer (ViT) backbones into prototype based models, while offering spatially deformed prototypes that not only accommodate geometric variations of objects but also provide coherent and clear prototypical feature representations with an adaptive number of prototypical parts. Our experiments show that our model can generally achieve higher performance than the existing prototype based models. Our comprehensive analyses ensure that the prototypes are consistent and the interpretations are faithful.
Published: 2024

2. Improving Decision Sparsity

Author: Sun, Yiyang, Wang, Tong, and Rudin, Cynthia
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: Sparsity is a central aspect of interpretability in machine learning. Typically, sparsity is measured in terms of the size of a model globally, such as the number of variables it uses. However, this notion of sparsity is not particularly relevant for decision-making; someone subjected to a decision does not care about variables that do not contribute to the decision. In this work, we dramatically expand a notion of decision sparsity called the Sparse Explanation Value(SEV) so that its explanations are more meaningful. SEV considers movement along a hypercube towards a reference point. By allowing flexibility in that reference and by considering how distances along the hypercube translate to distances in feature space, we can derive sparser and more meaningful explanations for various types of function classes. We present cluster-based SEV and its variant tree-based SEV, introduce a method that improves credibility of explanations, and propose algorithms that optimize decision sparsity in machine learning models., Comment: Accepted to 38th Conference on Neural Information Processing Systems (NeurIPS 2024)
Published: 2024

3. FastSurvival: Hidden Computational Blessings in Training Cox Proportional Hazards Models

Author: Liu, Jiachang, Zhang, Rui, and Rudin, Cynthia
Subjects: Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: Survival analysis is an important research topic with applications in healthcare, business, and manufacturing. One essential tool in this area is the Cox proportional hazards (CPH) model, which is widely used for its interpretability, flexibility, and predictive performance. However, for modern data science challenges such as high dimensionality (both $n$ and $p$) and high feature correlations, current algorithms to train the CPH model have drawbacks, preventing us from using the CPH model at its full potential. The root cause is that the current algorithms, based on the Newton method, have trouble converging due to vanishing second order derivatives when outside the local region of the minimizer. To circumvent this problem, we propose new optimization methods by constructing and minimizing surrogate functions that exploit hidden mathematical structures of the CPH model. Our new methods are easy to implement and ensure monotonic loss decrease and global convergence. Empirically, we verify the computational efficiency of our methods. As a direct application, we show how our optimization methods can be used to solve the cardinality-constrained CPH problem, producing very sparse high-quality models that were not previously practical to construct. We list several extensions that our breakthrough enables, including optimization opportunities, theoretical questions on CPH's mathematical structure, as well as other CPH-related applications., Comment: Accepted into NeurIPS 2024
Published: 2024

4. Students with Prior Anatomy Experience Start out Stronger in Medical School Gross Anatomy

Author: Marlene D. Louro, Grace Meegan, Lauren R. Rudin, Michael C. Granatosky, and Nathan E. Thompson
Abstract: Gross anatomy is a crucial course in medical school; it sets the foundation for future coursework and is highly valued by clinicians. While both medical students and faculty recognize the importance of pre-medical school anatomy experience, few medical schools require it as a prerequisite. Consequently, medical school gross anatomy courses have a diverse range of prior anatomy experience among students. Prior studies have shown mixed results regarding the impact of pre-medical school anatomy experience on medical school gross anatomy performance, often using final exam scores as the metric of analysis. In this study, we investigated the relationship between pre-medical school anatomy experience and medical school gross anatomy performance among students at New York Institute of Technology, College of Osteopathic Medicine. We surveyed students from all four matriculated years and analyzed their individual anatomy laboratory exam scores and final anatomy laboratory course scores. We found that students with prior anatomy experience performed significantly better on the first anatomy laboratory exam, leading to an overall positive effect on their final anatomy laboratory score. However, this advantage seemed to diminish in subsequent exams, suggesting that students without prior experience rapidly adjusted to the course challenges. Students with prior anatomy experience felt more prepared for the anatomy course, reported lower stress levels, and believed they had an advantage over peers without prior experience. Our study highlights the importance of pre-medical school anatomy experience, particularly for early performance in the anatomy course.
Published: 2024
Full Text: View/download PDF

5. Concept, Ritual, and Narrative: What Are Toddler Books?

Author: Shai Rudin
Abstract: Research on children's literature focuses on preschool children's books (children aged 3-6) and on juvenile fiction, yet is lacking in studies on toddler books (ages 0-3). In this paper, I present a definition of the genre of "toddler books" (often referred to as "boardbooks," despite it including non-boardbooks), while focusing on the illustrational aspects, that are at least as dominant as the written texts, if not more so. I also examine the plots and themes that are presented in toddlers' books, distinguishing between "concept books" (lexical books with no narrative), "ritual books" (which engage in instilling certain habits), and "narrative books" (toddlers' books that depict a complex narrative and serve as preparation for prereader books). The findings of the study indicate that toddlers' books are often infants' first encounter with both narrative thought and visual art, hence their significant place in their development.
Published: 2024
Full Text: View/download PDF

6. Phononic materials with effectively scale-separated hierarchical features using interpretable machine learning

Author: Bastawrous, Mary V., Chen, Zhi, Ogren, Alexander C., Daraio, Chiara, Rudin, Cynthia, and Brinson, L. Catherine
Subjects: Physics - Applied Physics, Computer Science - Machine Learning
Abstract: Manipulating the dispersive characteristics of vibrational waves is beneficial for many applications, e.g., high-precision instruments. architected hierarchical phononic materials have sparked promise tunability of elastodynamic waves and vibrations over multiple frequency ranges. In this article, hierarchical unit-cells are obtained, where features at each length scale result in a band gap within a targeted frequency range. Our novel approach, the ``hierarchical unit-cell template method,'' is an interpretable machine-learning approach that uncovers global unit-cell shape/topology patterns corresponding to predefined band-gap objectives. A scale-separation effect is observed where the coarse-scale band-gap objective is mostly unaffected by the fine-scale features despite the closeness of their length scales, thus enabling an efficient hierarchical algorithm. Moreover, the hierarchical patterns revealed are not predefined or self-similar hierarchies as common in current hierarchical phononic materials. Thus, our approach offers a flexible and efficient method for the exploration of new regions in the hierarchical design space, extracting minimal effective patterns for inverse design in applications targeting multiple frequency ranges.
Published: 2024

7. A New Dataset, Notation Software, and Representation for Computational Schenkerian Analysis

Author: Ni-Hahn, Stephen, Xu, Weihan, Yin, Jerry, Zhu, Rico, Mak, Simon, Jiang, Yue, and Rudin, Cynthia
Subjects: Computer Science - Sound, Computer Science - Artificial Intelligence
Abstract: Schenkerian Analysis (SchA) is a uniquely expressive method of music analysis, combining elements of melody, harmony, counterpoint, and form to describe the hierarchical structure supporting a work of music. However, despite its powerful analytical utility and potential to improve music understanding and generation, SchA has rarely been utilized by the computer music community. This is in large part due to the paucity of available high-quality data in a computer-readable format. With a larger corpus of Schenkerian data, it may be possible to infuse machine learning models with a deeper understanding of musical structure, thus leading to more "human" results. To encourage further research in Schenkerian analysis and its potential benefits for music informatics and generation, this paper presents three main contributions: 1) a new and growing dataset of SchAs, the largest in human- and computer-readable formats to date (>140 excerpts), 2) a novel software for visualization and collection of SchA data, and 3) a novel, flexible representation of SchA as a heterogeneous-edge graph data structure.
Published: 2024

8. Designing Pu Intermetallics with First Principle Calculations

Author: Cook, Matthew S., Arellano, David C., Prada, Derek V., Rudin, Sven P., Bauer, Eric D., and Phelan, W. Adam
Subjects: Condensed Matter - Materials Science, Condensed Matter - Strongly Correlated Electrons
Abstract: We present the ab initio supported discovery of two new Pu based intermetallic compounds, PuNiSn and PuPtSn. Using density functional theory, the formation energies within the relevant ternary phase diagrams were calculated to predict the stability of both compounds. Simultaneously, Pu-Ni-Sn and Pu-Pt-Sn materials were arc-melted and subsequently characterized with magnetization, specific heat, and resistivity measurements from 2-300 K. Magnetization measurements show that PuNiSn and PuPtSn order antiferromagnetically at TN = 11 K and TN = 15 K, respectively. Specific heat measurements show an enhanced residual electronic specific heat that is indicative of strong electron correlations. Resistivity measurements are indicative of Kondo behavior for PuNiSn while crystal field effects may play a role in the observed temperature dependence for PuPtSn.
Published: 2024

9. Axion Interactions with Domain and Bubble Walls

Author: Garcia, Isabel Garcia and Petrossian-Byrne, Rudin
Subjects: High Energy Physics - Phenomenology, Astrophysics - Cosmology and Nongalactic Astrophysics, High Energy Physics - Theory
Abstract: We show that interactions between axion-like particles (ALPs) and co-dimension one defects, such as phase-transition bubble walls and solitonic domain walls, can lead to important changes in the evolution of both walls and ALPs. The leading effect arises from the change in the ALP decay constant across the interface, which naturally follows from shift-symmetric interactions with the corresponding order parameter. Specifically, we show that for thin walls moving relativistically, an ALP background -- such as e.g. axion dark matter -- gives rise to a frictional force on the interface that is proportional to $\gamma^2$, with $\gamma$ the Lorentz factor of the wall, and that this effect is present in both the oscillating and frozen axion regimes. We explore the broader consequences of this effect for bubble and domain walls in the early universe, and show that this source of friction can be present even in the absent of a conventional medium such as radiation or matter. Possible implications include modifications to the dynamics of bubble and domain walls and their corresponding gravitational wave signatures, as well as the generation of a dark radiation component of ALPs in the form of ultra-relativistic `axion shells' with Lorentz factor $\gamma_\text{shell} \simeq 2\gamma^2 \gg 1$ that may remain relativistic until the present day., Comment: 35 pages, 5 figures
Published: 2024

10. Amazing Things Come From Having Many Good Models

Author: Rudin, Cynthia, Zhong, Chudi, Semenova, Lesia, Seltzer, Margo, Parr, Ronald, Liu, Jiachang, Katta, Srikar, Donnelly, Jon, Chen, Harry, and Boner, Zachery
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: The Rashomon Effect, coined by Leo Breiman, describes the phenomenon that there exist many equally good predictive models for the same dataset. This phenomenon happens for many real datasets and when it does, it sparks both magic and consternation, but mostly magic. In light of the Rashomon Effect, this perspective piece proposes reshaping the way we think about machine learning, particularly for tabular data problems in the nondeterministic (noisy) setting. We address how the Rashomon Effect impacts (1) the existence of simple-yet-accurate models, (2) flexibility to address user preferences, such as fairness and monotonicity, without losing performance, (3) uncertainty in predictions, fairness, and explanations, (4) reliable variable importance, (5) algorithm choice, specifically, providing advanced knowledge of which algorithms might be suitable for a given problem, and (6) public policy. We also discuss a theory of when the Rashomon Effect occurs and why. Our goal is to illustrate how the Rashomon Effect can have a massive impact on the use of machine learning for complex problems in society.
Published: 2024

11. This Looks Better than That: Better Interpretable Models with ProtoPNeXt

Author: Willard, Frank, Moffett, Luke, Mokel, Emmanuel, Donnelly, Jon, Guo, Stark, Yang, Julia, Kim, Giyoung, Barnett, Alina Jade, and Rudin, Cynthia
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Prototypical-part models are a popular interpretable alternative to black-box deep learning models for computer vision. However, they are difficult to train, with high sensitivity to hyperparameter tuning, inhibiting their application to new datasets and our understanding of which methods truly improve their performance. To facilitate the careful study of prototypical-part networks (ProtoPNets), we create a new framework for integrating components of prototypical-part models -- ProtoPNeXt. Using ProtoPNeXt, we show that applying Bayesian hyperparameter tuning and an angular prototype similarity metric to the original ProtoPNet is sufficient to produce new state-of-the-art accuracy for prototypical-part models on CUB-200 across multiple backbones. We further deploy this framework to jointly optimize for accuracy and prototype interpretability as measured by metrics included in ProtoPNeXt. Using the same resources, this produces models with substantially superior semantics and changes in accuracy between +1.3% and -1.5%. The code and trained models will be made publicly available upon publication.
Published: 2024

12. FPN-IAIA-BL: A Multi-Scale Interpretable Deep Learning Model for Classification of Mass Margins in Digital Mammography

Author: Yang, Julia, Barnett, Alina Jade, Donnelly, Jon, Kishore, Satvik, Fang, Jerry, Schwartz, Fides Regina, Chen, Chaofan, Lo, Joseph Y., and Rudin, Cynthia
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Digital mammography is essential to breast cancer detection, and deep learning offers promising tools for faster and more accurate mammogram analysis. In radiology and other high-stakes environments, uninterpretable ("black box") deep learning models are unsuitable and there is a call in these fields to make interpretable models. Recent work in interpretable computer vision provides transparency to these formerly black boxes by utilizing prototypes for case-based explanations, achieving high accuracy in applications including mammography. However, these models struggle with precise feature localization, reasoning on large portions of an image when only a small part is relevant. This paper addresses this gap by proposing a novel multi-scale interpretable deep learning model for mammographic mass margin classification. Our contribution not only offers an interpretable model with reasoning aligned with radiologist practices, but also provides a general architecture for computer vision with user-configurable prototypes from coarse- to fine-grained prototypes., Comment: 8 pages, 6 figures, Accepted for oral presentation at the 2024 CVPR Workshop on Domain adaptation, Explainability, Fairness in AI for Medical Image Analysis (DEF-AI-MIA)
Published: 2024

13. SiamQuality: a ConvNet-based foundation model for photoplethysmography signals.

Author: Ding, Cheng, Guo, Zhicheng, Chen, Zhaoliang, Lee, Randall, Rudin, Cynthia, and Hu, Xiao
Subjects: PPG signal quality, foundation model, physiological data, Photoplethysmography, Humans, Signal Processing, Computer-Assisted, Neural Networks, Computer
Abstract: Objective. Physiological data are often low quality and thereby compromises the effectiveness of related health monitoring. The primary goal of this study is to develop a robust foundation model that can effectively handle low-quality issue in physiological data.Approach. We introduce SiamQuality, a self-supervised learning approach using convolutional neural networks (CNNs) as the backbone. SiamQuality learns to generate similar representations for both high and low quality photoplethysmography (PPG) signals that originate from similar physiological states. We leveraged a substantial dataset of PPG signals from hospitalized intensive care patients, comprised of over 36 million 30 s PPG pairs.Main results. After pre-training the SiamQuality model, it was fine-tuned and tested on six PPG downstream tasks focusing on cardiovascular monitoring. Notably, in tasks such as respiratory rate estimation and atrial fibrillation detection, the models performance exceeded the state-of-the-art by 75% and 5%, respectively. The results highlight the effectiveness of our model across all evaluated tasks, demonstrating significant improvements, especially in applications for heart monitoring on wearable devices.Significance. This study underscores the potential of CNNs as a robust backbone for foundation models tailored to physiological data, emphasizing their capability to maintain performance despite variations in data quality. The success of the SiamQuality model in handling real-world, variable-quality data opens new avenues for the development of more reliable and efficient healthcare monitoring technologies.
Published: 2024

14. NLO friction in symmetry restoring phase transitions

Author: Azatov, Aleksandr, Barni, Giulio, and Petrossian-Byrne, Rudin
Subjects: High Energy Physics - Phenomenology, Astrophysics - Cosmology and Nongalactic Astrophysics, High Energy Physics - Theory
Abstract: Interactions between bubbles/domain walls and the surrounding medium are a topic of active research, particularly as they apply to friction effects on accelerated expansion during first-order phase transitions. In this paper, we analyze for the first time friction pressure on relativistic walls in phase transitions where gauge symmetry is restored, particularly motivated by the observation that this pressure can, in principle, be negative at leading order, since some particles lose mass by definition as they cross into the new phase. We find, however, that at NLO, the soft emission of vectors from a charged current leads to positive pressure scaling as the wall's Lorentz boost factor $\gamma_w$, similar to the case of gauge symmetry breaking. Contrary to the latter case, we find that the dominant contribution in single emission is safe from IR divergences and exhibits a much stronger dependence on the wall shape. Finally, we argue that in any phase transition, no multi-particle process on the wall can impart negative pressure greater than the leading order result, in the asymptotic limit of large velocity., Comment: 17 pages + appendices
Published: 2024

15. A noncollinear density functional theory ansatz for the phononic and thermodynamic properties of $\alpha$-Pu

Author: Muñoz, Alexander R., Phelan, W. Adam, Cook, Matthew S., Chappell, Greta L., Tobash, Paul H., Arellano, David C., Prada, Derek V., Jones, Travis E., and Rudin, Sven P.
Subjects: Condensed Matter - Materials Science, Condensed Matter - Strongly Correlated Electrons
Abstract: Plutonium's phase diagram is host to complex structures and interactions that make the description of its ground state properties elusive. Using all-electron density functional theory, we study the thermodynamic properties of $\alpha$-Pu. To do this, we build on recent work in the literature by introducing a novel noncollinear magnetic ansatz for $\alpha$-Pu's ground state. The noncollinear ansatz accurately recovers the experimental phonon density of states, heat capacity, and thermal expansion. These new results on $\alpha$-Pu along with recent results on $\delta$-Pu demonstrate the efficacy of noncollinear ansatzes for the description of plutonium.
Published: 2024

16. SiamQuality: A ConvNet-Based Foundation Model for Imperfect Physiological Signals

Author: Ding, Cheng, Guo, Zhicheng, Chen, Zhaoliang, Lee, Randall J, Rudin, Cynthia, and Hu, Xiao
Subjects: Electrical Engineering and Systems Science - Signal Processing, Computer Science - Machine Learning
Abstract: Foundation models, especially those using transformers as backbones, have gained significant popularity, particularly in language and language-vision tasks. However, large foundation models are typically trained on high-quality data, which poses a significant challenge, given the prevalence of poor-quality real-world data. This challenge is more pronounced for developing foundation models for physiological data; such data are often noisy, incomplete, or inconsistent. The present work aims to provide a toolset for developing foundation models on physiological data. We leverage a large dataset of photoplethysmography (PPG) signals from hospitalized intensive care patients. For this data, we propose SimQuality, a novel self-supervised learning task based on convolutional neural networks (CNNs) as the backbone to enforce representations to be similar for good and poor quality signals that are from similar physiological states. We pre-trained the SimQuality on over 36 million 30-second PPG pairs and then fine-tuned and tested on six downstream tasks using external datasets. The results demonstrate the superiority of the proposed approach on all the downstream tasks, which are extremely important for heart monitoring on wearable devices. Our method indicates that CNNs can be an effective backbone for foundation models that are robust to training data quality.
Published: 2024

17. Data Poisoning Attacks on Off-Policy Policy Evaluation Methods

Author: Lobo, Elita, Singh, Harvineet, Petrik, Marek, Rudin, Cynthia, and Lakkaraju, Himabindu
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Cryptography and Security
Abstract: Off-policy Evaluation (OPE) methods are a crucial tool for evaluating policies in high-stakes domains such as healthcare, where exploration is often infeasible, unethical, or expensive. However, the extent to which such methods can be trusted under adversarial threats to data quality is largely unexplored. In this work, we make the first attempt at investigating the sensitivity of OPE methods to marginal adversarial perturbations to the data. We design a generic data poisoning attack framework leveraging influence functions from robust statistics to carefully construct perturbations that maximize error in the policy value estimates. We carry out extensive experimentation with multiple healthcare and control datasets. Our results demonstrate that many existing OPE methods are highly prone to generating value estimates with large errors when subject to data poisoning attacks, even for small adversarial perturbations. These findings question the reliability of policy values derived using OPE methods and motivate the need for developing OPE methods that are statistically robust to train-time data poisoning attacks., Comment: Accepted at UAI 2022
Published: 2024

18. What is different between these datasets?

Author: Babbar, Varun, Guo, Zhicheng, and Rudin, Cynthia
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: The performance of machine learning models heavily depends on the quality of input data, yet real-world applications often encounter various data-related challenges. One such challenge could arise when curating training data or deploying the model in the real world - two comparable datasets in the same domain may have different distributions. While numerous techniques exist for detecting distribution shifts, the literature lacks comprehensive approaches for explaining dataset differences in a human-understandable manner. To address this gap, we propose a suite of interpretable methods (toolbox) for comparing two datasets. We demonstrate the versatility of our approach across diverse data modalities, including tabular data, language, images, and signals in both low and high-dimensional settings. Our methods not only outperform comparable and related approaches in terms of explanation quality and correctness, but also provide actionable, complementary insights to understand and mitigate dataset differences effectively.
Published: 2024

19. Symmetry Considerations for Learning Task Symmetric Robot Policies

Author: Mittal, Mayank, Rudin, Nikita, Klemm, Victor, Allshire, Arthur, and Hutter, Marco
Subjects: Computer Science - Robotics, Computer Science - Artificial Intelligence
Abstract: Symmetry is a fundamental aspect of many real-world robotic tasks. However, current deep reinforcement learning (DRL) approaches can seldom harness and exploit symmetry effectively. Often, the learned behaviors fail to achieve the desired transformation invariances and suffer from motion artifacts. For instance, a quadruped may exhibit different gaits when commanded to move forward or backward, even though it is symmetrical about its torso. This issue becomes further pronounced in high-dimensional or complex environments, where DRL methods are prone to local optima and fail to explore regions of the state space equally. Past methods on encouraging symmetry for robotic tasks have studied this topic mainly in a single-task setting, where symmetry usually refers to symmetry in the motion, such as the gait patterns. In this paper, we revisit this topic for goal-conditioned tasks in robotics, where symmetry lies mainly in task execution and not necessarily in the learned motions themselves. In particular, we investigate two approaches to incorporate symmetry invariance into DRL -- data augmentation and mirror loss function. We provide a theoretical foundation for using augmented samples in an on-policy setting. Based on this, we show that the corresponding approach achieves faster convergence and improves the learned behaviors in various challenging robotic tasks, from climbing boxes with a quadruped to dexterous manipulation., Comment: M. Mittal and N. Rudin contributed equally. Accepted for ICRA 2024
Published: 2024

20. SpaceHopper: A Small-Scale Legged Robot for Exploring Low-Gravity Celestial Bodies

Author: Spiridonov, Alexander, Buehler, Fabio, Berclaz, Moriz, Schelbert, Valerio, Geurts, Jorit, Krasnova, Elena, Steinke, Emma, Toma, Jonas, Wuethrich, Joschua, Polat, Recep, Zimmermann, Wim, Arm, Philip, Rudin, Nikita, Kolvenbach, Hendrik, and Hutter, Marco
Subjects: Computer Science - Robotics
Abstract: We present SpaceHopper, a three-legged, small-scale robot designed for future mobile exploration of asteroids and moons. The robot weighs 5.2kg and has a body size of 245mm while using space-qualifiable components. Furthermore, SpaceHopper's design and controls make it well-adapted for investigating dynamic locomotion modes with extended flight-phases. Instead of gyroscopes or fly-wheels, the system uses its three legs to reorient the body during flight in preparation for landing. We control the leg motion for reorientation using Deep Reinforcement Learning policies. In a simulation of Ceres' gravity (0.029g), the robot can reliably jump to commanded positions up to 6m away. Our real-world experiments show that SpaceHopper can successfully reorient to a safe landing orientation within 9.7 degree inside a rotational gimbal and jump in a counterweight setup in Earth's gravity. Overall, we consider SpaceHopper an important step towards controlled jumping locomotion in low-gravity environments., Comment: To be published in the 2024 IEEE International Conference on Robotics and Automation
Published: 2024

21. Automated real-world data integration improves cancer outcome prediction

Author: Jee, Justin, Fong, Christopher, Pichotta, Karl, Tran, Thinh Ngoc, Luthra, Anisha, Waters, Michele, Fu, Chenlian, Altoe, Mirella, Liu, Si-Yang, Maron, Steven B., Ahmed, Mehnaj, Kim, Susie, Pirun, Mono, Chatila, Walid K., de Bruijn, Ino, Pasha, Arfath, Kundra, Ritika, Gross, Benjamin, Mastrogiacomo, Brooke, Aprati, Tyler J., Liu, David, Gao, JianJiong, Capelletti, Marzia, Pekala, Kelly, Loudon, Lisa, Perry, Maria, Bandlamudi, Chaitanya, Donoghue, Mark, Satravada, Baby Anusha, Martin, Axel, Shen, Ronglai, Chen, Yuan, Brannon, A. Rose, Chang, Jason, Braunstein, Lior, Li, Anyi, Safonov, Anton, Stonestrom, Aaron, Sanchez-Vela, Pablo, Wilhelm, Clare, Robson, Mark, Scher, Howard, Ladanyi, Marc, Reis-Filho, Jorge S., Solit, David B., Jones, David R., Gomez, Daniel, Yu, Helena, Chakravarty, Debyani, Yaeger, Rona, Abida, Wassim, Park, Wungki, O’Reilly, Eileen M., Garcia-Aguilar, Julio, Socci, Nicholas, Sanchez-Vega, Francisco, Carrot-Zhang, Jian, Stetson, Peter D., Levine, Ross, Rudin, Charles M., Berger, Michael F., Shah, Sohrab P., Schrag, Deborah, Razavi, Pedram, Kehl, Kenneth L., Li, Bob T., Riely, Gregory J., and Schultz, Nikolaus
Published: 2024
Full Text: View/download PDF

22. Cellular ATP demand creates metabolically distinct subpopulations of mitochondria

Author: Ryu, Keun Woo, Fung, Tak Shun, Baker, Daphne C., Saoi, Michelle, Park, Jinsung, Febres-Aldana, Christopher A., Aly, Rania G., Cui, Ruobing, Sharma, Anurag, Fu, Yi, Jones, Olivia L., Cai, Xin, Pasolli, H. Amalia, Cross, Justin R., Rudin, Charles M., and Thompson, Craig B.
Published: 2024
Full Text: View/download PDF

23. CTLA4 blockade abrogates KEAP1/STK11-related resistance to PD-(L)1 inhibitors

Author: Skoulidis, Ferdinandos, Araujo, Haniel A., Do, Minh Truong, Qian, Yu, Sun, Xin, Cobo, Ana Galan, Le, John T., Montesion, Meagan, Palmer, Rachael, Jahchan, Nadine, Juan, Joseph M., Min, Chengyin, Yu, Yi, Pan, Xuewen, Arbour, Kathryn C., Vokes, Natalie, Schmidt, Stephanie T., Molkentine, David, Owen, Dwight H., Memmott, Regan, Patil, Pradnya D., Marmarelis, Melina E., Awad, Mark M., Murray, Joseph C., Hellyer, Jessica A., Gainor, Justin F., Dimou, Anastasios, Bestvina, Christine M., Shu, Catherine A., Riess, Jonathan W., Blakely, Collin M., Pecot, Chad V., Mezquita, Laura, Tabbó, Fabrizio, Scheffler, Matthias, Digumarthy, Subba, Mooradian, Meghan J., Sacher, Adrian G., Lau, Sally C. M., Saltos, Andreas N., Rotow, Julia, Johnson, Rocio Perez, Liu, Corinne, Stewart, Tyler, Goldberg, Sarah B., Killam, Jonathan, Walther, Zenta, Schalper, Kurt, Davies, Kurtis D., Woodcock, Mark G., Anagnostou, Valsamo, Marrone, Kristen A., Forde, Patrick M., Ricciuti, Biagio, Venkatraman, Deepti, Van Allen, Eliezer M., Cummings, Amy L., Goldman, Jonathan W., Shaish, Hiram, Kier, Melanie, Katz, Sharyn, Aggarwal, Charu, Ni, Ying, Azok, Joseph T., Segal, Jeremy, Ritterhouse, Lauren, Neal, Joel W., Lacroix, Ludovic, Elamin, Yasir Y., Negrao, Marcelo V., Le, Xiuning, Lam, Vincent K., Lewis, Whitney E., Kemp, Haley N., Carter, Brett, Roth, Jack A., Swisher, Stephen, Lee, Richard, Zhou, Teng, Poteete, Alissa, Kong, Yifan, Takehara, Tomohiro, Paula, Alvaro Guimaraes, Parra Cuentas, Edwin R., Behrens, Carmen, Wistuba, Ignacio I., Zhang, Jianjun, Blumenschein, George R., Gay, Carl, Byers, Lauren A., Gibbons, Don L., Tsao, Anne, Lee, J. Jack, Bivona, Trever G., Camidge, D. Ross, Gray, Jhannelle E., Lieghl, Natasha, Levy, Benjamin, Brahmer, Julie R., Garassino, Marina C., Gandara, David R., Garon, Edward B., Rizvi, Naiyer A., Scagliotti, Giorgio Vittorio, Wolf, Jürgen, Planchard, David, Besse, Benjamin, Herbst, Roy S., Wakelee, Heather A., Pennell, Nathan A., Shaw, Alice T., Jänne, Pasi A., Carbone, David P., Hellmann, Matthew D., Rudin, Charles M., Albacker, Lee, Mann, Helen, Zhu, Zhou, Lai, Zhongwu, Stewart, Ross, Peters, Solange, Johnson, Melissa L., Wong, Kwok K., Huang, Alan, Winslow, Monte M., Rosen, Michael J., Winters, Ian P., Papadimitrakopoulou, Vassiliki A., Cascone, Tina, Jewsbury, Philip, and Heymach, John V.
Published: 2024
Full Text: View/download PDF

24. Sparse learned kernels for interpretable and efficient medical time series processing

Author: Chen, Sully F., Guo, Zhicheng, Ding, Cheng, Hu, Xiao, and Rudin, Cynthia
Published: 2024
Full Text: View/download PDF

25. Derivative applications to asset allocation and multi-asset management

Author: Cazalet, William, Curtil, Dimitri, Fabozzi, Frank J., Hixon, Scott, Rudin, Alexander, Sathyajit, Rahul, Stavena, James, and Upadhyay, Shubham
Published: 2024
Full Text: View/download PDF

26. Concept, Ritual, and Narrative: What Are Toddler Books?

Author: Rudin, Shai
Published: 2024
Full Text: View/download PDF

27. Sparse and Faithful Explanations Without Sparse Models

Author: Sun, Yiyang, Chen, Zhi, Orlandi, Vittorio, Wang, Tong, and Rudin, Cynthia
Subjects: Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: Even if a model is not globally sparse, it is possible for decisions made from that model to be accurately and faithfully described by a small number of features. For instance, an application for a large loan might be denied to someone because they have no credit history, which overwhelms any evidence towards their creditworthiness. In this work, we introduce the Sparse Explanation Value (SEV), a new way of measuring sparsity in machine learning models. In the loan denial example above, the SEV is 1 because only one factor is needed to explain why the loan was denied. SEV is a measure of decision sparsity rather than overall model sparsity, and we are able to show that many machine learning models -- even if they are not sparse -- actually have low decision sparsity, as measured by SEV. SEV is defined using movements over a hypercube, allowing SEV to be defined consistently over various model classes, with movement restrictions reflecting real-world constraints. We proposed the algorithms that reduce SEV without sacrificing accuracy, providing sparse and completely faithful explanations, even without globally sparse models., Comment: Accepted in AISTATS 2024
Published: 2024

28. Optimal Sparse Survival Trees

Author: Zhang, Rui, Xin, Rui, Seltzer, Margo, and Rudin, Cynthia
Subjects: Computer Science - Machine Learning
Abstract: Interpretability is crucial for doctors, hospitals, pharmaceutical companies and biotechnology corporations to analyze and make decisions for high stakes problems that involve human health. Tree-based methods have been widely adopted for survival analysis due to their appealing interpretablility and their ability to capture complex relationships. However, most existing methods to produce survival trees rely on heuristic (or greedy) algorithms, which risk producing sub-optimal models. We present a dynamic-programming-with-bounds approach that finds provably-optimal sparse survival tree models, frequently in only a few seconds., Comment: AISTATS2024 camera ready version. arXiv admin note: text overlap with arXiv:2211.14980
Published: 2024

29. Interpretable Causal Inference for Analyzing Wearable, Sensor, and Distributional Data

Author: Katta, Srikar, Parikh, Harsh, Rudin, Cynthia, and Volfovsky, Alexander
Subjects: Computer Science - Machine Learning, Electrical Engineering and Systems Science - Signal Processing, Statistics - Methodology
Abstract: Many modern causal questions ask how treatments affect complex outcomes that are measured using wearable devices and sensors. Current analysis approaches require summarizing these data into scalar statistics (e.g., the mean), but these summaries can be misleading. For example, disparate distributions can have the same means, variances, and other statistics. Researchers can overcome the loss of information by instead representing the data as distributions. We develop an interpretable method for distributional data analysis that ensures trustworthy and robust decision-making: Analyzing Distributional Data via Matching After Learning to Stretch (ADD MALTS). We (i) provide analytical guarantees of the correctness of our estimation strategy, (ii) demonstrate via simulation that ADD MALTS outperforms other distributional data analysis methods at estimating treatment effects, and (iii) illustrate ADD MALTS' ability to verify whether there is enough cohesion between treatment and control units within subpopulations to trustworthily estimate treatment effects. We demonstrate ADD MALTS' utility by studying the effectiveness of continuous glucose monitors in mitigating diabetes risks.
Published: 2023

30. Reconsideration on evaluation of machine learning models in continuous monitoring using wearables

Author: Ding, Cheng, Guo, Zhicheng, Rudin, Cynthia, Xiao, Ran, Nahab, Fadi B, and Hu, Xiao
Subjects: Computer Science - Machine Learning, Electrical Engineering and Systems Science - Signal Processing
Abstract: This paper explores the challenges in evaluating machine learning (ML) models for continuous health monitoring using wearable devices beyond conventional metrics. We state the complexities posed by real-world variability, disease dynamics, user-specific characteristics, and the prevalence of false notifications, necessitating novel evaluation strategies. Drawing insights from large-scale heart studies, the paper offers a comprehensive guideline for robust ML model evaluation on continuous health monitoring.
Published: 2023

31. ProtoEEGNet: An Interpretable Approach for Detecting Interictal Epileptiform Discharges

Author: Tang, Dennis, Willard, Frank, Tegerdine, Ronan, Triplett, Luke, Donnelly, Jon, Moffett, Luke, Semenova, Lesia, Barnett, Alina Jade, Jing, Jin, Rudin, Cynthia, and Westover, Brandon
Subjects: Electrical Engineering and Systems Science - Signal Processing, Computer Science - Machine Learning
Abstract: In electroencephalogram (EEG) recordings, the presence of interictal epileptiform discharges (IEDs) serves as a critical biomarker for seizures or seizure-like events.Detecting IEDs can be difficult; even highly trained experts disagree on the same sample. As a result, specialists have turned to machine-learning models for assistance. However, many existing models are black boxes and do not provide any human-interpretable reasoning for their decisions. In high-stakes medical applications, it is critical to have interpretable models so that experts can validate the reasoning of the model before making important diagnoses. We introduce ProtoEEGNet, a model that achieves state-of-the-art accuracy for IED detection while additionally providing an interpretable justification for its classifications. Specifically, it can reason that one EEG looks similar to another ''prototypical'' EEG that is known to contain an IED. ProtoEEGNet can therefore help medical professionals effectively detect IEDs while maintaining a transparent decision-making process., Comment: 11 pages, 4 figures
Published: 2023

32. Acute effects of R-MDMA, S-MDMA, and racemic MDMA in a randomized double-blind cross-over trial in healthy participants

Author: Straumann, Isabelle, Avedisian, Isidora, Klaiber, Aaron, Varghese, Nimmy, Eckert, Anne, Rudin, Deborah, Luethi, Dino, and Liechti, Matthias E.
Published: 2024
Full Text: View/download PDF

33. Fast and Interpretable Mortality Risk Scores for Critical Care Patients

Author: Zhu, Chloe Qinyu, Tian, Muhang, Semenova, Lesia, Liu, Jiachang, Xu, Jack, Scarpa, Joseph, and Rudin, Cynthia
Subjects: Computer Science - Machine Learning, Computer Science - Computers and Society
Abstract: Prediction of mortality in intensive care unit (ICU) patients is an important task in critical care medicine. Prior work in creating mortality risk models falls into two major categories: domain-expert-created scoring systems, and black box machine learning (ML) models. Both of these have disadvantages: black box models are unacceptable for use in hospitals, whereas manual creation of models (including hand-tuning of logistic regression parameters) relies on humans to perform high-dimensional constrained optimization, which leads to a loss in performance. In this work, we bridge the gap between accurate black box models and hand-tuned interpretable models. We build on modern interpretable ML techniques to design accurate and interpretable mortality risk scores. We leverage the largest existing public ICU monitoring datasets, namely the MIMIC III and eICU datasets. By evaluating risk across medical centers, we are able to study generalization across domains. In order to customize our risk score models, we develop a new algorithm, GroupFasterRisk, which has several important benefits: (1) it uses hard sparsity constraint, allowing users to directly control the number of features; (2) it incorporates group sparsity to allow more cohesive models; (3) it allows for monotonicity correction on models for including domain knowledge; (4) it produces many equally-good models at once, which allows domain experts to choose among them. GroupFasterRisk creates its risk scores within hours, even on the large datasets we study here. GroupFasterRisk's risk scores perform better than risk scores currently used in hospitals, and have similar prediction performance to black box ML models (despite being much sparser). Because GroupFasterRisk produces a variety of risk scores and handles constraints, it allows design flexibility, which is the key enabler of practical and trustworthy model creation.
Published: 2023

34. Learning Agile Locomotion on Risky Terrains

Author: Zhang, Chong, Rudin, Nikita, Hoeller, David, and Hutter, Marco
Subjects: Computer Science - Robotics
Abstract: Quadruped robots have shown remarkable mobility on various terrains through reinforcement learning. Yet, in the presence of sparse footholds and risky terrains such as stepping stones and balance beams, which require precise foot placement to avoid falls, model-based approaches are often used. In this paper, we show that end-to-end reinforcement learning can also enable the robot to traverse risky terrains with dynamic motions. To this end, our approach involves training a generalist policy for agile locomotion on disorderly and sparse stepping stones before transferring its reusable knowledge to various more challenging terrains by finetuning specialist policies from it. Given that the robot needs to rapidly adapt its velocity on these terrains, we formulate the task as a navigation task instead of the commonly used velocity tracking which constrains the robot's behavior and propose an exploration strategy to overcome sparse rewards and achieve high robustness. We validate our proposed method through simulation and real-world experiments on an ANYmal-D robot achieving peak forward velocity of >= 2.5 m/s on sparse stepping stones and narrow balance beams. Video: youtu.be/Z5X0J8OH6z4, Comment: 8 pages, 11 figures. IROS 2024
Published: 2023

35. A Path to Simpler Models Starts With Noise

Author: Semenova, Lesia, Chen, Harry, Parr, Ronald, and Rudin, Cynthia
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Statistics - Machine Learning
Abstract: The Rashomon set is the set of models that perform approximately equally well on a given dataset, and the Rashomon ratio is the fraction of all models in a given hypothesis space that are in the Rashomon set. Rashomon ratios are often large for tabular datasets in criminal justice, healthcare, lending, education, and in other areas, which has practical implications about whether simpler models can attain the same level of accuracy as more complex models. An open question is why Rashomon ratios often tend to be large. In this work, we propose and study a mechanism of the data generation process, coupled with choices usually made by the analyst during the learning process, that determines the size of the Rashomon ratio. Specifically, we demonstrate that noisier datasets lead to larger Rashomon ratios through the way that practitioners train models. Additionally, we introduce a measure called pattern diversity, which captures the average difference in predictions between distinct classification patterns in the Rashomon set, and motivate why it tends to increase with label noise. Our results explain a key aspect of why simpler models often tend to perform as well as black box models on complex, noisier datasets., Comment: NeurIPS 2023
Published: 2023

36. This Looks Like Those: Illuminating Prototypical Concepts Using Multiple Visualizations

Author: Ma, Chiyu, Zhao, Brandon, Chen, Chaofan, and Rudin, Cynthia
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: We present ProtoConcepts, a method for interpretable image classification combining deep learning and case-based reasoning using prototypical parts. Existing work in prototype-based image classification uses a ``this looks like that'' reasoning process, which dissects a test image by finding prototypical parts and combining evidence from these prototypes to make a final classification. However, all of the existing prototypical part-based image classifiers provide only one-to-one comparisons, where a single training image patch serves as a prototype to compare with a part of our test image. With these single-image comparisons, it can often be difficult to identify the underlying concept being compared (e.g., ``is it comparing the color or the shape?''). Our proposed method modifies the architecture of prototype-based networks to instead learn prototypical concepts which are visualized using multiple image patches. Having multiple visualizations of the same prototype allows us to more easily identify the concept captured by that prototype (e.g., ``the test image and the related training patches are all the same shade of blue''), and allows our model to create richer, more interpretable visual explanations. Our experiments show that our ``this looks like those'' reasoning process can be applied as a modification to a wide range of existing prototypical image classification networks while achieving comparable accuracy on benchmark datasets.
Published: 2023

37. Safe and Interpretable Estimation of Optimal Treatment Regimes

Author: Parikh, Harsh, Lanners, Quinn, Akras, Zade, Zafar, Sahar F., Westover, M. Brandon, Rudin, Cynthia, and Volfovsky, Alexander
Subjects: Computer Science - Machine Learning, Statistics - Applications, Statistics - Methodology
Abstract: Recent statistical and reinforcement learning methods have significantly advanced patient care strategies. However, these approaches face substantial challenges in high-stakes contexts, including missing data, inherent stochasticity, and the critical requirements for interpretability and patient safety. Our work operationalizes a safe and interpretable framework to identify optimal treatment regimes. This approach involves matching patients with similar medical and pharmacological characteristics, allowing us to construct an optimal policy via interpolation. We perform a comprehensive simulation study to demonstrate the framework's ability to identify optimal policies even in complex settings. Ultimately, we operationalize our approach to study regimes for treating seizures in critically ill patients. Our findings strongly support personalized treatment strategies based on a patient's medical history and pharmacological features. Notably, we identify that reducing medication doses for patients with mild and brief seizure episodes while adopting aggressive treatment for patients in intensive care unit experiencing intense seizures leads to more favorable outcomes., Comment: Accepted for publication in the proceedings of AISTATS 2025
Published: 2023

38. Uncertainty Quantification of Bandgaps in Acoustic Metamaterials with Stochastic Geometric Defects and Material Properties

Author: Zhang, Han, Mahabadi, Rayehe Karimi, Rudin, Cynthia, Guilleminot, Johann, and Brinson, L. Catherine
Subjects: Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing, Physics - Applied Physics, Physics - Data Analysis, Statistics and Probability
Abstract: This paper studies the utility of techniques within uncertainty quantification, namely spectral projection and polynomial chaos expansion, in reducing sampling needs for characterizing acoustic metamaterial dispersion band responses given stochastic material properties and geometric defects. A novel method of encoding geometric defects in an interpretable, resolution independent is showcased in the formation of input space probability distributions. Orders of magnitude sampling reductions down to $\sim10^0$ and $\sim10^1$ are achieved in the 1D and 7D input space scenarios respectively while maintaining accurate output space probability distributions through combining Monte Carlo, quadrature rule, and sparse grid sampling with surrogate model fitting.
Published: 2023

39. SiamAF: Learning Shared Information from ECG and PPG Signals for Robust Atrial Fibrillation Detection

Author: Guo, Zhicheng, Ding, Cheng, Do, Duc H., Shah, Amit, Lee, Randall J., Hu, Xiao, and Rudin, Cynthia
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: Atrial fibrillation (AF) is the most common type of cardiac arrhythmia. It is associated with an increased risk of stroke, heart failure, and other cardiovascular complications, but can be clinically silent. Passive AF monitoring with wearables may help reduce adverse clinical outcomes related to AF. Detecting AF in noisy wearable data poses a significant challenge, leading to the emergence of various deep learning techniques. Previous deep learning models learn from a single modality, either electrocardiogram (ECG) or photoplethysmography (PPG) signals. However, deep learning models often struggle to learn generalizable features and rely on features that are more susceptible to corruption from noise, leading to sub-optimal performances in certain scenarios, especially with low-quality signals. Given the increasing availability of ECG and PPG signal pairs from wearables and bedside monitors, we propose a new approach, SiamAF, leveraging a novel Siamese network architecture and joint learning loss function to learn shared information from both ECG and PPG signals. At inference time, the proposed model is able to predict AF from either PPG or ECG and outperforms baseline methods on three external test sets. It learns medically relevant features as a result of our novel architecture design. The proposed model also achieves comparable performance to traditional learning regimes while requiring much fewer training labels, providing a potential approach to reduce future reliance on manual labeling.
Published: 2023

40. Quantisation Across Bubble Walls and Friction

Author: Azatov, Aleksandr, Barni, Giulio, Petrossian-Byrne, Rudin, and Vanvlasselaer, Miguel
Subjects: High Energy Physics - Phenomenology, Astrophysics - Cosmology and Nongalactic Astrophysics, High Energy Physics - Theory
Abstract: We quantise from first principles field theories living on the background of a bubble wall in the planar limit with particular focus on the case of spontaneous breaking of gauge symmetry. Using these tools, we compute the average momentum transfer from transition radiation: the soft emission of radiation by an energetic particle passing across the wall, with a particular focus on the longitudinal polarisation of vectors. We find these to be comparable to transverse polarisations in symmetry-breaking transitions with mild super-cooling, and dominant in broken to broken transitions with thin wall. Our results have phenomenological applications for the expansion of bubbles during first order phase transitions. Our general framework allows for the robust calculation of any particle processes of interest in such translation breaking backgrounds., Comment: 42 pages + appendices. Updated to match version published in JHEP
Published: 2023
Full Text: View/download PDF

41. Resilient Legged Local Navigation: Learning to Traverse with Compromised Perception End-to-End

Author: Jin, Jin, Zhang, Chong, Frey, Jonas, Rudin, Nikita, Mattamala, Matias, Cadena, Cesar, and Hutter, Marco
Subjects: Computer Science - Robotics, Computer Science - Artificial Intelligence, Computer Science - Machine Learning, Electrical Engineering and Systems Science - Systems and Control
Abstract: Autonomous robots must navigate reliably in unknown environments even under compromised exteroceptive perception, or perception failures. Such failures often occur when harsh environments lead to degraded sensing, or when the perception algorithm misinterprets the scene due to limited generalization. In this paper, we model perception failures as invisible obstacles and pits, and train a reinforcement learning (RL) based local navigation policy to guide our legged robot. Unlike previous works relying on heuristics and anomaly detection to update navigational information, we train our navigation policy to reconstruct the environment information in the latent space from corrupted perception and react to perception failures end-to-end. To this end, we incorporate both proprioception and exteroception into our policy inputs, thereby enabling the policy to sense collisions on different body parts and pits, prompting corresponding reactions. We validate our approach in simulation and on the real quadruped robot ANYmal running in real-time (<10 ms CPU inference). In a quantitative comparison with existing heuristic-based locally reactive planners, our policy increases the success rate over 30% when facing perception failures. Project Page: https://bit.ly/45NBTuh., Comment: Website and videos are available at our Project Page: https://bit.ly/45NBTuh
Published: 2023

42. The Rashomon Importance Distribution: Getting RID of Unstable, Single Model-based Variable Importance

Author: Donnelly, Jon, Katta, Srikar, Rudin, Cynthia, and Browne, Edward P.
Subjects: Computer Science - Machine Learning, Quantitative Biology - Genomics, Statistics - Machine Learning
Abstract: Quantifying variable importance is essential for answering high-stakes questions in fields like genetics, public policy, and medicine. Current methods generally calculate variable importance for a given model trained on a given dataset. However, for a given dataset, there may be many models that explain the target outcome equally well; without accounting for all possible explanations, different researchers may arrive at many conflicting yet equally valid conclusions given the same data. Additionally, even when accounting for all possible explanations for a given dataset, these insights may not generalize because not all good explanations are stable across reasonable data perturbations. We propose a new variable importance framework that quantifies the importance of a variable across the set of all good models and is stable across the data distribution. Our framework is extremely flexible and can be integrated with most existing model classes and global variable importance metrics. We demonstrate through experiments that our framework recovers variable importance rankings for complex simulation setups where other methods fail. Further, we show that our framework accurately estimates the true importance of a variable for the underlying data distribution. We provide theoretical guarantees on the consistency and finite sample error rates for our estimator. Finally, we demonstrate its utility with a real-world case study exploring which genes are important for predicting HIV load in persons with HIV, highlighting an important gene that has not previously been studied in connection with HIV. Code is available at https://github.com/jdonnelly36/Rashomon_Importance_Distribution., Comment: Appeared in NeurIPS 2023 as a spotlight paper
Published: 2023

43. Correction: SMARCA4 controls state plasticity in small cell lung cancer through regulation of neuroendocrine transcription factors and REST splicing

Author: Esther Redin, Harsha Sridhar, Yingqian A. Zhan, Barbara Pereira Mello, Hong Zhong, Vidushi Durani, Amin Sabet, Parvathy Manoj, Irina Linkov, Juan Qiu, Richard P. Koche, Elisa de Stanchina, Maider Astorkia, Doron Betel, Álvaro Quintanal-Villalonga, and Charles M. Rudin
Subjects: Diseases of the blood and blood-forming organs, RC633-647.5, Neoplasms. Tumors. Oncology. Including cancer and carcinogens, RC254-282
Published: 2024
Full Text: View/download PDF

44. An Ambivalent Story: Queer Children's Literature in Israel between 1986 and 2022

Author: Shai Rudin
Abstract: Since the establishment of the State of Israel in 1948, only 23 Queer literary works have been published for children and adolescents in Hebrew. This paper examines the characteristics of these works in light of the ambivalent and controversial status of the LGBTQ+ community in Israel. The findings show that the few works that have been published and that present same-sex families or LGBTQ+ characters, and the manner in which Queer themes are presented in those texts, are in line with how these issues are perceived and addressed by Israeli society. Moreover, this article divides the Queer Israeli literature for children and youth into three categories: implied Queer literature that hides the Queer existence through the allegoric genre; Queer-phobic literature that seemingly presents the Queer family but in actual fact promotes homophobic stereotypes; and Queer literature that places the LGBTQ+ character or family in the center of the story without passing judgment or criticism.
Published: 2024
Full Text: View/download PDF

45. A Self-Supervised Algorithm for Denoising Photoplethysmography Signals for Heart Rate Estimation from Wearables

Author: Jain, Pranay, Ding, Cheng, Rudin, Cynthia, and Hu, Xiao
Subjects: Electrical Engineering and Systems Science - Signal Processing, Computer Science - Machine Learning
Abstract: Smart watches and other wearable devices are equipped with photoplethysmography (PPG) sensors for monitoring heart rate and other aspects of cardiovascular health. However, PPG signals collected from such devices are susceptible to corruption from noise and motion artifacts, which cause errors in heart rate estimation. Typical denoising approaches filter or reconstruct the signal in ways that eliminate much of the morphological information, even from the clean parts of the signal that would be useful to preserve. In this work, we develop an algorithm for denoising PPG signals that reconstructs the corrupted parts of the signal, while preserving the clean parts of the PPG signal. Our novel framework relies on self-supervised training, where we leverage a large database of clean PPG signals to train a denoising autoencoder. As we show, our reconstructed signals provide better estimates of heart rate from PPG signals than the leading heart rate estimation methods. Further experiments show significant improvement in Heart Rate Variability (HRV) estimation from PPG signals using our algorithm. We conclude that our algorithm denoises PPG signals in a way that can improve downstream analysis of many different health metrics from wearable devices., Comment: 13 pages, 6 figures
Published: 2023

46. Sparse learned kernels for interpretable and efficient medical time series processing

Author: Chen, Sully F., Guo, Zhicheng, Ding, Cheng, Hu, Xiao, and Rudin, Cynthia
Subjects: Electrical Engineering and Systems Science - Signal Processing, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Rapid, reliable, and accurate interpretation of medical time-series signals is crucial for high-stakes clinical decision-making. Deep learning methods offered unprecedented performance in medical signal processing but at a cost: they were compute-intensive and lacked interpretability. We propose Sparse Mixture of Learned Kernels (SMoLK), an interpretable architecture for medical time series processing. SMoLK learns a set of lightweight flexible kernels that form a single-layer sparse neural network, providing not only interpretability, but also efficiency, robustness, and generalization to unseen data distributions. We introduce a parameter reduction techniques to reduce the size of SMoLK's networks while maintaining performance. We test SMoLK on two important tasks common to many consumer wearables: photoplethysmography (PPG) artifact detection and atrial fibrillation detection from single-lead electrocardiograms (ECGs). We find that SMoLK matches the performance of models orders of magnitude larger. It is particularly suited for real-time applications using low-power devices, and its interpretability benefits high-stakes situations., Comment: Published as an article in Nature Machine Intelligence (https://doi.org/10.1038/s42256-024-00898-4). 23 pages, 9 figures
Published: 2023
Full Text: View/download PDF

47. A Double Machine Learning Approach to Combining Experimental and Observational Data

Author: Parikh, Harsh, Morucci, Marco, Orlandi, Vittorio, Roy, Sudeepa, Rudin, Cynthia, and Volfovsky, Alexander
Subjects: Statistics - Methodology, Computer Science - Artificial Intelligence, Computer Science - Machine Learning, Economics - Econometrics
Abstract: Experimental and observational studies often lack validity due to untestable assumptions. We propose a double machine learning approach to combine experimental and observational studies, allowing practitioners to test for assumption violations and estimate treatment effects consistently. Our framework tests for violations of external validity and ignorability under milder assumptions. When only one of these assumptions is violated, we provide semiparametrically efficient treatment effect estimators. However, our no-free-lunch theorem highlights the necessity of accurately identifying the violated assumption for consistent treatment effect estimation. Through comparative analyses, we show our framework's superiority over existing data fusion methods. The practical utility of our approach is further exemplified by three real-world case studies, underscoring its potential for widespread application in empirical research.
Published: 2023

48. ANYmal Parkour: Learning Agile Navigation for Quadrupedal Robots

Author: Hoeller, David, Rudin, Nikita, Sako, Dhionis, and Hutter, Marco
Subjects: Computer Science - Robotics
Abstract: Performing agile navigation with four-legged robots is a challenging task due to the highly dynamic motions, contacts with various parts of the robot, and the limited field of view of the perception sensors. In this paper, we propose a fully-learned approach to train such robots and conquer scenarios that are reminiscent of parkour challenges. The method involves training advanced locomotion skills for several types of obstacles, such as walking, jumping, climbing, and crouching, and then using a high-level policy to select and control those skills across the terrain. Thanks to our hierarchical formulation, the navigation policy is aware of the capabilities of each skill, and it will adapt its behavior depending on the scenario at hand. Additionally, a perception module is trained to reconstruct obstacles from highly occluded and noisy sensory data and endows the pipeline with scene understanding. Compared to previous attempts, our method can plan a path for challenging scenarios without expert demonstration, offline computation, a priori knowledge of the environment, or taking contacts explicitly into account. While these modules are trained from simulated data only, our real-world experiments demonstrate successful transfer on hardware, where the robot navigates and crosses consecutive challenging obstacles with speeds of up to two meters per second. The supplementary video can be found on the project website: https://sites.google.com/leggedrobotics.com/agile-navigation
Published: 2023

49. Cohort profile: the Swiss Mother and Child HIV Cohort Study (MoCHiV)

Author: PA Crisinel, P Paioni, N Müller, A Anagnostopoulos, M Hoffmann, Huldrych Günthard, J Böni, J Schüpbach, M Baumann, D Nadal, Christoph Rudin, M Egger, C-A Siegrist, N Wagner, J Németh, A Duppenthaler, J Notter, L Kaiser, J-J CHESEAUX, P Schmid, A Rauch, M Huber, C Rudin, B Ledergerber, A Calmy, M Cavassini, M Rickenbach, H Furrer, M Battegay, L Elzi, Andri Rauch, B Hirschel, E Bernasconi, HC Bucher, J Fehr, J Fellay, CA Fux, D Haerry, B Hasse, HH Hirsch, I Hösli, O Keiser, T Klimkait, H Kovari, B Martinez de Tejada, G Pantaleo, R Speck, P Tarr, A Trkola, S Yerly, P Francioli, Andrea Duppenthaler, Karoline Aebi-Popp, Noemie Wagner, Paolo Paioni, Murezi Capaul, Anja Brunner, Anna Traytel, Pierre-Alex Crisinel, Begona Martinez De Tejada, Lisa Kottanattu, Marcel Stöckle, Irene Hösli, Alexandra Scherrer, Katharina Kusejko, Christian R Kahlert, I Abela, K Aebi-Popp, DL Braun, A Ciuffi, K Darling, G Dollenmaier, K Francini, HF Günthard, A Hachfeld, D Jackson-Perry, CR Kahlert, E Kapfhammer, M Kohns, L Kottanattu, RD Kouyos, K Kusejko, N Labhardt, C Marzolini, KJ Metzner, D Nicca, M Perreau, Polli Ch, L Salazar-Vizcaya, M Stöckle, M Thanh Lecompte, G Wandeler, M Weisser, and C Kind
Subjects: Medicine
Abstract: Purpose Prospective, multicentric observational cohort study in Switzerland investigating measures to prevent mother-to-child transmission in pregnant women with HIV (WWH) and assessing health and development of their exposed children as well as of children with HIV (CWH) in general.Participants Between January 1986 and December 2022, a total of 1446 mother–child pairs were enrolled. During the same period, the study also registered 187 CWH and 521 HIV-exposed but uninfected children (HEU), for whom detailed maternal information was not available. Consequently, the cohort comprises a total of 2154 children.Findings to date During these 37 years, research by the Swiss Mother and Child HIV Cohort Study (MoCHiV) and its international collaborators has strongly influenced the prevention of vertical transmission of HIV (eg, introduction and discontinuation of elective caesarean section, neonatal postexposure prophylaxis and breastfeeding). Contributions have also been made to the management of diagnostics (eg, p24 antigen assay) and the effects of antiretroviral treatment (eg, prematurity, growth) in HEU and CWH.Future plans Most children present within the cohort are now HEU, highlighting the need to investigate other vertically transmitted pathogens such as hepatitis B and C viruses, cytomegalovirus or Treponema pallidum. In addition, analyses are planned on the longitudinal health status of CWH (eg, resistance and prolonged exposure to antiretroviral therapy), on social aspects including stigma in CWH and HEU, and on interventions to further optimise antenatal and postpartum care in WWH.
Published: 2024
Full Text: View/download PDF

50. Correction: SMARCA4 controls state plasticity in small cell lung cancer through regulation of neuroendocrine transcription factors and REST splicing

Author: Redin, Esther, Sridhar, Harsha, Zhan, Yingqian A., Pereira Mello, Barbara, Zhong, Hong, Durani, Vidushi, Sabet, Amin, Manoj, Parvathy, Linkov, Irina, Qiu, Juan, Koche, Richard P., de Stanchina, Elisa, Astorkia, Maider, Betel, Doron, Quintanal-Villalonga, Álvaro, and Rudin, Charles M.
Published: 2024
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

21,766 results on '"Rudin A"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources