Author: "Smyth P" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Smyth P"' showing total 15,363 results

Start Over Author "Smyth P"

15,363 results on '"Smyth P"'

1. Benchmark Data Repositories for Better Benchmarking

Author: Longjohn, Rachel, Kelly, Markelle, Singh, Sameer, and Smyth, Padhraic
Subjects: Computer Science - Machine Learning, Computer Science - Digital Libraries
Abstract: In machine learning research, it is common to evaluate algorithms via their performance on standard benchmark datasets. While a growing body of work establishes guidelines for -- and levies criticisms at -- data and benchmarking practices in machine learning, comparatively less attention has been paid to the data repositories where these datasets are stored, documented, and shared. In this paper, we analyze the landscape of these $\textit{benchmark data repositories}$ and the role they can play in improving benchmarking. This role includes addressing issues with both datasets themselves (e.g., representational harms, construct validity) and the manner in which evaluation is carried out using such datasets (e.g., overemphasis on a few datasets and metrics, lack of reproducibility). To this end, we identify and discuss a set of considerations surrounding the design and use of benchmark data repositories, with a focus on improving benchmarking practices in machine learning., Comment: Accepted to NeurIPS Datasets and Benchmarks 2024
Published: 2024

2. ELBOing Stein: Variational Bayes with Stein Mixture Inference

Author: Rønning, Ola, Nalisnick, Eric, Ley, Christophe, Smyth, Padhraic, and Hamelryck, Thomas
Subjects: Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: Stein variational gradient descent (SVGD) [Liu and Wang, 2016] performs approximate Bayesian inference by representing the posterior with a set of particles. However, SVGD suffers from variance collapse, i.e. poor predictions due to underestimating uncertainty [Ba et al., 2021], even for moderately-dimensional models such as small Bayesian neural networks (BNNs). To address this issue, we generalize SVGD by letting each particle parameterize a component distribution in a mixture model. Our method, Stein Mixture Inference (SMI), optimizes a lower bound to the evidence (ELBO) and introduces user-specified guides parameterized by particles. SMI extends the Nonlinear SVGD framework [Wang and Liu, 2019] to the case of variational Bayes. SMI effectively avoids variance collapse, judging by a previously described test developed for this purpose, and performs well on standard data sets. In addition, SMI requires considerably fewer particles than SVGD to accurately estimate uncertainty for small BNNs. The synergistic combination of NSVGD, ELBO optimization and user-specified guides establishes a promising approach towards variational Bayesian inference in the case of tall and wide data.
Published: 2024

3. EventFlow: Forecasting Continuous-Time Event Data with Flow Matching

Author: Kerrigan, Gavin, Nelson, Kai, and Smyth, Padhraic
Subjects: Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: Continuous-time event sequences, in which events occur at irregular intervals, are ubiquitous across a wide range of industrial and scientific domains. The contemporary modeling paradigm is to treat such data as realizations of a temporal point process, and in machine learning it is common to model temporal point processes in an autoregressive fashion using a neural network. While autoregressive models are successful in predicting the time of a single subsequent event, their performance can be unsatisfactory in forecasting longer horizons due to cascading errors. We propose EventFlow, a non-autoregressive generative model for temporal point processes. Our model builds on the flow matching framework in order to directly learn joint distributions over event times, side-stepping the autoregressive process. EventFlow is likelihood-free, easy to implement and sample from, and either matches or surpasses the performance of state-of-the-art models in both unconditional and conditional generation tasks on a set of standard benchmarks
Published: 2024

4. A Generative Diffusion Model for Probabilistic Ensembles of Precipitation Maps Conditioned on Multisensor Satellite Observations

Author: Guilloteau, Clement, Kerrigan, Gavin, Nelson, Kai, Migliorini, Giosue, Smyth, Padhraic, Li, Runze, and Foufoula-Georgiou, Efi
Subjects: Physics - Atmospheric and Oceanic Physics, Physics - Data Analysis, Statistics and Probability
Abstract: A generative diffusion model is used to produce probabilistic ensembles of precipitation intensity maps at the 1-hour 5-km resolution. The generation is conditioned on infrared and microwave radiometric measurements from the GOES and DMSP satellites and is trained with merged ground radar and gauge data over southeastern United States. The generated precipitation maps reproduce the spatial autocovariance and other multiscale statistical properties of the gauge-radar reference fields on average. Conditioning the generation on the satellite measurements allows us to constrain the magnitude and location of each generated precipitation feature. The mean of the 128- member ensemble shows high spatial coherence with the reference fields with 0.82 linear correlation between the two. On average, the coherence between any two ensemble members is approximately the same as the coherence between any ensemble member and the ground reference, attesting that the ensemble dispersion is a proper measure of the estimation uncertainty. From the generated ensembles we can easily derive the probability of the precipitation intensity exceeding any given intensity threshold, at the 5-km resolution of the generation or at any desired aggregated resolution.
Published: 2024

5. Real-time observation of frustrated ultrafast recovery from ionisation in nanostructured SiO2 using laser driven accelerators

Author: Kennedy, J. P., Coughlan, M., Fitzpatrick, C. R. J., Huddleston, H. M., Smyth, J., Breslin, N., Donnelly, H., Arthur, C., Villagomez, B., Rosmej, O. N., Currell, F., Stella, L., Riley, D., Zepf, M., Yeung, M., Lewis, C. L. S., and Dromey, B.
Subjects: Physics - Accelerator Physics, Condensed Matter - Materials Science, Physics - Plasma Physics
Abstract: Ionising radiation interactions in matter can trigger a cascade of processes that underpin long-lived damage in the medium. To date, however, a lack of suitable methodologies has precluded our ability to understand the role that material nanostructure plays in this cascade. Here, we use transient photoabsorption to track the lifetime of free electrons (t_c) in bulk and nanostructured SiO2 (aerogel) irradiated by picosecond-scale (10^-12 s) bursts of X-rays and protons from a laser-driven accelerator. Optical streaking reveals a sharp increase in t_c from < 1 ps to > 50 ps over a narrow average density (p_av) range spanning the expected phonon-fracton crossover in aerogels. Numerical modelling suggests that this discontinuity can be understood by a quenching of rapid, phonon-assisted recovery in irradiated nanostructured SiO_2. This is shown to lead to an extended period of enhanced energy density in the excited electron population. Overall, these results open a direct route to tracking how low-level processes in complex systems can underpin macroscopically observed phenomena and, importantly, the conditions that permit them to emerge.
Published: 2024
Full Text: View/download PDF

6. V-Words, Lyndon Words and Galois Words

Author: Daykin, Jacqueline W., Mhaskar, Neerja, and Smyth, W. F.
Subjects: Computer Science - Data Structures and Algorithms
Abstract: We say that a family $\mathcal{W}$ of strings over $\Sigma^+$ forms a Unique Maximal Factorization Family (UMFF) if and only if every $w \in \mathcal{W}$ has a unique maximal factorization. Further, an UMFF $\mathcal{W}$ is called a circ-UMFF whenever it contains exactly one rotation of every primitive string $x \in \Sigma^+$. $V$-order is a non-lexicographical total ordering on strings that determines a circ-UMFF. In this paper we propose a generalization of circ-UMFF called the substring circ-UMFF and extend combinatorial research on $V$-order by investigating connections to Lyndon words. Then we extend these concepts to any total order. Applications of this research arise in efficient text indexing, compression, and search problems., Comment: 30 pages
Published: 2024

7. DNA Replication across α-l-(3-2)-Threofuranosyl Nucleotides Mediated by Human DNA Polymerase η.

Author: Tomar, Rachana, Ghodke, Pratibha, Patra, Amritraj, Smyth, Elizabeth, Pontarelli, Alexander, Copp, William, Guengerich, F, Chaput, John, Wilds, Christopher, Stone, Michael, and Egli, Martin
Subjects: Humans, DNA-Directed DNA Polymerase, DNA Replication, DNA, Nucleotides, Crystallography, X-Ray, Models, Molecular
Abstract: α-l-(3-2)-Threofuranosyl nucleic acid (TNA) pairs with itself, cross-pairs with DNA and RNA, and shows promise as a tool in synthetic genetics, diagnostics, and oligonucleotide therapeutics. We studied in vitro primer insertion and extension reactions catalyzed by human trans-lesion synthesis (TLS) DNA polymerase η (hPol η) opposite a TNA-modified template strand without and in combination with O4-alkyl thymine lesions. Across TNA-T (tT), hPol η inserted mostly dAMP and dGMP, dTMP and dCMP with lower efficiencies, followed by extension of the primer to a full-length product. hPol η inserted dAMP opposite O4-methyl and -ethyl analogs of tT, albeit with reduced efficiencies relative to tT. Crystal structures of ternary hPol η complexes with template tT and O4-methyl tT at the insertion and extension stages demonstrated that the shorter backbone and different connectivity of TNA compared to DNA (3 → 2 versus 5 → 3, respectively) result in local differences in sugar orientations, adjacent phosphate spacings, and directions of glycosidic bonds. The 3-OH of the primers terminal thymine was positioned at 3.4 Å on average from the α-phosphate of the incoming dNTP, consistent with insertion opposite and extension past the TNA residue by hPol η. Conversely, the crystal structure of a ternary hPol η·DNA·tTTP complex revealed that the primers terminal 3-OH was too distant from the tTTP α-phosphate, consistent with the inability of the polymerase to incorporate TNA. Overall, our study provides a better understanding of the tolerance of a TLS DNA polymerase vis-à-vis unnatural nucleotides in the template and as the incoming nucleoside triphosphate.
Published: 2024

8. Airflow Modeling for Citrus under Protective Screens.

Author: Kurafeeva, Liubov, Wolski, Rich, Krintz, Chandra, and Smyth, Thomas
Subjects: CFD, citrus crop, controlled environment agriculture, validation, wind modeling, Citrus, Agriculture, Models, Theoretical, Wind, Hydrodynamics
Abstract: This study explores the development and validation of an airflow model to support climate prediction for Citrus Under Protective Screens (CUPS) in California. CUPS is a permeable screen structure designed to protect a field of citrus trees from large insects including the vector that causes the devastating citrus greening disease. Because screen structures modify the environmental conditions (e.g., temperature, relative humidity, airflow), farm management and treatment strategies (e.g., pesticide spraying events) must be modified to account for these differences. Toward this end, we develop a model for predicting wind speed and direction in a commercial-scale research CUPS, using a computational fluid dynamics (CFD) model. We describe the model and validate it in two ways. In the first, we model a small-scale replica CUPS under controlled conditions and compare modeled and measured airflow in and around the replica structure. In the second, we model the full-scale CUPS and use historical measurements to back test the models accuracy. In both settings, the modeled airflow values fall within statistical confidence intervals generated from the corresponding measurements of the conditions being modeled. These findings suggest that the model can aid decision support and smart agriculture solutions for farmers as they adapt their farm management practices for CUPS structures.
Published: 2024

9. An Analysis of the Impact of Gold Open Access Publications in Computer Science

Author: Cunningham, Padraig and Smyth, Barry
Subjects: Computer Science - Digital Libraries
Abstract: There has been some concern about the impact of predatory publishers on scientific research for some time. Recently, publishers that might previously have been considered `predatory' have established their bona fides, at least to the extent that they are included in citation impact scores such as the field-weighted citation impact (FWCI). These are sometimes called `grey' publishers (MDPI, Frontiers, Hindawi). In this paper, we show that the citation landscape for these grey publications is significantly different from the mainstream landscape and that affording publications in these venues the same status as publications in mainstream journals may significantly distort metrics such as the FWCI., Comment: 12 pages, 8 figures
Published: 2024

10. SOD-YOLOv8 -- Enhancing YOLOv8 for Small Object Detection in Traffic Scenes

Author: Khalili, Boshra and Smyth, Andrew W.
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Object detection as part of computer vision can be crucial for traffic management, emergency response, autonomous vehicles, and smart cities. Despite significant advances in object detection, detecting small objects in images captured by distant cameras remains challenging due to their size, distance from the camera, varied shapes, and cluttered backgrounds. To address these challenges, we propose Small Object Detection YOLOv8 (SOD-YOLOv8), a novel model specifically designed for scenarios involving numerous small objects. Inspired by Efficient Generalized Feature Pyramid Networks (GFPN), we enhance multi-path fusion within YOLOv8 to integrate features across different levels, preserving details from shallower layers and improving small object detection accuracy. Also, A fourth detection layer is added to leverage high-resolution spatial information effectively. The Efficient Multi-Scale Attention Module (EMA) in the C2f-EMA module enhances feature extraction by redistributing weights and prioritizing relevant features. We introduce Powerful-IoU (PIoU) as a replacement for CIoU, focusing on moderate-quality anchor boxes and adding a penalty based on differences between predicted and ground truth bounding box corners. This approach simplifies calculations, speeds up convergence, and enhances detection accuracy. SOD-YOLOv8 significantly improves small object detection, surpassing widely used models in various metrics, without substantially increasing computational cost or latency compared to YOLOv8s. Specifically, it increases recall from 40.1\% to 43.9\%, precision from 51.2\% to 53.9\%, $\text{mAP}_{0.5}$ from 40.6\% to 45.1\%, and $\text{mAP}_{0.5:0.95}$ from 24\% to 26.6\%. In dynamic real-world traffic scenes, SOD-YOLOv8 demonstrated notable improvements in diverse conditions, proving its reliability and effectiveness in detecting small objects even in challenging environments., Comment: 15 pages, 14 figures
Published: 2024

11. Contrastive Learning of Asset Embeddings from Financial Time Series

Author: Dolphin, Rian, Smyth, Barry, and Dong, Ruihai
Subjects: Computer Science - Machine Learning, Quantitative Finance - Statistical Finance
Abstract: Representation learning has emerged as a powerful paradigm for extracting valuable latent features from complex, high-dimensional data. In financial domains, learning informative representations for assets can be used for tasks like sector classification, and risk management. However, the complex and stochastic nature of financial markets poses unique challenges. We propose a novel contrastive learning framework to generate asset embeddings from financial time series data. Our approach leverages the similarity of asset returns over many subwindows to generate informative positive and negative samples, using a statistical sampling strategy based on hypothesis testing to address the noisy nature of financial data. We explore various contrastive loss functions that capture the relationships between assets in different ways to learn a discriminative representation space. Experiments on real-world datasets demonstrate the effectiveness of the learned asset embeddings on benchmark industry classification and portfolio optimization tasks. In each case our novel approaches significantly outperform existing baselines highlighting the potential for contrastive learning to capture meaningful and actionable relationships in financial data., Comment: 9 pages, 4 figures, 4 tables
Published: 2024

12. Perceptions of Linguistic Uncertainty by Language Models and Humans

Author: Belem, Catarina G, Kelly, Markelle, Steyvers, Mark, Singh, Sameer, and Smyth, Padhraic
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: _Uncertainty expressions_ such as "probably" or "highly unlikely" are pervasive in human language. While prior work has established that there is population-level agreement in terms of how humans quantitatively interpret these expressions, there has been little inquiry into the abilities of language models in the same context. In this paper, we investigate how language models map linguistic expressions of uncertainty to numerical responses. Our approach assesses whether language models can employ theory of mind in this setting: understanding the uncertainty of another agent about a particular statement, independently of the model's own certainty about that statement. We find that 7 out of 10 models are able to map uncertainty expressions to probabilistic responses in a human-like manner. However, we observe systematically different behavior depending on whether a statement is actually true or false. This sensitivity indicates that language models are substantially more susceptible to bias based on their prior knowledge (as compared to humans). These findings raise important questions and have broad implications for human-AI and AI-AI communication., Comment: Accepted at EMNLP 2024 (Main)
Published: 2024

13. JANET: Joint Adaptive predictioN-region Estimation for Time-series

Author: English, Eshant, Wong-Toi, Eliot, Fontana, Matteo, Mandt, Stephan, Smyth, Padhraic, and Lippert, Christoph
Subjects: Statistics - Machine Learning, Computer Science - Machine Learning, Statistics - Methodology
Abstract: Conformal prediction provides machine learning models with prediction sets that offer theoretical guarantees, but the underlying assumption of exchangeability limits its applicability to time series data. Furthermore, existing approaches struggle to handle multi-step ahead prediction tasks, where uncertainty estimates across multiple future time points are crucial. We propose JANET (Joint Adaptive predictioN-region Estimation for Time-series), a novel framework for constructing conformal prediction regions that are valid for both univariate and multivariate time series. JANET generalises the inductive conformal framework and efficiently produces joint prediction regions with controlled K-familywise error rates, enabling flexible adaptation to specific application needs. Our empirical evaluation demonstrates JANET's superior performance in multi-step prediction tasks across diverse time series datasets, highlighting its potential for reliable and interpretable uncertainty quantification in sequential data., Comment: Alternate Title: Conformalised Joint Prediction Region for Time Series
Published: 2024

14. Anomaly Detection of Tabular Data Using LLMs

Author: Li, Aodong, Zhao, Yunhan, Qiu, Chen, Kloft, Marius, Smyth, Padhraic, Rudolph, Maja, and Mandt, Stephan
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Computation and Language
Abstract: Large language models (LLMs) have shown their potential in long-context understanding and mathematical reasoning. In this paper, we study the problem of using LLMs to detect tabular anomalies and show that pre-trained LLMs are zero-shot batch-level anomaly detectors. That is, without extra distribution-specific model fitting, they can discover hidden outliers in a batch of data, demonstrating their ability to identify low-density data regions. For LLMs that are not well aligned with anomaly detection and frequently output factual errors, we apply simple yet effective data-generating processes to simulate synthetic batch-level anomaly detection datasets and propose an end-to-end fine-tuning strategy to bring out the potential of LLMs in detecting real anomalies. Experiments on a large anomaly detection benchmark (ODDS) showcase i) GPT-4 has on-par performance with the state-of-the-art transductive learning-based anomaly detection methods and ii) the efficacy of our synthetic dataset and fine-tuning strategy in aligning LLMs to this task., Comment: accepted at the Anomaly Detection with Foundation Models workshop
Published: 2024

15. Perils of current DAO governance

Author: Kharman, Aida Manzano and Smyth, Ben
Subjects: Computer Science - Computers and Society
Abstract: DAO Governance is currently broken. We survey the state of the art and find worrying conclusions. Vote buying, vote selling and coercion are easy. The wealthy rule, decentralisation is a myth. Hostile take-overs are incentivised. Ballot secrecy is non-existent or short lived, despite being a human right. Verifiablity is achieved at the expense of privacy. These privacy concerns are highlighted with case study analyses of Vocdoni's governance protocol. This work presents two contributions: firstly a review of current DAO governance protocols, and secondly, an illustration of their vulnerabilities, showcasing the privacy and security threats these entail.
Published: 2024

16. Ocean weather, biological rates, and unexplained global ecological patterns.

Author: Li Shing Hiung, Darren, Schuster, Jasmin, Duncan, Murray, Payne, Nicholas, Helmuth, Brian, Chu, Jackson, Baum, Julia, Brambilla, Viviana, Bruno, John, Davies, Sarah, Dornelas, Maria, Gagnon, Patrick, Guy-Haim, Tamar, Jackson, Jennifer, Leichter, James, Madin, Joshua, Monteith, Zachary, Queirós, Ana, Schneider, Eric, Starko, Samuel, Talwar, Brendan, Wyatt, Alex, Aichelman, Hannah, Bensoussan, Nathaniel, Caruso, Carlo, Castillo, Karl, Choi, Francis, Dong, Yun-Wei, Garrabou, Joaquim, Guillemain, Dorian, Higgs, Nicholas, Jiang, Yuwu, Kersting, Diego, Kushner, David, Longo, Guilherme, Neufeld, Christopher, Peirache, Marion, Smyth, Tim, Sprague, Joshua, Urvoy, Gaëlle, Zuberer, Frederic, and Bates, Amanda
Subjects: biological rate, climate variability hypothesis, high frequency, in situ, ocean temperature
Abstract: As on land, oceans exhibit high temporal and spatial temperature variation. This ocean weather contributes to the physiological and ecological processes that ultimately determine the patterns of species distribution and abundance, yet is often unrecognized, especially in tropical oceans. Here, we tested the paradigm of temperature stability in shallow waters (
Published: 2024

17. Smooth connectivity in real algebraic varieties

Author: Cummings, Joseph, Hauenstein, Jonathan D., Hong, Hoon, and Smyth, Clifford D.
Subjects: Mathematics - Algebraic Geometry, 65H14, 14Q30
Abstract: A standard question in real algebraic geometry is to compute the number of connected components of a real algebraic variety in affine space. By adapting an approach for determining connectivity in complements of real hypersurfaces by Hong, Rohal, Safey El Din, and Schost, algorithms are presented for computing the number of connected components, the Euler characteristic, and deciding the connectivity between two points for a smooth manifold arising as the complement of a real hypersurface of a real algebraic variety. When taking such real hypersurface to be the set of singular points, this yields an approach for determining smooth connectivity in a real algebraic variety. The method is based upon gradient ascent/descent paths on the real algebraic variety and several examples are included to demonstrate the approach., Comment: 19 pages, 7 figures
Published: 2024

18. Searching for Free-Floating Planets with TESS: A Few Words of Clarification

Author: Kunimoto, Michelle, DeRocco, William, Smyth, Nolan, and Bryson, Steve
Subjects: Astrophysics - Earth and Planetary Astrophysics, Astrophysics - Instrumentation and Methods for Astrophysics
Abstract: We recently described the results of an initial search through TESS Sector 61 for free-floating planets. In this short note, we provide important context for our results and clarify the language used in our initial manuscript to ensure that our intended message is appropriately conveyed., Comment: 2 pages; note regarding arXiv:2404.11666
Published: 2024

19. Fine-tuning Protein Language Models with Deep Mutational Scanning improves Variant Effect Prediction

Author: Lafita, Aleix, Gonzalez, Ferran, Hossam, Mahmoud, Smyth, Paul, Deasy, Jacob, Allyn-Feuer, Ari, Seaton, Daniel, and Young, Stephen
Subjects: Quantitative Biology - Genomics, Computer Science - Machine Learning
Abstract: Protein Language Models (PLMs) have emerged as performant and scalable tools for predicting the functional impact and clinical significance of protein-coding variants, but they still lag experimental accuracy. Here, we present a novel fine-tuning approach to improve the performance of PLMs with experimental maps of variant effects from Deep Mutational Scanning (DMS) assays using a Normalised Log-odds Ratio (NLR) head. We find consistent improvements in a held-out protein test set, and on independent DMS and clinical variant annotation benchmarks from ProteinGym and ClinVar. These findings demonstrate that DMS is a promising source of sequence diversity and supervised training data for improving the performance of PLMs for variant effect prediction., Comment: Machine Learning for Genomics Explorations workshop at ICLR 2024
Published: 2024

20. Surrogate Modeling of Trajectory Map-matching in Urban Road Networks using Transformer Sequence-to-Sequence Model

Author: Mohammadi, Sevin and Smyth, Andrew W.
Subjects: Computer Science - Artificial Intelligence, Computer Science - Computational Engineering, Finance, and Science
Abstract: Large-scale geolocation telematics data acquired from connected vehicles has the potential to significantly enhance mobility infrastructures and operational systems within smart cities. To effectively utilize this data, it is essential to accurately match the geolocation data to the road segments. However, this matching is often not trivial due to the low sampling rate and errors exacerbated by multipath effects in urban environments. Traditionally, statistical modeling techniques such as Hidden-Markov models incorporating domain knowledge into the matching process have been extensively used for map-matching tasks. However, rule-based map-matching tasks are noise-sensitive and inefficient in processing large-scale trajectory data. Deep learning techniques directly learn the relationship between observed data and road networks from the data, often without the need for hand-crafted rules or domain knowledge. This renders them an efficient approach for map-matching large-scale datasets and more robust to the noise. This paper introduces a deep-learning model, specifically the transformer-based encoder-decoder model, to perform as a surrogate for offline map-matching algorithms. The encoder-decoder architecture initially encodes the series of noisy GPS points into a representation that automatically captures autoregressive behavior and spatial correlations between GPS points. Subsequently, the decoder associates data points with the road network features and thus transforms these representations into a sequence of road segments. The model is trained and evaluated using GPS traces collected in Manhattan, New York. Achieving an accuracy of 75%, transformer-based encoder-decoder models extensively employed in natural language processing presented a promising performance for translating noisy GPS data to the navigated routes in urban road networks., Comment: 15 pages, 10 figures
Published: 2024

21. Searching for Free-Floating Planets with TESS: I. Discovery of a First Terrestrial-Mass Candidate

Author: Kunimoto, Michelle, DeRocco, William, Smyth, Nolan, and Bryson, Steve
Subjects: Astrophysics - Earth and Planetary Astrophysics, Astrophysics - Astrophysics of Galaxies, Astrophysics - Instrumentation and Methods for Astrophysics
Abstract: Though free-floating planets (FFPs) that have been ejected from their natal star systems may outpopulate their bound counterparts in the terrestrial-mass range, they remain one of the least explored exoplanet demographics. Due to their negligible electromagnetic emission at all wavelengths, the only observational technique able to detect these worlds is gravitational microlensing. Microlensing by terrestrial-mass FFPs induces rare, short-duration magnifications of background stars, requiring high-cadence, wide-field surveys to detect these events. The Transiting Exoplanet Survey Satellite (TESS), though designed to detect close-bound exoplanets via the transit technique, boasts a cadence as short as 200 seconds and has monitored hundreds of millions of stars, making it well-suited to search for short-duration microlensing events as well. We have used existing data products from the TESS Quick-Look Pipeline (QLP) to perform a preliminary search for FFP microlensing candidates in 1.3 million light curves from TESS Sector 61. We find one compelling candidate associated with TIC-107150013, a source star at $d_s = 3.194$ kpc. The event has a duration $t_E = 0.074^{+0.002}_{-0.002}$ days and shows prominent finite-source features ($\rho = 4.55^{+0.08}_{-0.07}$), making it consistent with an FFP in the terrestrial-mass range. This exciting result indicates that our ongoing search through all TESS sectors has the opportunity to shed new light on this enigmatic population of worlds., Comment: 10 pages, 7 figures, submitted to MNRAS
Published: 2024

22. Dynamic Conditional Optimal Transport through Simulation-Free Flows

Author: Kerrigan, Gavin, Migliorini, Giosue, and Smyth, Padhraic
Subjects: Computer Science - Machine Learning
Abstract: We study the geometry of conditional optimal transport (COT) and prove a dynamical formulation which generalizes the Benamou-Brenier Theorem. Equipped with these tools, we propose a simulation-free flow-based method for conditional generative modeling. Our method couples an arbitrary source distribution to a specified target distribution through a triangular COT plan, and a conditional generative model is obtained by approximating the geodesic path of measures induced by this COT plan. Our theory and methods are applicable in infinite-dimensional settings, making them well suited for a wide class of Bayesian inverse problems. Empirically, we demonstrate that our method is competitive on several challenging conditional generation tasks, including an infinite-dimensional inverse problem.
Published: 2024

23. Constraints on Primordial Black Holes from $N$-body simulations of the Eridanus II Stellar Cluster

Author: Koulen, Julia Monika, Profumo, Stefano, and Smyth, Nolan
Subjects: Astrophysics - Cosmology and Nongalactic Astrophysics, High Energy Physics - Phenomenology
Abstract: The tidal disruption of old, compact stellar structures provides strong constraints on macroscopic dark matter candidates such as primordial black holes. In view of recent, new observational data on the Eridanus II dwarf galaxy and on its central stellar cluster, we employ, for the first time, $N$-body simulations to assess the impact of compact massive dark matter candidates on the gravitational stability of the cluster. We find evidence that such candidates must be lighter than about one solar mass if they constitute the totality of the dark matter. We additionally derive robust constraints on the fraction of the dark matter in macroscopic objects as a function of mass, by suitably modeling the remainder of the dark matter as standard fluid-like cold dark matter., Comment: 17 pages, 6 figures
Published: 2024

24. ‘Why can't people see or understand or make that effort to see who I am?’: documenting the experiences of low socioeconomic students at an elite tertiary institution through a social identity lens

Author: Walker, Sarah, Campbell, Sai, Smyth, Lillian, Platow, Michael J., Venville, Grady, and Willis, Tania
Published: 2024
Full Text: View/download PDF

25. Identification and epidemiological analysis of a putative novel hantavirus in Australian flying foxes

Author: Smith, Craig S., Underwood, Darren J., Gordon, Anita, Pyne, Michael J., Smyth, Anna, Genge, Benjamin, Driver, Luke, Mayer, David G., and Oakey, Jane
Published: 2024
Full Text: View/download PDF

26. Heterogeneous genetic architectures of prostate cancer susceptibility in sub-Saharan Africa

Author: Janivara, Rohini, Chen, Wenlong C., Hazra, Ujani, Baichoo, Shakuntala, Agalliu, Ilir, Kachambwa, Paidamoyo, Simonti, Corrine N., Brown, Lyda M., Tambe, Saanika P., Kim, Michelle S., Harlemon, Maxine, Jalloh, Mohamed, Muzondiwa, Dillon, Naidoo, Daphne, Ajayi, Olabode O., Snyper, Nana Yaa, Niang, Lamine, Diop, Halimatou, Ndoye, Medina, Mensah, James E., Abrahams, Afua O. D., Biritwum, Richard, Adjei, Andrew A., Adebiyi, Akindele O., Shittu, Olayiwola, Ogunbiyi, Olufemi, Adebayo, Sikiru, Nwegbu, Maxwell M., Ajibola, Hafees O., Oluwole, Olabode P., Jamda, Mustapha A., Pentz, Audrey, Haiman, Christopher A., Spies, Petrus V., van der Merwe, André, Cook, Michael B., Chanock, Stephen J., Berndt, Sonja I., Watya, Stephen, Lubwama, Alexander, Muchengeti, Mazvita, Doherty, Sean, Smyth, Natalie, Lounsbury, David, Fortier, Brian, Rohan, Thomas E., Jacobson, Judith S., Neugut, Alfred I., Hsing, Ann W., Gusev, Alexander, Aisuodionoe-Shadrach, Oseremen I., Joffe, Maureen, Adusei, Ben, Gueye, Serigne M., Fernandez, Pedro W., McBride, Jo, Andrews, Caroline, Petersen, Lindsay N., Lachance, Joseph, and Rebbeck, Timothy R.
Published: 2024
Full Text: View/download PDF

27. Mouse models to investigate in situ cell fate decisions induced by p53

Author: Lieschke, Elizabeth, Thomas, Annabella F, Kueh, Andrew, Atkin-Smith, Georgia K, Baldoni, Pedro L, La Marca, John E, Young, Savannah, Huang, Allan Shuai, Ross, Aisling M, Whelan, Lauren, Kaloni, Deeksha, Tai, Lin, Smyth, Gordon K, Herold, Marco J, Hawkins, Edwin D, Strasser, Andreas, and Kelly, Gemma L
Published: 2024
Full Text: View/download PDF

28. ACE Enquiry in Primary care: A Qualitative Exploration of the Perspective of General Practitioners in Northern Ireland

Author: Smyth, Rafael and McSherry, Dominic
Published: 2024
Full Text: View/download PDF

29. Implementation of homogeneous and heterogeneous tidal arrays in the Inner Sound of the Pentland Firth

Author: Patel, Misha D., Smyth, Amanda S. M., Angeloudis, Athanasios, and Adcock, Thomas A. A.
Published: 2024
Full Text: View/download PDF

30. The Risks for HIV and Sexually Transmitted Infections Among Men Who Have Sex with Men Who Engage in Chemsex in Low- and Middle-Income Countries: A Mixed Methods Systematic Review and Meta-Analysis

Author: Eustaquio, Patrick C., Smyth, Jamie, and Salisi, James A.
Published: 2024
Full Text: View/download PDF

31. Vasospasm and subsequent stroke from paraneoplastic syndrome in a pediatric patient with an intracranial mature teratoma: a case report

Author: Jenson, Amanda V., Rizvi, Ali Yunus, Reynolds, Rebecca A., Hartnett-Wright, Sara, Gellar, Thomas J., Stapleton, Stacie, Gonzalez-Gomez, Ignacio, Akbari, S. Hassan A., and Smyth, Matthew D.
Published: 2024
Full Text: View/download PDF

32. Gafchromic EBT3 film provides equivalent dosimetric performance to EBT-XD film for stereotactic radiosurgery dosimetry

Author: Smyth, Lloyd, Alves, Andrew, Collins, Katherine, and Beveridge, Sabeena
Published: 2024
Full Text: View/download PDF

33. Language-Driven Engineering An Interdisciplinary Software Development Paradigm

Author: Steffen, Bernhard, Margaria, Tiziana, Bainczyk, Alexander, Boßelmann, Steve, Busch, Daniel, Driessen, Marc, Frohme, Markus, Howar, Falk, Jörges, Sven, Krause, Marvin, Krumrey, Marco, Lamprecht, Anna-Lena, Lybecait, Michael, Murtovi, Alnis, Naujokat, Stefan, Neubauer, Johannes, Schieweck, Alexander, Schürmann, Jonas, Smyth, Steven, Steffen, Barbara, Storek, Fabian, Tegeler, Tim, Teumert, Sebastian, Wirkner, Dominic, and Zweihoff, Philip
Subjects: Computer Science - Software Engineering, Computer Science - Programming Languages
Abstract: We illustrate how purpose-specific, graphical modeling enables application experts with different levels of expertise to collaboratively design and then produce complex applications using their individual, purpose-specific modeling language. Our illustration includes seven graphical Integrated Modeling Environments (IMEs) that support full code generation, as well as four browser-based applications that were modeled and then fully automatically generated and produced using DIME, our most complex graphical IME. While the seven IMEs were chosen to illustrate the types of languages we support with our Language-Driven Engineering (LDE) approach, the four DIME products were chosen to give an impression of the power of our LDE-generated IMEs. In fact, Equinocs, Springer Nature's future editorial system for proceedings, is also being fully automatically generated and then deployed at their Dordrecht site using a deployment pipeline generated with Rig, one of the IMEs presented. Our technology is open source and the products presented are currently in use., Comment: 43 pages, 30 figures
Published: 2024

34. The Calibration Gap between Model and Human Confidence in Large Language Models

Author: Steyvers, Mark, Tejeda, Heliodoro, Kumar, Aakriti, Belem, Catarina, Karny, Sheer, Hu, Xinyue, Mayer, Lukas, and Smyth, Padhraic
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Computation and Language, Computer Science - Human-Computer Interaction
Abstract: For large language models (LLMs) to be trusted by humans they need to be well-calibrated in the sense that they can accurately assess and communicate how likely it is that their predictions are correct. Recent work has focused on the quality of internal LLM confidence assessments, but the question remains of how well LLMs can communicate this internal model confidence to human users. This paper explores the disparity between external human confidence in an LLM's responses and the internal confidence of the model. Through experiments involving multiple-choice questions, we systematically examine human users' ability to discern the reliability of LLM outputs. Our study focuses on two key areas: (1) assessing users' perception of true LLM confidence and (2) investigating the impact of tailored explanations on this perception. The research highlights that default explanations from LLMs often lead to user overestimation of both the model's confidence and its' accuracy. By modifying the explanations to more accurately reflect the LLM's internal confidence, we observe a significant shift in user perception, aligning it more closely with the model's actual confidence levels. This adjustment in explanatory approach demonstrates potential for enhancing user trust and accuracy in assessing LLM outputs. The findings underscore the importance of transparent communication of confidence levels in LLMs, particularly in high-stakes applications where understanding the reliability of AI-generated information is essential., Comment: 27 pages, 10 figures
Published: 2024

35. Generalized sleep decoding with basal ganglia signals in multiple movement disorders.

Author: Yin, Zixiao, Yu, Huiling, Yuan, Tianshuo, Smyth, Clay, Anjum, Md, Zhu, Guanyu, Ma, Ruoyu, Xu, Yichen, An, Qi, Gan, Yifei, Merk, Timon, Qin, Guofan, Xie, Hutao, Zhang, Ning, Wang, Chunxue, Jiang, Yin, Meng, Fangang, Yang, Anchao, Neumann, Wolf-Julian, Li, Luming, Zhang, Jianguo, Starr, Philip, and Little, Simon
Abstract: Sleep disturbances profoundly affect the quality of life in individuals with neurological disorders. Closed-loop deep brain stimulation (DBS) holds promise for alleviating sleep symptoms, however, this technique necessitates automated sleep stage decoding from intracranial signals. We leveraged overnight data from 121 patients with movement disorders (Parkinsons disease, Essential Tremor, Dystonia, Essential Tremor, Huntingtons disease, and Tourettes syndrome) in whom synchronized polysomnograms and basal ganglia local field potentials were recorded, to develop a generalized, multi-class, sleep specific decoder - BGOOSE. This generalized model achieved 85% average accuracy across patients and across disease conditions, even in the presence of recordings from different basal ganglia targets. Furthermore, we also investigated the role of electrocorticography on decoding performances and proposed an optimal decoding map, which was shown to facilitate channel selection for optimal model performances. BGOOSE emerges as a powerful tool for generalized sleep decoding, offering exciting potentials for the precision stimulation delivery of DBS and better management of sleep disturbances in movement disorders.
Published: 2024

36. Probabilistic Modeling for Sequences of Sets in Continuous-Time

Author: Chang, Yuxin, Boyd, Alex, and Smyth, Padhraic
Subjects: Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: Neural marked temporal point processes have been a valuable addition to the existing toolbox of statistical parametric models for continuous-time event data. These models are useful for sequences where each event is associated with a single item (a single type of event or a "mark") -- but such models are not suited for the practical situation where each event is associated with a set of items. In this work, we develop a general framework for modeling set-valued data in continuous-time, compatible with any intensity-based recurrent neural point process model. In addition, we develop inference methods that can use such models to answer probabilistic queries such as "the probability of item $A$ being observed before item $B$," conditioned on sequence history. Computing exact answers for such queries is generally intractable for neural models due to both the continuous-time nature of the problem setting and the combinatorially-large space of potential outcomes for each event. To address this, we develop a class of importance sampling methods for querying with set-based sequences and demonstrate orders-of-magnitude improvements in efficiency over direct sampling via systematic experiments with four real-world datasets. We also illustrate how to use this framework to perform model selection using likelihoods that do not involve one-step-ahead prediction., Comment: Oral presentation at AISTATS 2024
Published: 2023

37. New Light on Dark Extended Lenses with the Roman Space Telescope

Author: DeRocco, William, Smyth, Nolan, and Takhistov, Volodymyr
Subjects: Astrophysics - Cosmology and Nongalactic Astrophysics, High Energy Physics - Phenomenology
Abstract: The Roman Space Telescope's Galactic Bulge Time Domain Survey will constitute the most sensitive microlensing survey of the Galactic Bulge to date, opening up new opportunities to search for dark matter (DM). Many extensions of the Standard Model predict the formation of extended DM substructures, such as DM subhalos, boson/axion stars, and halo-dressed primordial black holes. We demonstrate that for such targets, Roman will be sensitive to a broad parameter space up to four orders of magnitude below existing constraints. Our analysis can be readily applied to other extended DM configurations as well., Comment: 9 pages, 2 figures; v2 updated to match accepted ApJL version
Published: 2023
Full Text: View/download PDF

38. Penetration Testing and Legacy Systems

Author: Smyth, Sandra
Subjects: Computer Science - Software Engineering
Abstract: As per Adusumilli (2015),'70% of corporate business systems today are legacy applications. Recent statistics prove that over 60% of IT budget is spent on maintaining these Legacy systems, showing the rigidity and the fragile nature of these systems.' Usually, testing is included during the software development cycle, using testing techniques such as unit testing, integration testing, and system testing before releasing the product. After the software product is released to production, no additional testing is done; the testing process is back to the table only when modifications are made. Techniques such as regression testing are included to ensure the changes do not affect existing functionality, but testing nonfunctional features that are rarely included in such regression tests' scope. Schrader (2021) affirms that 'legacy systems are often maintained only to ensure function,' and IT organizations may fail to consider the cybersecurity perspective to remain secure. Legacy systems are a high-risk component for the organization that must be carefully considered when structuring a cyber security strategy. This paper aims to help the reader understand some measures that can be taken to secure legacy systems, explaining what penetration testing is and how this testing technique can help secure legacy systems. Keywords: Testing, legacy, security, risks, prevention, mitigation, pentesting.
Published: 2023

39. Bayesian Online Learning for Consensus Prediction

Author: Showalter, Sam, Boyd, Alex, Smyth, Padhraic, and Steyvers, Mark
Subjects: Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: Given a pre-trained classifier and multiple human experts, we investigate the task of online classification where model predictions are provided for free but querying humans incurs a cost. In this practical but under-explored setting, oracle ground truth is not available. Instead, the prediction target is defined as the consensus vote of all experts. Given that querying full consensus can be costly, we propose a general framework for online Bayesian consensus estimation, leveraging properties of the multivariate hypergeometric distribution. Based on this framework, we propose a family of methods that dynamically estimate expert consensus from partial feedback by producing a posterior over expert and model beliefs. Analyzing this posterior induces an interpretable trade-off between querying cost and classification performance. We demonstrate the efficacy of our framework against a variety of baselines on CIFAR-10H and ImageNet-16H, two large-scale crowdsourced datasets.
Published: 2023

40. Peer Leader Perspectives from a PLTL Implementation in a Hispanic-Serving Institution

Author: Madhavan Narayanan, Kasey Powers, Dhananjaya Premawardena, Kelly Colby, Janet Liou Mark, Nagaraj Rao, Davida S. Smyth, and Mary Knopp-Kelly
Abstract: Peer-Led Team Learning (PLTL) is a pedagogical approach that has been shown to benefit all students, especially underrepresented minority students and peer leaders in Science, Technology, Engineering, and Mathematics (STEM) disciplines. In this work, we present results from our study of the impact of PLTL on our peer leaders from a controlled implementation in general biology, general chemistry, and statistics courses at a Hispanic-serving, minority-serving institution. More specifically, we have measured our PLTL program's impact on our peer leaders' skill development, engagement with the subject material, and sense of belonging as peer leaders. Weekly peer leader reflections analyzed using the Dreyfus model exhibited a consistent set of skills, while those analyzed using the Pazos model revealed a consistent type of student-peer leader interactions, allowing for peer leaders to be assigned to specific levels in the hierarchy of each of the models. Analysis of eight skill-based Likert-scale questions on the SALG survey showed an overall positive shift at the highest level. Independent of the skill or interaction level of the peer leader, we observed several instances of peer leaders acknowledging development in their communication skills, sincere attempts at creating an engaging classroom, and a deep investment in their student's success. Peer leaders also reported improvements in understanding of the subjects they were teaching, wanting to persevere and solve problems independently, and feeling passionate about helping other students.
Published: 2023

41. Metformin and small for gestational age babies: findings of a randomised placebo-controlled clinical trial of metformin in gestational diabetes (EMERGE)

Author: Dunne, Fidelma, Newman, Christine, Alvarez-Iglesias, Alberto, O’Shea, Paula, Devane, Declan, Gillespie, Paddy, Egan, Aoife, O’Donnell, Martin, and Smyth, Andrew
Published: 2024
Full Text: View/download PDF

42. Subtropical stormwater ponds are more frequently net nitrogen fixing compared to natural ponds

Author: Goeckner, Audrey H., Smyth, Ashley R., Holgerson, Meredith A., and Reisinger, Alexander J.
Published: 2024
Full Text: View/download PDF

43. Feasibility and acceptability of measuring prenatal stress in daily life using smartphone-based ecological momentary assessment and wearable physiological monitors

Author: Tung, Irene, Balaji, Uma, Hipwell, Alison E., Low, Carissa A., and Smyth, Joshua M.
Published: 2024
Full Text: View/download PDF

44. Higher education retention in Ireland and Scotland: the role of admissions policies

Author: Iannelli, Cristina, McMullin, Patricia, and Smyth, Emer
Published: 2024
Full Text: View/download PDF

45. Isoform-specific RNA structure determination using Nano-DMS-MaP

Author: Gribling-Burrer, Anne-Sophie, Bohn, Patrick, and Smyth, Redmond P.
Published: 2024
Full Text: View/download PDF

46. Enablers and Barriers of Online Mindfulness-Based Interventions for Informal Carers: A Mixed-Methods Systematic Review

Author: Abeysinghe Mudiyanselage, Charunya Amilani Kumarihami Rambukwella, Ewens, Beverley, Smyth, Aisling, Dickson, Joanne, and Ang, Seng Giap Marcus
Published: 2024
Full Text: View/download PDF

47. MRI and pathology comparisons in Rasmussen’s encephalitis: a multi-institutional examination of hemispherotomy outcomes relative to imaging and histological severity

Author: Doherty, Alexander, Knudson, Kathleen, Fuller, Christine, Leach, James L., Wang, Anthony C., Marupudi, Neena, Han, Rowland H., Tomko, Stuart, Ojemann, Jeff, Smyth, Matthew D., Mangano, Francesco, and Skoch, Jesse
Published: 2024
Full Text: View/download PDF

48. Treatment Modalities for Insomnia in Adults Aged 55 and Older: A Systematic Review of Literature from 2018 to 2023

Author: McPhillips, Miranda V., Petrovsky, Darina V., Lorenz, Rebecca, Lee, Jiwon, George, Tessy, Smyth, Aisling, Bubu, Omonigho Michael, and Brewster, Glenna S.
Published: 2024
Full Text: View/download PDF

49. Is your vote truly secret? Ballot Secrecy iff Ballot Independence: Proving necessary conditions and analysing case studies

Author: Kharman, Aida Manzano, Smyth, Ben, and Page, Freddie
Subjects: Computer Science - Cryptography and Security
Abstract: We formalise definitions of ballot secrecy and ballot independence by Smyth, JCS'21 as indistinguishability games in the computational model of security. These definitions improve upon Smyth, draft '21 to consider a wider class of voting systems. Both Smyth, JCS'21 and Smyth, draft '21 improve on earlier works by considering a more realistic adversary model wherein they have access to the ballot collection. We prove that ballot secrecy implies ballot independence. We say ballot independence holds if a system has non-malleable ballots. We construct games for ballot secrecy and non-malleability and show that voting schemes with malleable ballots do not preserve ballot secrecy. We demonstrate that Helios does not satisfy our definition of ballot secrecy. Furthermore, the Python framework we constructed for our case study shows that if an attack exists against non-malleability, this attack can be used to break ballot secrecy.
Published: 2023

50. Rogue worlds meet the dark side: revealing terrestrial-mass primordial black holes with the Nancy Grace Roman Space Telescope

Author: DeRocco, William, Frangipane, Evan, Hamer, Nick, Profumo, Stefano, and Smyth, Nolan
Subjects: Astrophysics - Cosmology and Nongalactic Astrophysics, Astrophysics - Earth and Planetary Astrophysics, Astrophysics - Astrophysics of Galaxies, High Energy Physics - Phenomenology
Abstract: Gravitational microlensing is one of the strongest observational techniques to observe non-luminous astrophysical bodies. Existing microlensing observations provide tantalizing evidence of a population of low-mass objects whose origin is unknown. These events may be caused by terrestrial-mass free-floating planets or by exotic objects such as primordial black holes. However, the nature of these objects cannot be resolved on an event-by-event basis, as the induced light curve is degenerate for lensing bodies of identical mass. One must instead statistically compare \textit{distributions} of lensing events to determine the nature of the lensing population. While existing surveys lack the statistics required to identify multiple subpopulations of lenses, this will change with the launch of the Nancy Grace Roman Space Telescope. Roman's Galactic Bulge Time Domain Survey is expected to observe hundreds of low-mass microlensing events, enabling a robust statistical characterization of this population. In this paper, we show that by exploiting features in the distribution of lensing event durations, Roman will be sensitive to a subpopulation of primordial black holes hidden amongst a background of free-floating planets. Roman's reach will extend to primordial black hole dark matter fractions as low as $f_\text{PBH} = 10^{-4}$ at peak sensitivity, and will be able to conclusively determine the origin of existing ultrashort-timescale microlensing events. A positive detection would provide evidence that a significant fraction of the cosmological dark matter consists of macroscopic, non-luminous objects., Comment: 11 pages, 6 figures
Published: 2023
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

15,363 results on '"Smyth P"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources