Author: "Winston, A." - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Winston, A."' showing total 97,763 results

Start Over Author "Winston, A."

97,763 results on '"Winston, A."'

1. Vocational-Technical Physics Project. Instructor's Manual. Field Test Edition.

Author: Forsyth Technical Inst., Winston-Salem, NC.
Abstract: This instructor's manual in vocational physics consists of five modules: Jacks, Thermometers, The Alternator, The Pool Table, and The Radiator. It is an individualized approach, designed for use with accompanying student manuals on each of the individual modules. Each module in the instructor's manual consists of a general description plus an outline of student objectives, prerequisites laboratory exercises, equipment and supplies, audiovisual materials, tests, instructional strategies, and estimated completion time. Appended to the manual is more specific information about equipment and supplies, including sources of supplies and costs. Tests and keys for each of the modules are appended. (NJ)
Published: 2024

2. Vocational-Technical Physics Project. Thermometers: I. Temperature and Heat, II. Expansion Thermometers, III. Electrical Thermometers. Field Test Edition.

Author: Forsyth Technical Inst., Winston-Salem, NC.
Abstract: This vocational physics individualized student instructional module on thermometers consists of the three units: Temperature and heat, expansion thermometers, and electrical thermometers. Designed with a laboratory orientation, experiments are included on linear expansion; making a bimetallic thermometer, a liquid-in-gas thermometer, and a gas thermometer; making, testing, and using thermocouples; comparing thermistors with ordinary materials, and calibrating a thermistor. Laboratory data sheets, illustrative drawings, review questions, student prerequisites, and objectives are also included in the module. (NJ)
Published: 2024

3. Oryx MLLM: On-Demand Spatial-Temporal Understanding at Arbitrary Resolution

Author: Liu, Zuyan, Dong, Yuhao, Liu, Ziwei, Hu, Winston, Lu, Jiwen, and Rao, Yongming
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Visual data comes in various forms, ranging from small icons of just a few pixels to long videos spanning hours. Existing multi-modal LLMs usually standardize these diverse visual inputs to a fixed resolution for visual encoders and yield similar numbers of tokens for LLMs. This approach is non-optimal for multimodal understanding and inefficient for processing inputs with long and short visual contents. To solve the problem, we propose Oryx, a unified multimodal architecture for the spatial-temporal understanding of images, videos, and multi-view 3D scenes. Oryx offers an on-demand solution to seamlessly and efficiently process visual inputs with arbitrary spatial sizes and temporal lengths through two core innovations: 1) a pre-trained OryxViT model that can encode images at any resolution into LLM-friendly visual representations; 2) a dynamic compressor module that supports 1x to 16x compression on visual tokens by request. These design features enable Oryx to accommodate extremely long visual contexts, such as videos, with lower resolution and high compression while maintaining high recognition precision for tasks like document understanding with native resolution and no compression. Beyond the architectural improvements, enhanced data curation and specialized training on long-context retrieval and spatial-aware data help Oryx achieve strong capabilities in image, video, and 3D multimodal understanding simultaneously. Our work is open-sourced at https://github.com/Oryx-mllm/Oryx.
Published: 2024

4. Revisiting Semi-supervised Adversarial Robustness via Noise-aware Online Robust Distillation

Author: Wu, Tsung-Han, Su, Hung-Ting, Chen, Shang-Tse, and Hsu, Winston H.
Subjects: Computer Science - Machine Learning, Computer Science - Computer Vision and Pattern Recognition
Abstract: The robust self-training (RST) framework has emerged as a prominent approach for semi-supervised adversarial training. To explore the possibility of tackling more complicated tasks with even lower labeling budgets, unlike prior approaches that rely on robust pretrained models, we present SNORD - a simple yet effective framework that introduces contemporary semi-supervised learning techniques into the realm of adversarial training. By enhancing pseudo labels and managing noisy training data more effectively, SNORD showcases impressive, state-of-the-art performance across diverse datasets and labeling budgets, all without the need for pretrained models. Compared to full adversarial supervision, SNORD achieves a 90% relative robust accuracy under epsilon = 8/255 AutoAttack, requiring less than 0.1%, 2%, and 10% labels for CIFAR-10, CIFAR-100, and TinyImageNet-200, respectively. Additional experiments confirm the efficacy of each component and demonstrate the adaptability of integrating SNORD with existing adversarial pretraining strategies to further bolster robustness., Comment: 12 pages, 4 figures, 9 tables
Published: 2024

5. Synchronization Control-Plane Protocol for Quantum Link Layer

Author: Ru, Brandon, Seah, Winston K. G., and Valera, Alvin C.
Subjects: Computer Science - Networking and Internet Architecture
Abstract: Heralded entanglement generation between nodes of a future quantum internet is a fundamental operation that unlocks the potential for quantum communication. In this paper, we propose a decentralized synchronization protocol that operates at the classical control-plane of the link layer, to navigate the coordination challenges of generating heralded entanglement across few-qubit quantum network nodes. Additionally, with quantum network simulations using NetSquid, we show that our protocol achieves lower entanglement request latencies than a naive distributed queue approach. We observe a sixfold reduction in average request latency growth as the number of quantum network links increases. The Eventual Synchronization Protocol (ESP) allows nodes to coordinate on heralded entanglement generation in a scalable manner within multi-peer quantum networks. To the best of our knowledge, this is the first decentralized synchronization protocol for managing heralded entanglement requests., Comment: accepted by the 20th International Conference on Network and Service Management (CNSM), 2024
Published: 2024

6. DiffQRCoder: Diffusion-based Aesthetic QR Code Generation with Scanning Robustness Guided Iterative Refinement

Author: Liao, Jia-Wei, Wang, Winston, Wang, Tzu-Sian, Peng, Li-Xuan, Weng, Ju-Hsuan, Chou, Cheng-Fu, and Chen, Jun-Cheng
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: With the success of Diffusion Models for image generation, the technologies also have revolutionized the aesthetic Quick Response (QR) code generation. Despite significant improvements in visual attractiveness for the beautified codes, their scannabilities are usually sacrificed and thus hinder their practical uses in real-world scenarios. To address this issue, we propose a novel Diffusion-based QR Code generator (DiffQRCoder) to effectively craft both scannable and visually pleasing QR codes. The proposed approach introduces Scanning-Robust Perceptual Guidance (SRPG), a new diffusion guidance for Diffusion Models to guarantee the generated aesthetic codes to obey the ground-truth QR codes while maintaining their attractiveness during the denoising process. Additionally, we present another post-processing technique, Scanning Robust Manifold Projected Gradient Descent (SR-MPGD), to further enhance their scanning robustness through iterative latent space optimization. With extensive experiments, the results demonstrate that our approach not only outperforms other compared methods in Scanning Success Rate (SSR) with better or comparable CLIP aesthetic score (CLIP-aes.) but also significantly improves the SSR of the ControlNet-only approach from 60% to 99%. The subjective evaluation indicates that our approach achieves promising visual attractiveness to users as well. Finally, even with different scanning angles and the most rigorous error tolerance settings, our approach robustly achieves over 95% SSR, demonstrating its capability for real-world applications.
Published: 2024

7. Distribution Discrepancy and Feature Heterogeneity for Active 3D Object Detection

Author: Chen, Huang-Yu, Yeh, Jia-Fong, Liao, Jia-Wei, Peng, Pin-Hsuan, and Hsu, Winston H.
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: LiDAR-based 3D object detection is a critical technology for the development of autonomous driving and robotics. However, the high cost of data annotation limits its advancement. We propose a novel and effective active learning (AL) method called Distribution Discrepancy and Feature Heterogeneity (DDFH), which simultaneously considers geometric features and model embeddings, assessing information from both the instance-level and frame-level perspectives. Distribution Discrepancy evaluates the difference and novelty of instances within the unlabeled and labeled distributions, enabling the model to learn efficiently with limited data. Feature Heterogeneity ensures the heterogeneity of intra-frame instance features, maintaining feature diversity while avoiding redundant or similar instances, thus minimizing annotation costs. Finally, multiple indicators are efficiently aggregated using Quantile Transform, providing a unified measure of informativeness. Extensive experiments demonstrate that DDFH outperforms the current state-of-the-art (SOTA) methods on the KITTI and Waymo datasets, effectively reducing the bounding box annotation cost by 56.3% and showing robustness when working with both one-stage and two-stage models., Comment: Accepted to CoRL 2024
Published: 2024

8. Context-Aware Replanning with Pre-explored Semantic Map for Object Navigation

Author: Su, Hung-Ting, Chen, Ching-Yuan, Ko, Po-Chen, Yeh, Jia-Fong, Sun, Min, and Hsu, Winston H.
Subjects: Computer Science - Robotics
Abstract: Pre-explored Semantic Maps, constructed through prior exploration using visual language models (VLMs), have proven effective as foundational elements for training-free robotic applications. However, existing approaches assume the map's accuracy and do not provide effective mechanisms for revising decisions based on incorrect maps. To address this, we introduce Context-Aware Replanning (CARe), which estimates map uncertainty through confidence scores and multi-view consistency, enabling the agent to revise erroneous decisions stemming from inaccurate maps without requiring additional labels. We demonstrate the effectiveness of our proposed method by integrating it with two modern mapping backbones, VLMaps and OpenMask3D, and observe significant performance improvements in object navigation tasks. More details can be found on the project page: https://carmaps.github.io/supplements/., Comment: CoRL 2024. The first three authors contributed equally, and their order of authorship is interchangeable. Project page: https://carmaps.github.io/supplements/
Published: 2024

9. Incorporating Like-Minded Peers to Overcome Friend Data Sparsity in Session-Based Social Recommendations

Author: An, Chunyan, Li, Yunhan, Yang, Qiang, Seah, Winston K. G., Li, Zhixu, and Yang, Conghao
Subjects: Computer Science - Social and Information Networks, Computer Science - Artificial Intelligence
Abstract: Session-based Social Recommendation (SSR) leverages social relationships within online networks to enhance the performance of Session-based Recommendation (SR). However, existing SSR algorithms often encounter the challenge of "friend data sparsity". Moreover, significant discrepancies can exist between the purchase preferences of social network friends and those of the target user, reducing the influence of friends relative to the target user's own preferences. To address these challenges, this paper introduces the concept of "Like-minded Peers" (LMP), representing users whose preferences align with the target user's current session based on their historical sessions. This is the first work, to our knowledge, that uses LMP to enhance the modeling of social influence in SSR. This approach not only alleviates the problem of friend data sparsity but also effectively incorporates users with similar preferences to the target user. We propose a novel model named Transformer Encoder with Graph Attention Aggregator Recommendation (TEGAARec), which includes the TEGAA module and the GAT-based social aggregation module. The TEGAA module captures and merges both long-term and short-term interests for target users and LMP users. Concurrently, the GAT-based social aggregation module is designed to aggregate the target users' dynamic interests and social influence in a weighted manner. Extensive experiments on four real-world datasets demonstrate the efficacy and superiority of our proposed model and ablation studies are done to illustrate the contributions of each component in TEGAARec., Comment: None
Published: 2024

10. HERMES: temporal-coHERent long-forM understanding with Episodes and Semantics

Author: Faure, Gueter Josmy, Yeh, Jia-Fong, Chen, Min-Hung, Su, Hung-Ting, Hsu, Winston H., and Lai, Shang-Hong
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence, Computer Science - Computation and Language
Abstract: Existing research often treats long-form videos as extended short videos, leading to several limitations: inadequate capture of long-range dependencies, inefficient processing of redundant information, and failure to extract high-level semantic concepts. To address these issues, we propose a novel approach that more accurately reflects human cognition. This paper introduces HERMES: temporal-coHERent long-forM understanding with Episodes and Semantics, a model that simulates episodic memory accumulation to capture action sequences and reinforces them with semantic knowledge dispersed throughout the video. Our work makes two key contributions: First, we develop an Episodic COmpressor (ECO) that efficiently aggregates crucial representations from micro to semi-macro levels, overcoming the challenge of long-range dependencies. Second, we propose a Semantics ReTRiever (SeTR) that enhances these aggregated representations with semantic information by focusing on the broader context, dramatically reducing feature dimensionality while preserving relevant macro-level information. This addresses the issues of redundancy and lack of high-level concept extraction. Extensive experiments demonstrate that HERMES achieves state-of-the-art performance across multiple long-video understanding benchmarks in both zero-shot and fully-supervised settings., Comment: This is an improved and expanded version of our EVAL-FoMo Workshop at ECCV'24 (v1 of this paper). Project page: https://joslefaure.github.io/assets/html/hermes.html
Published: 2024

11. Error-controlled non-additive interaction discovery in machine learning models

Author: Chen, Winston, Jiang, Yifan, Noble, William Stafford, and Lu, Yang Young
Subjects: Computer Science - Machine Learning, Statistics - Applications, Statistics - Machine Learning
Abstract: Machine learning (ML) models are powerful tools for detecting complex patterns within data, yet their "black box" nature limits their interpretability, hindering their use in critical domains like healthcare and finance. To address this challenge, interpretable ML methods have been developed to explain how features influence model predictions. However, these methods often focus on univariate feature importance, overlooking the complex interactions between features that ML models are capable of capturing. Recognizing this limitation, recent efforts have aimed to extend these methods to discover feature interactions, but existing approaches struggle with robustness and error control, especially under data perturbations. In this study, we introduce Diamond, a novel method for trustworthy feature interaction discovery. Diamond uniquely integrates the model-X knockoffs framework to control the false discovery rate (FDR), ensuring that the proportion of falsely discovered interactions remains low. We further address the challenges of using off-the-shelf interaction importance measures by proposing a calibration procedure that refines these measures to maintain the desired FDR. Diamond's applicability spans a wide range of ML models, including deep neural networks, tree-based models, and factorization-based models. Our empirical evaluations on both simulated and real datasets across various biomedical studies demonstrate Diamond's utility in enabling more reliable data-driven scientific discoveries. This method represents a significant step forward in the deployment of ML models for scientific innovation and hypothesis generation.
Published: 2024

12. Parallel Speculative Decoding with Adaptive Draft Length

Author: Liu, Tianyu, Li, Yun, Lv, Qitan, Liu, Kai, Zhu, Jianchen, and Hu, Winston
Subjects: Computer Science - Computation and Language
Abstract: Speculative decoding (SD), where an extra draft model is employed to provide multiple \textit{draft} tokens first and then the original target model verifies these tokens in parallel, has shown great power for LLM inference acceleration. However, existing SD methods suffer from the mutual waiting problem, i.e., the target model gets stuck when the draft model is \textit{guessing} tokens, and vice versa. This problem is directly incurred by the asynchronous execution of the draft model and the target model, and is exacerbated due to the fixed draft length in speculative decoding. To address these challenges, we propose a conceptually simple, flexible, and general framework to boost speculative decoding, namely \textbf{P}arallel sp\textbf{E}culative decoding with \textbf{A}daptive d\textbf{R}aft \textbf{L}ength (PEARL). Specifically, PEARL proposes \textit{pre-verify} to verify the first draft token in advance during the drafting phase, and \textit{post-verify} to generate more draft tokens during the verification phase. PEARL parallels the drafting phase and the verification phase via applying the two strategies, and achieves adaptive draft length for different scenarios, which effectively alleviates the mutual waiting problem. Moreover, we theoretically demonstrate that the mean accepted tokens of PEARL is more than existing \textit{draft-then-verify} works. Experiments on various text generation benchmarks demonstrate the effectiveness of our \name, leading to a superior speedup performance up to \textbf{3.79$\times$} and \textbf{1.52$\times$}, compared to auto-regressive decoding and vanilla speculative decoding, respectively.
Published: 2024

13. The Orbit and Mass of the Cepheid AW Per

Author: Evans, Nancy Remage, Gallenne, Alexandre, Kervella, Pierre, Mérand, Antoine, Monnier, John, Anderson, Richard I, Günther, H. Moritz, Proffitt, Charles, Winston, Elaine M., Pietrzynski, Grzegorz, Gieren, Wolfgang, Kuraszkiewicz, Joanna, Anugu, Narsireddy, Roettenbacher, Rachael M., Lanthermann, Cyprien, Gutierrez, Mayra, Schaefer, Gail, Setterholm, Benjamin R., Ibrahim, Noura, and Kraus, Stefan
Subjects: Astrophysics - Solar and Stellar Astrophysics
Abstract: The Cepheid AW Per is a component in a multiple system with a long period orbit. The radial velocities of Griffin (2016) cover the 38 year orbit well. An extensive program of interferometry with the CHARA array is reported here, from which the long period orbit is determined. In addition, a {\it Hubble Space Telescope} high resolution spectrum in the ultraviolet demonstrates that the companion is itself a binary with nearly equal mass components. These data combined with a distance from {\it Gaia} provide a mass of the Cepheid (primary) of M$_1$ = 6.79 $\pm$ 0.85 $M_\odot$. The combined mass of the secondary is M$_S$ = 8.79 $\pm$ 0.50 $M_\odot$. The accuracy of the mass will be improved after the fourth Gaia data release expected in approximately two years., Comment: Accepted for ApJ
Published: 2024

14. Investigating Video Reasoning Capability of Large Language Models with Tropes in Movies

Author: Su, Hung-Ting, Chao, Chun-Tong, Hsu, Ya-Ching, Lin, Xudong, Niu, Yulei, Lee, Hung-Yi, and Hsu, Winston H.
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: Large Language Models (LLMs) have demonstrated effectiveness not only in language tasks but also in video reasoning. This paper introduces a novel dataset, Tropes in Movies (TiM), designed as a testbed for exploring two critical yet previously overlooked video reasoning skills: (1) Abstract Perception: understanding and tokenizing abstract concepts in videos, and (2) Long-range Compositional Reasoning: planning and integrating intermediate reasoning steps for understanding long-range videos with numerous frames. Utilizing tropes from movie storytelling, TiM evaluates the reasoning capabilities of state-of-the-art LLM-based approaches. Our experiments show that current methods, including Captioner-Reasoner, Large Multimodal Model Instruction Fine-tuning, and Visual Programming, only marginally outperform a random baseline when tackling the challenges of Abstract Perception and Long-range Compositional Reasoning. To address these deficiencies, we propose Face-Enhanced Viper of Role Interactions (FEVoRI) and Context Query Reduction (ConQueR), which enhance Visual Programming by fostering role interaction awareness and progressively refining movie contexts and trope queries during reasoning processes, significantly improving performance by 15 F1 points. However, this performance still lags behind human levels (40 vs. 65 F1). Additionally, we introduce a new protocol to evaluate the necessity of Abstract Perception and Long-range Compositional Reasoning for task resolution. This is done by analyzing the code generated through Visual Programming using an Abstract Syntax Tree (AST), thereby confirming the increased complexity of TiM. The dataset and code are available at: https://ander1119.github.io/TiM, Comment: Project page: https://ander1119.github.io/TiM
Published: 2024

15. The Imaging Database for Epilepsy And Surgery (IDEAS)

Author: Taylor, Peter N., Wang, Yujiang, Simpson, Callum, Janiukstyte, Vytene, Horsley, Jonathan, Leiberg, Karoline, Little, Beth, Clifford, Harry, Adler, Sophie, Vos, Sjoerd B., Winston, Gavin P, McEvoy, Andrew W, Miserocchi, Anna, de Tisi, Jane, and Duncan, John S
Subjects: Quantitative Biology - Neurons and Cognition
Abstract: Magnetic resonance imaging (MRI) is a crucial tool to identify brain abnormalities in a wide range of neurological disorders. In focal epilepsy MRI is used to identify structural cerebral abnormalities. For covert lesions, machine learning and artificial intelligence algorithms may improve lesion detection if abnormalities are not evident on visual inspection. The success of this approach depends on the volume and quality of training data. Herein, we release an open-source dataset of preprocessed MRI scans from 442 individuals with drug-refractory focal epilepsy who had neurosurgical resections, and detailed demographic information. The MRI scan data includes the preoperative 3D T1 and where available 3D FLAIR, as well as a manually inspected complete surface reconstruction and volumetric parcellations. Demographic information includes age, sex, age of onset of epilepsy, location of surgery, histopathology of resected specimen, occurrence and frequency of focal seizures with and without impairment of awareness, focal to bilateral tonic-clonic seizures, number of anti-seizure medications (ASMs) at time of surgery, and a total of 1764 patient years of post-surgical follow up. Crucially, we also include resection masks delineated from post-surgical imaging. To demonstrate the veracity of our data, we successfully replicated previous studies showing long-term outcomes of seizure freedom in the range of around 50%. Our imaging data replicates findings of group level atrophy in patients compared to controls. Resection locations in the cohort were predominantly in the temporal and frontal lobes. We envisage our dataset, shared openly with the community, will catalyse the development and application of computational methods in clinical neurology.
Published: 2024

16. Brain Morphology Normative modelling platform for abnormality and Centile estimation: Brain MoNoCle

Author: Little, Bethany, Alyas, Nida, Surtees, Alexander, Winston, Gavin P, Duncan, John S, Cousins, David A, Taylor, John-Paul, Taylor, Peter, Leiberg, Karoline, and Wang, Yujiang
Subjects: Quantitative Biology - Neurons and Cognition
Abstract: Normative models of brain structure estimate the effects of covariates such as age and sex using large samples of healthy controls. These models can then be applied to smaller clinical cohorts to distinguish disease effects from other covariates. However, these advanced statistical modelling approaches can be difficult to access, and processing large healthy cohorts is computationally demanding. Thus, accessible platforms with pre-trained normative models are needed. We present such a platform for brain morphology analysis as an open-source web application https://cnnplab.shinyapps.io/normativemodelshiny/, with six key features: (i) user-friendly web interface, (ii) individual and group outputs, (iii) multi-site analysis, (iv) regional and whole-brain analysis, (v) integration with existing tools, and (vi) featuring multiple morphology metrics. Using a diverse sample of 3,276 healthy controls across 21 sites, we pre-trained normative models on various metrics. We validated the models with a small clinical sample of individuals with bipolar disorder, showing outputs that aligned closely with existing literature only after applying our normative modelling. Further validation with a cohort of temporal lobe epilepsy showed agreement with previous group-level findings and individual-level seizure lateralisation. Finally, with the ability to investigate multiple morphology measures in the same framework, we found that biological covariates are better explained in specific morphology measures, and for clinical applications, only some measures are sensitive to the disease process. Our platform offers a comprehensive framework to analyse brain morphology in clinical and research settings. Validations confirm the superiority of normative models and the advantage of investigating a range of brain morphology metrics together.
Published: 2024

17. Shared-unique Features and Task-aware Prioritized Sampling on Multi-task Reinforcement Learning

Author: Lin, Po-Shao, Yeh, Jia-Fong, Chen, Yi-Ting, and Hsu, Winston H.
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: We observe that current state-of-the-art (SOTA) methods suffer from the performance imbalance issue when performing multi-task reinforcement learning (MTRL) tasks. While these methods may achieve impressive performance on average, they perform extremely poorly on a few tasks. To address this, we propose a new and effective method called STARS, which consists of two novel strategies: a shared-unique feature extractor and task-aware prioritized sampling. First, the shared-unique feature extractor learns both shared and task-specific features to enable better synergy of knowledge between different tasks. Second, the task-aware sampling strategy is combined with the prioritized experience replay for efficient learning on tasks with poor performance. The effectiveness and stability of our STARS are verified through experiments on the mainstream Meta-World benchmark. From the results, our STARS statistically outperforms current SOTA methods and alleviates the performance imbalance issue. Besides, we visualize the learned features to support our claims and enhance the interpretability of STARS., Comment: The first two authors contribute equally
Published: 2024

18. VICtoR: Learning Hierarchical Vision-Instruction Correlation Rewards for Long-horizon Manipulation

Author: Hung, Kuo-Han, Lo, Pang-Chi, Yeh, Jia-Fong, Hsu, Han-Yuan, Chen, Yi-Ting, and Hsu, Winston H.
Subjects: Computer Science - Robotics
Abstract: We study reward models for long-horizon manipulation tasks by learning from action-free videos and language instructions, which we term the visual-instruction correlation (VIC) problem. Recent advancements in cross-modality modeling have highlighted the potential of reward modeling through visual and language correlations. However, existing VIC methods face challenges in learning rewards for long-horizon tasks due to their lack of sub-stage awareness, difficulty in modeling task complexities, and inadequate object state estimation. To address these challenges, we introduce VICtoR, a novel hierarchical VIC reward model capable of providing effective reward signals for long-horizon manipulation tasks. VICtoR precisely assesses task progress at various levels through a novel stage detector and motion progress evaluator, offering insightful guidance for agents learning the task effectively. To validate the effectiveness of VICtoR, we conducted extensive experiments in both simulated and real-world environments. The results suggest that VICtoR outperformed the best existing VIC methods, achieving a 43% improvement in success rates for long-horizon tasks.
Published: 2024

19. Enhancing Sustainable Urban Mobility Prediction with Telecom Data: A Spatio-Temporal Framework Approach

Author: Lin, ChungYi, Tung, Shen-Lung, Su, Hung-Ting, and Hsu, Winston H.
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Networking and Internet Architecture
Abstract: Traditional traffic prediction, limited by the scope of sensor data, falls short in comprehensive traffic management. Mobile networks offer a promising alternative using network activity counts, but these lack crucial directionality. Thus, we present the TeltoMob dataset, featuring undirected telecom counts and corresponding directional flows, to predict directional mobility flows on roadways. To address this, we propose a two-stage spatio-temporal graph neural network (STGNN) framework. The first stage uses a pre-trained STGNN to process telecom data, while the second stage integrates directional and geographic insights for accurate prediction. Our experiments demonstrate the framework's compatibility with various STGNN models and confirm its effectiveness. We also show how to incorporate the framework into real-world transportation systems, enhancing sustainable urban mobility., Comment: 8 Figures, 5 Tables. Just accepted by IJCAI (to appear)
Published: 2024

20. Unsupervised Image Prior via Prompt Learning and CLIP Semantic Guidance for Low-Light Image Enhancement

Author: Morawski, Igor, He, Kai, Dangi, Shusil, and Hsu, Winston H.
Subjects: Computer Science - Computer Vision and Pattern Recognition, Electrical Engineering and Systems Science - Image and Video Processing
Abstract: Currently, low-light conditions present a significant challenge for machine cognition. In this paper, rather than optimizing models by assuming that human and machine cognition are correlated, we use zero-reference low-light enhancement to improve the performance of downstream task models. We propose to improve the zero-reference low-light enhancement method by leveraging the rich visual-linguistic CLIP prior without any need for paired or unpaired normal-light data, which is laborious and difficult to collect. We propose a simple but effective strategy to learn prompts that help guide the enhancement method and experimentally show that the prompts learned without any need for normal-light data improve image contrast, reduce over-enhancement, and reduce noise over-amplification. Next, we propose to reuse the CLIP model for semantic guidance via zero-shot open vocabulary classification to optimize low-light enhancement for task-based performance rather than human visual perception. We conduct extensive experimental results showing that the proposed method leads to consistent improvements across various datasets regarding task-based performance and compare our method against state-of-the-art methods, showing favorable results across various low-light datasets., Comment: Accepted to CVPR 2024 Workshop NTIRE: New Trends in Image Restoration and Enhancement workshop and Challenges
Published: 2024

21. The fourth moment of the Hurwitz zeta function

Author: Heap, Winston and Sahay, Anurag
Subjects: Mathematics - Number Theory
Abstract: We prove a sharp upper bound for the fourth moment of the Hurwitz zeta function $\zeta(s,\alpha)$ on the critical line when the shift parameter $\alpha$ is irrational and of irrationality exponent strictly less than 3. As a consequence, we determine the order of magnitude of the $2k$th moment for all $0 \leqslant k \leqslant 2$ in this case. In contrast to the Riemann zeta function and other $L$-functions from arithmetic, these grow like $T (\log T)^k$. This suggests, and we conjecture, that the value distribution of $\zeta(s,\alpha)$ on the critical line is Gaussian., Comment: 34 pages
Published: 2024

22. Unmanned Vehicles in 6G Networks: A Unifying Treatment of Problems, Formulations, and Tools

Author: Hurst, Winston, Evmorfos, Spilios, Petropulu, Athina, and Mostofi, Yasamin
Subjects: Electrical Engineering and Systems Science - Systems and Control, Computer Science - Networking and Internet Architecture
Abstract: Unmanned Vehicles (UVs) functioning as autonomous agents are anticipated to play a crucial role in the 6th Generation of wireless networks. Their seamless integration, cost-effectiveness, and the additional controllability through motion planning make them an attractive deployment option for a wide range of applications, both as assets in the network (e.g., mobile base stations) and as consumers of network services (e.g., autonomous delivery systems). However, despite their potential, the convergence of UVs and wireless systems brings forth numerous challenges that require attention from both academia and industry. This paper then aims to offer a comprehensive overview encompassing the transformative possibilities as well as the significant challenges associated with UV-assisted next-generation wireless communications. Considering the diverse landscape of possible application scenarios, problem formulations, and mathematical tools related to UV-assisted wireless systems, the underlying core theme of this paper is the unification of the problem space, providing a structured framework to understand the use cases, problem formulations, and necessary mathematical tools. Overall, the paper sets forth a clear understanding of how unmanned vehicles can be integrated in the 6G ecosystem, paving the way towards harnessing the full potential at this intersection.
Published: 2024

23. Emergent Cooperation for Energy-efficient Connectivity via Wireless Power Transfer

Author: Hurst, Winston, Pallaprolu, Anurag, and Mostofi, Yasamin
Subjects: Electrical Engineering and Systems Science - Systems and Control
Abstract: This paper addresses the challenge of incentivizing energy-constrained, non-cooperative user equipment (UE) to serve as cooperative relays. We consider a source UE with a non-line-of-sight channel to an access point (AP), where direct communication may be infeasible or may necessitate a substantial transmit power. Other UEs in the vicinity are viewed as relay candidates, and our aim is to enable energy-efficient connectivity for the source, while accounting for the self-interested behavior and private channel state information of these candidates, by allowing the source to ``pay" the candidates via wireless power transfer (WPT). We propose a cooperation-inducing protocol, inspired by Myerson auction theory, which ensures that candidates truthfully report power requirements while minimizing the expected power used by the source. Through rigorous analysis, we establish the regularity of valuations for lognormal fading channels, which allows for the efficient determination of the optimal source transmit power. Extensive simulation experiments, employing real-world communication and WPT parameters, validate our theoretical framework. Our results demonstrate over 71% reduction in outage probability with as few as 4 relay candidates, compared to the non-cooperative scenario, and as much as 70% source power savings compared to a baseline approach, highlighting the efficacy of our proposed methodology.
Published: 2024

24. The Ethical Landscape of Prodromal Parkinson Disease: Considerations for Shared Decision-Making and Health Equity.

Author: Hoy, Colin W and Chiong, Winston
Subjects: Biomedical and Clinical Sciences, Neurosciences, Clinical Sciences, Parkinson's Disease, Neurodegenerative, Brain Disorders, Aging, Good Health and Well Being, Humans, Parkinson Disease, Health Equity, Prodromal Symptoms, Decision Making, Shared, Cognitive Sciences, Neurology & Neurosurgery, Clinical sciences
Published: 2024

25. Interstitial macrophages are a focus of viral takeover and inflammation in COVID-19 initiation in human lung

Author: Wu, Timothy Ting-Hsuan, Travaglini, Kyle J, Rustagi, Arjun, Xu, Duo, Zhang, Yue, Andronov, Leonid, Jang, SoRi, Gillich, Astrid, Dehghannasiri, Roozbeh, Martínez-Colón, Giovanny J, Beck, Aimee, Liu, Daniel Dan, Wilk, Aaron J, Morri, Maurizio, Trope, Winston L, Bierman, Rob, Weissman, Irving L, Shrager, Joseph B, Quake, Stephen R, Kuo, Christin S, Salzman, Julia, Moerner, WE, Kim, Peter S, Blish, Catherine A, and Krasnow, Mark A
Subjects: Medical Microbiology, Biomedical and Clinical Sciences, Lung, Vaccine Related, Emerging Infectious Diseases, Infectious Diseases, Pneumonia, Prevention, Pneumonia & Influenza, 2.2 Factors relating to the physical environment, 2.1 Biological and endogenous factors, Aetiology, Infection, Respiratory, Good Health and Well Being, Humans, COVID-19, SARS-CoV-2, Macrophages, Inflammation, RNA, Viral, Medical and Health Sciences, Immunology, Biomedical and clinical sciences, Health sciences
Abstract: Early stages of deadly respiratory diseases including COVID-19 are challenging to elucidate in humans. Here, we define cellular tropism and transcriptomic effects of SARS-CoV-2 virus by productively infecting healthy human lung tissue and using scRNA-seq to reconstruct the transcriptional program in "infection pseudotime" for individual lung cell types. SARS-CoV-2 predominantly infected activated interstitial macrophages (IMs), which can accumulate thousands of viral RNA molecules, taking over 60% of the cell transcriptome and forming dense viral RNA bodies while inducing host profibrotic (TGFB1, SPP1) and inflammatory (early interferon response, CCL2/7/8/13, CXCL10, and IL6/10) programs and destroying host cell architecture. Infected alveolar macrophages (AMs) showed none of these extreme responses. Spike-dependent viral entry into AMs used ACE2 and Sialoadhesin/CD169, whereas IM entry used DC-SIGN/CD209. These results identify activated IMs as a prominent site of viral takeover, the focus of inflammation and fibrosis, and suggest targeting CD209 to prevent early pathology in COVID-19 pneumonia. This approach can be generalized to any human lung infection and to evaluate therapeutics.
Published: 2024

26. A Health-Related Quality of Life Measure for Patients Who Undergo Minimally Invasive Glaucoma Surgery

Author: Hays, Ron D, Tarver, Michelle E, Eydelman, Malvina, Spaeth, George L, Parke, David W, Singh, Kuldev, Nguyen, Don, Saltzmann, Robert M, Smith, Oluwatosin, Shaw, My Le, Rosenberg, Lisa, Seibold, Leo, Teymoorian, Savak, Provencher, Lorraine M, Bicket, Amanda K, Arora, Nitika, Junk, Anna K, Chaya, Craig, Salim, Sarwat, Kuo, Debbie, Weiner, Asher, Zhang, Ze, Rhee, Brian Francis Douglas, McMillan, Brian, Choo, Clara, Garris, Winston, Noecker, Rob, Fellman, Ronald, Caprioli, Joseph, Vold, Steven, Pasquale, Louis, Cui, Qi, and Mbagwu, Michael
Subjects: Biomedical and Clinical Sciences, Ophthalmology and Optometry, Neurodegenerative, Aging, Eye Disease and Disorders of Vision, Neurosciences, Clinical Research, Patient Safety, Eye, Good Health and Well Being, Glaucoma Outcomes Survey Collaborative Study Group, Clinical Sciences, Opthalmology and Optometry, Public Health and Health Services, Ophthalmology & Optometry, Ophthalmology and optometry
Abstract: PurposeTo develop a patient-reported outcome measure to assess the impact of glaucoma and treatment, including minimally invasive glaucoma surgery (MIGS).DesignObservational study before and after concomitant cataract and Food and Drug Administration-approved implantable MIGS device surgery.SettingSurvey administration was on a computer, iPad, or similar device.Patient population184 adults completed the baseline survey, 124 a survey 3 months after surgery, and 106 the 1-month test-retest reliability survey. The age range was 37 to 89 (average age = 72). Most were female (57%), non-Hispanic White (81%), and had a college degree (56%).Main outcome measuresThe Glaucoma Outcomes Survey (GOS) assesses functional limitations (27 items), vision-related symptoms (7 items), psychosocial issues (7 items), and satisfaction with microinvasive glaucoma surgery (1 item). These multiple-item scales were scored on a 0 to 100 range, with a higher score indicating worse health.ResultsInternal consistency reliability estimates ranged from 0.75 to 0.93, and 1-month test-retest intraclass correlations ranged from 0.83 to 0.92 for the GOS scales. Product-moment correlations among the scales ranged from 0.56 to 0.60. Improvement in visual acuity in the study eye from baseline to the 3-month follow-up was significantly related to improvements in GOS functional limitations (r = 0.18, P = .0485), vision-related symptoms (r = 0.19, P = .0386), and psychosocial concerns (r = 0.18, P = .0503). Responders to treatment ranged from 17% for vision-related symptoms to 48% for functional limitations.ConclusionsThis study supports using the GOS for ophthalmic procedures such as MIGS. Further evaluation of the GOS in different patient subgroups and clinical settings is needed.
Published: 2024

27. A genome sequence for the threatened whitebark pine.

Author: Neale, David, Zimin, Aleksey, Meltzer, Amy, Bhattarai, Akriti, Amee, Maurice, Figueroa Corona, Laura, Puiu, Daniela, Wright, Jessica, De La Torre, Amanda, McGuire, Patrick, Timp, Winston, Salzberg, Steven, Wegrzyn, Jill, and Allen, Brian
Subjects: Pinus albicaulis, annotation, conifer, genome assembly, gymnosperm, whitebark pine, Pinus, Molecular Sequence Annotation, Genome, Plant, Genomics, Endangered Species, High-Throughput Nucleotide Sequencing
Abstract: Whitebark pine (WBP, Pinus albicaulis) is a white pine of subalpine regions in the Western contiguous United States and Canada. WBP has become critically threatened throughout a significant part of its natural range due to mortality from the introduced fungal pathogen white pine blister rust (WPBR, Cronartium ribicola) and additional threats from mountain pine beetle (Dendroctonus ponderosae), wildfire, and maladaptation due to changing climate. Vast acreages of WBP have suffered nearly complete mortality. Genomic technologies can contribute to a faster, more cost-effective approach to the traditional practices of identifying disease-resistant, climate-adapted seed sources for restoration. With deep-coverage Illumina short reads of haploid megagametophyte tissue and Oxford Nanopore long reads of diploid needle tissue, followed by a hybrid, multistep assembly approach, we produced a final assembly containing 27.6 Gb of sequence in 92,740 contigs (N50 537,007 bp) and 34,716 scaffolds (N50 2.0 Gb). Approximately 87.2% (24.0 Gb) of total sequence was placed on the 12 WBP chromosomes. Annotation yielded 25,362 protein-coding genes, and over 77% of the genome was characterized as repeats. WBP has demonstrated the greatest variation in resistance to WPBR among the North American white pines. Candidate genes for quantitative resistance include disease resistance genes known as nucleotide-binding leucine-rich repeat receptors (NLRs). A combination of protein domain alignments and direct genome scanning was employed to fully describe the 3 subclasses of NLRs. Our high-quality reference sequence and annotation provide a marked improvement in NLR identification compared to previous assessments that leveraged de novo-assembled transcriptomes.
Published: 2024

28. Patient experiences of resection versus responsive neurostimulation for drug-resistant epilepsy.

Author: Haeusermann, Tobias, Liu, Emily, Fong, Kristina, Dohan, Daniel, and Chiong, Winston
Subjects: Drug-resistant epilepsy, Patient experience, Responsive neurostimulation, Surgical resection, Humans, Drug Resistant Epilepsy, Epilepsy, Deep Brain Stimulation, Electrocorticography, Patient Outcome Assessment
Abstract: This study explored illness experiences and decision-making among patients with epilepsy who underwent two different types of surgical interventions: resection versus implantation of the NeuroPace Responsive Neurostimulation System (RNS). We recruited 31 participants from a level four epilepsy center in an academic medical institution. We observed 22 patient clinic visits (resection: n = 10, RNS: n = 12) and conducted 18 in-depth patient interviews (resection: n = seven, RNS: n = 11); most visits and interviews included patient caregivers. Using an applied ethnographic approach, we identified three major themes in the experiences of resection versus RNS patients. First, for patients in both cohorts, the therapeutic journey was circuitous in ways that defied standardized first-, second-, and third- line of care models. Second, in conceptualizing risk, resection patients emphasized the permanent loss of taking out brain tissue whereas RNS patients highlighted the reversibility of putting in a device. Lastly, in considering benefit, resection patients perceived their surgery as potentially curative while RNS patients understood implantation as primarily palliative with possible additional diagnostic benefit from chronic electrocorticography. Insight into the perspectives of patients and caregivers may help identify key topics for counseling and exploration by clinicians.
Published: 2024

29. Tracking-Assisted Object Detection with Event Cameras

Author: Yen, Ting-Kang, Morawski, Igor, Dangi, Shusil, He, Kai, Lin, Chung-Yi, Yeh, Jia-Fong, Su, Hung-Ting, and Hsu, Winston
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: Event-based object detection has recently garnered attention in the computer vision community due to the exceptional properties of event cameras, such as high dynamic range and no motion blur. However, feature asynchronism and sparsity cause invisible objects due to no relative motion to the camera, posing a significant challenge in the task. Prior works have studied various implicit-learned memories to retain as many temporal cues as possible. However, implicit memories still struggle to preserve long-term features effectively. In this paper, we consider those invisible objects as pseudo-occluded objects and aim to detect them by tracking through occlusions. Firstly, we introduce the visibility attribute of objects and contribute an auto-labeling algorithm to not only clean the existing event camera dataset but also append additional visibility labels to it. Secondly, we exploit tracking strategies for pseudo-occluded objects to maintain their permanence and retain their bounding boxes, even when features have not been available for a very long time. These strategies can be treated as an explicit-learned memory guided by the tracking objective to record the displacements of objects across frames. Lastly, we propose a spatio-temporal feature aggregation module to enrich the latent features and a consistency loss to increase the robustness of the overall pipeline. We conduct comprehensive experiments to verify our method's effectiveness where still objects are retained, but real occluded objects are discarded. The results demonstrate that (1) the additional visibility labels can assist in supervised training, and (2) our method outperforms state-of-the-art approaches with a significant improvement of 7.9% absolute mAP.
Published: 2024

30. Chemical homogenization for non-mixing reactive interfaces in porous media

Author: Lindqwister, Winston, Veveakis, Manolis, and Lesueur, Martin
Subjects: Condensed Matter - Soft Condensed Matter, Physics - Chemical Physics, Physics - Fluid Dynamics
Abstract: Porous media, while ubiquitous across many engineering disciplines, is inherently difficult to characterize due to their innate stochasticity and heterogeneity. The key for predicting porous material behavior comes down to the structuring of its microstructure, where the linkages of microstructural properties to mesoscale effects remain as one of the key questions in unlocking understanding of this class of materials. One proposed method of linking scales comes down to using Minkowski functionals -- geometric morphometers that describe the spatial and topological features of a convex space -- to draw connections from microstructural form to mesoscale features. In this work, chemical equilibrium and kinetics on a microstructure surface were explored, with Minkowski functionals used as the basis for relating microstructural geometry to chemical performance. Using surface CRNs to model chemical behavior -- a novel asynchronous cellular automaton -- linkages were found between the Minkowski functionals and equilibrium equilibrium constant, as well as properties related to the dynamics of the system's reaction quotient.
Published: 2024

31. Diffusion-based Aesthetic QR Code Generation via Scanning-Robust Perceptual Guidance

Author: Liao, Jia-Wei, Wang, Winston, Wang, Tzu-Sian, Peng, Li-Xuan, Chou, Cheng-Fu, and Chen, Jun-Cheng
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: QR codes, prevalent in daily applications, lack visual appeal due to their conventional black-and-white design. Integrating aesthetics while maintaining scannability poses a challenge. In this paper, we introduce a novel diffusion-model-based aesthetic QR code generation pipeline, utilizing pre-trained ControlNet and guided iterative refinement via a novel classifier guidance (SRG) based on the proposed Scanning-Robust Loss (SRL) tailored with QR code mechanisms, which ensures both aesthetics and scannability. To further improve the scannability while preserving aesthetics, we propose a two-stage pipeline with Scanning-Robust Perceptual Guidance (SRPG). Moreover, we can further enhance the scannability of the generated QR code by post-processing it through the proposed Scanning-Robust Projected Gradient Descent (SRPGD) post-processing technique based on SRL with proven convergence. With extensive quantitative, qualitative, and subjective experiments, the results demonstrate that the proposed approach can generate diverse aesthetic QR codes with flexibility in detail. In addition, our pipelines outperforming existing models in terms of Scanning Success Rate (SSR) 86.67% (+40%) with comparable aesthetic scores. The pipeline combined with SRPGD further achieves 96.67% (+50%). Our code will be available https://github.com/jwliao1209/DiffQRCode.
Published: 2024

32. Role of Locality and Weight Sharing in Image-Based Tasks: A Sample Complexity Separation between CNNs, LCNs, and FCNs

Author: Lahoti, Aakash, Karp, Stefani, Winston, Ezra, Singh, Aarti, and Li, Yuanzhi
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Computer Vision and Pattern Recognition, Statistics - Machine Learning
Abstract: Vision tasks are characterized by the properties of locality and translation invariance. The superior performance of convolutional neural networks (CNNs) on these tasks is widely attributed to the inductive bias of locality and weight sharing baked into their architecture. Existing attempts to quantify the statistical benefits of these biases in CNNs over locally connected convolutional neural networks (LCNs) and fully connected neural networks (FCNs) fall into one of the following categories: either they disregard the optimizer and only provide uniform convergence upper bounds with no separating lower bounds, or they consider simplistic tasks that do not truly mirror the locality and translation invariance as found in real-world vision tasks. To address these deficiencies, we introduce the Dynamic Signal Distribution (DSD) classification task that models an image as consisting of $k$ patches, each of dimension $d$, and the label is determined by a $d$-sparse signal vector that can freely appear in any one of the $k$ patches. On this task, for any orthogonally equivariant algorithm like gradient descent, we prove that CNNs require $\tilde{O}(k+d)$ samples, whereas LCNs require $\Omega(kd)$ samples, establishing the statistical advantages of weight sharing in translation invariant tasks. Furthermore, LCNs need $\tilde{O}(k(k+d))$ samples, compared to $\Omega(k^2d)$ samples for FCNs, showcasing the benefits of locality in local tasks. Additionally, we develop information theoretic tools for analyzing randomized algorithms, which may be of interest for statistical research., Comment: 40 pages, 4 figures, Accepted to ICLR 2024, Spotlight
Published: 2024

33. ShapeFormer: Shape Prior Visible-to-Amodal Transformer-based Amodal Instance Segmentation

Author: Tran, Minh, Bounsavy, Winston, Vo, Khoa, Nguyen, Anh, Nguyen, Tri, and Le, Ngan
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Amodal Instance Segmentation (AIS) presents a challenging task as it involves predicting both visible and occluded parts of objects within images. Existing AIS methods rely on a bidirectional approach, encompassing both the transition from amodal features to visible features (amodal-to-visible) and from visible features to amodal features (visible-to-amodal). Our observation shows that the utilization of amodal features through the amodal-to-visible can confuse the visible features due to the extra information of occluded/hidden segments not presented in visible display. Consequently, this compromised quality of visible features during the subsequent visible-to-amodal transition. To tackle this issue, we introduce ShapeFormer, a decoupled Transformer-based model with a visible-to-amodal transition. It facilitates the explicit relationship between output segmentations and avoids the need for amodal-to-visible transitions. ShapeFormer comprises three key modules: (i) Visible-Occluding Mask Head for predicting visible segmentation with occlusion awareness, (ii) Shape-Prior Amodal Mask Head for predicting amodal and occluded masks, and (iii) Category-Specific Shape Prior Retriever aims to provide shape prior knowledge. Comprehensive experiments and extensive ablation studies across various AIS benchmarks demonstrate the effectiveness of our ShapeFormer. The code is available at: \url{https://github.com/UARK-AICV/ShapeFormer}, Comment: Accepted to IJCNN2024
Published: 2024

34. Customizable Avatars with Dynamic Facial Action Coded Expressions (CADyFACE) for Improved User Engagement

Author: Witherow, Megan A., Butler, Crystal, Shields, Winston J., Ilgin, Furkan, Diawara, Norou, Keener, Janice, Harrington, John W., and Iftekharuddin, Khan M.
Subjects: Computer Science - Human-Computer Interaction, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: Customizable 3D avatar-based facial expression stimuli may improve user engagement in behavioral biomarker discovery and therapeutic intervention for autism, Alzheimer's disease, facial palsy, and more. However, there is a lack of customizable avatar-based stimuli with Facial Action Coding System (FACS) action unit (AU) labels. Therefore, this study focuses on (1) FACS-labeled, customizable avatar-based expression stimuli for maintaining subjects' engagement, (2) learning-based measurements that quantify subjects' facial responses to such stimuli, and (3) validation of constructs represented by stimulus-measurement pairs. We propose Customizable Avatars with Dynamic Facial Action Coded Expressions (CADyFACE) labeled with AUs by a certified FACS expert. To measure subjects' AUs in response to CADyFACE, we propose a novel Beta-guided Correlation and Multi-task Expression learning neural network (BeCoME-Net) for multi-label AU detection. The beta-guided correlation loss encourages feature correlation with AUs while discouraging correlation with subject identities for improved generalization. We train BeCoME-Net for unilateral and bilateral AU detection and compare with state-of-the-art approaches. To assess construct validity of CADyFACE and BeCoME-Net, twenty healthy adult volunteers complete expression recognition and mimicry tasks in an online feasibility study while webcam-based eye-tracking and video are collected. We test validity of multiple constructs, including face preference during recognition and AUs during mimicry., Comment: 12 pages, 8 figures
Published: 2024

35. Tel2Veh: Fusion of Telecom Data and Vehicle Flow to Predict Camera-Free Traffic via a Spatio-Temporal Framework

Author: Lin, ChungYi, Tung, Shen-Lung, Su, Hung-Ting, and Hsu, Winston H.
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: Vehicle flow, a crucial indicator for transportation, is often limited by detector coverage. With the advent of extensive mobile network coverage, we can leverage mobile user activities, or cellular traffic, on roadways as a proxy for vehicle flow. However, as counts of cellular traffic may not directly align with vehicle flow due to data from various user types, we present a new task: predicting vehicle flow in camera-free areas using cellular traffic. To uncover correlations within multi-source data, we deployed cameras on selected roadways to establish the Tel2Veh dataset, consisting of extensive cellular traffic and sparse vehicle flows. Addressing this challenge, we propose a framework that independently extracts features and integrates them with a graph neural network (GNN)-based fusion to discern disparities, thereby enabling the prediction of unseen vehicle flows using cellular traffic. This work advances the use of telecom data in transportation and pioneers the fusion of telecom and vision-based data, offering solutions for traffic management., Comment: 4 pages, 5 figures, 4 tables. Accepted by WWW'24, to appear
Published: 2024

36. A New Probe of Cosmic Birefringence Using Galaxy Polarization and Shapes

Author: Yin, Weichen Winston, Dai, Liang, Huang, Junwu, Ji, Lingyuan, and Ferraro, Simone
Subjects: Astrophysics - Cosmology and Nongalactic Astrophysics, Astrophysics - Astrophysics of Galaxies, High Energy Physics - Phenomenology
Abstract: We propose a new method to search for parity-violating new physics via measurements of cosmic birefringence and demonstrate its power in detecting the topological effect originating from an axion string network with an axion-photon coupling as a motivated source of cosmic birefringence. The method, using large galaxy samples, exploits an empirical correlation between the polarization direction of the integrated radio emission from a spiral galaxy and its apparent shape. We devise unbiased minimum-variance quadratic estimators for discrete samples of galaxies with both integrated radio polarization and shape measurements. Assuming a synergy with overlapping optical imaging surveys, we forecast the sensitivity to polarization rotation of the forthcoming SKA radio continuum surveys of spiral galaxies out to $z \sim 1.5$. The angular noise power spectrum of polarization rotation using our method can be lower than that expected from CMB Stage-IV experiments, when assuming a wide survey covering $\sim 1000\,{\rm deg}^2$ and reaching an RMS flux of $\sim 1\,\mu{\rm Jy}$. Our method will be complementary to CMB-based methods as it will be subject to different systematics. It can be generalized to probe time-varying or redshift-varying birefringence signals.
Published: 2024

37. Learning the Covariance of Treatment Effects Across Many Weak Experiments

Author: Bibaut, Aurélien, Chou, Winston, Ejdemyr, Simon, and Kallus, Nathan
Subjects: Statistics - Methodology
Abstract: When primary objectives are insensitive or delayed, experimenters may instead focus on proxy metrics derived from secondary outcomes. For example, technology companies often infer the long-term impacts of product interventions from their effects on short-term user engagement signals. We consider the meta-analysis of many historical experiments to learn the covariance of treatment effects on these outcomes, which can support the construction of such proxies. Even when experiments are plentiful, if treatment effects are weak, the covariance of estimated treatment effects across experiments can be highly biased. We overcome this with techniques inspired by weak instrumental variable analysis. We show that Limited Information Maximum Likelihood (LIML) learns a parameter equivalent to fitting total least squares to a transformation of the scatterplot of treatment effects, and that Jackknife Instrumental Variables Estimation (JIVE) learns another parameter computable from the average of Jackknifed covariance matrices across experiments. We also present a total covariance estimator for the latter estimand under homoskedasticity, which is equivalent to a $k$-class estimator. We show how these parameters can be used to construct unbiased proxy metrics under various structural models. Lastly, we discuss the real-world application of our methods at Netflix., Comment: Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining
Published: 2024

38. AED: Adaptable Error Detection for Few-shot Imitation Policy

Author: Yeh, Jia-Fong, Hung, Kuo-Han, Lo, Pang-Chi, Chung, Chi-Ming, Wu, Tsung-Han, Su, Hung-Ting, Chen, Yi-Ting, and Hsu, Winston H.
Subjects: Computer Science - Robotics
Abstract: We introduce a new task called Adaptable Error Detection (AED), which aims to identify behavior errors in few-shot imitation (FSI) policies based on visual observations in novel environments. The potential to cause serious damage to surrounding areas limits the application of FSI policies in real-world scenarios. Thus, a robust system is necessary to notify operators when FSI policies are inconsistent with the intent of demonstrations. This task introduces three challenges: (1) detecting behavior errors in novel environments, (2) identifying behavior errors that occur without revealing notable changes, and (3) lacking complete temporal information of the rollout due to the necessity of online detection. However, the existing benchmarks cannot support the development of AED because their tasks do not present all these challenges. To this end, we develop a cross-domain AED benchmark, consisting of 322 base and 153 novel environments. Additionally, we propose Pattern Observer (PrObe) to address these challenges. PrObe is equipped with a powerful pattern extractor and guided by novel learning objectives to parse discernible patterns in the policy feature representations of normal or error states. Through our comprehensive evaluation, PrObe demonstrates superior capability to detect errors arising from a wide range of FSI policies, consistently surpassing strong baselines. Moreover, we conduct detailed ablations and a pilot study on error correction to validate the effectiveness of the proposed architecture design and the practicality of the AED task, respectively.
Published: 2024

39. Developments and applications of the OPTIMADE API for materials discovery, design, and data exchange

Author: Evans, Matthew L., Bergsma, Johan, Merkys, Andrius, Andersen, Casper W., Andersson, Oskar B., Beltrán, Daniel, Blokhin, Evgeny, Boland, Tara M., Balderas, Rubén Castañeda, Choudhary, Kamal, Díaz, Alberto Díaz, García, Rodrigo Domínguez, Eckert, Hagen, Eimre, Kristjan, Montero, María Elena Fuentes, Krajewski, Adam M., Mortensen, Jens Jørgen, Duarte, José Manuel Nápoles, Pietryga, Jacob, Qi, Ji, Carrillo, Felipe de Jesús Trejo, Vaitkus, Antanas, Yu, Jusong, Zettel, Adam, de Castro, Pedro Baptista, Carlsson, Johan, Cerqueira, Tiago F. T., Divilov, Simon, Hajiyani, Hamidreza, Hanke, Felix, Jose, Kevin, Oses, Corey, Riebesell, Janosh, Schmidt, Jonathan, Winston, Donald, Xie, Christen, Yang, Xiaoyu, Bonella, Sara, Botti, Silvana, Curtarolo, Stefano, Draxl, Claudia, Cobas, Luis Edmundo Fuentes, Hospital, Adam, Liu, Zi-Kui, Marques, Miguel A. L., Marzari, Nicola, Morris, Andrew J., Ong, Shyue Ping, Orozco, Modesto, Persson, Kristin A., Thygesen, Kristian S., Wolverton, Chris, Scheidgen, Markus, Toher, Cormac, Conduit, Gareth J., Pizzi, Giovanni, Gražulis, Saulius, Rignanese, Gian-Marco, and Armiento, Rickard
Subjects: Condensed Matter - Materials Science
Abstract: The Open Databases Integration for Materials Design (OPTIMADE) application programming interface (API) empowers users with holistic access to a growing federation of databases, enhancing the accessibility and discoverability of materials and chemical data. Since the first release of the OPTIMADE specification (v1.0), the API has undergone significant development, leading to the upcoming v1.2 release, and has underpinned multiple scientific studies. In this work, we highlight the latest features of the API format, accompanying software tools, and provide an update on the implementation of OPTIMADE in contributing materials databases. We end by providing several use cases that demonstrate the utility of the OPTIMADE API in materials research that continue to drive its ongoing development.
Published: 2024
Full Text: View/download PDF

40. TelTrans: Applying Multi-Type Telecom Data to Transportation Evaluation and Prediction via Multifaceted Graph Modeling

Author: Lin, ChungYi, Tung, Shen-Lung, Su, Hung-Ting, and Hsu, Winston H.
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: To address the limitations of traffic prediction from location-bound detectors, we present Geographical Cellular Traffic (GCT) flow, a novel data source that leverages the extensive coverage of cellular traffic to capture mobility patterns. Our extensive analysis validates its potential for transportation. Focusing on vehicle-related GCT flow prediction, we propose a graph neural network that integrates multivariate, temporal, and spatial facets for improved accuracy. Experiments reveal our model's superiority over baselines, especially in long-term predictions. We also highlight the potential for GCT flow integration into transportation systems., Comment: 7 pages, 7 figures, 4 tables. Accepted by AAAI-24-IAAI, to appear
Published: 2024

41. Excitons and polaritons in singlet fission materials: Photophysics, photochemistry, and optoelectronics

Author: Ostroverkhova, Oksana, Goldthwaite, Winston, and Lamug, Roshell
Published: 2024
Full Text: View/download PDF

42. Mind the gap in kidney care: translating what we know into what we do

Author: Luyckx, Valerie A., Tuttle, Katherine R., Abdellatif, Dina, Correa-Rotter, Ricardo, Fung, Winston W. S., Haris, Agnès, Hsiao, Li-Li, Khalife, Makram, Kumaraswami, Latha A., Loud, Fiona, Raghavan, Vasundhara, Roumeliotis, Stefanos, Sierra, Marianella, Ulasi, Ifeoma, Wang, Bill, Lui, Siu-Fai, Liakopoulos, Vassilios, and Balducci, Alessandro
Published: 2024
Full Text: View/download PDF

43. Neonatal intracranial haemorrhage secondary to vein of Galen aneurysmal dilatation

Author: Wu, Yilong, Chia, Ghim Song, Tan, Timothy S. E., Lim, Winston E. H., Toh, Luke H. W., and Low, Sharon Y. Y.
Published: 2024
Full Text: View/download PDF

44. Diagnostic efficacy of image-guided core needle biopsy of suspected malignant osseous lesions: a retrospective cohort study from a single academic institution

Author: Winkler, Winston L., Baker, Jonathan C., Tomasian, Anderanik, Vander Velde, Theodore L., Hillen, Travis J., Luo, Chongliang, Imaoka, Resten, Dettorre, Gino M., and Jennings, Jack W.
Published: 2024
Full Text: View/download PDF

45. Exploring the efficacy of computer games as a pedagogical tool for teaching and learning programming: A systematic review

Author: Wilson, Kuuku Nyameye, Ghansah, Benjamin, Ananga, Patricia, Oppong, Stephen Opoku, Essibu, Winston Kwamina, and Essibu, Einstein Kow
Published: 2024
Full Text: View/download PDF

46. Thermal and hydrodynamic CFD evaluation of throat sizing effect on biomass gasification performance in downdraft fixed-bed reactor

Author: Kuttin, Kannie Winston, Ding, Lu, and Yu, Guangsuo
Published: 2024
Full Text: View/download PDF

47. Trans-axillary sonography in the ABER (ABduction and External Rotation) position: a window to the subscapularis, teres major and latissimus dorsi

Author: Selvarajah, Logeswaran, Cresswell, Mark, David, Romain, Winston, Paul, and Murray, Timothy
Published: 2024
Full Text: View/download PDF

48. Regional irrigation expansion can support climate-resilient crop production in post-invasion Ukraine

Author: Rosa, Lorenzo, Ragettli, Silvan, Sinha, Ranu, Zhovtonog, Olga, Yu, Winston, and Karimi, Poolad
Published: 2024
Full Text: View/download PDF

49. Effective refractive error coverage and spectacle coverage among school children in Telangana, South India

Author: Prakash, Winston D, Marmamula, Srinivas, Keeffe, Jill, and Khanna, Rohit C.
Published: 2024
Full Text: View/download PDF

50. The excitability of ipsilateral motor evoked potentials is not task-specific and spatially distinct from the contralateral motor hotspot

Author: Seusing, Nelly, Strauss, Sebastian, Fleischmann, Robert, Nafz, Christina, Groppa, Sergiu, Muthuraman, Muthuraman, Ding, Hao, Byblow, Winston D., Lotze, Martin, and Grothe, Matthias
Published: 2024
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Category

Publication Type

Journal

Region

Database

Publisher

97,763 results on '"Winston, A."'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources