Author: "A. Koo" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"A. Koo"' showing total 185,630 results

Start Over Author "A. Koo"

185,630 results on '"A. Koo"'

1. Speech Recognition-based Feature Extraction for Enhanced Automatic Severity Classification in Dysarthric Speech

Author: Choi, Yerin, Lee, Jeehyun, and Koo, Myoung-Wan
Subjects: Computer Science - Sound, Computer Science - Artificial Intelligence, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: Due to the subjective nature of current clinical evaluation, the need for automatic severity evaluation in dysarthric speech has emerged. DNN models outperform ML models but lack user-friendly explainability. ML models offer explainable results at a feature level, but their performance is comparatively lower. Current ML models extract various features from raw waveforms to predict severity. However, existing methods do not encompass all dysarthric features used in clinical evaluation. To address this gap, we propose a feature extraction method that minimizes information loss. We introduce an ASR transcription as a novel feature extraction source. We finetune the ASR model for dysarthric speech, then use this model to transcribe dysarthric speech and extract word segment boundary information. It enables capturing finer pronunciation and broader prosodic features. These features demonstrated an improved severity prediction performance to existing features: balanced accuracy of 83.72%., Comment: Accepted to SLT 2024
Published: 2024

2. Structured Object Language Modeling (SoLM): Native Structured Objects Generation Conforming to Complex Schemas with Self-Supervised Denoising

Author: Tavanaei, Amir, Koo, Kee Kiat, Ceker, Hayreddin, Jiang, Shaobai, Li, Qi, Han, Julien, and Bouyarmane, Karim
Subjects: Computer Science - Software Engineering, Computer Science - Artificial Intelligence
Abstract: In this paper, we study the problem of generating structured objects that conform to a complex schema, with intricate dependencies between the different components (facets) of the object. The facets of the object (attributes, fields, columns, properties) can be a mix of short, structured, type-constrained facts, or long natural-language descriptions. The object has to be self-consistent between the different facets in the redundant information it carries (relative consistency), while being grounded with respect to world knowledge (absolute consistency). We frame the problem as a Language Modeling problem (Structured Object Language Modeling) and train an LLM to perform the task natively, without requiring instructions or prompt-engineering. We propose a self-supervised denoising method to train the model from an existing dataset of such objects. The input query can be the existing object itself, in which case the model acts as a regenerator, completing, correcting, normalizing the input, or any unstructured blurb to be structured. We show that the self-supervised denoising training provides a strong baseline, and that additional supervised fine-tuning with small amount of human demonstrations leads to further improvement. Experimental results show that the proposed method matches or outperforms prompt-engineered general-purpose state-of-the-art LLMs (Claude 3, Mixtral-8x7B), while being order-of-magnitude more cost-efficient.
Published: 2024

3. Jovis: A Visualization Tool for PostgreSQL Query Optimizer

Author: Choi, Yoojin, Han, Juhee, Koo, Kyoseung, and Moon, Bongki
Subjects: Computer Science - Databases, Computer Science - Human-Computer Interaction
Abstract: In the world of relational database management, the query optimizer is a critical component that significantly impacts query performance. To address the challenge of optimizing query performance due to the complexity of optimizers -- especially with join operations -- we introduce Jovis. This novel visualization tool provides a window into the often intricate process of query optimization in PostgreSQL, making it more accessible and understandable. PostgreSQL employs two different query optimization strategies: the Dynamic Programming (DP) Optimizer for most scenarios and the Genetic Query Optimizer (GEQO) for more complex queries with numerous joins, both of which are supported in Jovis. Our tool visualizes the optimizer's decision-making process, from evaluating access paths for each relation to determining join orderings, all using data derived from the optimizer's logs. Jovis not only clarifies the query optimization process through visualizations but also serves as an invaluable learning tool for learners and a practical resource for experienced database professionals looking to optimize their query performance or even the query optimizer itself. The source code has been made available at https://github.com/snu-jovis.
Published: 2024

4. Initial Mass Functions of Young Stellar Clusters from the Gemini Spectroscopic Survey of Nearby Galaxies I. Young Massive Clusters in the Antennae galaxies

Author: Koo, Jae-Rim, Kim, Hyun-Jeong, and Lim, Beomdu
Subjects: Astrophysics - Astrophysics of Galaxies
Abstract: The stellar initial mass function (IMF) is a key parameter to understand the star formation process and the integrated properties of stellar populations in remote galaxies. We present a spectroscopic study of young massive clusters (YMCs) in the starburst galaxies NGC 4038/39. The integrated spectra of seven YMCs obtained with GMOS-S attached to the 8.2-m Gemini South telescope reveal the spectral features associated with stellar ages and the underlying IMFs. We constrain the ages of the YMCs using the absorption lines and strong emission bands from Wolf-Rayet stars. The internal reddening is also estimated from the strength of the Na I D absorption lines. Based on these constraints, the observed spectra are matched with the synthetic spectra generated from a simple stellar population model. Several parameters of the clusters including age, reddening, cluster mass, and the underlying IMF are derived from the spectral matching. The ages of the YMCs range from 2.5 to 6.5 Myr, and these clusters contain stellar masses ranging from 1.6 X 10^5 M_sun to 7.9 X 10^7 M_sun. The underlying IMFs appear to differ from the universal form of the Salpeter/Kroupa IMF. Interestingly, massive clusters tend to have the bottom-heavy IMFs, although the masses of some clusters are overestimated due to the crowding effect. Based on this, our results suggest that the universal form of the IMF is not always valid when analyzing integrated light from unresolved stellar systems. However, further study with a larger sample size is required to reach a definite conclusion., Comment: 18 pages, 9 figures, accepted for publication in AJ
Published: 2024

5. TPC: Test-time Procrustes Calibration for Diffusion-based Human Image Animation

Author: Yoon, Sunjae, Koo, Gwanhyeong, Lee, Younghwan, and Yoo, Chang D.
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Human image animation aims to generate a human motion video from the inputs of a reference human image and a target motion video. Current diffusion-based image animation systems exhibit high precision in transferring human identity into targeted motion, yet they still exhibit irregular quality in their outputs. Their optimal precision is achieved only when the physical compositions (i.e., scale and rotation) of the human shapes in the reference image and target pose frame are aligned. In the absence of such alignment, there is a noticeable decline in fidelity and consistency. Especially, in real-world environments, this compositional misalignment commonly occurs, posing significant challenges to the practical usage of current systems. To this end, we propose Test-time Procrustes Calibration (TPC), which enhances the robustness of diffusion-based image animation systems by maintaining optimal performance even when faced with compositional misalignment, effectively addressing real-world scenarios. The TPC provides a calibrated reference image for the diffusion model, enhancing its capability to understand the correspondence between human shapes in the reference and target images. Our method is simple and can be applied to any diffusion-based image animation system in a model-agnostic manner, improving the effectiveness at test time without additional training., Comment: 24 pages, 16 figures, NeurIPS 2024
Published: 2024

6. Impact of High-Brightness Entangled Photon Pairs on CHSH Inequality Experiment

Author: Kim, Jin-Woo, Lim, Suseong, Kim, Heonoh, and Rhee, June Koo Kevin
Subjects: Quantum Physics
Abstract: Verifying the violation of Bell's inequality is one of the most representative methods to demonstrate that entangled photon pairs prepared in a quantum optics-based system exhibit quantum properties. While experiments on Bell inequality violations have been theoretically well-established and extensively conducted to implement various quantum information technologies in laboratory settings, mathematical modeling for accurately predicting the distribution of high-intensity entangled photon pairs in high-loss environments remains an issue that requires further research. As the brightness of the entangled photon pairs increases, the influence of multi-photon effects becomes more significant, leading to a decrease in the CHSH value $S$ and also a reduction in the standard deviation of the CHSH value $\Delta S$. Therefore, a new analysis of the $(S-2)/\Delta S$ value is required to more precisely confirm the degree of CHSH inequality violation including the reliability of $S$. In this paper, we propose a mathematical model to predict the $(S-2)/\Delta S$ value as a function of the brightness of the entangled photon pair source, and we also suggest the need to optimize the brightness of this source. Additionally, we provide experimental evidence supporting this model. The experiment confirms that when the mean photon number is $\mu=0.026$ in an entanglement distribution setup with a total loss of $-19.03$ dB, the CHSH value drops to 2.69, while the $(S-2)/\Delta S$ value increases to 60.95., Comment: 11 pages, 1 table, and 4 figures
Published: 2024

7. Towards Robust and Efficient Federated Low-Rank Adaptation with Heterogeneous Clients

Author: Koo, Jabin, Jang, Minwoo, and Ok, Jungseul
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Distributed, Parallel, and Cluster Computing
Abstract: Federated fine-tuning for Large Language Models (LLMs) has recently gained attention due to the heavy communication overhead of transmitting large model updates. Low Rank Adaptation (LoRA) has been proposed as a solution, yet its application in federated learning is complicated by discordance in aggregation. Existing methods addressing this discordance often suffer from performance degradation at low ranks in heterogeneous data settings. In response, we introduce LoRA-A2 (Low Rank Adaptation with Alternating freeze and Adaptive rank selection), which demonstrates robustness in challenging settings with low ranks and high data heterogeneity. Our experimental findings reveal that LoRA-A2 maintains performance even under extreme heterogeneity and low rank conditions, achieving up to a 99.8% reduction in uploaded parameters compared to full fine-tuning without compromising performance. This adaptive mechanism boosts robustness and communication efficiency in federated fine-tuning, enabling the practical deployment of LLMs in resource-constrained environments.
Published: 2024

8. Optimizing Keyphrase Ranking for Relevance and Diversity Using Submodular Function Optimization (SFO)

Author: Umair, Muhammad, Hashmi, Syed Jalaluddin, and Lee, Young-Koo
Subjects: Computer Science - Information Retrieval, Computer Science - Artificial Intelligence
Abstract: Keyphrase ranking plays a crucial role in information retrieval and summarization by indexing and retrieving relevant information efficiently. Advances in natural language processing, especially large language models (LLMs), have improved keyphrase extraction and ranking. However, traditional methods often overlook diversity, resulting in redundant keyphrases. We propose a novel approach using Submodular Function Optimization (SFO) to balance relevance and diversity in keyphrase ranking. By framing the task as submodular maximization, our method selects diverse and representative keyphrases. Experiments on benchmark datasets show that our approach outperforms existing methods in both relevance and diversity metrics, achieving SOTA performance in execution time. Our code is available online.
Published: 2024

9. SWITCH: Studying with Teacher for Knowledge Distillation of Large Language Models

Author: Koo, Jahyun, Hwang, Yerin, Kim, Yongil, Kang, Taegwan, Bae, Hyunkyung, and Jung, Kyomin
Subjects: Computer Science - Computation and Language
Abstract: Despite the success of Large Language Models (LLMs), they still face challenges related to high inference costs and memory requirements. To address these issues, Knowledge Distillation (KD) has emerged as a popular method for model compression, with student-generated outputs (SGOs) being particularly notable for reducing the mismatch between training and inference. However, SGOs often produce noisy and biased sequences, which can lead to misguidance from the teacher model, especially in long sequences. To mitigate these challenges, we propose SWITCH (Studying WIth TeaCHer for Knowledge Distillation), a novel approach that strategically incorporates the teacher model during the student's sequence generation. SWITCH identifies discrepancies between the token probabilities of the teacher and student models, allowing the teacher to intervene selectively, particularly in long sequences that are more prone to teacher misguidance. Extensive experimental results across three model families and five instruction-following datasets show that SWITCH surpasses traditional KD methods, particularly excelling in the generation of long sequential data.
Published: 2024

10. Parameter-Efficient Fine-Tuning of State Space Models

Author: Galim, Kevin, Kang, Wonjun, Zeng, Yuchen, Koo, Hyung Il, and Lee, Kangwook
Subjects: Computer Science - Machine Learning, Computer Science - Computation and Language
Abstract: Deep State Space Models (SSMs), such as Mamba (Gu & Dao, 2024), have emerged as powerful tools for language modeling, offering high performance with efficient inference and linear scaling in sequence length. However, the application of parameter-efficient fine-tuning (PEFT) methods to SSM-based models remains largely unexplored. This paper aims to systematically study two key questions: (i) How do existing PEFT methods perform on SSM-based models? (ii) Which modules are most effective for fine-tuning? We conduct an empirical benchmark of four basic PEFT methods on SSM-based models. Our findings reveal that prompt-based methods (e.g., prefix-tuning) are no longer effective, an empirical result further supported by theoretical analysis. In contrast, LoRA remains effective for SSM-based models. We further investigate the optimal application of LoRA within these models, demonstrating both theoretically and experimentally that applying LoRA to linear projection matrices without modifying SSM modules yields the best results, as LoRA is not effective at tuning SSM modules. To further improve performance, we introduce LoRA with Selective Dimension tuning (SDLoRA), which selectively updates certain channels and states on SSM modules while applying LoRA to linear projection matrices. Extensive experimental results show that this approach outperforms standard LoRA., Comment: Code is available at https://github.com/furiosa-ai/ssm-peft
Published: 2024

11. VRVQ: Variable Bitrate Residual Vector Quantization for Audio Compression

Author: Chae, Yunkee, Choi, Woosung, Takida, Yuhta, Koo, Junghyun, Ikemiya, Yukara, Zhong, Zhi, Cheuk, Kin Wai, Martínez-Ramírez, Marco A., Lee, Kyogu, Liao, Wei-Hsiang, and Mitsufuji, Yuki
Subjects: Computer Science - Sound, Computer Science - Machine Learning, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: Recent state-of-the-art neural audio compression models have progressively adopted residual vector quantization (RVQ). Despite this success, these models employ a fixed number of codebooks per frame, which can be suboptimal in terms of rate-distortion tradeoff, particularly in scenarios with simple input audio, such as silence. To address this limitation, we propose variable bitrate RVQ (VRVQ) for audio codecs, which allows for more efficient coding by adapting the number of codebooks used per frame. Furthermore, we propose a gradient estimation method for the non-differentiable masking operation that transforms from the importance map to the binary importance mask, improving model training via a straight-through estimator. We demonstrate that the proposed training framework achieves superior results compared to the baseline method and shows further improvement when applied to the current state-of-the-art codec., Comment: Accepted at NeurIPS 2024 Workshop on Machine Learning and Compression
Published: 2024

12. The Green Monster hiding in front of Cas A: JWST reveals a dense and dusty circumstellar structure pockmarked by ejecta interactions

Author: De Looze, Ilse, Milisavljevic, Dan, Temim, Tea, Dickinson, Danielle, Fesen, Robert, Arendt, Richard G., Chastenet, Jeremy, Orlando, Salvatore, Vink, Jacco, Barlow, Michael J., Kirchschlager, Florian, Priestley, Felix D., Raymond, John C., Rho, Jeonghee, Sartorio, Nina S., Scheffler, Tassilo, Schmidt, Franziska, Blair, William P., Fox, Ori, Fryer, Christopher, Janka, Hans-Thomas, Koo, Bon-Chul, Laming, J. Martin, Matsuura, Mikako, Patnaude, Dan, Relano, Monica, Rest, Armin, Schmidt, Judy, Smith, Nathan, and Sravan, Niharika
Subjects: Astrophysics - High Energy Astrophysical Phenomena, Astrophysics - Solar and Stellar Astrophysics
Abstract: JWST observations of the young Galactic supernova remnant Cassiopeia A revealed an unexpected structure seen as a green emission feature in colored composite MIRI F1130W and F1280W images - hence dubbed the Green Monster - that stretches across the central parts of the remnant in projection. Combining the kinematic information from NIRSpec and MIRI MRS with the multi-wavelength imaging from NIRCam and MIRI, we associate the Green Monster with circumstellar material that was lost during an asymmetric mass-loss phase. MIRI images are dominated by dust emission but its spectra show emission lines from Ne, H and Fe with low radial velocities indicative of a CSM nature. An X-ray analysis of this feature in a companion paper (Vink et al. 2024) supports its CSM nature and detects significant blue shifting, thereby placing the Green Monster on the near side, in front of the Cas A SN remnant. The most striking features of the Green Monster are dozens of almost perfectly circular 1" - 3" sized holes, most likely created by interaction between high-velocity SN ejecta material and the CSM. Further investigation is needed to understand whether these holes were formed by small 8000-10500 km/s N-rich ejecta knots that penetrated and advanced out ahead of the remnant's 5000 - 6000 km/s outer blastwave, or by narrow ejecta fingers that protrude into the forward-shocked CSM. The detection of the Green Monster provides further evidence of the highly asymmetric mass-loss that Cas A's progenitor star underwent prior to explosion., Comment: 28 pages, 12 figures, resubmitted to ApJL after minor revision, comments welcome
Published: 2024

13. Consistent and Repeatable Testing of mMIMO O-RU across labs: A Japan-Singapore Experience

Author: Nguyen, Thanh-Tam, Ngo, Mao V., Chen, Binbin, Kuchitsu, Mitsuhiro, Wai, Serena, Kawai, Seitaro, Suzuki, Kenya, Koo, Eng Wei, and Quek, Tony
Subjects: Computer Science - Networking and Internet Architecture
Abstract: Open Radio Access Networks (RAN) aim to bring a paradigm shift to telecommunications industry, by enabling an open, intelligent, virtualized, and multi-vendor interoperable RAN ecosystem. At the center of this movement, O-RAN ALLIANCE defines the O-RAN architecture and standards, so that companies around the globe can use these specifications to create innovative and interoperable solutions. To accelerate the adoption of O-RAN products, rigorous testing of O-RAN Radio Unit (O-RU) and other O-RAN products plays a key role. O-RAN ALLIANCE has approved around 20 Open Testing and Integration Centres (OTICs) globally. OTICs serve as vendor-neutral platforms for providing the testing and integration services, with the vision that an O-RAN product certified in any OTIC is accepted in other parts of the world. To demonstrate the viability of such a certified-once-and-use-everywhere approach, one theme in the O-RAN Global PlugFest Spring 2024 is to demonstrate consistent and repeatable testing for the open fronthaul interface across multiple labs. Towards this, Japan OTIC and Asia Pacific OTIC in Singapore have teamed up together with an O-RU vendor and Keysight Technology. Our international team successfully completed all test cases defined by O-RAN ALLIANCE for O-RU conformance testing. In this paper, we share our journey in achieving this outcome, focusing on the challenges we have overcome and the lessons we have learned through this process., Comment: Published version at RitiRAN Workshop - co-located with IEEE VTC Fall 2024
Published: 2024

14. Pomeranchuk Instability Induced by an Emergent Higher-Order van Hove Singularity on the Distorted Kagome Surface of Co$_3$Sn$_2$S$_2$

Author: Nag, Pranab Kumar, Batabyal, Rajib, Ingham, Julian, Morali, Noam, Tan, Hengxin, Koo, Jahyun, Consiglio, Armando, Liu, Enke, Avraham, Nurit, Queiroz, Raquel, Thomale, Ronny, Yan, Binghai, Felser, Claudia, and Beidenkopf, Haim
Subjects: Condensed Matter - Strongly Correlated Electrons
Abstract: Materials hosting flat bands at the vicinity of the Fermi level promote exotic symmetry broken states. Common to many of these are van Hove singularities at saddle points of the dispersion or even higher-order van Hove singularities where the dispersion is flattened further. The band structure of kagome metals hosts both a flat band and two regular saddle points flanking a Dirac node. We investigate the kagome ferromagnetic metal Co$_3$Sn$_2$S$_2$ using scanning tunneling spectroscopy. We identify a new mechanism by which a triangular distortion on its kagome Co$_3$Sn surface termination considerably flattens the saddle point dispersion, and induces an isolated higher-order van Hove singularity (HOvHS) with algebraically divergent density of states pinned to the Fermi energy. The distortion-induced HOvHS precipitates a Pomeranchuk instability of the Fermi surface, resulting in the formation of a series of nematic electronic states. We visualize the nematic order across an energy shell of about 100 meV in both real-, reciprocal-, and momentum-spaces, as a cascade of wavefunction distributions which spontaneously break the remaining rotational symmetry of the underlying distorted kagome lattice, without generating any additional translational symmetry breaking. It signifies the spontaneous removal of a subset of saddle points from the Fermi energy to lower energies. By tracking the electronic wavefunction structure across the deformed Fermi surface we further identify a charge pumping-like evolution of the wavefunction center of mass. The mechanism we find for the generation of higher-order saddle points under a kagome distortion may be common to other kagome materials, and potentially other lattice structures, suggesting a generic new avenue for inducing unconventional electronic instabilities towards exotic states of matter.
Published: 2024

15. How Can It Really Be Effective? Experiences of Asynchronous and Synchronous Learning in Online Counseling Graduate Programs

Author: Katie Koo and Mei Jiang
Abstract: By interviewing eleven graduate students in online counseling programs, this interpretive qualitative study explored students' perceptions and practical experiences of synchronous and asynchronous learning methods. The results suggested that counseling students preferred in-person counseling classes to online courses, found synchronous methods more effective than asynchronous methods, considered asynchronous methods less stressful compared to synchronous methods, and felt that micro-counseling skills and emotional reactions were not fully attainable in their online counseling training programs. Based on these findings, we recommend further systematic investigation of counseling graduate students' diverse experiences and perceptions of synchronous and asynchronous learning methods.
Published: 2024

16. The Development and Validation of English Communicative Competence Model for High School Students in Korea

Author: Whyun Young Choi and Mun-Koo Kang
Abstract: This study develops and validates an English communicative competence model for Korean high school students, in response to the need to redefine the relevant concepts and components of competence that are demanded by the rapidly evolving future society. Drawing on Celce-Murcia's (2008) theoretical model on communicative competence, this research conceptualized a model that could assess high school students' English communicative competence by examining relevant domestic and international studies as well as theoretical reflections. Expert opinions from a two-stage Delphi survey were compiled and incorporated to revise, supplement, and validate the English communicative competence among high school students reflecting Korea's English education environment. Following this process, the conceptual model for English communicative competence was reorganized into five sub-competences (sociolinguistic, discourse, linguistic, interactional, and strategic competence) and 15 corresponding subfactors. The content validity ratio values for the conceptual definition and factor structure of this model were all above 0.64, thus affirming the validity of the conceptual definition and factor structure.
Published: 2024

17. Analog reservoir computing via ferroelectric mixed phase boundary transistors.

Author: Kim, Jangsaeng, Park, Eun, Shin, Wonjun, Koo, Ryun-Han, Han, Chang-Hyeon, Kang, He, Yang, Tae, Goh, Youngin, Lee, Kilho, Ha, Daewon, Cheema, Suraj, Jeong, Jae, and Kwon, Daewoong
Abstract: Analog reservoir computing (ARC) systems have attracted attention owing to their efficiency in processing temporal information. However, the distinct functionalities of the system components pose challenges for hardware implementation. Herein, we report a fully integrated ARC system that leverages material versatility of the ferroelectric-to-mixed phase boundary (MPB) hafnium zirconium oxides integrated onto indium-gallium-zinc oxide thin-film transistors (TFTs). MPB-based TFTs (MPBTFTs) with nonlinear short-term memory characteristics are utilized for physical reservoirs and artificial neuron, while nonvolatile ferroelectric TFTs mimic synaptic behavior for readout networks. Furthermore, double-gate configuration of MPBTFTs enhances reservoir state differentiation and state expansion for physical reservoir and processes both excitatory and inhibitory pulses for neuronal functionality with minimal hardware burden. The seamless integration of ARC components on a single wafer executes complex real-world time-series predictions with a low normalized root mean squared error of 0.28. The material-device co-optimization proposed in this study paves the way for the development of area- and energy-efficient ARC systems.
Published: 2024

18. Improvement in Patient-reported Symptoms and Satisfaction with Tildrakizumab in a Real-world Study in Patients with Moderate-to-severe Plaque Psoriasis.

Author: Vasquez, Juan, Heim, Jayme, Bhutani, Tina, Koo, John, Mathew, Jacob, Ferro, Thomas, and Bhatia, Neal
Subjects: Phase 4 clinical trial, patient satisfaction, patient-reported outcomes, psoriasis, tildrakizumab
Abstract: OBJECTIVE: Tildrakizumab, an anti-interleukin-23 p19 monoclonal antibody, is approved for the treatment of adults with moderate-to-severe plaque psoriasis. Limited evidence is available regarding the effects of tildrakizumab on patient-reported symptoms and satisfaction. This report describes the secondary endpoints of patient-reported symptoms and treatment satisfaction over 64 weeks in patients with moderate-to-severe plaque psoriasis treated with tildrakizumab in a Phase IV, real-world study. METHODS: In this uncontrolled, open-label study (NCT03718299), patients received tildrakizumab 100 mg at baseline, Week (W)4, and every 12 weeks thereafter to W52, with the final assessment at W64. Patient-reported secondary endpoints included numerical rating scale (NRS) scores for itch, pain, and scaling, and treatment satisfaction measured by 3 rating scales (Treatment Satisfaction Questionnaire for Medication [TSQM], Tildrakizumab Overall Satisfaction, and Patient Happiness with Psoriasis Control instrument) through W64. RESULTS: Of the 55 patients enrolled, 45 were assessed at W64. Mean NRS scores for itch, pain, and scaling all decreased from baseline beginning as early as W4 with maintenance through W64 (P≤0.001). Treatment satisfaction was positive throughout treatment based on all 3 measures. Mean±SD TSQM domain scores increased from 59.5±17.0 at W4 to 79.5±20.1 at W64 for Effectiveness and from 72.7±18.6 to 81.9±20.5 for Global Satisfaction. LIMITATIONS: The study is small and lacks a comparator arm. CONCLUSION: Tildrakizumab treatment improved patient-reported symptoms in patients with moderate-to-severe plaque psoriasis in a real-world setting and was associated with high levels of treatment satisfaction over 64 weeks.
Published: 2024

19. Descemet Endothelial Thickness Comparison Trial II (DETECT II): multicentre, outcome assessor-masked, placebo-controlled trial comparing Descemet membrane endothelial keratoplasty (DMEK) to Descemet stripping only (DSO) with adjunctive ripasudil for Fuchs dystrophy.

Author: Lin, Charles, Chamberlain, Winston, Benetz, Beth, Gensheimer, William, Li, Jennifer, Jeng, Bennie, Clover, Jameson, Varnado, Nicole, Abdelrahman, Sarah, Srinivasan, Amrita, Syed, Zeba, Koo, Ellen, Arnold, Benjamin, Lietman, Thomas, Lass, Jonathan, and Rose-Nussbaumer, Jennifer
Subjects: clinical trial, cornea, Humans, Fuchs Endothelial Dystrophy, Descemet Stripping Endothelial Keratoplasty, Visual Acuity, Male, Sulfonamides, Female, Isoquinolines, Endothelium, Corneal, Descemet Membrane, Treatment Outcome, Middle Aged, Aged, Ophthalmic Solutions, Multicenter Studies as Topic
Abstract: INTRODUCTION: It remains uncertain whether Descemet membrane endothelial keratoplasty (DMEK) or Descemet stripping only (DSO) yields better outcomes in patients with symptomatic Fuchs endothelial corneal dystrophy (FECD). This paper presents the protocol for the Descemet Endothelial Thickness Comparison Trial II (DETECT II), a multicentre, outcome-masked, randomised, placebo-controlled, clinical trial comparing DMEK to DSO with ripasudil (DSO-R) for this patient population. METHODS AND ANALYSIS: A total of 60 patients with endothelial dysfunction due to symptomatic FECD will be enrolled from seven participating sites in the USA. The patients will be randomly assigned in a 1:1 ratio to one of the following treatment groups: group 1-DMEK plus topical placebo and group 2-DSO plus topical ripasudil 0.4%. The enrolment period is 24 months. The primary outcome is best spectacle-corrected visual acuity at 12 months. Secondary outcomes include peripheral and central endothelial cell density, visual acuity, vision-related quality of life and Pentacam Scheimpflug tomography. Study outcomes will be analysed using mixed effects linear regression. Adverse events, including rebubble procedures, endothelial failure and graft rejection, will be documented and analysed using appropriate statistical methods. DETECT II aims to provide evidence on the comparative effectiveness of DMEK and DSO-R. The results of this trial will contribute to optimising the treatment of FECD, while also exploring the cost-effectiveness of these interventions. Dissemination of findings through peer-reviewed publications and national/international meetings will facilitate knowledge translation and guide clinical practice in the field of corneal transplantation. ETHICS AND DISSEMINATION: A data and safety monitoring committee has been empanelled by the National Eye Institute. All study protocols will be subject to review and approval by WCG IRB as the single IRB of record. This study will comply with the National Institute of Health (NIH) Data Sharing Policy and Policy on the Dissemination of NIH-Funded Clinical Trial Information and the Clinical Trials Registration and Results Information Submission rule. Data from the trial will be made available on reasonable request. TRIAL REGISTRATION NUMBER: NCT05275972.
Published: 2024

20. Magnetization Plateaus by the Field-Induced Partitioning of Spin Lattices

Author: Whangbo, Myung-Hwan, Koo, Hyun-Joo, Kremer, Reinhard K., and Vasiliev, Alexander N.
Subjects: Condensed Matter - Materials Science, Condensed Matter - Strongly Correlated Electrons, 81, A.1, B.0, C.0
Abstract: To search for a conceptual picture describing the magnetization plateau phenomenon, we surveyed the crystal structures and the spin lattices of those magnets exhibiting plateaus in their magnetization vs. magnetic field curves by probing the three questions: (a) why only certain magnets exhibit magnetization plateaus, (b) why there occur several different types of magnetization plateaus, and (c) what controls the widths of magnetization plateaus. We show that the answers to these questions lie in how the magnets under field absorb Zeeman energy hence changing their magnetic structures. The magnetic structure of a magnet insulator is commonly described in terms of its spin lattice, which requires the determination of the spin exchanges nonnegligible strengths between the magnetic ions. Our work strongly suggests that a magnet under magnetic field partitions its spin lattice into antiferromagnetic (AFM) or ferrimagnetic fragments by breaking its weak magnetic bonds. Our supposition of the field-induced partitioning of spin lattice into magnetic fragments is supported by the anisotropic magnetization plateaus of Ising magnets and by the highly anisotropic width of the 1/3-magnetization plateau in azurite. The answers to the three questions (a) - (c) emerge naturally by analyzing how these fragments are formed under magnetic field., Comment: A survey of the magnetization plateau phenomenon; 128 pages including the Supporting Information
Published: 2024

21. DNI: Dilutional Noise Initialization for Diffusion Video Editing

Author: Yoon, Sunjae, Koo, Gwanhyeong, Hong, Ji Woo, and Yoo, Chang D.
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Text-based diffusion video editing systems have been successful in performing edits with high fidelity and textual alignment. However, this success is limited to rigid-type editing such as style transfer and object overlay, while preserving the original structure of the input video. This limitation stems from an initial latent noise employed in diffusion video editing systems. The diffusion video editing systems prepare initial latent noise to edit by gradually infusing Gaussian noise onto the input video. However, we observed that the visual structure of the input video still persists within this initial latent noise, thereby restricting non-rigid editing such as motion change necessitating structural modifications. To this end, this paper proposes Dilutional Noise Initialization (DNI) framework which enables editing systems to perform precise and dynamic modification including non-rigid editing. DNI introduces a concept of `noise dilution' which adds further noise to the latent noise in the region to be edited to soften the structural rigidity imposed by input video, resulting in more effective edits closer to the target prompt. Extensive experiments demonstrate the effectiveness of the DNI framework., Comment: 17 pages, 11 figures, ECCV 2024
Published: 2024

22. A new critical growth parameter and mechanistic model for SiC nanowire synthesis via Si substrate carbonization: the role of H$_2$/CH$_4$ gas flow ratio

Author: Koo, Junghyun and Kim, Chinkyo
Subjects: Physics - Applied Physics, Condensed Matter - Materials Science
Abstract: SiC structures, including nanowires and films, can be effectively grown on Si substrates through carbonization. However, growth parameters other than temperature, which influence the preferential formation of SiC nanowires or films, have not yet been identified. In this work, we investigate SiC synthesis via Si carbonization using methane (CH$_4$) by varying the growth temperature and the hydrogen to methane gas flow ratio (H$_2$/CH$_4$). We demonstrate that adjusting these parameters allows for the preferential growth of SiC nanowires or films. Specifically, SiC nanowires are preferentially grown when the H$_2$/CH$_4$ ratio exceeds a specific threshold, which varies with the growth temperature, ranging between 1200$^\circ$C and 1310$^\circ$C. Establishing this precise growth window for SiC nanowires in terms of the H$_2$/CH$_4$ ratio and growth temperature provides new insights into the parameter-driven morphology of SiC. Furthermore, we propose a mechanistic model to explain the preferential growth of either SiC nanowires or films, based on the kinetics of gas-phase reactions and surface processes. These findings not only advance our understanding of SiC growth mechanisms but also pave the way for optimized fabrication strategies for SiC-based nanostructures., Comment: 6 pages, 5 figures
Published: 2024

23. Latent Diffusion Bridges for Unsupervised Musical Audio Timbre Transfer

Author: Mancusi, Michele, Halychanskyi, Yurii, Cheuk, Kin Wai, Lai, Chieh-Hsin, Uhlich, Stefan, Koo, Junghyun, Martínez-Ramírez, Marco A., Liao, Wei-Hsiang, Fabbro, Giorgio, and Mitsufuji, Yuki
Subjects: Computer Science - Sound, Computer Science - Artificial Intelligence, Computer Science - Information Retrieval, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: Music timbre transfer is a challenging task that involves modifying the timbral characteristics of an audio signal while preserving its melodic structure. In this paper, we propose a novel method based on dual diffusion bridges, trained using the CocoChorales Dataset, which consists of unpaired monophonic single-instrument audio data. Each diffusion model is trained on a specific instrument with a Gaussian prior. During inference, a model is designated as the source model to map the input audio to its corresponding Gaussian prior, and another model is designated as the target model to reconstruct the target audio from this Gaussian prior, thereby facilitating timbre transfer. We compare our approach against existing unsupervised timbre transfer models such as VAEGAN and Gaussian Flow Bridges (GFB). Experimental results demonstrate that our method achieves both better Fr\'echet Audio Distance (FAD) and melody preservation, as reflected by lower pitch distances (DPD) compared to VAEGAN and GFB. Additionally, we discover that the noise level from the Gaussian prior, $\sigma$, can be adjusted to control the degree of melody preservation and amount of timbre transferred.
Published: 2024

24. Global Mild Solutions to a BGK Model for Barotropic Gas Dynamics

Author: Koo, Dowan and Song, Sihyun
Subjects: Mathematics - Analysis of PDEs, 82C40, 76N15, 35F25
Abstract: We establish global existence of mild solutions to the BGK model proposed by Bouchut [J. Stat. Phys., 95, (1999), 113--170] under the minimal assumption of finite kinetic entropy initial data. Moreover we rigorously derive a kinetic entropy inequality, which combined with the theory developed by Berthelin and Vasseur [SIAM J. Math. Anal., 36, (2005), 1807--1835] leads to the hydrodynamic limit to the barotropic Euler equations. The main tools employed in the analysis are stability estimates for the Maxwellian and a velocity averaging lemma., Comment: 24 pages, 1 figure
Published: 2024

25. OPAL: Outlier-Preserved Microscaling Quantization Accelerator for Generative Large Language Models

Author: Koo, Jahyun, Park, Dahoon, Jung, Sangwoo, and Kung, Jaeha
Subjects: Computer Science - Machine Learning, Computer Science - Hardware Architecture, Computer Science - Computation and Language
Abstract: To overcome the burden on the memory size and bandwidth due to ever-increasing size of large language models (LLMs), aggressive weight quantization has been recently studied, while lacking research on quantizing activations. In this paper, we present a hardware-software co-design method that results in an energy-efficient LLM accelerator, named OPAL, for generation tasks. First of all, a novel activation quantization method that leverages the microscaling data format while preserving several outliers per sub-tensor block (e.g., four out of 128 elements) is proposed. Second, on top of preserving outliers, mixed precision is utilized that sets 5-bit for inputs to sensitive layers in the decoder block of an LLM, while keeping inputs to less sensitive layers to 3-bit. Finally, we present the OPAL hardware architecture that consists of FP units for handling outliers and vectorized INT multipliers for dominant non-outlier related operations. In addition, OPAL uses log2-based approximation on softmax operations that only requires shift and subtraction to maximize power efficiency. As a result, we are able to improve the energy efficiency by 1.6~2.2x, and reduce the area by 2.4~3.1x with negligible accuracy loss, i.e., <1 perplexity increase., Comment: 7 pages, 8 figures, DAC2024 accepted
Published: 2024

26. Understanding Online Discussion Across Difference: Insights from Gun Discourse on Reddit

Author: Magu, Rijul, Kumar, Nivedhitha Mathan, Liu, Yihe, Koo, Xander, Yang, Diyi, and Bruckman, Amy
Subjects: Computer Science - Social and Information Networks, Computer Science - Computers and Society, J.4, K.4
Abstract: When discussing difficult topics online, is it common to meaningfully engage with people from diverse perspectives? Why or why not? Could features of the online environment be redesigned to encourage civil conversation across difference? In this paper, we study discussions of gun policy on Reddit, with the overarching goal of developing insights into the potential of the internet to support understanding across difference. We use two methods: a clustering analysis of Reddit posts to contribute insights about what people discuss, and an interview study of twenty Reddit users to help us understand why certain kinds of conversation take place and others don't. We find that the discussion of gun politics falls into three groups: conservative pro-gun, liberal pro-gun, and liberal anti-gun. Each type of group has its own characteristic topics. While our subjects state that they would be willing to engage with others across the ideological divide, in practice they rarely do. Subjects are siloed into like-minded subreddits through a two-pronged effect, where they are simultaneously pushed away from opposing-view communities while actively seeking belonging in like-minded ones. Another contributing factor is Reddit's "karma" mechanism: fear of being downvoted and losing karma points and social approval of peers causes our subjects to hesitate to say anything in conflict with group norms. The pseudonymous nature of discussion on Reddit plays a complex role, with some subjects finding it freeing and others fearing reprisal from others not bound by face-to-face norms of politeness. Our subjects believe that content moderation can help ameliorate these issues; however, our findings suggest that moderators need different tools to do so effectively. We conclude by suggesting platform design changes that might increase discussion across difference., Comment: CSCW 2024
Published: 2024

27. Explainable AI for computational pathology identifies model limitations and tissue biomarkers

Author: Kaczmarzyk, Jakub R., Saltz, Joel H., and Koo, Peter K.
Subjects: Quantitative Biology - Tissues and Organs
Abstract: Introduction: Deep learning models hold great promise for digital pathology, but their opaque decision-making processes undermine trust and hinder clinical adoption. Explainable AI methods are essential to enhance model transparency and reliability. Methods: We developed HIPPO, an explainable AI framework that systematically modifies tissue regions in whole slide images to generate image counterfactuals, enabling quantitative hypothesis testing, bias detection, and model evaluation beyond traditional performance metrics. HIPPO was applied to a variety of clinically important tasks, including breast metastasis detection in axillary lymph nodes, prognostication in breast cancer and melanoma, and IDH mutation classification in gliomas. In computational experiments, HIPPO was compared against traditional metrics and attention-based approaches to assess its ability to identify key tissue elements driving model predictions. Results: In metastasis detection, HIPPO uncovered critical model limitations that were undetectable by standard performance metrics or attention-based methods. For prognostic prediction, HIPPO outperformed attention by providing more nuanced insights into tissue elements influencing outcomes. In a proof-of-concept study, HIPPO facilitated hypothesis generation for identifying melanoma patients who may benefit from immunotherapy. In IDH mutation classification, HIPPO more robustly identified the pathology regions responsible for false negatives compared to attention, suggesting its potential to outperform attention in explaining model decisions. Conclusions: HIPPO expands the explainable AI toolkit for computational pathology by enabling deeper insights into model behavior. This framework supports the trustworthy development, deployment, and regulation of weakly-supervised models in clinical and research settings, promoting their broader adoption in digital pathology.
Published: 2024

28. Pre-Trained Language Models for Keyphrase Prediction: A Review

Author: Umair, Muhammad, Sultana, Tangina, and Lee, Young-Koo
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Keyphrase Prediction (KP) is essential for identifying keyphrases in a document that can summarize its content. However, recent Natural Language Processing (NLP) advances have developed more efficient KP models using deep learning techniques. The limitation of a comprehensive exploration jointly both keyphrase extraction and generation using pre-trained language models spotlights a critical gap in the literature, compelling our survey paper to bridge this deficiency and offer a unified and in-depth analysis to address limitations in previous surveys. This paper extensively examines the topic of pre-trained language models for keyphrase prediction (PLM-KP), which are trained on large text corpora via different learning (supervisor, unsupervised, semi-supervised, and self-supervised) techniques, to provide respective insights into these two types of tasks in NLP, precisely, Keyphrase Extraction (KPE) and Keyphrase Generation (KPG). We introduce appropriate taxonomies for PLM-KPE and KPG to highlight these two main tasks of NLP. Moreover, we point out some promising future directions for predicting keyphrases.
Published: 2024
Full Text: View/download PDF

29. Strategy of satellite QKD with passive high brightness entangled photon pair source

Author: Kim, Jin-Woo, Lim, Suseong, Kim, Heonoh, and Rhee, June-Koo Kevin
Subjects: Quantum Physics, 81V80
Abstract: A high-brightness entangled photon pair (HBEPP) source is essential for conducting entanglement-based quantum key distribution (QKD) between a satellite and a ground station. While an ultrabright source can overcome significant losses in satellite-based QKD (SQKD) and increase the sifted key rate, it also induces the multi-photon effect, raising the system's error rate. To accurately estimate system performance, we first present an analytical model for calculating the measurement probabilities of HBEPP distribution through an asymmetric loss channel. Based on this model, we propose the use of a passive-intensity HBEPP source for SQKD systems, assuming a polarization-independent channel and threshold detectors for measurement. We confirm that fixing the mean photon number at $\bar{\mu}=0.1$ achieves a performance of $99.7\%$ compared to the ideal one-way communication entanglement-based SQKD protocol, which is effectively optimizing the HBEPP source brightness in accordance with system losses., Comment: 9 pages, 5 figures, 3 tables
Published: 2024

30. Thermoelectric signature of quantum criticality in the heavy-fermion superconductor CeRhIn$_5$

Author: Cao, Zi-Yu, Wang, Honghong, Park, Chan-Koo, Park, Tae Beom, Jang, Harim, Seo, Soonbeom, Kim, Sung-Il, and Park, Tuson
Subjects: Condensed Matter - Superconductivity, Condensed Matter - Strongly Correlated Electrons
Abstract: The evolution of the Fermi surface across the quantum critical point (QCP), which is relevant for characterizing the quantum criticality and understanding its relation with unconventional superconductivity, is an intriguing subject in the study of strongly correlated electron systems. In this study, we report the thermopower measurements to investigate a change in Fermi surface across the QCP in pure and 4.4% Sn-doped CeRhIn$_5$. Results show that their thermopower behavior differs significantly in the vicinity of their respective pressure-induced QCP. In pure CeRhIn$_5$, a drastic collapse of the thermopower takes place at the Kondo breakdown QCP, where the Fermi surface reconstructs concurrently with the development of the magnetic order. By contrast, the thermopower exhibits a broadly symmetric behavior around the QCP in 4.4% Sn-doped CeRhIn$_5$, which is a characteristic of the spin-density-wave QCP. These observations are consistent with the theoretical expectations and suggest the effectiveness of thermopower measurement in discriminating the nature of quantum criticality in heavy-fermion systems., Comment: 12 pages, 4 figures
Published: 2024

31. SDSS-IV MaNGA: Stellar rotational support in disk galaxies vs. central surface density and stellar population age

Author: Wang, Xiaohan, Luo, Yifei, Faber, S. M., Koo, David C., Mao, Shude, Westfall, Kyle B., Lu, Shengdong, Wang, Weichen, Bundy, Kevin, Boardman, N., Avila-Reese, Vladimir, Fernández-Trincado, José G., and Lane, Richard R.
Subjects: Astrophysics - Astrophysics of Galaxies
Abstract: We investigate how the stellar rotational support changes as a function of spatially resolved stellar population age ($\rm D_n4000$) and relative central stellar surface density ($\Delta \Sigma_1$) for MaNGA isolated/central disk galaxies. We find that the galaxy rotational support $\lambda_{R_\mathrm{e}}$ varies smoothly as a function of $\Delta \Sigma_1$ and $\rm D_n4000$. $\rm D_n4000$ vs. $\Delta \Sigma_1$ follows a "J-shape", with $\lambda_{R_\mathrm{e}}$ contributing to the scatters. In this "J-shaped" pattern rotational support increases with central $\rm D_n4000$ when $\Delta \Sigma_1$ is low but decreases with $\Delta \Sigma_1$ when $\Delta \Sigma_1$ is high. Restricting attention to low-$\Delta \Sigma_1$ (i.e, large-radius) galaxies, we suggest that the trend of increasing rotational support with $\rm D_n4000$ for these objects is produced by a mix of two different processes, a primary trend characterized by growth in $\lambda_{R_\mathrm{e}}$ along with mass through gas accretion, on top of which disturbance episodes are overlaid, which reduce rotational support and trigger increased star formation. An additional finding is that star forming galaxies with low $\Delta \Sigma_1$ have relatively larger radii than galaxies with higher $\Delta \Sigma_1$ at fixed stellar mass. Assuming that these relative radii rankings are preserved while galaxies are star forming then implies clear evolutionary paths in central $\rm D_n4000$ vs. $\Delta \Sigma_1$. The paper closes with comments on the implications that these paths have for the evolution of pseudo-bulges vs. classical-bulges. The utility of using $\rm D_n4000$-$\Delta \Sigma_1$ to study $\lambda_{R_\mathrm{e}}$ reinforces the notion that galaxy kinematics correlate both with structure and with stellar-population state, and indicates the importance of a multi-dimensional description for understanding bulge and galaxy evolution., Comment: 24 pages, 22 figures (including Appendix), accepted for publication in MNRAS
Published: 2024
Full Text: View/download PDF

32. SPECtrophotometer for TRansmission spectroscopy of exoplanets (SPECTR)

Author: Choi, Yeon-Ho, Park, Myeong-Gu, Kim, Kang-Min, Koo, Jae-Rim, Bang, Tae-Yang, Park, Chan, Jang, Jeong-Gyun, Han, Inwoo, Jang, Bi-Ho, Lee, Jong Ung, Jeong, Ueejeong, and Lee, Byeong-Cheol
Subjects: Astrophysics - Earth and Planetary Astrophysics, Astrophysics - Instrumentation and Methods for Astrophysics
Abstract: The SPECtrophotometer for TRansmission spectroscopy of exoplanets (SPECTR) is a new low-resolution optical (3800 {\AA} - 6850 {\AA}) spectrophotometer installed at the Bohyunsan Optical Astronomy Observatory (BOAO) 1.8 m telescope. SPECTR is designed for observing the transmission spectra of transiting exoplanets. Unique features of SPECTR are its long slit length of 10 arcminutes which facilitates observing the target and the comparison star simultaneously, and its wide slit width to minimize slit losses. SPECTR will be used to survey exoplanets, such as those identified by the Transiting Exoplanet Survey Satellite (TESS), providing information about their radii across the wavelength range. In this paper, we present the design of SPECTR and the observational results of the partial transit of HD 189733 b and a full transit of Qatar-8 b. Analyses show the SPECTR's capability on the white light curves with an accuracy of one ppt. The transmission spectrum of HD 189733 b shows general agreement with previous studies., Comment: 12 pages, 11 figures, 5 tables, accepted for PASP
Published: 2024

33. Overcoming Uncertain Incompleteness for Robust Multimodal Sequential Diagnosis Prediction via Knowledge Distillation and Random Data Erasing

Author: Koo, Heejoon
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: In this paper, we present NECHO v2, a novel framework designed to enhance the predictive accuracy of multimodal sequential patient diagnoses under uncertain missing visit sequences, a common challenge in real clinical settings. Firstly, we modify NECHO, designed in a diagnosis code-centric fashion, to handle uncertain modality representation dominance under the imperfect data. Secondly, we develop a systematic knowledge distillation by employing the modified NECHO as both teacher and student. It encompasses a modality-wise contrastive and hierarchical distillation, transformer representation random distillation, along with other distillations to align representations between teacher and student tightly and effectively. We also utilise random erasing on individual data points within sequences during both training and distillation of the teacher to lightly simulate scenario with missing visit information, thereby fostering effective knowledge transfer. As a result, NECHO v2 verifies itself by showing robust superiority in multimodal sequential diagnosis prediction under both balanced and imbalanced incomplete settings on multimodal healthcare data., Comment: 5 pages, 1 figure, and 4 tables
Published: 2024

34. FlexiEdit: Frequency-Aware Latent Refinement for Enhanced Non-Rigid Editing

Author: Koo, Gwanhyeong, Yoon, Sunjae, Hong, Ji Woo, and Yoo, Chang D.
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Current image editing methods primarily utilize DDIM Inversion, employing a two-branch diffusion approach to preserve the attributes and layout of the original image. However, these methods encounter challenges with non-rigid edits, which involve altering the image's layout or structure. Our comprehensive analysis reveals that the high-frequency components of DDIM latent, crucial for retaining the original image's key features and layout, significantly contribute to these limitations. Addressing this, we introduce FlexiEdit, which enhances fidelity to input text prompts by refining DDIM latent, by reducing high-frequency components in targeted editing areas. FlexiEdit comprises two key components: (1) Latent Refinement, which modifies DDIM latent to better accommodate layout adjustments, and (2) Edit Fidelity Enhancement via Re-inversion, aimed at ensuring the edits more accurately reflect the input text prompts. Our approach represents notable progress in image editing, particularly in performing complex non-rigid edits, showcasing its enhanced capability through comparative experiments., Comment: ECCV 2024
Published: 2024

35. Adversarial Databases Improve Success in Retrieval-based Large Language Models

Author: Wu, Sean, Koo, Michael, Kao, Li Yo, Black, Andy, Blum, Lesley, Scalzo, Fabien, and Kurtz, Ira
Subjects: Computer Science - Computation and Language
Abstract: Open-source LLMs have shown great potential as fine-tuned chatbots, and demonstrate robust abilities in reasoning and surpass many existing benchmarks. Retrieval-Augmented Generation (RAG) is a technique for improving the performance of LLMs on tasks that the models weren't explicitly trained on, by leveraging external knowledge databases. Numerous studies have demonstrated the effectiveness of RAG to more successfully accomplish downstream tasks when using vector datasets that consist of relevant background information. It has been implicitly assumed by those in the field that if adversarial background information is utilized in this context, that the success of using a RAG-based approach would be nonexistent or even negatively impact the results. To address this assumption, we tested several open-source LLMs on the ability of RAG to improve their success in answering multiple-choice questions (MCQ) in the medical subspecialty field of Nephrology. Unlike previous studies, we examined the effect of RAG in utilizing both relevant and adversarial background databases. We set up several open-source LLMs, including Llama 3, Phi-3, Mixtral 8x7b, Zephyr$\beta$, and Gemma 7B Instruct, in a zero-shot RAG pipeline. As adversarial sources of information, text from the Bible and a Random Words generated database were used for comparison. Our data show that most of the open-source LLMs improve their multiple-choice test-taking success as expected when incorporating relevant information vector databases. Surprisingly however, adversarial Bible text significantly improved the success of many LLMs and even random word text improved test taking ability of some of the models. In summary, our results demonstrate for the first time the countertintuitive ability of adversarial information datasets to improve the RAG-based LLM success., Comment: 24 pages, 3 figures, 11 tables
Published: 2024

36. Optimizing Query Generation for Enhanced Document Retrieval in RAG

Author: Koo, Hamin, Kim, Minseon, and Hwang, Sung Ju
Subjects: Computer Science - Information Retrieval
Abstract: Large Language Models (LLMs) excel in various language tasks but they often generate incorrect information, a phenomenon known as "hallucinations". Retrieval-Augmented Generation (RAG) aims to mitigate this by using document retrieval for accurate responses. However, RAG still faces hallucinations due to vague queries. This study aims to improve RAG by optimizing query generation with a query-document alignment score, refining queries using LLMs for better precision and efficiency of document retrieval. Experiments have shown that our approach improves document retrieval, resulting in an average accuracy gain of 1.6%.
Published: 2024

37. Litmus tests of the flat $\Lambda$CDM model and model-independent measurement of $H_0r_\mathrm{d}$ with LSST and DESI

Author: L'Huillier, Benjamin, Mitra, Ayan, Shafieloo, Arman, Keeley, Ryan E., and Koo, Hanwool
Subjects: Astrophysics - Cosmology and Nongalactic Astrophysics, General Relativity and Quantum Cosmology
Abstract: In this analysis we apply a model-independent framework to test the flat $\Lambda$CDM cosmology using simulated SNIa data from the upcoming Legacy Survey of Space and Time (LSST) and combined with simulated Dark Energy Spectroscopic Instrument (DESI) five-years Baryon Acoustic Oscillations (BAO) data. We adopt an iterative smoothing technique to reconstruct the expansion history from SNIa data, which, when combined with BAO measurements, facilitates a comprehensive test of the Universe's curvature and the nature of dark energy. The analysis is conducted under three different mock true cosmologies: a flat $\Lambda$CDM universe, a universe with a notable curvature ($\Omega_{k,0} = 0.1$), and one with dynamically evolving dark energy. Each cosmology demonstrates different kinds and varying degrees of deviation from the standard model predictions. We forecast that our reconstruction technique can constrain cosmological parameters, such as the curvature ($\Omega_{k,0}$) and $c/H_0 r_\mathrm{d}$, with a precision of approximately 0.5\% for $c/H_0r_\mathrm{d}$ and 0.04 for $\Omega_{k,0}$, competitive with current cosmic microwave background constraints, without assuming any form of dark energy.
Published: 2024

38. Automata-based constraints for language model decoding

Author: Koo, Terry, Liu, Frederick, and He, Luheng
Subjects: Computer Science - Computation and Language, Computer Science - Formal Languages and Automata Theory
Abstract: Language models (LMs) are often expected to generate strings in some formal language; for example, structured data, API calls, or code snippets. Although LMs can be tuned to improve their adherence to formal syntax, this does not guarantee conformance, especially with smaller LMs suitable for large-scale deployment. In addition, tuning requires significant resources, making it impractical for uncommon or task-specific formats. To prevent downstream parsing errors we would ideally constrain the LM to only produce valid output, but this is severely complicated by tokenization, which is typically both ambiguous and misaligned with the formal grammar. We solve these issues through the application of automata theory, deriving an efficient closed-form solution for the regular languages, a broad class of formal languages with many practical applications, including API calls or schema-guided JSON and YAML. We also discuss pragmatic extensions for coping with the issue of high branching factor, and extend our techniques to deterministic context-free languages, which similarly admit an efficient closed-form solution. Previous work on this topic (Willard and Louf, 2023) layers bespoke solutions onto automata, leading to problems with speed, correctness, and extensibility. Instead, we reformulate the entire task in terms of automata so we can leverage well-studied and well-optimized algorithms. Our system compiles constraints ~7,000x faster, is provably correct, and can be extended in a modular fashion., Comment: COLM 2024 Camera-ready version, responding to feedback from reviewers
Published: 2024

39. Balancing Operator's Risk Averseness in Model Predictive Control of a Reservoir System

Author: Koo, Ja-Ho, Abraham, Edo, Jonoski, Andreja, and Solomatine, Dimitri P.
Subjects: Electrical Engineering and Systems Science - Systems and Control
Abstract: Model Predictive Control (MPC) is an optimal control strategy suited for flood control of water resources infrastructure. Despite many studies on reservoir flood control and their theoretical contribution, optimisation methodologies have not been widely applied in real-time operation due to disparities between research assumptions and practical requirements. First, tacit objectives such as minimising the magnitude and frequency of changes in the existing outflow schedule are considered important in practice, but these are nonlinear and challenging to formulate to suit all conditions. Incorporating these objectives transforms the problem into a multi-objective nonlinear optimisation problem that is difficult to solve online. Second, it is reasonable to assume that the weights and parameters are not stationary because the preference varies depending on the state of the system. To overcome these limitations, we propose a framework that converts the original intractable problem into parameterized linear MPC problems with dynamic optimisation of weights and parameters. This is done by introducing a model-based learning concept under the assumption of the dynamic nature of the operator's preference. We refer to this framework as Parameterised Dynamic MPC (PD-MPC). The effectiveness of this framework is demonstrated through a numerical experiment for the Daecheong multipurpose reservoir in South Korea. We find that PD-MPC outperforms `standard' MPC-based designs without a dynamic optimisation process under the same uncertain inflows.
Published: 2024

40. Microreserves are an important tool for amphibian conservation.

Author: Steigerwald, Emma, Chen, Julia, Oshiro, Julianne, Vredenburg, Vance, Catenazzi, Alessandro, and Koo, Michelle
Subjects: Animals, Amphibians, Conservation of Natural Resources, Biodiversity, Endangered Species
Abstract: Initiatives to protect 30% of Earth by 2030 prompt evaluation of how to efficiently target shortcomings in the global protected area (PA) network. Focusing on amphibians, the most vulnerable vertebrate class, we illustrate the conservation value of microreserves, a term we employ here to refer to reserves of
Published: 2024

41. Treating Chronic Pruritus: Are We at the Threshold of a Breakthrough?

Author: Smith, Payton, Kranyak, Allison, Johnson, Chandler, Haran, Kathryn, Liao, Wilson, Bhutani, Tina, and Koo, John
Abstract: Chronic pruritis, characterized by persistent itchiness lasting more than six weeks, affects up to 15% of the population, significantly impairing quality of life. Despite its prevalence and impact, there is an absence of FDA-approved medications specifically for the treatment of chronic pruritus, highlighting a significant unmet need in dermatology. Advancements in dermatologic medications, however, including the development of biologics and Janus kinase (JAK) inhibitors, signal potential breakthroughs in pruritus management through a radically different mechanism of action that focuses on their effect on the nervous system. Currently, the most commonly utilized treatments for pruritis are sedating antihistamines, which have been largely ineffective for non-histamine-induced itch, underscoring the necessity for novel approaches. This editorial reviews key studies and clinical trials with a particular focus on cases of prurigo nodularis, where itch serves as the primary pathology rather than just a symptom. The effectiveness of dupilumab in phase III trials for treating prurigo nodularis, independent of its effects on dermatitis or atopic background, alongside the success of JAK inhibitors in managing chronic idiopathic pruritus, indicates a shift towards therapies that directly and specifically target itch nerve pathways instead of indirectly via immune system modulation or sedation. These developments suggest that significant progress may be on the horizon for treating chronic itch, providing hope for those suffering from pruritis, the number one cause of misery in dermatology.
Published: 2024

42. Graph Neural Network as Computationally Efficient Emulator of Ice-sheet and Sea-level System Model (ISSM)

Author: Koo, Younghyun and Rahnemoonfar, Maryam
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Mathematics - Numerical Analysis
Abstract: The Ice-sheet and Sea-level System Model (ISSM) provides solutions for Stokes equations relevant to ice sheet dynamics by employing finite element and fine mesh adaption. However, since its finite element method is compatible only with Central Processing Units (CPU), the ISSM has limits on further economizing computational time. Thus, by taking advantage of Graphics Processing Units (GPUs), we design a graph convolutional network (GCN) as a fast emulator for ISSM. The GCN is trained and tested using the 20-year transient ISSM simulations in the Pine Island Glacier (PIG). The GCN reproduces ice thickness and velocity with a correlation coefficient greater than 0.998, outperforming the traditional convolutional neural network (CNN). Additionally, GCN shows 34 times faster computational speed than the CPU-based ISSM modeling. The GPU-based GCN emulator allows us to predict how the PIG will change in the future under different melting rate scenarios with high fidelity and much faster computational time., Comment: 5 pages, 4 figures, submitted to the 2024 IEEE International Geoscience and Remote Sensing Symposium. arXiv admin note: text overlap with arXiv:2402.05291
Published: 2024

43. Graph Neural Networks for Emulation of Finite-Element Ice Dynamics in Greenland and Antarctic Ice Sheets

Author: Koo, Younghyun and Rahnemoonfar, Maryam
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Mathematics - Numerical Analysis
Abstract: Although numerical models provide accurate solutions for ice sheet dynamics based on physics laws, they accompany intensified computational demands to solve partial differential equations. In recent years, convolutional neural networks (CNNs) have been widely used as statistical emulators for those numerical models. However, since CNNs operate on regular grids, they cannot represent the refined meshes and computational efficiency of finite-element numerical models. Therefore, instead of CNNs, this study adopts an equivariant graph convolutional network (EGCN) as an emulator for the ice sheet dynamics modeling. EGCN reproduces ice thickness and velocity changes in the Helheim Glacier, Greenland, and Pine Island Glacier, Antarctica, with 260 times and 44 times faster computation time, respectively. Compared to the traditional CNN and graph convolutional network, EGCN shows outstanding accuracy in thickness prediction near fast ice streams by preserving the equivariance to the translation and rotation of graphs., Comment: 6 pages, 2 figures, submitted to the ICML 2024 Workshop on Machine Learning for Earth System Modeling
Published: 2024

44. Neural Pose Representation Learning for Generating and Transferring Non-Rigid Object Poses

Author: Yoo, Seungwoo, Koo, Juil, Yeo, Kyeongmin, and Sung, Minhyuk
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Graphics
Abstract: We propose a novel method for learning representations of poses for 3D deformable objects, which specializes in 1) disentangling pose information from the object's identity, 2) facilitating the learning of pose variations, and 3) transferring pose information to other object identities. Based on these properties, our method enables the generation of 3D deformable objects with diversity in both identities and poses, using variations of a single object. It does not require explicit shape parameterization such as skeletons or joints, point-level or shape-level correspondence supervision, or variations of the target object for pose transfer. To achieve pose disentanglement, compactness for generative models, and transferability, we first design the pose extractor to represent the pose as a keypoint-based hybrid representation and the pose applier to learn an implicit deformation field. To better distill pose information from the object's geometry, we propose the implicit pose applier to output an intrinsic mesh property, the face Jacobian. Once the extracted pose information is transferred to the target object, the pose applier is fine-tuned in a self-supervised manner to better describe the target object's shapes with pose variations. The extracted poses are also used to train a cascaded diffusion model to enable the generation of novel poses. Our experiments with the DeformThings4D and Human datasets demonstrate state-of-the-art performance in pose transfer and the ability to generate diverse deformed shapes with various objects and poses., Comment: NeurIPS 2024
Published: 2024

45. Understanding Pedestrian Movement Using Urban Sensing Technologies: The Promise of Audio-based Sensors

Author: Han, Chaeyeon, Seshadri, Pavan, Ding, Yiwei, Posner, Noah, Koo, Bon Woo, Agrawal, Animesh, Lerch, Alexander, and Guhathakurta, Subhrajit
Subjects: Electrical Engineering and Systems Science - Audio and Speech Processing, Computer Science - Artificial Intelligence, Computer Science - Machine Learning, Computer Science - Multimedia, Computer Science - Sound
Abstract: While various sensors have been deployed to monitor vehicular flows, sensing pedestrian movement is still nascent. Yet walking is a significant mode of travel in many cities, especially those in Europe, Africa, and Asia. Understanding pedestrian volumes and flows is essential for designing safer and more attractive pedestrian infrastructure and for controlling periodic overcrowding. This study discusses a new approach to scale up urban sensing of people with the help of novel audio-based technology. It assesses the benefits and limitations of microphone-based sensors as compared to other forms of pedestrian sensing. A large-scale dataset called ASPED is presented, which includes high-quality audio recordings along with video recordings used for labeling the pedestrian count data. The baseline analyses highlight the promise of using audio sensors for pedestrian tracking, although algorithmic and technological improvements to make the sensors practically usable continue. This study also demonstrates how the data can be leveraged to predict pedestrian trajectories. Finally, it discusses the use cases and scenarios where audio-based pedestrian sensing can support better urban and transportation planning., Comment: submitted to Urban Informatics
Published: 2024

46. Functional voxel hierarchy and afferent capacity revealed mental state transition on dynamic correlation resting-state fMRI

Author: Lee, Dong Soo, Kim, Hyun Joo, Huh, Youngmin, Kang, Yeon Koo, Whi, Wonseok, Lee, Hyekyoung, and Kang, Hyejin
Subjects: Quantitative Biology - Neurons and Cognition
Abstract: Voxel hierarchy on dynamic brain graphs is produced by k core percolation on functional dynamic amplitude correlation of resting-state fMRI. Directed graphs and their afferent/efferent capacities are produced by Markov modeling of the universal cover of undirected graphs simultaneously with the calculation of volume entropy. Positive and unsigned negative brain graphs were analyzed separately on sliding-window representation to underpin the visualization and quantitation of mental dynamic states with their transitions. Voxel hierarchy animation maps of positive graphs revealed abrupt changes in coreness k and kmaxcore, which we called mental state transitions. Afferent voxel capacities of the positive graphs also revealed transient modules composed of dominating voxels/independent components and their exchanges representing mental state transitions. Animation and quantification plots of voxel hierarchy and afferent capacity corroborated each other in underpinning mental state transitions and afferent module exchange on the positive directed functional connectivity graphs. We propose the use of spatiotemporal trajectories of voxels on positive dynamic graphs to construct hierarchical structures by k core percolation and quantified in- and out-flows of information of voxels by volume entropy/directed graphs to subserve diverse resting mental state transitions on resting-state fMRI graphs in normal human individuals.
Published: 2024

47. MultiPragEval: Multilingual Pragmatic Evaluation of Large Language Models

Author: Park, Dojun, Lee, Jiwoo, Park, Seohyun, Jeong, Hyeyun, Koo, Youngeun, Hwang, Soonha, Park, Seonwoo, and Lee, Sungeun
Subjects: Computer Science - Computation and Language
Abstract: As the capabilities of Large Language Models (LLMs) expand, it becomes increasingly important to evaluate them beyond basic knowledge assessment, focusing on higher-level language understanding. This study introduces MultiPragEval, the first multilingual pragmatic evaluation of LLMs, designed for English, German, Korean, and Chinese. Comprising 1200 question units categorized according to Grice's Cooperative Principle and its four conversational maxims, MultiPragEval enables an in-depth assessment of LLMs' contextual awareness and their ability to infer implied meanings. Our findings demonstrate that Claude3-Opus significantly outperforms other models in all tested languages, establishing a state-of-the-art in the field. Among open-source models, Solar-10.7B and Qwen1.5-14B emerge as strong competitors. By analyzing pragmatic inference, we provide valuable insights into the capabilities essential for advanced language comprehension in AI systems., Comment: The 2nd GenBench workshop on generalisation (benchmarking) in NLP
Published: 2024

48. MR-RawNet: Speaker verification system with multiple temporal resolutions for variable duration utterances using raw waveforms

Author: Kim, Seung-bin, Lim, Chan-yeong, Heo, Jungwoo, Kim, Ju-ho, Shin, Hyun-seo, Koo, Kyo-Won, and Yu, Ha-Jin
Subjects: Electrical Engineering and Systems Science - Audio and Speech Processing, Computer Science - Artificial Intelligence
Abstract: In speaker verification systems, the utilization of short utterances presents a persistent challenge, leading to performance degradation primarily due to insufficient phonetic information to characterize the speakers. To overcome this obstacle, we propose a novel structure, MR-RawNet, designed to enhance the robustness of speaker verification systems against variable duration utterances using raw waveforms. The MR-RawNet extracts time-frequency representations from raw waveforms via a multi-resolution feature extractor that optimally adjusts both temporal and spectral resolutions simultaneously. Furthermore, we apply a multi-resolution attention block that focuses on diverse and extensive temporal contexts, ensuring robustness against changes in utterance length. The experimental results, conducted on VoxCeleb1 dataset, demonstrate that the MR-RawNet exhibits superior performance in handling utterances of variable duration compared to other raw waveform-based systems., Comment: 5 pages, accepted by Interspeech 2024
Published: 2024

49. FRAG: Frequency Adapting Group for Diffusion Video Editing

Author: Yoon, Sunjae, Koo, Gwanhyeong, Kim, Geonwoo, and Yoo, Chang D.
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: In video editing, the hallmark of a quality edit lies in its consistent and unobtrusive adjustment. Modification, when integrated, must be smooth and subtle, preserving the natural flow and aligning seamlessly with the original vision. Therefore, our primary focus is on overcoming the current challenges in high quality edit to ensure that each edit enhances the final product without disrupting its intended essence. However, quality deterioration such as blurring and flickering is routinely observed in recent diffusion video editing systems. We confirm that this deterioration often stems from high-frequency leak: the diffusion model fails to accurately synthesize high-frequency components during denoising process. To this end, we devise Frequency Adapting Group (FRAG) which enhances the video quality in terms of consistency and fidelity by introducing a novel receptive field branch to preserve high-frequency components during the denoising process. FRAG is performed in a model-agnostic manner without additional training and validates the effectiveness on video editing benchmarks (i.e., TGVE, DAVIS)., Comment: 16 pages, 16 figures, ICML 2024
Published: 2024

50. Shockingly Bright Warm Carbon Monoxide Molecular Features in the Supernova Remnant Cassiopeia A Revealed by JWST

Author: Rho, J., Park, S. -H., Arendt, R., Matsuura, M., Milisavljevic, D., Temim, T., De Looze, I., Blair, W. P., Rest, A., Fox, O., Ravi, A. P., Koo, B. -C., Barlow, M., Burrows, A., Chevalier, R., Clayton, G., Fesen, R., Fransson, C., Fryer, C., Gomez, H. L., Janka, H. -T., Kirchschlarger, F., Laming, J. M., Orlando, S., Patnaude, D., Pavlov, G., Plucinsky, P., Posselt, B., Priestley, F., Raymond, J., Sartorio, N., Schmidt, F., Slane, P., Smith, N., Sravan, N., Vink, J., Weil, K., Wheeler, J., and Yoon, S. C.
Subjects: Astrophysics - Astrophysics of Galaxies, Astrophysics - High Energy Astrophysical Phenomena, Astrophysics - Solar and Stellar Astrophysics
Abstract: We present JWST NIRCam (F356W and F444W filters) and MIRI (F770W) images and NIRSpec- IFU spectroscopy of the young supernova remnant Cassiopeia A (Cas A). We obtained the data as part of a JWST survey of Cas A. The NIRCam and MIRI images map the spatial distributions of synchrotron radiation, Ar-rich ejecta, and CO on both large and small scales, revealing remarkably complex structures. The CO emission is stronger at the outer layers than the Ar ejecta, which indicates the reformation of CO molecules behind the reverse shock. NIRSpec-IFU spectra (3 - 5.5 microns) were obtained toward two representative knots in the NE and S fields. Both regions are dominated by the bright fundamental rovibrational band of CO in the two R and P branches, with strong [Ar VI] and relatively weaker, variable strength ejecta lines of [Si IX], [Ca IV], [Ca V] and [Mg IV]. The NIRSpec-IFU data resolve individual ejecta knots and filaments spatially and in velocity space. The fundamental CO band in the JWST spectra reveals unique shapes of CO, showing a few tens of sinusoidal patterns of rovibrational lines with pseudo-continuum underneath, which is attributed to the high-velocity widths of CO lines. The CO also shows high J lines at different vibrational transitions. Our results with LTE modeling of CO emission indicate a temperature of 1080 K and provide unique insight into the correlations between dust, molecules, and highly ionized ejecta in supernovae, and have strong ramifications for modeling dust formation that is led by CO cooling in the early Universe., Comment: accepted for the ApJ letter (17 pages and 10 figures)
Published: 2024

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

185,630 results on '"A. Koo"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources