Author: "Huang, Yukun" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Huang, Yukun"' showing total 738 results

Start Over Author "Huang, Yukun"

738 results on '"Huang, Yukun"'

1. Enhancing Large Language Models' Situated Faithfulness to External Contexts

Author: Huang, Yukun, Chen, Sanxing, Cai, Hongyi, and Dhingra, Bhuwan
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Large Language Models (LLMs) are often augmented with external information as contexts, but this external information can sometimes be inaccurate or even intentionally misleading. We argue that robust LLMs should demonstrate situated faithfulness, dynamically calibrating their trust in external information based on their confidence in the internal knowledge and the external context. To benchmark this capability, we evaluate LLMs across several QA datasets, including a newly created dataset called RedditQA featuring in-the-wild incorrect contexts sourced from Reddit posts. We show that when provided with both correct and incorrect contexts, both open-source and proprietary models tend to overly rely on external information, regardless of its factual accuracy. To enhance situated faithfulness, we propose two approaches: Self-Guided Confidence Reasoning (SCR) and Rule-Based Confidence Reasoning (RCR). SCR enables models to self-access the confidence of external information relative to their own internal knowledge to produce the most accurate answer. RCR, in contrast, extracts explicit confidence signals from the LLM and determines the final answer using predefined rules. Our results show that for LLMs with strong reasoning capabilities, such as GPT-4o and GPT-4o mini, SCR outperforms RCR, achieving improvements of up to 24.2% over a direct input augmentation baseline. Conversely, for a smaller model like Llama-3-8B, RCR outperforms SCR. Fine-tuning SCR with our proposed Confidence Reasoning Direct Preference Optimization (CR-DPO) method improves performance on both seen and unseen datasets, yielding an average improvement of 8.9% on Llama-3-8B. In addition to quantitative results, we offer insights into the relative strengths of SCR and RCR. Our findings highlight promising avenues for improving situated faithfulness in LLMs. The data and code are released.
Published: 2024

2. Real-time Fake News from Adversarial Feedback

Author: Chen, Sanxing, Huang, Yukun, and Dhingra, Bhuwan
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: We show that existing evaluations for fake news detection based on conventional sources, such as claims on fact-checking websites, result in an increasing accuracy over time for LLM-based detectors -- even after their knowledge cutoffs. This suggests that recent popular political claims, which form the majority of fake news on such sources, are easily classified using surface-level shallow patterns. Instead, we argue that a proper fake news detection dataset should test a model's ability to reason factually about the current world by retrieving and reading related evidence. To this end, we develop a novel pipeline that leverages natural language feedback from a RAG-based detector to iteratively modify real-time news into deceptive fake news that challenges LLMs. Our iterative rewrite decreases the binary classification AUC by an absolute 17.5 percent for a strong RAG GPT-4o detector. Our experiments reveal the important role of RAG in both detecting and generating fake news, as retrieval-free LLM detectors are vulnerable to unseen events and adversarial attacks, while feedback from RAG detection helps discover more deceitful patterns in fake news.
Published: 2024

3. DreamWaltz-G: Expressive 3D Gaussian Avatars from Skeleton-Guided 2D Diffusion

Author: Huang, Yukun, Wang, Jianan, Zeng, Ailing, Zha, Zheng-Jun, Zhang, Lei, and Liu, Xihui
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Graphics, Computer Science - Machine Learning
Abstract: Leveraging pretrained 2D diffusion models and score distillation sampling (SDS), recent methods have shown promising results for text-to-3D avatar generation. However, generating high-quality 3D avatars capable of expressive animation remains challenging. In this work, we present DreamWaltz-G, a novel learning framework for animatable 3D avatar generation from text. The core of this framework lies in Skeleton-guided Score Distillation and Hybrid 3D Gaussian Avatar representation. Specifically, the proposed skeleton-guided score distillation integrates skeleton controls from 3D human templates into 2D diffusion models, enhancing the consistency of SDS supervision in terms of view and human pose. This facilitates the generation of high-quality avatars, mitigating issues such as multiple faces, extra limbs, and blurring. The proposed hybrid 3D Gaussian avatar representation builds on the efficient 3D Gaussians, combining neural implicit fields and parameterized 3D meshes to enable real-time rendering, stable SDS optimization, and expressive animation. Extensive experiments demonstrate that DreamWaltz-G is highly effective in generating and animating 3D avatars, outperforming existing methods in both visual quality and animation expressiveness. Our framework further supports diverse applications, including human video reenactment and multi-subject scene composition., Comment: Project page: https://yukun-huang.github.io/DreamWaltz-G/
Published: 2024

4. Dynamics of Binary Planets within Star Clusters

Author: Huang, Yukun, Zhu, Wei, and Kokubo, Eiichiro
Subjects: Astrophysics - Earth and Planetary Astrophysics, Astrophysics - Solar and Stellar Astrophysics
Abstract: We develop analytical tools and perform three-body simulations to investigate the orbital evolution and dynamical stability of binary planets within star clusters. Our analytical results show that the orbital stability of a planetary-mass binary against passing stars is mainly related to its orbital period. Critical flybys, defined as stellar encounters with energy kicks comparable to the binary binding energy, can efficiently produce a wide range of semimajor axes ($a$) and eccentricities ($e$) from a dominant population of primordially tight JuMBOs. The critical flyby criterion we derived offers an improvement over the commonly used tidal radius criterion, particularly in high-speed stellar encounters. Applying our results to the recently discovered Jupiter-Mass Binary Objects (JuMBOs) by the James Webb Space Telescope (JWST), our simulations suggest that to match the observed $\sim$9% wide binary fraction, an initial semimajor axis of $a_0 \sim$ 10-20 au and a density-weighted residence time of $\chi \gtrsim 10^4$ Myr pc$^{-3}$ are favored. These results imply that the JWST JuMBOs probably formed as tight binaries near the cluster core., Comment: 11 pages, 4 figures, published on ApJL
Published: 2024
Full Text: View/download PDF

5. Asteroid Kamo`oalewa's journey from the lunar Giordano Bruno crater to Earth 1:1 resonance

Author: Jiao, Yifei, Cheng, Bin, Huang, Yukun, Asphaug, Erik, Gladman, Brett, Malhotra, Renu, Michel, Patrick, Yu, Yang, and Baoyin, Hexi
Subjects: Astrophysics - Earth and Planetary Astrophysics
Abstract: Among the nearly 30,000 known near-Earth asteroids (NEAs), only tens of them possess Earth co-orbital characteristics with semi-major axes $\sim$1 au. In particular, 469219 Kamo`oalewa (2016 HO3), upcoming target of China's Tianwen-2 asteroid sampling mission, exhibits a meta-stable 1:1 mean-motion resonance with Earth. Intriguingly, recent ground-based observations show that Kamo`oalewa has spectroscopic characteristics similar to space-weathered lunar silicates, hinting at a lunar origin instead of an asteroidal one like the vast majority of NEAs. Here we use numerical simulations to demonstrate that Kamo`oalewa's physical and orbital properties are compatible with a fragment from a crater larger than 10--20 km formed on the Moon in the last few million years. The impact could have ejected sufficiently large fragments into heliocentric orbits, some of which could be transferred to Earth 1:1 resonance and persist today. This leads us to suggest the young lunar crater Giordano Bruno (22 km diameter, 1--10 Ma age) as the most likely source, linking a specific asteroid in space to its source crater on the Moon. The hypothesis will be tested by the Tianwen-2 mission when it returns a sample of Kamo`oalewa. And the upcoming NEO Surveyor mission will possibly help us to identify such a lunar-derived NEA population., Comment: 29 pages, 4 figures. Published in Nature Astronomy, 19 April 2024
Published: 2024
Full Text: View/download PDF

6. Atomic Self-Consistency for Better Long Form Generations

Author: Thirukovalluru, Raghuveer, Huang, Yukun, and Dhingra, Bhuwan
Subjects: Computer Science - Computation and Language
Abstract: Recent work has aimed to improve LLM generations by filtering out hallucinations, thereby improving the precision of the information in responses. Correctness of a long-form response, however, also depends on the recall of multiple pieces of information relevant to the question. In this paper, we introduce Atomic Self-Consistency (ASC), a technique for improving the recall of relevant information in an LLM response. ASC follows recent work, Universal Self-Consistency (USC) in using multiple stochastic samples from an LLM to improve the long-form response. Unlike USC which only focuses on selecting the best single generation, ASC picks authentic subparts from the samples and merges them into a superior composite answer. Through extensive experiments and ablations, we show that merging relevant subparts of multiple samples performs significantly better than picking a single sample. ASC demonstrates significant gains over USC on multiple factoids and open-ended QA datasets - ASQA, QAMPARI, QUEST, ELI5 with ChatGPT and Llama2. Our analysis also reveals untapped potential for enhancing long-form generations using approach of merging multiple samples., Comment: 12 pages
Published: 2024

7. Results from the autoPET challenge on fully automated lesion segmentation in oncologic PET/CT imaging

Author: Gatidis, Sergios, Früh, Marcel, Fabritius, Matthias P., Gu, Sijing, Nikolaou, Konstantin, Fougère, Christian La, Ye, Jin, He, Junjun, Peng, Yige, Bi, Lei, Ma, Jun, Wang, Bo, Zhang, Jia, Huang, Yukun, Heiliger, Lars, Marinov, Zdravko, Stiefelhagen, Rainer, Egger, Jan, Kleesiek, Jens, Sibille, Ludovic, Xiang, Lei, Bendazzoli, Simone, Astaraki, Mehdi, Ingrisch, Michael, Cyran, Clemens C., and Küstner, Thomas
Published: 2024
Full Text: View/download PDF

8. Comparison of the Accuracy of a Deep Learning Method for Lesion Detection in PET/CT and PET/MRI Images

Author: Pang, Lifang, Zhang, Zheng, Liu, Guobing, Hu, Pengcheng, Chen, Shuguang, Gu, Yushen, Huang, Yukun, Zhang, Jia, Shi, Yuhang, Cao, Tuoyu, Zhang, Yiqiu, and Shi, Hongcheng
Published: 2024
Full Text: View/download PDF

9. Calibrating Long-form Generations from Large Language Models

Author: Huang, Yukun, Liu, Yixin, Thirukovalluru, Raghuveer, Cohan, Arman, and Dhingra, Bhuwan
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: To enhance Large Language Models' (LLMs) reliability, calibration is essential -- the model's assessed confidence scores should align with the actual likelihood of its responses being correct. However, current confidence elicitation methods and calibration metrics typically rely on a binary true/false assessment of response correctness. This approach does not apply to long-form generation, where an answer can be partially correct. Addressing this gap, we introduce a unified calibration framework, in which both the correctness of the LLMs' responses and their associated confidence levels are treated as distributions across a range of scores. Within this framework, we develop three metrics to precisely evaluate LLM calibration and further propose two confidence elicitation methods based on self-consistency and self-evaluation. Our experiments, which include long-form QA and summarization tasks, demonstrate that larger models don't necessarily guarantee better calibration, that calibration performance is found to be metric-dependent, and that self-consistency methods excel in factoid datasets. We also find that calibration can be enhanced through techniques such as fine-tuning, integrating relevant source documents, scaling the temperature, and combining self-consistency with self-evaluation. Lastly, we showcase a practical application of our system: selecting and cascading open-source models and ChatGPT to optimize correctness given a limited API budget. This research not only challenges existing notions of LLM calibration but also offers practical methodologies for improving trustworthiness in long-form generation.
Published: 2024

10. DreamComposer: Controllable 3D Object Generation via Multi-View Conditions

Author: Yang, Yunhan, Huang, Yukun, Wu, Xiaoyang, Guo, Yuan-Chen, Zhang, Song-Hai, Zhao, Hengshuang, He, Tong, and Liu, Xihui
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Utilizing pre-trained 2D large-scale generative models, recent works are capable of generating high-quality novel views from a single in-the-wild image. However, due to the lack of information from multiple views, these works encounter difficulties in generating controllable novel views. In this paper, we present DreamComposer, a flexible and scalable framework that can enhance existing view-aware diffusion models by injecting multi-view conditions. Specifically, DreamComposer first uses a view-aware 3D lifting module to obtain 3D representations of an object from multiple views. Then, it renders the latent features of the target view from 3D representations with the multi-view feature fusion module. Finally the target view features extracted from multi-view inputs are injected into a pre-trained diffusion model. Experiments show that DreamComposer is compatible with state-of-the-art diffusion models for zero-shot novel view synthesis, further enhancing them to generate high-fidelity novel view images with multi-view conditions, ready for controllable 3D object reconstruction and various other applications., Comment: Project Page: https://yhyang-myron.github.io/DreamComposer/
Published: 2023

11. Preparation and Adsorption Performance of Polyvinyl Chloride/Polyacrylonitrile Blended Lithium Ion-Sieve Membrane

Author: Cao, Yijun, Cao, Zan, Liu, Jiang, Huang, Yukun, and Wang, Long
Published: 2024
Full Text: View/download PDF

12. Primordial Orbital Alignment of Sednoids

Author: Huang, Yukun and Gladman, Brett
Subjects: Astrophysics - Earth and Planetary Astrophysics
Abstract: We examined the past history of the three most detached TransNeptunian Objects (TNOs) -- Sedna, 2012 VP113, and Leleakuhonua (2015 TG387) -- the three clearest members of the dynamical class known as sednoids, with high perihelia distances $q$. By integrating backward their nominal (and a set of cloned) orbits for the Solar System's age, we surprisingly find that the only time all their apsidal lines tightly cluster was 4.5 Gyr ago, at perihelion longitude $\varpi$ of $200^\circ$. This "primordial alignment" is independent of the observational biases that contribute to the current on-sky clustering in the large-semimajor axis Kuiper Belt. If future sednoid discoveries confirm these findings, this strongly argues for an initial event during the planet formation epoch which imprinted this particular apsidal orientation on the early detached TNO population. Their apsidal orientations were then subsequently modified only by the simple precession from the 4 giant planets (and weakly by the galactic tide). If other sednoids also cluster around the same primordial value, various models suggesting a still present planet in the outer Solar System would be incompatible with this alignment. We inspected two scenarios that could potentially explain the primordial alignment. First, a rogue planet model (where another massive planet raises perihelia near its own longitude until ejection) naturally produces this signature. Alternatively, a close stellar passage early in Solar System history raises perihelia, but it is poor at creating strong apsidal clustering. We show that all other known $35
Published: 2023
Full Text: View/download PDF

13. Alexpaca: Learning Factual Clarification Question Generation Without Examples

Author: Toles, Matthew, Huang, Yukun, Yu, Zhou, and Gravano, Luis
Subjects: Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: Real-life tasks such as giving legal or technical advice often lack complete context at the outset and can have disparate answers depending thereon. The ability to derive missing factual information by asking clarifying questions (ACQ) is an important element of real-life collaboration on such reasoning tasks. Existing factual clarification question challenges evaluate generations based on word overlap or human evaluations. Recent work explores generating a response to the clarifying question then evaluating its utility directly. So far, these tasks are limited to disambiguating the user's intent rather than concrete facts about the situation. The factual domain presents unique challenges since responses to clarification questions must be factually true for accurate evaluation. To enable evaluation of factual domain clarification question generation, We present a new task that focuses on the ability to elicit missing information in multi-hop reasoning tasks. The task, HotpotQA-FLM, can be evaluated automatically, making it convenient for benchmarking language models. We observe that humans outperform GPT-4 by a large margin, while Llama 3 8B Instruct does not even beat the dummy baseline in some metrics. Finally, we find by fine-tuning Llama 3 8B Instruct on its own generations, filtered via rejection sampling, we can improve information recovery by 27.6 percent.
Published: 2023

14. TOSS:High-quality Text-guided Novel View Synthesis from a Single Image

Author: Shi, Yukai, Wang, Jianan, Cao, He, Tang, Boshi, Qi, Xianbiao, Yang, Tianyu, Huang, Yukun, Liu, Shilong, Zhang, Lei, and Shum, Heung-Yeung
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: In this paper, we present TOSS, which introduces text to the task of novel view synthesis (NVS) from just a single RGB image. While Zero-1-to-3 has demonstrated impressive zero-shot open-set NVS capability, it treats NVS as a pure image-to-image translation problem. This approach suffers from the challengingly under-constrained nature of single-view NVS: the process lacks means of explicit user control and often results in implausible NVS generations. To address this limitation, TOSS uses text as high-level semantic information to constrain the NVS solution space. TOSS fine-tunes text-to-image Stable Diffusion pre-trained on large-scale text-image pairs and introduces modules specifically tailored to image and camera pose conditioning, as well as dedicated training for pose correctness and preservation of fine details. Comprehensive experiments are conducted with results showing that our proposed TOSS outperforms Zero-1-to-3 with more plausible, controllable and multiview-consistent NVS results. We further support these results with comprehensive ablations that underscore the effectiveness and potential of the introduced semantic guidance and architecture design.
Published: 2023

15. OSSOS. XXIX. The Population and Perihelion Distribution of the Detached Kuiper Belt

Author: Beaudoin, Matthew, Gladman, Brett, Huang, Yukun, Bannister, Michele, Kavelaars, J. J., Petit, Jean-Marc, and Volk, Kathryn
Subjects: Astrophysics - Earth and Planetary Astrophysics
Abstract: The detached transneptunian objects (TNOs) are those with semimajor axes beyond the 2:1 resonance with Neptune, which are neither resonant nor scattering. Using the detached sample from the OSSOS telescopic survey, we produce the first studies of their orbital distribution based on matching the orbits and numbers of the known TNOs after accounting for survey biases. We show that the detached TNO perihelion ($q$) distribution cannot be uniform, but is instead better matched by two uniform components with a break near $q\approx40$ au. We produce parametric two-component models that are not rejectable by the OSSOS data set, and estimate that there are $36,\!000^{+12,000}_{-9,000}$ detached TNOs with absolute magnitudes $H_r < 8.66$ ($D \gtrsim 100$ km) and semimajor axes $48 < a < 250$ au (95% confidence limits). Although we believe these heuristic two-parameter models yield a correct population estimate, we then use the same methods to show that the perihelion distribution of a detached disk created by a simulated rogue planet matches the $q$ distribution even better, suggesting that the temporary presence of other planets in the early Solar System is a promising model to create today's large semimajor axis TNO population. This numerical model results in a detached TNO population estimate of $48,\!000^{+15,000}_{-12,000}$. Because this illustrates how difficult-to-detect $q>50$ au objects are likely present, we conclude that there are $(5 \pm 2)\times10^4$ dynamically detached TNOs, which are thus roughly twice as numerous as the entire transneptunian hot main belt., Comment: Accepted for publication in The Planetary Science Journal. 16 pages, 8 figures
Published: 2023
Full Text: View/download PDF

16. DreamTime: An Improved Optimization Strategy for Diffusion-Guided 3D Generation

Author: Huang, Yukun, Wang, Jianan, Shi, Yukai, Tang, Boshi, Qi, Xianbiao, and Zhang, Lei
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Graphics, Computer Science - Machine Learning
Abstract: Text-to-image diffusion models pre-trained on billions of image-text pairs have recently enabled 3D content creation by optimizing a randomly initialized differentiable 3D representation with score distillation. However, the optimization process suffers slow convergence and the resultant 3D models often exhibit two limitations: (a) quality concerns such as missing attributes and distorted shape and texture; (b) extremely low diversity comparing to text-guided image synthesis. In this paper, we show that the conflict between the 3D optimization process and uniform timestep sampling in score distillation is the main reason for these limitations. To resolve this conflict, we propose to prioritize timestep sampling with monotonically non-increasing functions, which aligns the 3D optimization process with the sampling process of diffusion model. Extensive experiments show that our simple redesign significantly improves 3D content creation with faster convergence, better quality and diversity., Comment: ICLR 2024
Published: 2023

17. Extraction and recycling technologies of cobalt from primary and secondary resources: A comprehensive review

Author: Huang, Yukun, Chen, Pengxu, Shu, Xuanzhao, Fu, Biao, Peng, Weijun, Liu, Jiang, Cao, Yijun, and Zhu, Xiaofeng
Published: 2024
Full Text: View/download PDF

18. Intracerebral fate of organic and inorganic nanoparticles is dependent on microglial extracellular vesicle function

Author: Gao, Jinchao, Song, Qingxiang, Gu, Xiao, Jiang, Gan, Huang, Jialin, Tang, Yuyun, Yu, Renhe, Wang, Antian, Huang, Yukun, Zheng, Gang, Chen, Hongzhuan, and Gao, Xiaoling
Published: 2024
Full Text: View/download PDF

19. DreamWaltz: Make a Scene with Complex 3D Animatable Avatars

Author: Huang, Yukun, Wang, Jianan, Zeng, Ailing, Cao, He, Qi, Xianbiao, Shi, Yukai, Zha, Zheng-Jun, and Zhang, Lei
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: We present DreamWaltz, a novel framework for generating and animating complex 3D avatars given text guidance and parametric human body prior. While recent methods have shown encouraging results for text-to-3D generation of common objects, creating high-quality and animatable 3D avatars remains challenging. To create high-quality 3D avatars, DreamWaltz proposes 3D-consistent occlusion-aware Score Distillation Sampling (SDS) to optimize implicit neural representations with canonical poses. It provides view-aligned supervision via 3D-aware skeleton conditioning which enables complex avatar generation without artifacts and multiple faces. For animation, our method learns an animatable 3D avatar representation from abundant image priors of diffusion model conditioned on various poses, which could animate complex non-rigged avatars given arbitrary poses without retraining. Extensive evaluations demonstrate that DreamWaltz is an effective and robust approach for creating 3D avatars that can take on complex shapes and appearances as well as novel poses for animation. The proposed framework further enables the creation of complex scenes with diverse compositions, including avatar-avatar, avatar-object and avatar-scene interactions. See https://dreamwaltz3d.github.io/ for more vivid 3D avatar and animation results., Comment: To appear in NeurIPS 2023; Project page: https://dreamwaltz3d.github.io/
Published: 2023

20. In-context Learning Distillation: Transferring Few-shot Learning Ability of Pre-trained Language Models

Author: Huang, Yukun, Chen, Yanda, Yu, Zhou, and McKeown, Kathleen
Subjects: Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: Given the success with in-context learning of large pre-trained language models, we introduce in-context learning distillation to transfer in-context few-shot learning ability from large models to smaller models. We propose to combine in-context learning objectives with language modeling objectives to distill both the ability to read in-context examples and task knowledge to the smaller models. We perform in-context learning distillation under two different few-shot learning paradigms: Meta In-context Tuning (Meta-ICT) and Multitask In-context Tuning (Multitask-ICT). Multitask-ICT performs better on multitask few-shot learning but also requires more computation than Meta-ICT. Our method shows consistent improvements for both Meta-ICT and Multitask-ICT on two benchmarks: LAMA and CrossFit. Our extensive experiments and analysis reveal that in-context learning objectives and language modeling objectives are complementary under the Multitask-ICT paradigm. In-context learning objectives achieve the best performance when combined with language modeling objectives.
Published: 2022

21. Neural Dependencies Emerging from Learning Massive Categories

Author: Feng, Ruili, Zheng, Kecheng, Zhu, Kai, Shen, Yujun, Zhao, Jian, Huang, Yukun, Zhao, Deli, Zhou, Jingren, Jordan, Michael, and Zha, Zheng-Jun
Subjects: Computer Science - Machine Learning, Computer Science - Computer Vision and Pattern Recognition
Abstract: This work presents two astonishing findings on neural networks learned for large-scale image classification. 1) Given a well-trained model, the logits predicted for some category can be directly obtained by linearly combining the predictions of a few other categories, which we call \textbf{neural dependency}. 2) Neural dependencies exist not only within a single model, but even between two independently learned models, regardless of their architectures. Towards a theoretical analysis of such phenomena, we demonstrate that identifying neural dependencies is equivalent to solving the Covariance Lasso (CovLasso) regression problem proposed in this paper. Through investigating the properties of the problem solution, we confirm that neural dependency is guaranteed by a redundant logit covariance matrix, which condition is easily met given massive categories, and that neural dependency is highly sparse, implying that one category correlates to only a few others. We further empirically show the potential of neural dependencies in understanding internal data correlations, generalizing models to unseen categories, and improving model robustness with a dependency-derived regularizer. Code for this work will be made publicly available.
Published: 2022

22. A Rogue Planet Helps Populate the Distant Kuiper Belt

Author: Huang, Yukun, Gladman, Brett, Beaudoin, Matthew, and Zhang, Kevin
Subjects: Astrophysics - Earth and Planetary Astrophysics
Abstract: The orbital distribution of transneptunian objects (TNOs) in the distant Kuiper Belt (with semimajor axes beyond the 2:1 resonance, roughly $a$=50-100 au) provides constraints on the dynamical history of the outer solar system. Recent studies show two striking features of this region: 1) a very large population of objects in distant mean-motion resonances with Neptune, and 2) the existence of a substantial detached population (non-resonant objects largely decoupled from Neptune). Neptune migration models are able to implant some resonant and detached objects during the planet migration era, but many fail to match a variety of aspects of the orbital distribution. In this work, we report simulations carried out using an improved version of the GPU-based code GLISSE, following 100,000 test particles per simulation in parallel while handling their planetary close encounters. We demonstrate for the first time that a 2 Earth-mass rogue planet temporarily present during planet formation can abundantly populate both the distant resonances and the detached populations, surprisingly even without planetary migration. We show how weak encounters with the rogue greatly increase the efficiency of filling the resonances, while also dislodging TNOs out of resonance once they reach high perihelia. The rogue's secular gravitational influence simultaneously generates numerous detached objects observed at all semimajor axes. These results suggest that the early presence of additional planet(s) reproduces the observed TNO orbital structure in the distant Kuiper Belt., Comment: 14 pages, 5 figures. accepted for publication in ApJ Letter. For associated animated movies, see https://yukunhuang.com/
Published: 2022
Full Text: View/download PDF

23. Whole-Body Lesion Segmentation in 18F-FDG PET/CT

Author: Zhang, Jia, Huang, Yukun, Zhang, Zheng, and Shi, Yuhang
Subjects: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Computer Vision and Pattern Recognition
Abstract: There has been growing research interest in using deep learning based method to achieve fully automated segmentation of lesion in Positron emission tomography computed tomography(PET CT) scans for the prognosis of various cancers. Recent advances in the medical image segmentation shows the nnUNET is feasible for diverse tasks. However, lesion segmentation in the PET images is not straightforward, because lesion and physiological uptake has similar distribution patterns. The Distinction of them requires extra structural information in the CT images. The present paper introduces a nnUNet based method for the lesion segmentation task. The proposed model is designed on the basis of the joint 2D and 3D nnUNET architecture to predict lesions across the whole body. It allows for automated segmentation of potential lesions. We evaluate the proposed method in the context of AutoPet Challenge, which measures the lesion segmentation performance in the metrics of dice score, false-positive volume and false-negative volume.
Published: 2022

24. Rank Diminishing in Deep Neural Networks

Author: Feng, Ruili, Zheng, Kecheng, Huang, Yukun, Zhao, Deli, Jordan, Michael, and Zha, Zheng-Jun
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: The rank of neural networks measures information flowing across layers. It is an instance of a key structural condition that applies across broad domains of machine learning. In particular, the assumption of low-rank feature representations leads to algorithmic developments in many architectures. For neural networks, however, the intrinsic mechanism that yields low-rank structures remains vague and unclear. To fill this gap, we perform a rigorous study on the behavior of network rank, focusing particularly on the notion of rank deficiency. We theoretically establish a universal monotonic decreasing property of network rank from the basic rules of differential and algebraic composition, and uncover rank deficiency of network blocks and deep function coupling. By virtue of our numerical tools, we provide the first empirical analysis of the per-layer behavior of network rank in practical settings, i.e., ResNets, deep MLPs, and Transformers on ImageNet. These empirical results are in direct accord with our theory. Furthermore, we reveal a novel phenomenon of independence deficit caused by the rank deficiency of deep networks, where classification confidence of a given category can be linearly decided by the confidence of a handful of other categories. The theoretical results of this work, together with the empirical findings, may advance understanding of the inherent principles of deep neural networks., Comment: 31 pages, 12 figures
Published: 2022

25. Learning a Better Initialization for Soft Prompts via Meta-Learning

Author: Huang, Yukun, Qian, Kun, and Yu, Zhou
Subjects: Computer Science - Computation and Language
Abstract: Prompt tuning (PT) is an effective approach to adapting pre-trained language models to downstream tasks. Without a good initialization, prompt tuning doesn't perform well under few-shot settings. So pre-trained prompt tuning (PPT) is proposed to initialize prompts by leveraging pre-training data. We propose MetaPT (Meta-learned Prompt Tuning) to further improve PPT's initialization by considering latent structure within the pre-training data. Specifically, we introduce the structure by first clustering pre-training data into different auxiliary tasks with unsupervised methods. Then we use these tasks to pre-train prompts with a meta-learning algorithm. Such a process can make prompts learn a better initialization by discovering commonalities among these auxiliary tasks. We evaluate our method on seven downstream tasks. Our MetaPT achieves better and more stable performance than the state-of-the-art method.
Published: 2022

26. Efficient selective flotation separation of talc and molybdenite with a novel molybdenite depressant: Remarkable performance and mechanism

Author: Qi, Mengyao, Luo, Zhenkai, Peng, Weijun, Wang, Wei, Cao, Yijun, Zhang, Longyu, and Huang, Yukun
Published: 2024
Full Text: View/download PDF

27. Efficient and clean treatment of indium-bearing zinc ferrite: A new approach using a water-regulated deep eutectic solvent

Author: Liu, Jiang, Chen, Bingxue, Huang, Yukun, Cao, Yijun, Chen, Jingbo, Wang, Liqiang, Liu, Yan, and Fan, Yangyang
Published: 2024
Full Text: View/download PDF

28. A novel theranostic strategy for myocardial infarction through neutralization of endogenous SO2 using an endoplasmic reticulum-targeted fluorescent probe

Author: Jiang, Yunhan, Liu, Pingxian, Li, Huidong, Fan, Dongmei, Huang, Yukun, Zhou, Meng, and Yang, Tao
Published: 2024
Full Text: View/download PDF

29. Comparison of sea buckthorn fruit oil nanoemulsions stabilized by protein-polysaccharide conjugates prepared using β-glucan from various sources

Author: Shen, Ziyi, Dai, Juan, Yang, Xinyue, Liu, Yao, Liu, Lei, Huang, YuKun, Wang, Lijun, Chen, Pengfei, Chen, Xianggui, Zhang, Chisong, Zhao, Juan, Yang, Xiao, and Wang, Qin
Published: 2024
Full Text: View/download PDF

30. Exploring organophosphate ester contamination and distribution in food: A meta-analysis

Author: Li, Wenjun, Chen, Junlong, Bie, Qianqian, Chen, Xianggui, Huang, Yukun, Zhang, Kaihui, and Qian, Shan
Published: 2024
Full Text: View/download PDF

31. A comprehensive review of the electrochemical advanced oxidation processes: Detection of free radical, electrode materials and application

Author: Zhang, Longyu, Peng, Weijun, Wang, Wei, Cao, Yijun, Fan, Guixia, Huang, Yukun, and Qi, Mengyao
Published: 2024
Full Text: View/download PDF

32. Synchronous averaging with sliding narrowband filtering for low-speed bearing fault diagnosis

Author: Huang, Yukun, Wang, Kun, Deng, Zhenhong, Xue, Zhengkun, Zhang, Baoqiang, and Luo, Huageng
Published: 2024
Full Text: View/download PDF

33. Revealing the flavor compositions, microbial diversity, and biological functions of Huangshui from different production workshops

Author: Guo, Qingyan, Zhao, Jingjing, Peng, Jiabao, Huang, Yukun, and Shao, Bing
Published: 2024
Full Text: View/download PDF

34. Marigold-like cobalt-rich slag for highly efficient degradation of organic pollutants via peroxymonosulfate activation: Process factors, catalytic mechanism, and economic evaluation

Author: Wang, Chongqing, Li, Xingyang, Liu, Hongwen, Peng, Weijun, Cao, Yijun, and Huang, Yukun
Published: 2024
Full Text: View/download PDF

35. Rapid determination of germanium in lignite coal and coal-related solid byproducts by graphite furnace digestion inductively coupled plasma emission spectroscopy

Author: Huang, Yukun, Chen, Guangyu, Fu, Biao, Si, Yingfu, Li, Peng, Cao, Yijun, Rong, Lingkun, and Zhao, Chunjie
Published: 2024
Full Text: View/download PDF

36. Free Inclinations for Transneptunian Objects in the Main Kuiper Belt

Author: Huang, Yukun, Gladman, Brett, and Volk, Kathryn
Subjects: Astrophysics - Earth and Planetary Astrophysics
Abstract: There is a complex inclination structure present in the transneptunian object (TNO) orbital distribution in the main classical belt region (between orbital semimajor axes of 39 and 48 au). The long-term gravitational effects of the giant planets make TNO orbits precess, but non-resonant objects maintain a nearly constant 'free' inclination ($I_\text{free}$) with respect to a local forced precession pole. Because of the likely cosmogonic importance of the distribution of this quantity, we tabulate free inclinations for all main-belt TNOs, each individually computed using barycentric orbital elements with respect to each object's local forcing pole. We show that the simplest method, based on the Laplace-Lagrange secular theory, is unable to give correct forcing poles for objects near the $\nu_{18}$ secular resonance, resulting in poorly conserved $I_\text{free}$ values in much of the main belt. We thus instead implemented an averaged Hamiltonian to obtain the expected nodal precession for each TNO, yielding significantly more accurate free inclinations for non-resonant objects. For the vast majority (96\%) of classical belt TNOs, these $I_\text{free}$ values are conserved to $<1^\circ$ over 4 Gyr numerical simulations, demonstrating the advantage of using this well-conserved quantity in studies of the TNO population and its primordial inclination profile; our computed distributions only reinforce the idea of a very co-planar surviving 'cold' primordial population, overlain by a large $I$-width implanted 'hot' population., Comment: 15 pages, 6 figures, accepted for publication in ApJS. Data downloadable in https://yukunhuang.com
Published: 2022
Full Text: View/download PDF

37. Separation of silicon and germanium from the chlorination distillation residue based on co-precipitation of sodium-aluminum–silicon

Author: Huang, Yukun, Chen, Guangyu, Zhang, Yifan, Fu, Biao, Cao, Yijun, Liu, Jiang, and Peng, Weijun
Published: 2024
Full Text: View/download PDF

38. Preparation and characterization of a selective clarifier targeting chitinase: Effect on inhibiting turbidity formation and retaining aroma components in mulberry wine

Author: Liu, Zurui, Dai, Juan, Zhang, Kaihui, Ding, Yuexuan, Yang, Xinyue, Huang, YuKun, Wang, Lijun, Chen, Pengfei, Zhou, Zheng, Chen, Xianggui, and Yang, Xiao
Published: 2024
Full Text: View/download PDF

39. Re-investigation the phase equilibria and thermodynamic assessment of the Nd-Sn binary system

Author: He, Cuiyun, Huang, Yukun, Liu, Shengyu, Zen, Jianmin, and Jiang, Ruyi
Published: 2024
Full Text: View/download PDF

40. ICDM 2020 Knowledge Graph Contest: Consumer Event-Cause Extraction

Author: He, Congqing, Zhang, Jie, Zhu, Xiangyu, Liu, Huan, and Huang, Yukun
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Consumer Event-Cause Extraction, the task aimed at extracting the potential causes behind certain events in the text, has gained much attention in recent years due to its wide applications. The ICDM 2020 conference sets up an evaluation competition that aims to extract events and the causes of the extracted events with a specified subject (a brand or product). In this task, we mainly focus on how to construct an end-to-end model, and extract multiple event types and event-causes simultaneously. To this end, we introduce a fresh perspective to revisit the relational event-cause extraction task and propose a novel sequence tagging framework, instead of extracting event types and events-causes separately. Experiments show our framework outperforms baseline methods even when its encoder module uses an initialized pre-trained BERT encoder, showing the power of the new tagging framework. In this competition, our team achieved 1st place in the first stage leaderboard, and 3rd place in the final stage leaderboard.
Published: 2021

41. Improvements in recovery of rare earth elements (REEs) from coal via fluidized-bed combustion: Thermal alteration of REE mineralogy and its impact on element extractability

Author: Fu, Biao, Si, Yingfu, Huang, Yongda, Xu, Guorong, Cao, Yijun, Zhao, Chunjie, Huang, Yukun, Zou, Renjie, Luo, Guangqian, and Yao, Hong
Published: 2024
Full Text: View/download PDF

42. A new strategy for clean utilization of zinc oxide dust: Preparation of spinel zinc ferrite by solid phase reaction and its catalytic degradation of organic wastewater

Author: Huang, Yukun, Chen, Xiaolei, Fan, Yangyang, Wang, Chongqing, Cao, Yijun, Peng, Weijun, Fu, Biao, Liu, Jiang, and Hu, Mingzhen
Published: 2024
Full Text: View/download PDF

43. A novel molybdenite depressant for efficient selective flotation separation of chalcopyrite and molybdenite

Author: Qi, Mengyao, Peng, Weijun, Wang, Wei, Cao, Yijun, Zhang, Longyu, and Huang, Yukun
Published: 2024
Full Text: View/download PDF

44. Green manuring alters reactive N losses and N pools in arable soils: A meta-regression study

Author: Xu, Bing, Gui, Dongyang, Peng, Hongbo, Huang, Yukun, and Sha, Zhipeng
Published: 2024
Full Text: View/download PDF

45. Reprogramming systemic and local immune function to empower immunotherapy against glioblastoma

Author: Zhou, Songlei, Huang, Yukun, Chen, Yu, Liu, Yipu, Xie, Laozhi, You, Yang, Tong, Shiqiang, Xu, Jianpei, Jiang, Gan, Song, Qingxiang, Mei, Ni, Ma, Fenfen, Gao, Xiaoling, Chen, Hongzhuan, and Chen, Jun
Published: 2023
Full Text: View/download PDF

46. Data-physics-driven estimation of battery state of charge and capacity

Author: Tang, Aihua, Huang, Yukun, Xu, Yuchen, Hu, Yuanzhi, Yan, Fuwu, Tan, Yong, Jin, Xin, and Yu, Quanqing
Published: 2024
Full Text: View/download PDF

47. A green method for selective separation of molybdenite and pyrite via electrochemical oxidation pretreatment-flotation and its mechanism

Author: Zhang, Longyu, Peng, Weijun, Wang, Wei, Cao, Yijun, Qi, Mengyao, and Huang, Yukun
Published: 2024
Full Text: View/download PDF

48. Correlation between microbial communities and volatile flavor compounds in the fermentation of Semen Sojae Praeparatum

Author: Guo, Qingyan, Peng, Jiabao, Zhao, Jingjing, Yue, Jiaxin, Huang, Yukun, and Shao, Bing
Published: 2024
Full Text: View/download PDF

49. Intranasal drug delivery: The interaction between nanoparticles and the nose-to-brain pathway

Author: Chen, Yaoxing, Zhang, Chenyun, Huang, Yukun, Ma, Yuxiao, Song, Qingxiang, Chen, Hongzhuan, Jiang, Gan, and Gao, Xiaoling
Published: 2024
Full Text: View/download PDF

50. Research on Synergy Effect of Enterprise Architecture Integration in Power Grid Industry from the Perspective of Value Stream

Author: Pei, Qiugen, Zhang, Li, Peng, Zewu, Chen, Yunzhi, Huang, Yukun, Li, Kan, Editor-in-Chief, Li, Qingyong, Associate Editor, Fournier-Viger, Philippe, Series Editor, Hong, Wei-Chiang, Series Editor, Liang, Xun, Series Editor, Wang, Long, Series Editor, Xu, Xuesong, Series Editor, Kandel, Bijay Kumar, editor, Misra, Anuranjan, editor, Liao, Junfeng, editor, and Valmohammadi, Changiz, editor
Published: 2023
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

738 results on '"Huang, Yukun"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources