Author: "Sheng, Jun" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Sheng, Jun"' showing total 5,178 results

Start Over Author "Sheng, Jun"

5,178 results on '"Sheng, Jun"'

1. Dual-Head Knowledge Distillation: Enhancing Logits Utilization with an Auxiliary Head

Author: Yang, Penghui, Zong, Chen-Chen, Huang, Sheng-Jun, Feng, Lei, and An, Bo
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: Traditional knowledge distillation focuses on aligning the student's predicted probabilities with both ground-truth labels and the teacher's predicted probabilities. However, the transition to predicted probabilities from logits would obscure certain indispensable information. To address this issue, it is intuitive to additionally introduce a logit-level loss function as a supplement to the widely used probability-level loss function, for exploiting the latent information of logits. Unfortunately, we empirically find that the amalgamation of the newly introduced logit-level loss and the previous probability-level loss will lead to performance degeneration, even trailing behind the performance of employing either loss in isolation. We attribute this phenomenon to the collapse of the classification head, which is verified by our theoretical analysis based on the neural collapse theory. Specifically, the gradients of the two loss functions exhibit contradictions in the linear classifier yet display no such conflict within the backbone. Drawing from the theoretical analysis, we propose a novel method called dual-head knowledge distillation, which partitions the linear classifier into two classification heads responsible for different losses, thereby preserving the beneficial effects of both losses on the backbone while eliminating adverse influences on the classification head. Extensive experiments validate that our method can effectively exploit the information inside the logits and achieve superior performance against state-of-the-art counterparts., Comment: Preprint
Published: 2024

2. The JCMT BISTRO Survey: The Magnetic Fields of the IC 348 Star-forming Region

Author: Choi, Youngwoo, Kwon, Woojin, Pattle, Kate, Arzoumanian, Doris, Bourke, Tyler L., Hoang, Thiem, Hwang, Jihye, Koch, Patrick M., Sadavoy, Sarah, Bastien, Pierre, Furuya, Ray, Lai, Shih-Ping, Qiu, Keping, Ward-Thompson, Derek, Berry, David, Byun, Do-Young, Chen, Huei-Ru Vivien, Chen, Wen Ping, Chen, Mike, Chen, Zhiwei, Ching, Tao-Chung, Cho, Jungyeon, Choi, Minho, Choi, Yunhee, Coudé, Simon, Chrysostomou, Antonio, Chung, Eun Jung, Dai, Sophia, Debattista, Victor, Di Francesco, James, Diep, Pham Ngoc, Doi, Yasuo, Duan, Hao-Yuan, Duan, Yan, Eswaraiah, Chakali, Fanciullo, Lapo, Fiege, Jason, Fissel, Laura M., Franzmann, Erica, Friberg, Per, Friesen, Rachel, Fuller, Gary, Gledhill, Tim, Graves, Sarah, Greaves, Jane, Griffin, Matt, Gu, Qilao, Han, Ilseung, Hasegawa, Tetsuo, Houde, Martin, Hull, Charles L. H., Inoue, Tsuyoshi, Inutsuka, Shu-ichiro, Iwasaki, Kazunari, Jeong, Il-Gyo, Johnstone, Doug, Karoly, Janik, Könyves, Vera, Kang, Ji-hyun, Lacaille, Kevin, Law, Chi-Yan, Lee, Chang Won, Lee, Hyeseung, Lee, Chin-Fei, Lee, Jeong-Eun, Lee, Sang-Sung, Li, Dalei, Li, Di, Li, Guangxing, Li, Hua-bai, Lin, Sheng-Jun, Liu, Hong-Li, Liu, Tie, Liu, Sheng-Yuan, Liu, Junhao, Longmore, Steven, Lu, Xing, Lyo, A-Ran, Mairs, Steve, Matsumura, Masafumi, Matthews, Brenda, Moriarty-Schieven, Gerald, Nagata, Tetsuya, Nakamura, Fumitaka, Nakanishi, Hiroyuki, Ngoc, Nguyen Bich, Ohashi, Nagayoshi, Onaka, Takashi, Park, Geumsook, Parsons, Harriet, Peretto, Nicolas, Priestley, Felix, Pyo, Tae-Soo, Qian, Lei, Rao, Ramprasad, Rawlings, Jonathan, Rawlings, Mark, Retter, Brendan, Richer, John, Rigby, Andrew, Saito, Hiro, Savini, Giorgio, Seta, Masumichi, Sharma, Ekta, Shimajiri, Yoshito, Shinnaga, Hiroko, Soam, Archana, Kang, Miju, Kataoka, Akimasa, Kawabata, Koji, Kemper, Francisca, Kim, Jongsoo, Kim, Shinyoung, Kim, Gwanjeong, Kim, Kyoung Hee, Kim, Mi-Ryang, Kim, Kee-Tae, Kim, Hyosung, Kirchschlager, Florian, Kirk, Jason, Kobayashi, Masato I. N., Kusune, Takayoshi, Kwon, Jungmi, Tamura, Motohide, Tang, Ya-Wen, Tang, Xindi, Tomisaka, Kohji, Tsukamoto, Yusuke, Viti, Serena, Wang, Hongchi, Wang, Jia-Wei, Wu, Jintai, Xie, Jinjin, Yang, Meng-Zhe, Yen, Hsi-Wei, Yoo, Hyunju, Yuan, Jinghua, Yun, Hyeong-Sik, Zenko, Tetsuya, Zhang, Guoyin, Zhang, Yapeng, Zhang, Chuan-Peng, Zhou, Jianjun, Zhu, Lei, de Looze, Ilse, André, Philippe, Dowell, C. Darren, Eden, David, Eyres, Stewart, Falle, Sam, Gouellec, Valentin J. M. Le, Poidevin, Frédérick, and van Loo, Sven
Subjects: Astrophysics - Astrophysics of Galaxies
Abstract: We present 850 $\mu$m polarization observations of the IC 348 star-forming region in the Perseus molecular cloud as part of the B-fields In STar-forming Region Observation (BISTRO) survey. We study the magnetic properties of two cores (HH 211 MMS and IC 348 MMS) and a filamentary structure of IC 348. We find that the overall field tends to be more perpendicular than parallel to the filamentary structure of the region. The polarization fraction decreases with intensity, and we estimate the trend by power-law and the mean of the Rice distribution fittings. The power indices for the cores are much smaller than 1, indicative of possible grain growth to micron size in the cores. We also measure the magnetic field strengths of the two cores and the filamentary area separately by applying the Davis-Chandrasekhar-Fermi method and its alternative version for compressed medium. The estimated mass-to-flux ratios are 0.45-2.20 and 0.63-2.76 for HH 211 MMS and IC 348 MMS, respectively, while the ratios for the filament is 0.33-1.50. This result may suggest that the transition from subcritical to supercritical conditions occurs at the core scale ($\sim$ 0.05 pc) in the region. In addition, we study the energy balance of the cores and find that the relative strength of turbulence to the magnetic field tends to be stronger for IC 348 MMS than HH 211 MMS. The result could potentially explain the different configurations inside the two cores: a single protostellar system in HH 211 MMS and multiple protostars in IC 348 MMS., Comment: Accepted for publication in ApJ. 21 pages, 12 figures
Published: 2024

3. Dirichlet-Based Coarse-to-Fine Example Selection For Open-Set Annotation

Author: Wang, Ye-Wen, Zong, Chen-Chen, Xie, Ming-Kun, and Huang, Sheng-Jun
Subjects: Computer Science - Artificial Intelligence
Abstract: Active learning (AL) has achieved great success by selecting the most valuable examples from unlabeled data. However, they usually deteriorate in real scenarios where open-set noise gets involved, which is studied as open-set annotation (OSA). In this paper, we owe the deterioration to the unreliable predictions arising from softmax-based translation invariance and propose a Dirichlet-based Coarse-to-Fine Example Selection (DCFS) strategy accordingly. Our method introduces simplex-based evidential deep learning (EDL) to break translation invariance and distinguish known and unknown classes by considering evidence-based data and distribution uncertainty simultaneously. Furthermore, hard known-class examples are identified by model discrepancy generated from two classifier heads, where we amplify and alleviate the model discrepancy respectively for unknown and known classes. Finally, we combine the discrepancy with uncertainties to form a two-stage strategy, selecting the most informative examples from known classes. Extensive experiments on various openness ratio datasets demonstrate that DCFS achieves state-of-art performance.
Published: 2024

4. CodeACT: Code Adaptive Compute-efficient Tuning Framework for Code LLMs

Author: Lv, Weijie, Xia, Xuan, and Huang, Sheng-Jun
Subjects: Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: Large language models (LLMs) have shown great potential in code-related tasks, yet open-source models lag behind their closed-source counterparts. To bridge this performance gap, existing methods generate vast amounts of synthetic data for fine-tuning, leading to inefficiencies in training. Motivated by the need for more effective and efficient training, we propose the Code Adaptive Compute-efficient Tuning (CodeACT) framework. CodeACT introduces the Complexity and Diversity Aware Sampling (CDAS) method to select high-quality training data based on complexity and diversity, and the Dynamic Pack padding strategy to reduce computational resource usage by minimizing padding tokens during training. Experimental results demonstrate that CodeACT-DeepSeek-Coder-6.7B, fine-tuned on only 40% of the EVOL-Instruct data, achieves an 8.6% performance increase on HumanEval, reduces training time by 78%, and decreases peak GPU memory usage by 27%. These findings underscore CodeACT's ability to enhance the performance and efficiency of open-source models. By optimizing both the data selection and training processes, CodeACT offers a comprehensive approach to improving the capabilities of open-source LLMs while significantly reducing computational requirements, addressing the dual challenges of data quality and training efficiency, and paving the way for more resource-efficient and performant models.
Published: 2024

5. Dual-Decoupling Learning and Metric-Adaptive Thresholding for Semi-Supervised Multi-Label Learning

Author: Xiao, Jia-Hao, Xie, Ming-Kun, Fan, Heng-Bo, Niu, Gang, Sugiyama, Masashi, and Huang, Sheng-Jun
Subjects: Computer Science - Machine Learning
Abstract: Semi-supervised multi-label learning (SSMLL) is a powerful framework for leveraging unlabeled data to reduce the expensive cost of collecting precise multi-label annotations. Unlike semi-supervised learning, one cannot select the most probable label as the pseudo-label in SSMLL due to multiple semantics contained in an instance. To solve this problem, the mainstream method developed an effective thresholding strategy to generate accurate pseudo-labels. Unfortunately, the method neglected the quality of model predictions and its potential impact on pseudo-labeling performance. In this paper, we propose a dual-perspective method to generate high-quality pseudo-labels. To improve the quality of model predictions, we perform dual-decoupling to boost the learning of correlative and discriminative features, while refining the generation and utilization of pseudo-labels. To obtain proper class-wise thresholds, we propose the metric-adaptive thresholding strategy to estimate the thresholds, which maximize the pseudo-label performance for a given metric on labeled data. Experiments on multiple benchmark datasets show the proposed method can achieve the state-of-the-art performance and outperform the comparative methods with a significant margin.
Published: 2024

6. Relative Difficulty Distillation for Semantic Segmentation

Author: Liang, Dong, Sun, Yue, Du, Yun, Chen, Songcan, and Huang, Sheng-Jun
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Current knowledge distillation (KD) methods primarily focus on transferring various structured knowledge and designing corresponding optimization goals to encourage the student network to imitate the output of the teacher network. However, introducing too many additional optimization objectives may lead to unstable training, such as gradient conflicts. Moreover, these methods ignored the guidelines of relative learning difficulty between the teacher and student networks. Inspired by human cognitive science, in this paper, we redefine knowledge from a new perspective -- the student and teacher networks' relative difficulty of samples, and propose a pixel-level KD paradigm for semantic segmentation named Relative Difficulty Distillation (RDD). We propose a two-stage RDD framework: Teacher-Full Evaluated RDD (TFE-RDD) and Teacher-Student Evaluated RDD (TSE-RDD). RDD allows the teacher network to provide effective guidance on learning focus without additional optimization goals, thus avoiding adjusting learning weights for multiple losses. Extensive experimental evaluations using a general distillation loss function on popular datasets such as Cityscapes, CamVid, Pascal VOC, and ADE20k demonstrate the effectiveness of RDD against state-of-the-art KD methods. Additionally, our research showcases that RDD can integrate with existing KD methods to improve their upper performance bound.
Published: 2024

7. One-shot Active Learning Based on Lewis Weight Sampling for Multiple Deep Models

Author: Huang, Sheng-Jun, Li, Yi, Sun, Yiming, and Tang, Ying-Peng
Subjects: Computer Science - Machine Learning
Abstract: Active learning (AL) for multiple target models aims to reduce labeled data querying while effectively training multiple models concurrently. Existing AL algorithms often rely on iterative model training, which can be computationally expensive, particularly for deep models. In this paper, we propose a one-shot AL method to address this challenge, which performs all label queries without repeated model training. Specifically, we extract different representations of the same dataset using distinct network backbones, and actively learn the linear prediction layer on each representation via an $\ell_p$-regression formulation. The regression problems are solved approximately by sampling and reweighting the unlabeled instances based on their maximum Lewis weights across the representations. An upper bound on the number of samples needed is provided with a rigorous analysis for $p\in [1, +\infty)$. Experimental results on 11 benchmarks show that our one-shot approach achieves competitive performances with the state-of-the-art AL methods for multiple target models., Comment: The proof of Lemma 3.11 is fixed
Published: 2024

8. Improving Generalization of Deep Neural Networks by Optimum Shifting

Author: Zhou, Yuyan, Li, Ye, Feng, Lei, and Huang, Sheng-Jun
Subjects: Computer Science - Machine Learning
Abstract: Recent studies showed that the generalization of neural networks is correlated with the sharpness of the loss landscape, and flat minima suggests a better generalization ability than sharp minima. In this paper, we propose a novel method called \emph{optimum shifting}, which changes the parameters of a neural network from a sharp minimum to a flatter one while maintaining the same training loss value. Our method is based on the observation that when the input and output of a neural network are fixed, the matrix multiplications within the network can be treated as systems of under-determined linear equations, enabling adjustment of parameters in the solution space, which can be simply accomplished by solving a constrained optimization problem. Furthermore, we introduce a practical stochastic optimum shifting technique utilizing the Neural Collapse theory to reduce computational costs and provide more degrees of freedom for optimum shifting. Extensive experiments (including classification and detection) with various deep neural network architectures on benchmark datasets demonstrate the effectiveness of our method.
Published: 2024

9. Deuterium fractionation of the starless core L 1498

Author: Lin, Sheng-Jun, Lai, Shih-Ping, Pagani, Laurent, Lefèvre, Charlène, and Thieme, Travis J.
Subjects: Astrophysics - Astrophysics of Galaxies
Abstract: Molecular deuteration is commonly seen in starless cores and is expected to occur on a timescale comparable to that of the core contraction. Thus, the deuteration serves as a chemical clock, allowing us to investigate dynamical theories of core formation. We aim to provide a 3D cloud description for the starless core L 1498 located in the nearby low-mass star-forming region Taurus, and explore the possible core formation mechanism of L 1498. We carried out non-local thermal equilibrium radiative transfer with multi-transition observations of the high-density tracer N$_2$H$^+$ to derive the density and temperature profiles of the L 1498 core. Combining with the spectral observations of the deuterated species, ortho-H$_2$D$^+$, N$_2$D$^+$, and DCO$^+$, we derived the abundance profiles for observed species and performed chemical modeling of the deuteration profiles across L 1498 to constrain the contraction timescale. We present the first ortho-H$_2$D$^+$ (1$_{10}$-1$_{11}$) detection toward L 1498. We find a peak molecular hydrogen density of $1.6_{-0.3}^{+3.0}\times10^{5}$~cm$^{-3}$, a temperature of 7.5$_{-0.5}^{+0.7}$~K, and a N$_2$H$^+$ deuteration of 0.27$_{-0.15}^{+0.12}$ in the center. We derive a lower limit of the core age for L 1498 of 0.16~Ma which is compatible with the typical free-fall time, indicating that L 1498 likely formed rapidly., Comment: 21 pages, 12 figures, accepted for publication in A&A
Published: 2024
Full Text: View/download PDF

10. Continual Learning in the Presence of Repetition

Author: Hemati, Hamed, Pellegrini, Lorenzo, Duan, Xiaotian, Zhao, Zixuan, Xia, Fangfang, Masana, Marc, Tscheschner, Benedikt, Veas, Eduardo, Zheng, Yuxiang, Zhao, Shiji, Li, Shao-Yuan, Huang, Sheng-Jun, Lomonaco, Vincenzo, and van de Ven, Gido M.
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: Continual learning (CL) provides a framework for training models in ever-evolving environments. Although re-occurrence of previously seen objects or tasks is common in real-world problems, the concept of repetition in the data stream is not often considered in standard benchmarks for CL. Unlike with the rehearsal mechanism in buffer-based strategies, where sample repetition is controlled by the strategy, repetition in the data stream naturally stems from the environment. This report provides a summary of the CLVision challenge at CVPR 2023, which focused on the topic of repetition in class-incremental learning. The report initially outlines the challenge objective and then describes three solutions proposed by finalist teams that aim to effectively exploit the repetition in the stream to learn continually. The experimental results from the challenge highlight the effectiveness of ensemble-based solutions that employ multiple versions of similar modules, each trained on different but overlapping subsets of classes. This report underscores the transformative potential of taking a different perspective in CL by employing repetition in the data stream to foster innovative strategy design., Comment: Preprint; Challenge Report of the 4th Workshop on Continual Learning in Computer Vision at CVPR
Published: 2024

11. The First Estimation of the Ambipolar Diffusivity Coefficient from Multi-Scale Observations of the Class 0/I Protostar, HOPS-370

Author: Thieme, Travis J., Lai, Shih-Ping, Lee, Yueh-Ning, Lin, Sheng-Jun, and Yen, Hsi-Wei
Subjects: Astrophysics - Solar and Stellar Astrophysics, Astrophysics - Astrophysics of Galaxies
Abstract: Protostars are born in magnetized environments. As a consequence, the formation of protostellar disks can be suppressed by the magnetic field efficiently removing angular momentum of the infalling material. Non-ideal MHD effects are proposed to as one way to allow protostellar disks to form. Thus, it is important to understand their contributions in observations of protostellar systems. We derive an analytical equation to estimate the ambipolar diffusivity coefficient at the edge of the protostellar disk in the Class 0/I protostar, HOPS-370, for the first time, under the assumption that the disk radius is set by ambipolar diffusion. Using previous results of the protostellar mass, disk mass, disk radius, density and temperature profiles and magnetic field strength, we estimate the ambipolar diffusivity coefficient to be $1.7^{+1.5}_{-1.4}\times10^{19}\,\mathrm{cm^{2}\,s^{-1}}$. We quantify the contribution of ambipolar diffusion by estimating its dimensionless Els\"{a}sser number to be $\sim1.7^{+1.0}_{-1.0}$, indicating its dynamical importance in this region. We compare to chemical calculations of the ambipolar diffusivity coefficient using the Non-Ideal magnetohydrodynamics Coefficients and Ionisation Library (NICIL), which is consistent with our results. In addition, we compare our derived ambipolar diffusivity coefficient to the diffusivity coefficients for Ohmic dissipation and the Hall effect, and find ambipolar diffusion is dominant in our density regime. These results demonstrate a new methodology to understand non-ideal MHD effects in observations of protostellar disks. More detailed modeling of the magnetic field, envelope and microphysics, along with a larger sample of protostellar systems is needed to further understand the contributions of non-ideal MHD., Comment: 20 pages, 5 figures. Accepted for publication in ApJ
Published: 2024

12. Counterfactual Reasoning for Multi-Label Image Classification via Patching-Based Training

Author: Xie, Ming-Kun, Xiao, Jia-Hao, Peng, Pei, Niu, Gang, Sugiyama, Masashi, and Huang, Sheng-Jun
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: The key to multi-label image classification (MLC) is to improve model performance by leveraging label correlations. Unfortunately, it has been shown that overemphasizing co-occurrence relationships can cause the overfitting issue of the model, ultimately leading to performance degradation. In this paper, we provide a causal inference framework to show that the correlative features caused by the target object and its co-occurring objects can be regarded as a mediator, which has both positive and negative impacts on model predictions. On the positive side, the mediator enhances the recognition performance of the model by capturing co-occurrence relationships; on the negative side, it has the harmful causal effect that causes the model to make an incorrect prediction for the target object, even when only co-occurring objects are present in an image. To address this problem, we propose a counterfactual reasoning method to measure the total direct effect, achieved by enhancing the direct effect caused only by the target object. Due to the unknown location of the target object, we propose patching-based training and inference to accomplish this goal, which divides an image into multiple patches and identifies the pivot patch that contains the target object. Experimental results on multiple benchmark datasets with diverse configurations validate that the proposed method can achieve state-of-the-art performance.
Published: 2024

13. Lp Solution of Reflected BSDEs with One Continuous Barrier and Quasi-linear Growth Generators

Author: Fan, Sheng-jun
Published: 2024
Full Text: View/download PDF

14. Bidirectional Uncertainty-Based Active Learning for Open Set Annotation

Author: Zong, Chen-Chen, Wang, Ye-Wen, Ning, Kun-Peng, Ye, Hai-Bo, and Huang, Sheng-Jun
Subjects: Computer Science - Machine Learning
Abstract: Active learning (AL) in open set scenarios presents a novel challenge of identifying the most valuable examples in an unlabeled data pool that comprises data from both known and unknown classes. Traditional methods prioritize selecting informative examples with low confidence, with the risk of mistakenly selecting unknown-class examples with similarly low confidence. Recent methods favor the most probable known-class examples, with the risk of picking simple already mastered examples. In this paper, we attempt to query examples that are both likely from known classes and highly informative, and propose a Bidirectional Uncertainty-based Active Learning (BUAL) framework. Specifically, we achieve this by first pushing the unknown class examples toward regions with high-confidence predictions, i.e., the proposed Random Label Negative Learning method. Then, we propose a Bidirectional Uncertainty sampling strategy by jointly estimating uncertainty posed by both positive and negative learning to perform consistent and stable sampling. BUAL successfully extends existing uncertainty-based AL methods to complex open-set scenarios. Extensive experiments on multiple datasets with varying openness demonstrate that BUAL achieves state-of-the-art performance. The code is available at https://github.com/chenchenzong/BUAL., Comment: Accepted to ECCV 2024
Published: 2024
Full Text: View/download PDF

15. Empowering Language Models with Active Inquiry for Deeper Understanding

Author: Pang, Jing-Cheng, Fan, Heng-Bo, Wang, Pengyuan, Xiao, Jia-Hao, Tang, Nan, Yang, Si-Hang, Jia, Chengxing, Huang, Sheng-Jun, and Yu, Yang
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: The rise of large language models (LLMs) has revolutionized the way that we interact with artificial intelligence systems through natural language. However, LLMs often misinterpret user queries because of their uncertain intention, leading to less helpful responses. In natural human interactions, clarification is sought through targeted questioning to uncover obscure information. Thus, in this paper, we introduce LaMAI (Language Model with Active Inquiry), designed to endow LLMs with this same level of interactive engagement. LaMAI leverages active learning techniques to raise the most informative questions, fostering a dynamic bidirectional dialogue. This approach not only narrows the contextual gap but also refines the output of the LLMs, aligning it more closely with user expectations. Our empirical studies, across a variety of complex datasets where LLMs have limited conversational context, demonstrate the effectiveness of LaMAI. The method improves answer accuracy from 31.9% to 50.9%, outperforming other leading question-answering frameworks. Moreover, in scenarios involving human participants, LaMAI consistently generates responses that are superior or comparable to baseline methods in more than 82% of the cases. The applicability of LaMAI is further evidenced by its successful integration with various LLMs, highlighting its potential for the future of interactive language models.
Published: 2024

16. Filamentary Network and Magnetic Field Structures Revealed with BISTRO in the High-Mass Star-Forming Region NGC2264 : Global Properties and Local Magnetogravitational Configurations

Author: Wang, Jia-Wei, Koch, Patrick M., Clarke, Seamus D., Fuller, Gary, Peretto, Nicolas, Tang, Ya-Wen, Yen, Hsi-Wei, Lai, Shih-Ping, Ohashi, Nagayoshi, Arzoumanian, Doris, Johnstone, Doug, Furuya, Ray, Inutsuka, Shu-ichiro, Lee, Chang Won, Ward-Thompson, Derek, Gouellec, Valentin J. M. Le, Liu, Hong-Li, Fanciullo, Lapo, Hwang, Jihye, Pattle, Kate, Poidevin, Frédérick, Tahani, Mehrnoosh, Onaka, Takashi, Rawlings, Mark G., Chung, Eun Jung, Liu, Junhao, Lyo, A-Ran, Priestley, Felix, Hoang, Thiem, Tamura, Motohide, Berry, David, Bastien, Pierre, Ching, Tao-Chung, Coudé, Simon, Kwon, Woojin, Chen, Mike, Eswaraiah, Chakali, Soam, Archana, Hasegawa, Tetsuo, Qiu, Keping, Bourke, Tyler L., Byun, Do-Young, Chen, Zhiwei, Chen, Huei-Ru Vivien, Chen, Wen Ping, Cho, Jungyeon, Choi, Minho, Choi, Yunhee, Choi, Youngwoo, Chrysostomou, Antonio, Dai, Sophia, Di Francesco, James, Diep, Pham Ngoc, Doi, Yasuo, Duan, Yan, Duan, Hao-Yuan, Eden, David, Fiege, Jason, Fissel, Laura M., Franzmann, Erica, Friberg, Per, Friesen, Rachel, Gledhill, Tim, Graves, Sarah, Greaves, Jane, Griffin, Matt, Gu, Qilao, Han, Ilseung, Hayashi, Saeko, Houde, Martin, Inoue, Tsuyoshi, Iwasaki, Kazunari, Jeong, Il-Gyo, Könyves, Vera, Kang, Ji-hyun, Kang, Miju, Karoly, Janik, Kataoka, Akimasa, Kawabata, Koji, Khan, Zacariyya, Kim, Mi-Ryang, Kim, Kee-Tae, Kim, Kyoung Hee, Kim, Shinyoung, Kim, Jongsoo, Kim, Hyosung, Kim, Gwanjeong, Kirchschlager, Florian, Kirk, Jason, Kobayashi, Masato I. N., Kusune, Takayoshi, Kwon, Jungmi, Lacaille, Kevin, Law, Chi-Yan, Lee, Sang-Sung, Lee, Hyeseung, Lee, Jeong-Eun, Lee, Chin-Fei, Li, Dalei, Li, Hua-bai, Li, Guangxing, Li, Di, Lin, Sheng-Jun, Liu, Tie, Liu, Sheng-Yuan, Lu, Xing, Mairs, Steve, Matsumura, Masafumi, Matthews, Brenda, Moriarty-Schieven, Gerald, Nagata, Tetsuya, Nakamura, Fumitaka, Nakanishi, Hiroyuki, Ngoc, Nguyen Bich, Park, Geumsook, Parsons, Harriet, Pyo, Tae-Soo, Qian, Lei, Rao, Ramprasad, Rawlings, Jonathan, Retter, Brendan, Richer, John, Rigby, Andrew, Sadavoy, Sarah, Saito, Hiro, Savini, Giorgio, Seta, Masumichi, Sharma, Ekta, Shimajiri, Yoshito, Shinnaga, Hiroko, Tang, Xindi, Thuong, Hoang Duc, Tomisaka, Kohji, Tram, Le Ngoc, Tsukamoto, Yusuke, Viti, Serena, Wang, Hongchi, Whitworth, Anthony, Wu, Jintai, Xie, Jinjin, Yang, Meng-Zhe, Yoo, Hyunju, Yuan, Jinghua, Yun, Hyeong-Sik, Zenko, Tetsuya, Zhang, Chuan-Peng, Zhang, Yapeng, Zhang, Guoyin, Zhou, Jianjun, Zhu, Lei, de Looze, Ilse, André, Philippe, Dowell, C. Darren, Eyres, Stewart, Falle, Sam, Robitaille, Jean-François, and van Loo, Sven
Subjects: Astrophysics - Solar and Stellar Astrophysics, Astrophysics - Astrophysics of Galaxies
Abstract: We report 850 $\mu$m continuum polarization observations toward the filamentary high-mass star-forming region NGC 2264, taken as part of the B-fields In STar forming Regions Observations (BISTRO) large program on the James Clerk Maxwell Telescope (JCMT). These data reveal a well-structured non-uniform magnetic field in the NGC 2264C and 2264D regions with a prevailing orientation around 30 deg from north to east. Field strengths estimates and a virial analysis for the major clumps indicate that NGC 2264C is globally dominated by gravity while in 2264D magnetic, gravitational, and kinetic energies are roughly balanced. We present an analysis scheme that utilizes the locally resolved magnetic field structures, together with the locally measured gravitational vector field and the extracted filamentary network. From this, we infer statistical trends showing that this network consists of two main groups of filaments oriented approximately perpendicular to one another. Additionally, gravity shows one dominating converging direction that is roughly perpendicular to one of the filament orientations, which is suggestive of mass accretion along this direction. Beyond these statistical trends, we identify two types of filaments. The type-I filament is perpendicular to the magnetic field with local gravity transitioning from parallel to perpendicular to the magnetic field from the outside to the filament ridge. The type-II filament is parallel to the magnetic field and local gravity. We interpret these two types of filaments as originating from the competition between radial collapsing, driven by filament self-gravity, and the longitudinal collapsing, driven by the region's global gravity., Comment: Accepted for publication in the Astrophysical Journal. 43 pages, 32 figures, and 4 tables (including Appendix)
Published: 2024

17. Dirichlet-Based Prediction Calibration for Learning with Noisy Labels

Author: Zong, Chen-Chen, Wang, Ye-Wen, Xie, Ming-Kun, and Huang, Sheng-Jun
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: Learning with noisy labels can significantly hinder the generalization performance of deep neural networks (DNNs). Existing approaches address this issue through loss correction or example selection methods. However, these methods often rely on the model's predictions obtained from the softmax function, which can be over-confident and unreliable. In this study, we identify the translation invariance of the softmax function as the underlying cause of this problem and propose the \textit{Dirichlet-based Prediction Calibration} (DPC) method as a solution. Our method introduces a calibrated softmax function that breaks the translation invariance by incorporating a suitable constant in the exponent term, enabling more reliable model predictions. To ensure stable model training, we leverage a Dirichlet distribution to assign probabilities to predicted labels and introduce a novel evidence deep learning (EDL) loss. The proposed loss function encourages positive and sufficiently large logits for the given label, while penalizing negative and small logits for other labels, leading to more distinct logits and facilitating better example selection based on a large-margin criterion. Through extensive experiments on diverse benchmark datasets, we demonstrate that DPC achieves state-of-the-art performance. The code is available at https://github.com/chenchenzong/DPC.
Published: 2024
Full Text: View/download PDF

18. Mechanism, prevention, and control of mining-induced dynamic disasters in underground metal mines in China: Challenges and solutions

Author: Li, Peng, Cai, Mei-feng, Miao, Sheng-jun, Ren, Fen-hua, Gorjian, Mostafa, and Peng, Chao
Published: 2024
Full Text: View/download PDF

19. A Deep Model for Partial Multi-label Image Classification with Curriculum-based Disambiguation

Author: Sun, Feng, Xie, Ming-Kun, and Huang, Sheng-Jun
Published: 2024
Full Text: View/download PDF

20. Heterogeneous engineering of MnSe@NC@ReS2 core–shell nanowires for advanced sodium-/potassium-ion batteries

Author: Lu, Sheng-Jun, Lin, Jin-Yi, Wang, Cai-Hong, Zhang, Yu-Fei, Zhang, Yi, and Fan, Hao-Sen
Published: 2024
Full Text: View/download PDF

21. Rising Tides of Knowledge: Exploring China’s Higher Education Landscape and Human Capital Growth

Author: Xiao, Shumei, Sheng, Jun, and Zhang, Guangtao
Published: 2024
Full Text: View/download PDF

22. Magnetic fields of the starless core L 1512

Author: Lin, Sheng-Jun, Lai, Shih-Ping, Pattle, Kate, Berry, David, Clemens, Dan P., Pagani, Laurent, Ward-Thompson, Derek, Thieme, Travis J., and Ching, Tao-Chung
Subjects: Astrophysics - Astrophysics of Galaxies
Abstract: We present JCMT POL-2 850 um dust polarization observations and Mimir H band stellar polarization observations toward the starless core L1512. We detect the highly-ordered core-scale magnetic field traced by the POL-2 data, of which the field orientation is consistent with the parsec-scale magnetic fields traced by Planck data, suggesting the large-scale fields thread from the low-density region to the dense core region in this cloud. The surrounding magnetic field traced by the Mimir data shows a wider variation in the field orientation, suggesting there could be a transition of magnetic field morphology at the envelope scale. L1512 was suggested to be presumably older than 1.4 Myr in a previous study via time-dependent chemical analysis, hinting that the magnetic field could be strong enough to slow the collapse of L1512. In this study, we use the Davis-Chandrasekhar-Fermi method to derive a plane-of-sky magnetic field strength ($B_{pos}$) of 18$\pm$7 uG and an observed mass-to-flux ratio ($\lambda_{obs}$) of 3.5$\pm$2.4, suggesting that L1512 is magnetically supercritical. However, the absence of significant infall motion and the presence of an oscillating envelope are inconsistent with the magnetically supercritical condition. Using a Virial analysis, we suggest the presence of a hitherto hidden line-of-sight magnetic field strength of ~27 uG with a mass-to-flux ratio ($\lambda_{tot}$) of ~1.6, in which case both magnetic and kinetic pressures are important in supporting the L1512 core. On the other hand, L1512 may have just reached supercriticality and will collapse at any time., Comment: 25 pages, 10 figures, accepted for publication in ApJ
Published: 2023

23. ALMA Survey of Orion Planck Galactic Cold Clumps (ALMASOP): Discovery of an extremely dense and compact object embedded in the prestellar core G208.68-19.92-N2

Author: Hirano, Naomi, Sahu, Dipen, Liu, Sheng-Yaun, Liu, Tie, Tatematsu, Ken'ichi, Dutta, Somnath, Li, Shanghuo, Lee, Chin-Fei, Li, Pak Shing, Hsu, Shih-Ying, Lin, Sheng-Jun, Johnstone, Doug, Bronfman, Leonardo, Chen, Huei-Ru Vivien, Eden, David J., Kuan, Yi-Jehng, Kwon, Woojin, Lee, Chang Won, Liu, Hong-Li, Rawlings, Mark G., Ristorcelli, Isabelle, and Traficante, Alessio
Subjects: Astrophysics - Astrophysics of Galaxies
Abstract: The internal structure of the prestellar core G208.68-19.02-N2 (G208-N2) in the Orion Molecular Cloud 3 (OMC-3) region has been studied with the Atacama Large Millimeter/submillimeter Array (ALMA). The dust continuum emission revealed a filamentary structure with a length of $\sim$5000 au and an average H$_2$ volume density of $\sim$6 $\times$ 10$^7$ cm$^{-3}$. At the tip of this filamentary structure, there is a compact object, which we call a ``nucleus", with a radius of $\sim$150--200 au and a mass of $\sim$0.1 M$_{\odot}$. The nucleus has a central density of $\sim$2 $\times$ 10$^9$ cm$^{-3}$ with a radial density profile of $r^{-1.87{\pm}0.11}$. The density scaling of the nucleus is $\sim$3.7 times higher than that of the singular isothermal sphere. This as well as the very low virial parameter of 0.39 suggest that the gravity is dominant over the pressure everywhere in the nucleus. However, there is no sign of CO outflow localized to this nucleus. The filamentary structure is traced by the N$_2$D$^+$ 3--2 emission, but not by the C$^{18}$O 2--1 emission, implying the significant CO depletion due to high density and cold temperature. Toward the nucleus, the N$_2$D$^+$ also shows the signature of depletion. This could imply either the depletion of the parent molecule, N$_2$, or the presence of the embedded very-low luminosity central source that could sublimate the CO in the very small area. The nucleus in G208-N2 is considered to be a prestellar core on the verge of first hydrostatic core (FHSC) formation or a candidate for the FHSC., Comment: 27 pages, 16 figures
Published: 2023

24. Improving Lens Flare Removal with General Purpose Pipeline and Multiple Light Sources Recovery

Author: Zhou, Yuyan, Liang, Dong, Chen, Songcan, Huang, Sheng-Jun, Yang, Shuo, and Li, Chongyi
Subjects: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Computer Vision and Pattern Recognition
Abstract: When taking images against strong light sources, the resulting images often contain heterogeneous flare artifacts. These artifacts can importantly affect image visual quality and downstream computer vision tasks. While collecting real data pairs of flare-corrupted/flare-free images for training flare removal models is challenging, current methods utilize the direct-add approach to synthesize data. However, these methods do not consider automatic exposure and tone mapping in image signal processing pipeline (ISP), leading to the limited generalization capability of deep models training using such data. Besides, existing methods struggle to handle multiple light sources due to the different sizes, shapes and illuminance of various light sources. In this paper, we propose a solution to improve the performance of lens flare removal by revisiting the ISP and remodeling the principle of automatic exposure in the synthesis pipeline and design a more reliable light sources recovery strategy. The new pipeline approaches realistic imaging by discriminating the local and global illumination through convex combination, avoiding global illumination shifting and local over-saturation. Our strategy for recovering multiple light sources convexly averages the input and output of the neural network based on illuminance levels, thereby avoiding the need for a hard threshold in identifying light sources. We also contribute a new flare removal testing dataset containing the flare-corrupted images captured by ten types of consumer electronics. The dataset facilitates the verification of the generalization capability of flare removal methods. Extensive experiments show that our solution can effectively improve the performance of lens flare removal and push the frontier toward more general situations., Comment: ICCV 2023
Published: 2023

25. Multi-Label Knowledge Distillation

Author: Yang, Penghui, Xie, Ming-Kun, Zong, Chen-Chen, Feng, Lei, Niu, Gang, Sugiyama, Masashi, and Huang, Sheng-Jun
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Computer Vision and Pattern Recognition
Abstract: Existing knowledge distillation methods typically work by imparting the knowledge of output logits or intermediate feature maps from the teacher network to the student network, which is very successful in multi-class single-label learning. However, these methods can hardly be extended to the multi-label learning scenario, where each instance is associated with multiple semantic labels, because the prediction probabilities do not sum to one and feature maps of the whole example may ignore minor classes in such a scenario. In this paper, we propose a novel multi-label knowledge distillation method. On one hand, it exploits the informative semantic knowledge from the logits by dividing the multi-label learning problem into a set of binary classification problems; on the other hand, it enhances the distinctiveness of the learned feature representations by leveraging the structural information of label-wise embeddings. Experimental results on multiple benchmark datasets validate that the proposed method can avoid knowledge counteraction among labels, thus achieving superior performance against diverse comparing methods. Our code is available at: https://github.com/penghui-yang/L2D, Comment: Accepted by ICCV 2023. The first two authors contributed equally to this work
Published: 2023

26. Horseshoe kidney with concurrent mesangial proliferative IgA nephropathy：one case report

Author: Huang Tan, Sheng-jun Liu, Kan Liao, Yuan Gao, Jiang-hua Sun, and Chen-ling Liu
Subjects: horseshoe kidney, iga nephropathy, renal biopsy, Internal medicine, RC31-1245
Published: 2024
Full Text: View/download PDF

27. Inhibitory effect of 1,4,5,6-tetrahydroxy-7,8-diprenylxanthone against NSCLC with L858R/T790M/C797S mutant EGFR

Author: Wang, Jing, Wang, Yuna, Zhang, Shuanggou, Qu, Yana, Zhang, Ruohan, Wang, Xuanjun, Sheng, Jun, and Sun, Peiyuan
Published: 2024
Full Text: View/download PDF

28. AgriGAN: unpaired image dehazing via a cycle-consistent generative adversarial network for the agricultural plant phenotype

Author: Ding, Jin-Ting, Peng, Yong-Yu, Huang, Min, and Zhou, Sheng-Jun
Published: 2024
Full Text: View/download PDF

29. Proteomic analysis of mitochondria associated membranes in renal ischemic reperfusion injury

Author: Li, Yi, Wang, Hua-bin, Cao, Jin-long, Zhang, Wen-jun, Wang, Hai-long, Xu, Chang-hong, Li, Kun-peng, Liu, Yi, Wang, Ji-rong, Ha, Hua-lan, Fu, Sheng-jun, and Yang, Li
Published: 2024
Full Text: View/download PDF

30. ThermomiR-377-3p-induced suppression of Cirbp expression is required for effective elimination of cancer cells and cancer stem-like cells by hyperthermia

Author: Lin, Tao-Yan, Jia, Jun-Shuang, Luo, Wei-Ren, Lin, Xiao-Lin, Xiao, Sheng-Jun, Yang, Jie, Xia, Jia-Wei, Zhou, Chen, Zhou, Zhi-Hao, Lin, Shu-Jun, Li, Qi-Wen, Yang, Zhi-Zhi, Lei, Ye, Yang, Wen-Qing, Shen, Hong-Fen, Huang, Shi-Hao, Wang, Sheng-Chun, Chen, Lin-Bei, Yang, Yu-Lin, Xue, Shu-Wen, Li, Yong-Long, Dai, Guan-Qi, Zhou, Ying, Li, Ying-Chun, Wei, Fang, Rong, Xiao-Xiang, Luo, Xiao-Jun, Zhao, Bing-Xia, Huang, Wen-Hua, Xiao, Dong, and Sun, Yan
Published: 2024
Full Text: View/download PDF

31. Enhancing Efficiency and Decision-Making in Higher Education Through Intelligent Commercial Integration: Leveraging Artificial Intelligence

Author: Han, Xiao, Xiao, Shumei, Sheng, Jun, and Zhang, Guangtao
Published: 2024
Full Text: View/download PDF

32. Rational design of metal selenides nanomaterials for alkali metal ion (Li+/Na+/K+) batteries: current status and perspectives

Author: Sun, Rui, Xu, Feng, Wang, Cai-Hong, Lu, Sheng-Jun, Zhang, Yu-Fei, and Fan, Hao-Sen
Published: 2024
Full Text: View/download PDF

33. The biocontrol roles of cyclic lipopeptide putisolvin produced from Pseudomonas capeferrum HN2-3 on the Phytophthora blight disease in cucumbers

Author: Sheng, Jun, Qin, Xiao, Yang, Xiao, Liu, Qian, and Ma, Zongwang
Published: 2024
Full Text: View/download PDF

34. Cohomology groups of a new class of Kadison-Singer algebras

Author: An, Guangyu, Cheng, Xing, and Sheng, Jun
Published: 2024
Full Text: View/download PDF

35. The JCMT BISTRO Survey: Studying the Complex Magnetic Field of L43

Author: Karoly, Janik, Ward-Thompson, Derek, Pattle, Kate, Berry, David, Whitworth, Anthony, Kirk, Jason, Bastien, Pierre, Ching, Tao-Chung, Coude, Simon, Hwang, Jihye, Kwon, Woojin, Soam, Archana, Wang, Jia-Wei, Hasegawa, Tetsuo, Lai, Shih-Ping, Qiu, Keping, Arzoumanian, Doris, Bourke, Tyler L., Byun, Do-Young, Chen, Huei-Ru Vivien, Chen, Wen Ping, Chen, Mike, Chen, Zhiwei, Cho, Jungyeon, Choi, Minho, Choi, Youngwoo, Choi, Yunhee, Chrysostomou, Antonio, Chung, Eun Jung, Dai, Sophia, Debattista, Victor, Di Francesco, James, Diep, Pham Ngoc, Doi, Yasuo, Duan, Hao-Yuan, Duan, Yan, Eswaraiah, Chakali, Fanciullo, Lapo, Fiege, Jason, Fissel, Laura M., Franzmann, Erica, Friberg, Per, Friesen, Rachel, Fuller, Gary, Furuya, Ray, Gledhill, Tim, Graves, Sarah, Greaves, Jane, Griffin, Matt, Gu, Qilao, Han, Ilseung, Hoang, Thiem, Houde, Martin, Hull, Charles L. H., Inoue, Tsuyoshi, Inutsuka, Shu-ichiro, Iwasaki, Kazunari, Jeong, Il-Gyo, Johnstone, Doug, Konyves, Vera, Kang, Ji-hyun, Kang, Miju, Kataoka, Akimasa, Kawabata, Koji, Kemper, Francisca, Kim, Jongsoo, Kim, Shinyoung, Kim, Gwanjeong, Kim, Kyoung Hee, Kim, Mi-Ryang, Kim, Kee-Tae, Kim, Hyosung, Kirchschlager, Florian, Kobayashi, Masato I. N., Koch, Patrick M., Kusune, Takayoshi, Kwon, Jungmi, Lacaille, Kevin, Law, Chi-Yan, Lee, Chang Won, Lee, Hyeseung, Lee, Yong-Hee, Lee, Chin-Fei, Lee, Jeong-Eun, Lee, Sang-Sung, Li, Dalei, Li, Di, Li, Guangxing, Li, Hua-bai, Lin, Sheng-Jun, Liu, Hong-Li, Liu, Tie, Liu, Sheng-Yuan, Liu, Junhao, Longmore, Steven, Lu, Xing, Lyo, A-Ran, Mairs, Steve, Matsumura, Masafumi, Matthews, Brenda, Moriarty-Schieven, Gerald, Nagata, Tetsuya, Nakamura, Fumitaka, Nakanishi, Hiroyuki, Ngoc, Nguyen Bich, Ohashi, Nagayoshi, Onaka, Takashi, Park, Geumsook, Parsons, Harriet, Peretto, Nicolas, Priestley, Felix, Pyo, Tae-Soo, Qian, Lei, Rao, Ramprasad, Rawlings, Jonathan, Rawlings, Mark, Retter, Brendan, Richer, John, Rigby, Andrew, Sadavoy, Sarah, Saito, Hiro, Savini, Giorgio, Seta, Masumichi, Sharma, Ekta, Shimajiri, Yoshito, Shinnaga, Hiroko, Tahani, Mehrnoosh, Tamura, Motohide, Tang, Ya-Wen, Tang, Xindi, Tomisaka, Kohji, Tram, Le Ngoc, Tsukamoto, Yusuke, Viti, Serena, Wang, Hongchi, Wu, Jintai, Xie, Jinjin, Yang, Meng-Zhe, Yen, Hsi-Wei, Yoo, Hyunju, Yuan, Jinghua, Yun, Hyeong-Sik, Zenko, Tetsuya, Zhang, Guoyin, Zhang, Yapeng, Zhang, Chuan-Peng, Zhou, Jianjun, Zhu, Lei, de Looze, Ilse, Andre, Philippe, Dowell, C. Darren, Eden, David, Eyres, Stewart, Falle, Sam, Gouellec, Valentin J. M. Le, Poidevin, Frederick, Robitaille, Jean-Francois, and van Loo, Sven
Subjects: Astrophysics - Astrophysics of Galaxies
Abstract: We present observations of polarized dust emission at 850 $\mu$m from the L43 molecular cloud which sits in the Ophiuchus cloud complex. The data were taken using SCUBA-2/POL-2 on the James Clerk Maxwell Telescope as a part of the BISTRO large program. L43 is a dense ($N_{\rm H_2}\sim 10^{22}$-10$^{23}$ cm$^{-2}$) complex molecular cloud with a submillimetre-bright starless core and two protostellar sources. There appears to be an evolutionary gradient along the isolated filament that L43 is embedded within, with the most evolved source closest to the Sco OB2 association. One of the protostars drives a CO outflow that has created a cavity to the southeast. We see a magnetic field that appears to be aligned with the cavity walls of the outflow, suggesting interaction with the outflow. We also find a magnetic field strength of up to $\sim$160$\pm$30 $\mu$G in the main starless core and up to $\sim$90$\pm$40 $\mu$G in the more diffuse, extended region. These field strengths give magnetically super- and sub-critical values respectively and both are found to be roughly trans-Alfv\'enic. We also present a new method of data reduction for these denser but fainter objects like starless cores., Comment: Accepted for publication in ApJ. 23 pages, 9 figures (7 main text, 2 appendix)
Published: 2023
Full Text: View/download PDF

36. Unlocking the Power of Open Set : A New Perspective for Open-Set Noisy Label Learning

Author: Wan, Wenhai, Wang, Xinrui, Xie, Ming-Kun, Li, Shao-Yuan, Huang, Sheng-Jun, and Chen, Songcan
Subjects: Computer Science - Machine Learning, Computer Science - Computer Vision and Pattern Recognition
Abstract: Learning from noisy data has attracted much attention, where most methods focus on closed-set label noise. However, a more common scenario in the real world is the presence of both open-set and closed-set noise. Existing methods typically identify and handle these two types of label noise separately by designing a specific strategy for each type. However, in many real-world scenarios, it would be challenging to identify open-set examples, especially when the dataset has been severely corrupted. Unlike the previous works, we explore how models behave when faced with open-set examples, and find that \emph{a part of open-set examples gradually get integrated into certain known classes}, which is beneficial for the separation among known classes. Motivated by the phenomenon, we propose a novel two-step contrastive learning method CECL (Class Expansion Contrastive Learning) which aims to deal with both types of label noise by exploiting the useful information of open-set examples. Specifically, we incorporate some open-set examples into closed-set classes to enhance performance while treating others as delimiters to improve representative ability. Extensive experiments on synthetic and real-world datasets with diverse label noise demonstrate the effectiveness of CECL.
Published: 2023

37. Class-Distribution-Aware Pseudo Labeling for Semi-Supervised Multi-Label Learning

Author: Xie, Ming-Kun, Xiao, Jia-Hao, Liu, Hao-Zhe, Niu, Gang, Sugiyama, Masashi, and Huang, Sheng-Jun
Subjects: Computer Science - Machine Learning
Abstract: Pseudo-labeling has emerged as a popular and effective approach for utilizing unlabeled data. However, in the context of semi-supervised multi-label learning (SSMLL), conventional pseudo-labeling methods encounter difficulties when dealing with instances associated with multiple labels and an unknown label count. These limitations often result in the introduction of false positive labels or the neglect of true positive ones. To overcome these challenges, this paper proposes a novel solution called Class-Aware Pseudo-Labeling (CAP) that performs pseudo-labeling in a class-aware manner. The proposed approach introduces a regularized learning framework incorporating class-aware thresholds, which effectively control the assignment of positive and negative pseudo-labels for each class. Notably, even with a small proportion of labeled examples, our observations demonstrate that the estimated class distribution serves as a reliable approximation. Motivated by this finding, we develop a class-distribution-aware thresholding strategy to ensure the alignment of pseudo-label distribution with the true distribution. The correctness of the estimated class distribution is theoretically verified, and a generalization error bound is provided for our proposed method. Extensive experiments on multiple benchmark datasets confirm the efficacy of CAP in addressing the challenges of SSMLL problems.
Published: 2023

38. ALL-E: Aesthetics-guided Low-light Image Enhancement

Author: Li, Ling, Liang, Dong, Gao, Yuanhang, Huang, Sheng-Jun, and Chen, Songcan
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Evaluating the performance of low-light image enhancement (LLE) is highly subjective, thus making integrating human preferences into image enhancement a necessity. Existing methods fail to consider this and present a series of potentially valid heuristic criteria for training enhancement models. In this paper, we propose a new paradigm, i.e., aesthetics-guided low-light image enhancement (ALL-E), which introduces aesthetic preferences to LLE and motivates training in a reinforcement learning framework with an aesthetic reward. Each pixel, functioning as an agent, refines itself by recursive actions, i.e., its corresponding adjustment curve is estimated sequentially. Extensive experiments show that integrating aesthetic assessment improves both subjective experience and objective evaluation. Our results on various benchmarks demonstrate the superiority of ALL-E over state-of-the-art methods.
Published: 2023

39. Implicit Stochastic Gradient Descent for Training Physics-informed Neural Networks

Author: Li, Ye, Chen, Song-Can, and Huang, Sheng-Jun
Subjects: Computer Science - Machine Learning
Abstract: Physics-informed neural networks (PINNs) have effectively been demonstrated in solving forward and inverse differential equation problems, but they are still trapped in training failures when the target functions to be approximated exhibit high-frequency or multi-scale features. In this paper, we propose to employ implicit stochastic gradient descent (ISGD) method to train PINNs for improving the stability of training process. We heuristically analyze how ISGD overcome stiffness in the gradient flow dynamics of PINNs, especially for problems with multi-scale solutions. We theoretically prove that for two-layer fully connected neural networks with large hidden nodes, randomly initialized ISGD converges to a globally optimal solution for the quadratic loss function. Empirical results demonstrate that ISGD works well in practice and compares favorably to other gradient-based optimization methods such as SGD and Adam, while can also effectively address the numerical stiffness in training dynamics via gradient descent., Comment: 17 pages, published as a conference paper at AAAI23
Published: 2023

40. AgriGAN: unpaired image dehazing via a cycle-consistent generative adversarial network for the agricultural plant phenotype

Author: Jin-Ting Ding, Yong-Yu Peng, Min Huang, and Sheng-Jun Zhou
Subjects: Agricultural images, Generative adversarial networks, Image dehazing, Information extraction, Medicine, Science
Abstract: Abstract Artificially extracted agricultural phenotype information exhibits high subjectivity and low accuracy, while the utilization of image extraction information is susceptible to interference from haze. Furthermore, the effectiveness of the agricultural image dehazing method used for extracting such information is limited due to unclear texture details and color representation in the images. To address these limitations, we propose AgriGAN (unpaired image dehazing via a cycle-consistent generative adversarial network) for enhancing the dehazing performance in agricultural plant phenotyping. The algorithm incorporates an atmospheric scattering model to improve the discriminator model and employs a whole-detail consistent discrimination approach to enhance discriminator efficiency, thereby accelerating convergence towards Nash equilibrium state within the adversarial network. Finally, by training with network adversarial loss + cycle consistent loss, clear images are obtained after dehazing process. Experimental evaluations and comparative analysis were conducted to assess this algorithm's performance, demonstrating improved accuracy in dehazing agricultural images while preserving detailed texture information and mitigating color deviation issues.
Published: 2024
Full Text: View/download PDF

41. Enhancing the mechanical properties of TATB-based PBXs through strong hydrogen bonding interactions

Author: Xian-zhi Zhou, Cheng-cheng Zeng, Zi-jian Li, Gang Li, Sheng-jun Zheng, and Fu-De Nie
Subjects: UPy, Grafting, PDA, TATB-based PBX, Mechanical properties, Chemical technology, TP1-1185
Abstract: Interfacial strength is a key factor affecting the mechanical properties of materials. This study aims to enhance the mechanical properties of energetic polymer bonded explosives (PBXs) by modifying 1,3,5-triamino-2,4,6-trintrobenzene (TATB) crystals—a typical energetic material—using 2-ureido-41H-6-methyl-pyrimidinone (UPy) derivatives with strong hydrogen-bonding interactions. Specifically, strongly adhesive polydopamine (PDA) was employed to graft UPy-functionalized molecules with isocyanate groups (–NCO) and hydroxyl groups (–OH). Scanning electron microscopy (SEM) images indicate that TATB crystals became rougher after being coated with PDA, while the introduction of UPy did not affect the surface morphology. The presence of urethane bond peaks in the samples indicates that UPy-NCO was successfully grafted onto the PDA. UPy is essentially nonpolar and is prone to bind with binders, having the potential to improve the creep resistance of PBXs. Due to the strong interfacial enhancement by UPy and PDA, the tensile strength and compressive strength of the sample grafted with 1 wt% UPy significantly increased by 35.6 % and 26.5 %, respectively. Theoretical calculations indicate interfacial enhancement by UPy introduction, where the strong hydrogen bonding may produce a positive impact. The successful introduction of UPy modified the nature of TATB and improved its interfacial strength, finally enhancing the mechanical properties of the PBXs. The conditions for the grafting reaction in this study are mild and universal and thus can be applied to other compositions.
Published: 2024
Full Text: View/download PDF

42. First BISTRO observations of the dark cloud Taurus L1495A-B10: the role of the magnetic field in the earliest stages of low-mass star formation

Author: Ward-Thompson, Derek, Karoly, Janik, Pattle, Kate, Whitworth, Anthony, Kirk, Jason, Berry, David, Bastien, Pierre, Ching, Tao-Chung, Coude, Simon, Hwang, Jihye, Kwon, Woojin, Soam, Archana, Wang, Jia-Wei, Hasegawa, Tetsuo, Lai, Shih-Ping, Qiu, Keping, Arzoumanian, Doris, Bourke, Tyler L., Byun, Do-Young, Chen, Huei-Ru Vivien, Chen, Wen Ping, Chen, Mike, Chen, Zhiwei, Cho, Jungyeon, Choi, Minho, Choi, Youngwoo, Choi, Yunhee, Chrysostomou, Antonio, Chung, Eun Jung, Dai, Sophia, Debattista, Victor, Di Francesco, James, Diep, Pham Ngoc, Doi, Yasuo, Duan, Hao-Yuan, Duan, Yan, Eswaraiah, Chakali, Fanciullo, Lapo, Fiege, Jason, Fissel, Laura M., Franzmann, Erica, Friberg, Per, Friesen, Rachel, Fuller, Gary, Furuya, Ray, Gledhill, Tim, Graves, Sarah, Greaves, Jane, Griffin, Matt, Gu, Qilao, Han, Ilseung, Hayashi, Saeko, Hoang, Thiem, Houde, Martin, Hull, Charles L. H., Inoue, Tsuyoshi, Inutsuka, Shu-ichiro, Iwasaki, Kazunari, Jeong, Il-Gyo, Johnstone, Doug, Konyves, Vera, Kang, Ji-hyun, Kang, Miju, Kataoka, Akimasa, Kawabata, Koji, Kemper, Francisca, Kim, Jongsoo, Kim, Shinyoung, Kim, Gwanjeong, Kim, Kyoung Hee, Kim, Mi-Ryang, Kim, Kee-Tae, Kim, Hyosung, Kirchschlager, Florian, Kobayashi, Masato I. N., Koch, Patrick M., Kusune, Takayoshi, Kwon, Jungmi, Lacaille, Kevin, Law, Chi-Yan, Lee, Chang Won, Lee, Hyeseung, Lee, Yong-Hee, Lee, Chin-Fei, Lee, Jeong-Eun, Lee, Sang-Sung, Li, Dalei, Li, Di, Li, Guangxing, Li, Hua-bai, Lin, Sheng-Jun, Liu, Hong-Li, Liu, Tie, Liu, Sheng-Yuan, Liu, Junhao, Longmore, Steven, Lu, Xing, Lyo, A-Ran, Mairs, Steve, Matsumura, Masafumi, Matthews, Brenda, Moriarty-Schieven, Gerald, Nagata, Tetsuya, Nakamura, Fumitaka, Nakanishi, Hiroyuki, Ngoc, Nguyen Bich, Ohashi, Nagayoshi, Onaka, Takashi, Park, Geumsook, Parsons, Harriet, Peretto, Nicolas, Priestley, Felix, Pyo, Tae-Soo, Qian, Lei, Rao, Ramprasad, Rawlings, Jonathan, Rawlings, Mark, Retter, Brendan, Richer, John, Rigby, Andrew, Sadavoy, Sarah, Saito, Hiro, Savini, Giorgio, Seta, Masumichi, Shimajiri, Yoshito, Shinnaga, Hiroko, Tahani, Mehrnoosh, Tamura, Motohide, Tang, Ya-Wen, Tang, Xindi, Tomisaka, Kohji, Tram, Le Ngoc, Tsukamoto, Yusuke, Viti, Serena, Wang, Hongchi, Wu, Jintai, Xie, Jinjin, Yang, Meng-Zhe, Yen, Hsi-Wei, Yoo, Hyunju, Yuan, Jinghua, Yun, Hyeong-Sik, Zenko, Tetsuya, Zhang, Guoyin, Zhang, Yapeng, Zhang, Chuan-Peng, Zhou, Jianjun, Zhu, Lei, de Looze, Ilse, Andre, Philippe, Dowell, C. Darren, Eden, David, Eyres, Stewart, Falle, Sam, Gouellec, Valentin J. M. Le, Poidevin, Frederick, Robitaille, Jean-Francois, and van Loo, Sven
Subjects: Astrophysics - Astrophysics of Galaxies, Astrophysics - Solar and Stellar Astrophysics
Abstract: We present BISTRO Survey 850 {\mu}m dust emission polarisation observations of the L1495A-B10 region of the Taurus molecular cloud, taken at the JCMT. We observe a roughly triangular network of dense filaments. We detect 9 of the dense starless cores embedded within these filaments in polarisation, finding that the plane-of-sky orientation of the core-scale magnetic field lies roughly perpendicular to the filaments in almost all cases. We also find that the large-scale magnetic field orientation measured by Planck is not correlated with any of the core or filament structures, except in the case of the lowest-density core. We propose a scenario for early prestellar evolution that is both an extension to, and consistent with, previous models, introducing an additional evolutionary transitional stage between field-dominated and matter-dominated evolution, observed here for the first time. In this scenario, the cloud collapses first to a sheet-like structure. Uniquely, we appear to be seeing this sheet almost face-on. The sheet fragments into filaments, which in turn form cores. However, the material must reach a certain critical density before the evolution changes from being field-dominated to being matter-dominated. We measure the sheet surface density and the magnetic field strength at that transition for the first time and show consistency with an analytical prediction that had previously gone untested for over 50 years (Mestel 1965)., Comment: 14 pages, 5 figures. ApJ accepted
Published: 2023
Full Text: View/download PDF

43. MUS-CDB: Mixed Uncertainty Sampling with Class Distribution Balancing for Active Annotation in Aerial Object Detection

Author: Liang, Dong, Zhang, Jing-Wei, Tang, Ying-Peng, and Huang, Sheng-Jun
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Recent aerial object detection models rely on a large amount of labeled training data, which requires unaffordable manual labeling costs in large aerial scenes with dense objects. Active learning effectively reduces the data labeling cost by selectively querying the informative and representative unlabelled samples. However, existing active learning methods are mainly with class-balanced settings and image-based querying for generic object detection tasks, which are less applicable to aerial object detection scenarios due to the long-tailed class distribution and dense small objects in aerial scenes. In this paper, we propose a novel active learning method for cost-effective aerial object detection. Specifically, both object-level and image-level informativeness are considered in the object selection to refrain from redundant and myopic querying. Besides, an easy-to-use class-balancing criterion is incorporated to favor the minority objects to alleviate the long-tailed class distribution problem in model training. We further devise a training loss to mine the latent knowledge in the unlabeled image regions. Extensive experiments are conducted on the DOTA-v1.0 and DOTA-v2.0 benchmarks to validate the effectiveness of the proposed method. For the ReDet, KLD, and SASM detectors on the DOTA-v2.0 dataset, the results show that our proposed MUS-CDB method can save nearly 75\% of the labeling cost while achieving comparable performance to other active learning methods in terms of mAP.Code is publicly online (https://github.com/ZJW700/MUS-CDB)., Comment: 13 pages, 7 figures
Published: 2022
Full Text: View/download PDF

44. The JCMT BISTRO-2 Survey: Magnetic Fields of the Massive DR21 Filament

Author: Ching, Tao-Chung, Qiu, Keping, Li, Di, Ren, Zhiyuan, Lai, Shih-Ping, Berry, David, Pattle, Kate, Furuya, Ray, Ward-Thompson, Derek, Johnstone, Doug, Koch, Patrick M., Lee, Chang Won, Hoang, Thiem, Hasegawa, Tetsuo, Kwon, Woojin, Bastien, Pierre, Eswaraiah, Chakali, Wang, Jia-Wei, Kim, Kyoung Hee, Hwang, Jihye, Soam, Archana, Lyo, A-Ran, Liu, Junhao, Gouellec, Valentin J. M. Le, Arzoumanian, Doris, Whitworth, Anthony, Di Francesco, James, Poidevin, Frederick, Liu, Tie, Coude, Simon, Tahani, Mehrnoosh, Liu, Hong-Li, Onaka, Takashi, Li, Dalei, Tamura, Motohide, Chen, Zhiwei, Tang, Xindi, Kirchschlager, Florian, Bourke, Tyler L., Byun, Do-Young, Chen, Mike, Chen, Huei-Ru Vivien, Chen, Wen Ping, Cho, Jungyeon, Choi, Yunhee, Choi, Youngwoo, Choi, Minho, Chrysostomou, Antonio, Chung, Eun Jung, Dai, Y. Sophia, Diep, Pham Ngoc, Doi, Yasuo, Duan, Yan, Duan, Hao-Yuan, Eden, David, Fanciullo, Lapo, Fiege, Jason, Fissel, Laura M., Franzmann, Erica, Friberg, Per, Friesen, Rachel, Fuller, Gary, Gledhill, Tim, Graves, Sarah, Greaves, Jane, Griffin, Matt, Gu, Qilao, Han, Ilseung, Hayashi, Saeko, Houde, Martin, Hull, Charles L. H., Inoue, Tsuyoshi, Inutsuka, Shu-ichiro, Iwasaki, Kazunari, Jeong, Il-Gyo, Konyves, Vera, Kang, Ji-hyun, Kang, Miju, Karoly, Janik, Kataoka, Akimasa, Kawabata, Koji, Kemper, Francisca, Kim, Jongsoo, Kim, Mi-Ryang, Kim, Shinyoung, Kim, Hyosung, Kim, Kee-Tae, Kim, Gwanjeong, Kirk, Jason, Kobayashi, Masato I. N., Kusune, Takayoshi, Kwon, Jungmi, Lacaille, Kevin, Law, Chi-Yan, Lee, Sang-Sung, Lee, Hyeseung, Lee, Jeong-Eun, Lee, Chin-Fei, Lee, Yong-Hee, Li, Guangxing, Li, Hua-bai, Lin, Sheng-Jun, Liu, Sheng-Yuan, Lu, Xing, Mairs, Steve, Matsumura, Masafumi, Matthews, Brenda, Moriarty-Schieven, Gerald, Nagata, Tetsuya, Nakamura, Fumitaka, Nakanishi, Hiroyuki, Ngoc, Nguyen Bich, Ohashi, Nagayoshi, Park, Geumsook, Parsons, Harriet, Peretto, Nicolas, Priestley, Felix, Pyo, Tae-Soo, Qian, Lei, Rao, Ramprasad, Rawlings, Mark, Rawlings, Jonathan, Retter, Brendan, Richer, John, Rigby, Andrew, Sadavoy, Sarah, Saito, Hiro, Savini, Giorgio, Seta, Masumichi, Shimajiri, Yoshito, Shinnaga, Hiroko, Tang, Ya-Wen, Tomisaka, Kohji, Tram, Le Ngoc, Tsukamoto, Yusuke, Viti, Serena, Wang, Hongchi, Wu, Jintai, Xie, Jinjin, Yang, Meng-Zhe, Yen, Hsi-Wei, Yoo, Hyunju, Yuan, Jinghua, Yun, Hyeong-Sik, Zenko, Tetsuya, Zhang, Chuan-Peng, Zhang, Yapeng, Zhang, Guoyin, Zhou, Jianjun, Zhu, Lei, de Looze, Ilse, Andre, Philippe, Dowell, C. Darren, Eyres, Stewart, Falle, Sam, Robitaille, Jean-Francois, and van Loo, Sven
Subjects: Astrophysics - Astrophysics of Galaxies, Astrophysics - Solar and Stellar Astrophysics
Abstract: We present 850 $\mu$m dust polarization observations of the massive DR21 filament from the B-fields In STar-forming Region Observations (BISTRO) survey, using the POL-2 polarimeter and the SCUBA-2 camera on the James Clerk Maxwell Telescope. We detect ordered magnetic fields perpendicular to the parsec-scale ridge of the DR21 main filament. In the sub-filaments, the magnetic fields are mainly parallel to the filamentary structures and smoothly connect to the magnetic fields of the main filament. We compare the POL-2 and Planck dust polarization observations to study the magnetic field structures of the DR21 filament on 0.1--10 pc scales. The magnetic fields revealed in the Planck data are well aligned with those of the POL-2 data, indicating a smooth variation of magnetic fields from large to small scales. The plane-of-sky magnetic field strengths derived from angular dispersion functions of dust polarization are 0.6--1.0 mG in the DR21 filament and $\sim$ 0.1 mG in the surrounding ambient gas. The mass-to-flux ratios are found to be magnetically supercritical in the filament and slightly subcritical to nearly critical in the ambient gas. The alignment between column density structures and magnetic fields changes from random alignment in the low-density ambient gas probed by Planck to mostly perpendicular in the high-density main filament probed by JCMT. The magnetic field structures of the DR21 filament are in agreement with MHD simulations of a strongly magnetized medium, suggesting that magnetic fields play an important role in shaping the DR21 main filament and sub-filaments., Comment: 26 pages, 13 figures, ApJ accepted
Published: 2022
Full Text: View/download PDF