Author: "Duan, Haoran" / Publication Year Range: This year - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Duan, Haoran"' showing total 36 results

Start Over Author "Duan, Haoran" Publication Year Range This year

36 results on '"Duan, Haoran"'

1. Dynamic Label Adversarial Training for Deep Learning Robustness Against Adversarial Attacks

Author: Liu, Zhenyu, Duan, Haoran, Liang, Huizhi, Long, Yang, Snasel, Vaclav, Nicosia, Guiseppe, Ranjan, Rajiv, and Ojha, Varun
Subjects: Computer Science - Machine Learning, Computer Science - Computer Vision and Pattern Recognition
Abstract: Adversarial training is one of the most effective methods for enhancing model robustness. Recent approaches incorporate adversarial distillation in adversarial training architectures. However, we notice two scenarios of defense methods that limit their performance: (1) Previous methods primarily use static ground truth for adversarial training, but this often causes robust overfitting; (2) The loss functions are either Mean Squared Error or KL-divergence leading to a sub-optimal performance on clean accuracy. To solve those problems, we propose a dynamic label adversarial training (DYNAT) algorithm that enables the target model to gradually and dynamically gain robustness from the guide model's decisions. Additionally, we found that a budgeted dimension of inner optimization for the target model may contribute to the trade-off between clean accuracy and robust accuracy. Therefore, we propose a novel inner optimization method to be incorporated into the adversarial training. This will enable the target model to adaptively search for adversarial examples based on dynamic labels from the guiding model, contributing to the robustness of the target model. Extensive experiments validate the superior performance of our approach.
Published: 2024

2. Prototype Correlation Matching and Class-Relation Reasoning for Few-Shot Medical Image Segmentation

Author: Zhang, Yumin, Li, Hongliu, Gao, Yajun, Duan, Haoran, Huang, Yawen, and Zheng, Yefeng
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Few-shot medical image segmentation has achieved great progress in improving accuracy and efficiency of medical analysis in the biomedical imaging field. However, most existing methods cannot explore inter-class relations among base and novel medical classes to reason unseen novel classes. Moreover, the same kind of medical class has large intra-class variations brought by diverse appearances, shapes and scales, thus causing ambiguous visual characterization to degrade generalization performance of these existing methods on unseen novel classes. To address the above challenges, in this paper, we propose a \underline{\textbf{P}}rototype correlation \underline{\textbf{M}}atching and \underline{\textbf{C}}lass-relation \underline{\textbf{R}}easoning (i.e., \textbf{PMCR}) model. The proposed model can effectively mitigate false pixel correlation matches caused by large intra-class variations while reasoning inter-class relations among different medical classes. Specifically, in order to address false pixel correlation match brought by large intra-class variations, we propose a prototype correlation matching module to mine representative prototypes that can characterize diverse visual information of different appearances well. We aim to explore prototype-level rather than pixel-level correlation matching between support and query features via optimal transport algorithm to tackle false matches caused by intra-class variations. Meanwhile, in order to explore inter-class relations, we design a class-relation reasoning module to segment unseen novel medical objects via reasoning inter-class relations between base and novel classes. Such inter-class relations can be well propagated to semantic encoding of local query features to improve few-shot segmentation performance. Quantitative comparisons illustrates the large performance improvement of our model over other baseline methods.
Published: 2024

3. Wearable-based behaviour interpolation for semi-supervised human activity recognition

Author: Duan, Haoran, Wang, Shidong, Ojha, Varun, Wang, Shizheng, Huang, Yawen, Long, Yang, Ranjan, Rajiv, and Zheng, Yefeng
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: While traditional feature engineering for Human Activity Recognition (HAR) involves a trial-anderror process, deep learning has emerged as a preferred method for high-level representations of sensor-based human activities. However, most deep learning-based HAR requires a large amount of labelled data and extracting HAR features from unlabelled data for effective deep learning training remains challenging. We, therefore, introduce a deep semi-supervised HAR approach, MixHAR, which concurrently uses labelled and unlabelled activities. Our MixHAR employs a linear interpolation mechanism to blend labelled and unlabelled activities while addressing both inter- and intra-activity variability. A unique challenge identified is the activityintrusion problem during mixing, for which we propose a mixing calibration mechanism to mitigate it in the feature embedding space. Additionally, we rigorously explored and evaluated the five conventional/popular deep semi-supervised technologies on HAR, acting as the benchmark of deep semi-supervised HAR. Our results demonstrate that MixHAR significantly improves performance, underscoring the potential of deep semi-supervised techniques in HAR.
Published: 2024
Full Text: View/download PDF

4. ExactDreamer: High-Fidelity Text-to-3D Content Creation via Exact Score Matching

Author: Zhang, Yumin, Miao, Xingyu, Duan, Haoran, Wei, Bo, Shah, Tejal, Long, Yang, and Ranjan, Rajiv
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Text-to-3D content creation is a rapidly evolving research area. Given the scarcity of 3D data, current approaches often adapt pre-trained 2D diffusion models for 3D synthesis. Among these approaches, Score Distillation Sampling (SDS) has been widely adopted. However, the issue of over-smoothing poses a significant limitation on the high-fidelity generation of 3D models. To address this challenge, LucidDreamer replaces the Denoising Diffusion Probabilistic Model (DDPM) in SDS with the Denoising Diffusion Implicit Model (DDIM) to construct Interval Score Matching (ISM). However, ISM inevitably inherits inconsistencies from DDIM, causing reconstruction errors during the DDIM inversion process. This results in poor performance in the detailed generation of 3D objects and loss of content. To alleviate these problems, we propose a novel method named Exact Score Matching (ESM). Specifically, ESM leverages auxiliary variables to mathematically guarantee exact recovery in the DDIM reverse process. Furthermore, to effectively capture the dynamic changes of the original and auxiliary variables, the LoRA of a pre-trained diffusion model implements these exact paths. Extensive experiments demonstrate the effectiveness of ESM in text-to-3D generation, particularly highlighting its superiority in detailed generation.
Published: 2024

5. Rehearsal-free Federated Domain-incremental Learning

Author: Sun, Rui, Duan, Haoran, Dong, Jiahua, Ojha, Varun, Shah, Tejal, and Ranjan, Rajiv
Subjects: Computer Science - Machine Learning, Computer Science - Computer Vision and Pattern Recognition
Abstract: We introduce a rehearsal-free federated domain incremental learning framework, RefFiL, based on a global prompt-sharing paradigm to alleviate catastrophic forgetting challenges in federated domain-incremental learning, where unseen domains are continually learned. Typical methods for mitigating forgetting, such as the use of additional datasets and the retention of private data from earlier tasks, are not viable in federated learning (FL) due to devices' limited resources. Our method, RefFiL, addresses this by learning domain-invariant knowledge and incorporating various domain-specific prompts from the domains represented by different FL participants. A key feature of RefFiL is the generation of local fine-grained prompts by our domain adaptive prompt generator, which effectively learns from local domain knowledge while maintaining distinctive boundaries on a global scale. We also introduce a domain-specific prompt contrastive learning loss that differentiates between locally generated prompts and those from other domains, enhancing RefFiL's precision and effectiveness. Compared to existing methods, RefFiL significantly alleviates catastrophic forgetting without requiring extra memory space, making it ideal for privacy-sensitive and resource-constrained devices.
Published: 2024

6. Dreamer XL: Towards High-Resolution Text-to-3D Generation via Trajectory Score Matching

Author: Miao, Xingyu, Duan, Haoran, Ojha, Varun, Song, Jun, Shah, Tejal, Long, Yang, and Ranjan, Rajiv
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: In this work, we propose a novel Trajectory Score Matching (TSM) method that aims to solve the pseudo ground truth inconsistency problem caused by the accumulated error in Interval Score Matching (ISM) when using the Denoising Diffusion Implicit Models (DDIM) inversion process. Unlike ISM which adopts the inversion process of DDIM to calculate on a single path, our TSM method leverages the inversion process of DDIM to generate two paths from the same starting point for calculation. Since both paths start from the same starting point, TSM can reduce the accumulated error compared to ISM, thus alleviating the problem of pseudo ground truth inconsistency. TSM enhances the stability and consistency of the model's generated paths during the distillation process. We demonstrate this experimentally and further show that ISM is a special case of TSM. Furthermore, to optimize the current multi-stage optimization process from high-resolution text to 3D generation, we adopt Stable Diffusion XL for guidance. In response to the issues of abnormal replication and splitting caused by unstable gradients during the 3D Gaussian splatting process when using Stable Diffusion XL, we propose a pixel-by-pixel gradient clipping method. Extensive experiments show that our model significantly surpasses the state-of-the-art models in terms of visual quality and performance. Code: \url{https://github.com/xingy038/Dreamer-XL}.
Published: 2024

7. From Sora What We Can See: A Survey of Text-to-Video Generation

Author: Sun, Rui, Zhang, Yumin, Shah, Tejal, Sun, Jiahao, Zhang, Shuoying, Li, Wenqi, Duan, Haoran, Wei, Bo, and Ranjan, Rajiv
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence
Abstract: With impressive achievements made, artificial intelligence is on the path forward to artificial general intelligence. Sora, developed by OpenAI, which is capable of minute-level world-simulative abilities can be considered as a milestone on this developmental path. However, despite its notable successes, Sora still encounters various obstacles that need to be resolved. In this survey, we embark from the perspective of disassembling Sora in text-to-video generation, and conducting a comprehensive review of literature, trying to answer the question, \textit{From Sora What We Can See}. Specifically, after basic preliminaries regarding the general algorithms are introduced, the literature is categorized from three mutually perpendicular dimensions: evolutionary generators, excellent pursuit, and realistic panorama. Subsequently, the widely used datasets and metrics are organized in detail. Last but more importantly, we identify several challenges and open problems in this domain and propose potential future directions for research and development., Comment: A comprehensive list of text-to-video generation studies in this survey is available at https://github.com/soraw-ai/Awesome-Text-to-Video-Generation
Published: 2024

8. Sentinel-Guided Zero-Shot Learning: A Collaborative Paradigm without Real Data Exposure

Author: Wan, Fan, Miao, Xingyu, Duan, Haoran, Deng, Jingjing, Gao, Rui, and Long, Yang
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: With increasing concerns over data privacy and model copyrights, especially in the context of collaborations between AI service providers and data owners, an innovative SG-ZSL paradigm is proposed in this work. SG-ZSL is designed to foster efficient collaboration without the need to exchange models or sensitive data. It consists of a teacher model, a student model and a generator that links both model entities. The teacher model serves as a sentinel on behalf of the data owner, replacing real data, to guide the student model at the AI service provider's end during training. Considering the disparity of knowledge space between the teacher and student, we introduce two variants of the teacher model: the omniscient and the quasi-omniscient teachers. Under these teachers' guidance, the student model seeks to match the teacher model's performance and explores domains that the teacher has not covered. To trade off between privacy and performance, we further introduce two distinct security-level training protocols: white-box and black-box, enhancing the paradigm's adaptability. Despite the inherent challenges of real data absence in the SG-ZSL paradigm, it consistently outperforms in ZSL and GZSL tasks, notably in the white-box protocol. Our comprehensive evaluation further attests to its robustness and efficiency across various setups, including stringent black-box training protocol.
Published: 2024

9. Sustainable wastewater management through nitrogen-cycling microorganisms

Author: Liu, Tao, Duan, Haoran, Lücker, Sebastian, Zheng, Min, Daims, Holger, Yuan, Zhiguo, and Guo, Jianhua
Published: 2024
Full Text: View/download PDF

10. Pixel Sentence Representation Learning

Author: Xiao, Chenghao, Huang, Zhuoxu, Chen, Danlu, Hudson, G Thomas, Li, Yizhi, Duan, Haoran, Lin, Chenghua, Fu, Jie, Han, Jungong, and Moubayed, Noura Al
Subjects: Computer Science - Computation and Language, Computer Science - Computer Vision and Pattern Recognition
Abstract: Pretrained language models are long known to be subpar in capturing sentence and document-level semantics. Though heavily investigated, transferring perturbation-based methods from unsupervised visual representation learning to NLP remains an unsolved problem. This is largely due to the discreteness of subword units brought by tokenization of language models, limiting small perturbations of inputs to form semantics-preserved positive pairs. In this work, we conceptualize the learning of sentence-level textual semantics as a visual representation learning process. Drawing from cognitive and linguistic sciences, we introduce an unsupervised visual sentence representation learning framework, employing visually-grounded text perturbation methods like typos and word order shuffling, resonating with human cognitive patterns, and enabling perturbation to texts to be perceived as continuous. Our approach is further bolstered by large-scale unsupervised topical alignment training and natural language inference supervision, achieving comparable performance in semantic textual similarity (STS) to existing state-of-the-art NLP methods. Additionally, we unveil our method's inherent zero-shot cross-lingual transferability and a unique leapfrogging pattern across languages during iterative training. To our knowledge, this is the first representation learning method devoid of traditional language models for understanding sentence and document semantics, marking a stride closer to human-like textual comprehension. Our code is available at https://github.com/gowitheflow-1998/Pixel-Linguist
Published: 2024

11. ConRF: Zero-shot Stylization of 3D Scenes with Conditioned Radiation Fields

Author: Miao, Xingyu, Bai, Yang, Duan, Haoran, Wan, Fan, Huang, Yawen, Long, Yang, and Zheng, Yefeng
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Most of the existing works on arbitrary 3D NeRF style transfer required retraining on each single style condition. This work aims to achieve zero-shot controlled stylization in 3D scenes utilizing text or visual input as conditioning factors. We introduce ConRF, a novel method of zero-shot stylization. Specifically, due to the ambiguity of CLIP features, we employ a conversion process that maps the CLIP feature space to the style space of a pre-trained VGG network and then refine the CLIP multi-modal knowledge into a style transfer neural radiation field. Additionally, we use a 3D volumetric representation to perform local style transfer. By combining these operations, ConRF offers the capability to utilize either text or images as references, resulting in the generation of sequences with novel views enhanced by global or local stylization. Our experiment demonstrates that ConRF outperforms other existing methods for 3D scene and single-text stylization in terms of visual quality.
Published: 2024

12. CTNeRF: Cross-Time Transformer for Dynamic Neural Radiance Field from Monocular Video

Author: Miao, Xingyu, Bai, Yang, Duan, Haoran, Huang, Yawen, Wan, Fan, Long, Yang, and Zheng, Yefeng
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: The goal of our work is to generate high-quality novel views from monocular videos of complex and dynamic scenes. Prior methods, such as DynamicNeRF, have shown impressive performance by leveraging time-varying dynamic radiation fields. However, these methods have limitations when it comes to accurately modeling the motion of complex objects, which can lead to inaccurate and blurry renderings of details. To address this limitation, we propose a novel approach that builds upon a recent generalization NeRF, which aggregates nearby views onto new viewpoints. However, such methods are typically only effective for static scenes. To overcome this challenge, we introduce a module that operates in both the time and frequency domains to aggregate the features of object motion. This allows us to learn the relationship between frames and generate higher-quality images. Our experiments demonstrate significant improvements over state-of-the-art methods on dynamic scene datasets. Specifically, our approach outperforms existing methods in terms of both the accuracy and visual quality of the synthesized views. Our code is available on https://github.com/xingy038/CTNeRF., Comment: Accepted by Pattern Recognition
Published: 2024
Full Text: View/download PDF

13. Promoting patient-centered care in CAR-T therapy for hematologic malignancy: a qualitative meta-synthesis

Author: Xie, Caiqin, Duan, Haoran, Liu, Hui, Wang, Yunhua, Sun, Zhuanyi, and Lan, Meijuan
Published: 2024
Full Text: View/download PDF

14. Mainstream short-cut N removal modelling: current status and perspectives

Author: Kirim, Gamze, primary, McCullough, Kester, additional, Bressani-Ribeiro, Thiago, additional, Domingo-Félez, Carlos, additional, Duan, Haoran, additional, Al-Omari, Ahmed, additional, De Clippeleir, Haydee, additional, Jimenez, Jose, additional, Klaus, Stephanie, additional, Ladipo-Obasa, Mojolaoluwa, additional, Mehrani, Mohamad-Javad, additional, Regmi, Pusker, additional, Torfs, Elena, additional, Volcke, Eveline I. P., additional, and Vanrolleghem, Peter A., additional
Published: 2024
Full Text: View/download PDF

15. Node and edge dual-masked self-supervised graph representation

Author: Tang, Peng, Xie, Cheng, and Duan, Haoran
Published: 2024
Full Text: View/download PDF

16. Dynamic visual-guided selection for zero-shot learning

Author: Zhou, Yuan, Xiang, Lei, Liu, Fan, Duan, Haoran, and Long, Yang
Published: 2024
Full Text: View/download PDF

17. Removal of emerging contaminants from wastewater by various treatment technologies in wastewater treatment plants

Author: Rousis, Nikolaos I., primary, Bade, Richard, additional, Dewapriya, Pradeep, additional, Li, Jiaying, additional, Duan, Haoran, additional, and Thomaidis, Nikolaos S., additional
Published: 2024
Full Text: View/download PDF

18. Impact of nutrient deficiency on biological sewage treatment – Perspectives towards urine source segregation

Author: Chen, Chee Xiang, Koskue, Veera, Duan, Haoran, Gao, Li, Shon, Ho Kyong, Martin, Gregory J.O., Chen, George Q., and Freguia, Stefano
Published: 2024
Full Text: View/download PDF

19. Lactobacillus inoculation mediated carboxylates and alcohols production from waste activated sludge fermentation system: Insight into process outcomes and metabolic network

Author: Wu, Lan, Ngo, Huu Hao, Wang, Chen, Hou, Yanan, Chen, Xueming, Guo, Wenshan, Duan, Haoran, Ni, Bing-Jie, and Wei, Wei
Published: 2024
Full Text: View/download PDF

20. Making waves: Knowledge and data fusion in urban water modelling

Author: Duan, Haoran, Li, Jiuling, and Yuan, Zhiguo
Published: 2024
Full Text: View/download PDF

21. Meta-path and hypergraph fused distillation framework for heterogeneous information networks embedding

Author: Yu, Beibei, Xie, Cheng, Cai, Hongming, Duan, Haoran, and Tang, Peng
Published: 2024
Full Text: View/download PDF

22. Occurrence, spatiotemporal trends, fate, and treatment technologies for microplastics and organic contaminants in biosolids: A review

Author: Vo, Phong H.N., Ky Le, Gia, Huy, Lai Nguyen, Zheng, Lei, Chaiwong, Chawalit, Nguyen, Nam Nhat, Nguyen, Hong T.M., Ralph, Peter J., Kuzhiumparambil, Unnikrishnan, Danaee, Soroosh, Toft, Sonja, Madsen, Craig, Kim, Mikael, Fenstermacher, Jim, Hai, Ho Truong Nam, Duan, Haoran, and Tscharke, Ben
Published: 2024
Full Text: View/download PDF

23. Contextual features online prediction for self-supervised graph representation

Author: Duan, Haoran, Xie, Cheng, Tang, Peng, and Yu, Beibei
Published: 2024
Full Text: View/download PDF

24. Wearable-based behaviour interpolation for semi-supervised human activity recognition

Author: Duan, Haoran, primary, Wang, Shidong, additional, Ojha, Varun, additional, Wang, Shizheng, additional, Huang, Yawen, additional, Long, Yang, additional, Ranjan, Rajiv, additional, and Zheng, Yefeng, additional
Published: 2024
Full Text: View/download PDF

25. Developing a conceptual model for understanding caregiving experience and their impacts on quality of life for Chinese breast cancer family caregivers: A qualitative study

Author: Gao, Chaoyue, primary, Li, Min, additional, Guo, Linfang, additional, Duan, Haoran, additional, Zhang, Peili, additional, and Ding, Yongxia, additional
Published: 2024
Full Text: View/download PDF

26. A portable frequency‐domain electromagnetic detection system.

Author: Peng, Fengjiang, Zhao, XueZhao, Zhang, Song, Duan, Haoran, Du, Shibao, Zhao, Qing, and Guo, Cheng
Subjects: TRANSMITTERS (Communication), PULSE width modulation, DIGITAL modulation, GEOPHYSICAL prospecting
Abstract: Summary: Frequency‐domain detection stands as a critical method in remote geophysical exploration; however, its practical application is constrained by the considerable size of the requisite instrumentation. In this paper, we present a compact frequency domain electromagnetic detection system. The integration of intelligent power devices and a novel hardware architecture substantially diminishes the system's dimensions, thereby enhancing its portability. Furthermore, the system design incorporates sinusoidal pulse width modulation and digital phase‐locked amplification to ensure the efficacy of the proposed system. Both the transmitter and receiver system are constrained to dimensions smaller than 1.5 m. Subsequent experimental validation attests to the exemplary hardware performance of the proposed system, with the transmitting voltage of the transmitter circuit attaining 500 V, and the receiver circuit exhibiting a sensitivity as low as 10−8 V. Significantly, this design paradigm not only facilitates the integration of transmitter and receiver systems for frequency‐domain electromagnetic detection but also introduces novel prospects for the application of frequency‐domain electromagnetic detection methods across diverse fields. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

27. Prototype Correlation Matching and Class-Relation Reasoning for Few-Shot Medical Image Segmentation

Author: Zhang, Yumin, primary, Li, Hongliu, additional, Gao, Yajun, additional, Duan, Haoran, additional, Huang, Yawen, additional, and Zheng, Yefeng, additional
Published: 2024
Full Text: View/download PDF

28. Sentinel-Guided Zero-Shot Learning: A Collaborative Paradigm without Real Data Exposure

Author: Wan, Fan, primary, Miao, Xingyu, additional, Duan, Haoran, additional, Deng, Jingjing, additional, Gao, Rui, additional, and Long, Yang, additional
Published: 2024
Full Text: View/download PDF

29. Unraveling Complexity: An Exploration into the Large-Scale Multi-Modal Signal Processing

Author: Wen, Zhenyu, primary, Ye, Yuheng, additional, Su, Jie, additional, Li, Taotao, additional, Wan, Jinhao, additional, Zheng, Shilian, additional, Hong, Zhen, additional, He, Shibo, additional, Duan, Haoran, additional, Li, Yuexiang, additional, Huang, Yawen, additional, and Zheng, Yefeng, additional
Published: 2024
Full Text: View/download PDF

30. Rules for Expectation: Learning to Generate Rules via Social Environment Modeling

Author: Pu, Jiyao, Duan, Haoran, Zhao, Junzhe, and Long, Yang
Abstract: The evolution of natural life is guided by a perpetually adaptive set of rules, encompassing natural laws, human policies, and game mechanics. Automated game design, through the creation of simulated environments populated by AI agents, embodies these rules, aligning with the objectives of artificial life research that seeks to replicate the dynamics of biological life through computational models. This paper presents a comprehensive framework, the Rule Generation Networks (RGN), devised for automated rule design, evaluation, and evolution in line with controllable expectations. We refine and formalize three cardinal elements - rules, strategies, and evaluation - to elucidate the intricate relationships inherent in rule generation tasks. The RGN integrates generative neural networks for rule design and a suite of reinforcement learning models for rule evaluation. To exemplify rule evolution and adaptation across varying environments, we introduce a controllability metric to gauge game dynamics and evolve the rule designer accordingly. Furthermore, we develop two game environments, Maze Run and Trust Evolution, modelling human exploration and societal trade dynamics, to gamify and evaluate the generated rules.
Published: 2024
Full Text: View/download PDF

31. Chapter 19 - Removal of emerging contaminants from wastewater by various treatment technologies in wastewater treatment plants

Author: Rousis, Nikolaos I., Bade, Richard, Dewapriya, Pradeep, Li, Jiaying, Duan, Haoran, and Thomaidis, Nikolaos S.
Published: 2024
Full Text: View/download PDF

32. MRL-Seg: Overcoming Imbalance in Medical Image Segmentation With Multi-Step Reinforcement Learning

Author: Yang, Feiyang, Li, Xiongfei, Duan, Haoran, Xu, Feilong, Huang, Yawen, Zhang, Xiaoli, Long, Yang, and Zheng, Yefeng
Abstract: Medical image segmentation is a critical task for clinical diagnosis and research. However, dealing with highly imbalanced data remains a significant challenge in this domain, where the region of interest (ROI) may exhibit substantial variations across different slices. This presents a significant hurdle to medical image segmentation, as conventional segmentation methods may either overlook the minority class or overly emphasize the majority class, ultimately leading to a decrease in the overall generalization ability of the segmentation results. To overcome this, we propose a novel approach based on multi-step reinforcement learning, which integrates prior knowledge of medical images and pixel-wise segmentation difficulty into the reward function. Our method treats each pixel as an individual agent, utilizing diverse actions to evaluate its relevance for segmentation. To validate the effectiveness of our approach, we conduct experiments on four imbalanced medical datasets, and the results show that our approach surpasses other state-of-the-art methods in highly imbalanced scenarios. These findings hold substantial implications for clinical diagnosis and research.
Published: 2024
Full Text: View/download PDF

33. List of contributors

Author: Abdi, Jafar, Abedini, Hassan, Afsharpour, Maryam, Ahamadi, Shabnam, Allahkarami, Esmaeil, Allahkarami, Ebrahim, Aravamudan, Kannan, Asfaram, Arash, Ayeleru, O.O., Bade, Richard, Baharinikoo, Leila, Benettayeb, Asmaa, Bhoje, Rutuja, Bhushan, Brij, Bilińska, Lucyna, Chaudhary, Monika, Chaudhary, Shubham, Dehghani, Mohammad Hadi, Delfani, Fatemeh, Devendra, Rai, Dewapriya, Pradeep, Duan, Haoran, Fazeli, Hossein, Fuentes-Ramirez, Rosalba, Garcia-Huante, Yolanda G., Gaur, Rama, Ghosh, Asim K., Ghosh, Soumya, Gmurek, Marta, Haddou, Boumediene, Hashmi, Zubair, Hosseini, Mohammadmehdi, Jatoi, Abdul Sattar, Javadian, Hamedreza, Karchiyappan, Thirugnanasambandham, Karri, Rama Rao, Kotnala, Shreya, Kum, Soyoon, Kumar, Ajay, Kumar, Ankur, Li, Jiaying, Malbenia John, Masamvu, Mariam, Ezrah, Mazari, Shaukat Ali, Mazloom, Golshan, Mehdi, Saif Ul, Mendez-Alvarado, Lorena N., Mitchel, Chitepo Ruvimbo, Modekwe, H.U., Monfared, Abolfazl Dehghan, Mubarak, Nabisab Mujawar, Nasr, Mahmoud, Nayak, Arunima, Olubambi, P.A., Ordaz, Alberto, Parkash, Anand, Patel, Jinal, Peralta-Hernandez, Juan M., Ponce, Lorena Cornejo, Rajendran, Saravanan, Ramirez, Irwing, Rezai, Bahram, Rodriguez-Narvaez, Oscar M., Rousis, Nikolaos I., Rowles, Lewis S., Samy, Mahmoud, Sayyaadi, Hoseyn, Shahabuddin, Syed, Shahverdian, Mohammad Hassan, Shishir, Sinha, Shojaei, Saeed, Shojaei, Siroos, Sohani, Ali, Srinivasrao Naik, B., Suhas, Suresh, R., Tevatia, Prashant, Thomaidis, Nikolaos S., Tyagi, Inderjeet, Villaseñor-Basulto, Déborah L., Yanardağ, Duygu, and Zamani Pedram, Mona
Published: 2024
Full Text: View/download PDF

34. Life cycle assessment of hydrogenotrophic denitrification in membrane aerated biofilm reactors for sustainable wastewater treatment.

Author: Li R, Lu MY, Guo RB, Duan H, Ni BJ, and Fu SF
Abstract: The conventional anaerobic-anoxic-oxic (AAO) process for wastewater treatment is associated with high energy consumption and pollutant emissions due to its reliance on heterotrophic denitrification. In contrast, membrane aerated biofilm reactors (MABR) coupled with hydrogenotrophic denitrification (H 2 -MABR) offers a more promising alternative. This study conducts a life cycle assessment (LCA) to evaluate the environmental and economic benefits of H 2 -MABR compared to traditional AAO processes. Results indicate that even with a limited reactor life, the application of MABR in actual wastewater treatment plants can yield over 30 % reduction in environmental and economic impacts. Using CO 2 from biogas as a carbon source significantly reduces carbon emissions during the anaerobic stage, while the efficient nitrogen removal minimizes the need for wastewater recirculation and electricity consumption. The H 2 -driven denitrification process also avoids emissions and secondary pollution risks associated with organic electron donors. Furthermore, coupling H 2 -MABR with renewable energy source and Power-to-Gas technology further enhances sustainability by ensuring a stable hydrogen supply. Given the significant potential of H 2 -MABR for improving wastewater treatment, further research and large-scale implementation are highly encouraged., Competing Interests: Declaration of competing interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper., (Copyright © 2024. Published by Elsevier Ltd.)
Published: 2024
Full Text: View/download PDF

35. A Semantic-Consistent Few-Shot Modulation Recognition Framework for IoT Applications.

Author: Su J, Sun P, Jiang Y, Wen Z, Guo F, Wu Y, Hong Z, Duan H, Huang Y, Ranjan R, and Zheng Y
Abstract: The rapid growth of the Internet of Things (IoT) has led to the widespread adoption of the IoT networks in numerous digital applications. To counter physical threats in these systems, automatic modulation classification (AMC) has emerged as an effective approach for identifying the modulation format of signals in noisy environments. However, identifying those threats can be particularly challenging due to the scarcity of labeled data, which is a common issue in various IoT applications, such as anomaly detection for unmanned aerial vehicles (UAVs) and intrusion detection in the IoT networks. Few-shot learning (FSL) offers a promising solution by enabling models to grasp the concepts of new classes using only a limited number of labeled samples. However, prevalent FSL techniques are primarily tailored for tasks in the computer vision domain and are not suitable for the wireless signal domain. Instead of designing a new FSL model, this work suggests a novel approach that enhances wireless signals to be more efficiently processed by the existing state-of-the-art (SOTA) FSL models. We present the semantic-consistent signal pretransformation (ScSP), a parameterized transformation architecture that ensures signals with identical semantics exhibit similar representations. ScSP is designed to integrate seamlessly with various SOTA FSL models for signal modulation recognition and supports commonly used deep learning backbones. Our evaluation indicates that ScSP boosts the performance of numerous SOTA FSL models, while preserving flexibility.
Published: 2024
Full Text: View/download PDF

36. UniHead: Unifying Multi-Perception for Detection Heads.

Author: Zhou H, Yang R, Zhang Y, Duan H, Huang Y, Hu R, Li X, and Zheng Y
Abstract: The detection head constitutes a pivotal component within object detectors, tasked with executing both classification and localization functions. Regrettably, the commonly used parallel head often lacks omni perceptual capabilities, such as deformation perception (DP), global perception (GP), and cross-task perception (CTP). Despite numerous methods attempting to enhance these abilities from a single aspect, achieving a comprehensive and unified solution remains a significant challenge. In response to this challenge, we develop an innovative detection head, termed UniHead, to unify three perceptual abilities simultaneously. More precisely, our approach: 1) introduces DP, enabling the model to adaptively sample object features; 2) proposes a dual-axial aggregation transformer (DAT) to adeptly model long-range dependencies, thereby achieving GP; and 3) devises a cross-task interaction transformer (CIT) that facilitates interaction between the classification and localization branches, thus aligning the two tasks. As a plug-and-play method, the proposed UniHead can be conveniently integrated with existing detectors. Extensive experiments on the COCO dataset demonstrate that our UniHead can bring significant improvements to many detectors. For instance, the UniHead can obtain + 2.7 AP gains in RetinaNet, + 2.9 AP gains in FreeAnchor, and + 2.1 AP gains in GFL. The code is available at https://github.com/zht8506/UniHead.
Published: 2024
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

36 results on '"Duan, Haoran"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources