Author: "Liu, Haozhe" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Liu, Haozhe"' showing total 518 results

Start Over Author "Liu, Haozhe"

518 results on '"Liu, Haozhe"'

1. Adaptive Caching for Faster Video Generation with Diffusion Transformers

Author: Kahatapitiya, Kumara, Liu, Haozhe, He, Sen, Liu, Ding, Jia, Menglin, Zhang, Chenyang, Ryoo, Michael S., and Xie, Tian
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Generating temporally-consistent high-fidelity videos can be computationally expensive, especially over longer temporal spans. More-recent Diffusion Transformers (DiTs) -- despite making significant headway in this context -- have only heightened such challenges as they rely on larger models and heavier attention mechanisms, resulting in slower inference speeds. In this paper, we introduce a training-free method to accelerate video DiTs, termed Adaptive Caching (AdaCache), which is motivated by the fact that "not all videos are created equal": meaning, some videos require fewer denoising steps to attain a reasonable quality than others. Building on this, we not only cache computations through the diffusion process, but also devise a caching schedule tailored to each video generation, maximizing the quality-latency trade-off. We further introduce a Motion Regularization (MoReg) scheme to utilize video information within AdaCache, essentially controlling the compute allocation based on motion content. Altogether, our plug-and-play contributions grant significant inference speedups (e.g. up to 4.7x on Open-Sora 720p - 2s video generation) without sacrificing the generation quality, across multiple video DiT baselines., Comment: Project-page is available at https://adacache-dit.github.io
Published: 2024

2. MarDini: Masked Autoregressive Diffusion for Video Generation at Scale

Author: Liu, Haozhe, Liu, Shikun, Zhou, Zijian, Xu, Mengmeng, Xie, Yanping, Han, Xiao, Pérez, Juan C., Liu, Ding, Kahatapitiya, Kumara, Jia, Menglin, Wu, Jui-Chieh, He, Sen, Xiang, Tao, Schmidhuber, Jürgen, and Pérez-Rúa, Juan-Manuel
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence
Abstract: We introduce MarDini, a new family of video diffusion models that integrate the advantages of masked auto-regression (MAR) into a unified diffusion model (DM) framework. Here, MAR handles temporal planning, while DM focuses on spatial generation in an asymmetric network design: i) a MAR-based planning model containing most of the parameters generates planning signals for each masked frame using low-resolution input; ii) a lightweight generation model uses these signals to produce high-resolution frames via diffusion de-noising. MarDini's MAR enables video generation conditioned on any number of masked frames at any frame positions: a single model can handle video interpolation (e.g., masking middle frames), image-to-video generation (e.g., masking from the second frame onward), and video expansion (e.g., masking half the frames). The efficient design allocates most of the computational resources to the low-resolution planning model, making computationally expensive but important spatio-temporal attention feasible at scale. MarDini sets a new state-of-the-art for video interpolation; meanwhile, within few inference steps, it efficiently generates videos on par with those of much more expensive advanced image-to-video models., Comment: Project Page: https://mardini-vidgen.github.io
Published: 2024

3. Highway Reinforcement Learning

Author: Wang, Yuhui, Strupl, Miroslav, Faccio, Francesco, Wu, Qingyuan, Liu, Haozhe, Grudzień, Michał, Tan, Xiaoyang, and Schmidhuber, Jürgen
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: Learning from multi-step off-policy data collected by a set of policies is a core problem of reinforcement learning (RL). Approaches based on importance sampling (IS) often suffer from large variances due to products of IS ratios. Typical IS-free methods, such as $n$-step Q-learning, look ahead for $n$ time steps along the trajectory of actions (where $n$ is called the lookahead depth) and utilize off-policy data directly without any additional adjustment. They work well for proper choices of $n$. We show, however, that such IS-free methods underestimate the optimal value function (VF), especially for large $n$, restricting their capacity to efficiently utilize information from distant future time steps. To overcome this problem, we introduce a novel, IS-free, multi-step off-policy method that avoids the underestimation issue and converges to the optimal VF. At its core lies a simple but non-trivial \emph{highway gate}, which controls the information flow from the distant future by comparing it to a threshold. The highway gate guarantees convergence to the optimal VF for arbitrary $n$ and arbitrary behavioral policies. It gives rise to a novel family of off-policy RL algorithms that safely learn even when $n$ is very large, facilitating rapid credit assignment from the far future to the past. On tasks with greatly delayed rewards, including video games where the reward is given only at the end of the game, our new methods outperform many existing multi-step off-policy algorithms.
Published: 2024

4. Lazy Layers to Make Fine-Tuned Diffusion Models More Traceable

Author: Liu, Haozhe, Zhang, Wentian, Li, Bing, Ghanem, Bernard, and Schmidhuber, Jürgen
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Cryptography and Security
Abstract: Foundational generative models should be traceable to protect their owners and facilitate safety regulation. To achieve this, traditional approaches embed identifiers based on supervisory trigger-response signals, which are commonly known as backdoor watermarks. They are prone to failure when the model is fine-tuned with nontrigger data. Our experiments show that this vulnerability is due to energetic changes in only a few 'busy' layers during fine-tuning. This yields a novel arbitrary-in-arbitrary-out (AIAO) strategy that makes watermarks resilient to fine-tuning-based removal. The trigger-response pairs of AIAO samples across various neural network depths can be used to construct watermarked subpaths, employing Monte Carlo sampling to achieve stable verification results. In addition, unlike the existing methods of designing a backdoor for the input/output space of diffusion models, in our method, we propose to embed the backdoor into the feature space of sampled subpaths, where a mask-controlled trigger function is proposed to preserve the generation performance and ensure the invisibility of the embedded backdoor. Our empirical studies on the MS-COCO, AFHQ, LSUN, CUB-200, and DreamBooth datasets confirm the robustness of AIAO; while the verification rates of other trigger-based methods fall from ~90% to ~70% after fine-tuning, those of our method remain consistently above 90%.
Published: 2024

5. Faster Diffusion via Temporal Attention Decomposition

Author: Liu, Haozhe, Zhang, Wentian, Xie, Jinheng, Faccio, Francesco, Xu, Mengmeng, Xiang, Tao, Shou, Mike Zheng, Perez-Rua, Juan-Manuel, and Schmidhuber, Jürgen
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: We explore the role of attention mechanism during inference in text-conditional diffusion models. Empirical observations suggest that cross-attention outputs converge to a fixed point after several inference steps. The convergence time naturally divides the entire inference process into two phases: an initial phase for planning text-oriented visual semantics, which are then translated into images in a subsequent fidelity-improving phase. Cross-attention is essential in the initial phase but almost irrelevant thereafter. However, self-attention initially plays a minor role but becomes crucial in the second phase. These findings yield a simple and training-free method known as temporally gating the attention (TGATE), which efficiently generates images by caching and reusing attention outputs at scheduled time steps. Experimental results show when widely applied to various existing text-conditional diffusion models, TGATE accelerates these models by 10%-50%. The code of TGATE is available at https://github.com/HaozheLiu-ST/T-GATE.
Published: 2024

6. Fingerprint Presentation Attack Detector Using Global-Local Model

Author: Liu, Haozhe, Zhang, Wentian, Liu, Feng, Wu, Haoqian, and Shen, Linlin
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: The vulnerability of automated fingerprint recognition systems (AFRSs) to presentation attacks (PAs) promotes the vigorous development of PA detection (PAD) technology. However, PAD methods have been limited by information loss and poor generalization ability, resulting in new PA materials and fingerprint sensors. This paper thus proposes a global-local model-based PAD (RTK-PAD) method to overcome those limitations to some extent. The proposed method consists of three modules, called: 1) the global module; 2) the local module; and 3) the rethinking module. By adopting the cut-out-based global module, a global spoofness score predicted from nonlocal features of the entire fingerprint images can be achieved. While by using the texture in-painting-based local module, a local spoofness score predicted from fingerprint patches is obtained. The two modules are not independent but connected through our proposed rethinking module by localizing two discriminative patches for the local module based on the global spoofness score. Finally, the fusion spoofness score by averaging the global and local spoofness scores is used for PAD. Our experimental results evaluated on LivDet 2017 show that the proposed RTK-PAD can achieve an average classification error (ACE) of 2.28% and a true detection rate (TDR) of 91.19% when the false detection rate (FDR) equals 1.0%, which significantly outperformed the state-of-the-art methods by $\sim$10% in terms of TDR (91.19% versus 80.74%)., Comment: This paper was accepted by IEEE Transactions on Cybernetics. Current version is updated with minor revisions on introduction and related works
Published: 2024
Full Text: View/download PDF

7. Investigation of the entrapped micro bubble defect as a high-energy laser damage precursor in fused silica optics

Author: Hu, Shuo, Zhang, Shuai, Lu, Lihua, Liu, Haozhe, Miao, Xinxiang, and Chen, Jiaxuan
Published: 2024
Full Text: View/download PDF

8. Structure responsible for the superconducting state in La3Ni2O7 at high pressure and low temperature conditions

Author: Wang, Luhong, Li, Yan, Xie, Shengyi, Liu, Fuyang, Sun, Hualei, Huang, Caoxin, Gao, Yang, Nakagawa, Takeshi, Fu, Boyang, Dong, Bo, Cao, Zhenhui, Yu, Runze, Kawaguchi, Saori I., Kadobayashi, Hirokazu, Wang, Meng, Jin, Changqing, Mao, Ho-kwang, and Liu, Haozhe
Subjects: Condensed Matter - Superconductivity, Condensed Matter - Materials Science
Abstract: Very recently, a new superconductor with Tc = 80 K was reported in nickelate (La3Ni2O7) at around 15 - 40 GPa conditions (Nature, 621, 493, 2023) [1], which is the second type of unconventional superconductor, beside the cuprates, with Tc above liquid nitrogen temperature. However, the phase diagram plotted in this report was mostly based on the transport measurement at low temperature and high pressure conditions, and the assumed corresponding X-ray diffraction (XRD) results was carried out at room temperature. This encouraged us to carry out in situ high pressure and low temperature synchrotron XRD experiments to determine which phase is responsible for the high Tc state. In addition to the phase transition from orthorhombic Amam structure to orthorhombic Fmmm structure, a tetragonal phase with space group of I4/mmm was discovered when the sample was compressed to 19 GPa at 40 K where the superconductivity takes palce in La3Ni2O7. The calculations based on this tetragonal structure reveal that the electronic states approached to the Fermi energy were mainly dominated by the eg orbitals (3dz2 and 3dx2-y2) of Ni atoms, which are located in the oxygen octahedral crystal field. The correlation between Tc and this structural evolution, especially Ni-O octahedra regularity and the in-plane Ni-O-Ni bonding angles, are analyzed. This work sheds new lights to identify what is the most likely phase responsible for superconductivity in the double layered nickelate.
Published: 2023

9. Learning to Identify Critical States for Reinforcement Learning from Videos

Author: Liu, Haozhe, Zhuge, Mingchen, Li, Bing, Wang, Yuhui, Faccio, Francesco, Ghanem, Bernard, and Schmidhuber, Jürgen
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence
Abstract: Recent work on deep reinforcement learning (DRL) has pointed out that algorithmic information about good policies can be extracted from offline data which lack explicit information about executed actions. For example, videos of humans or robots may convey a lot of implicit information about rewarding action sequences, but a DRL machine that wants to profit from watching such videos must first learn by itself to identify and recognize relevant states/actions/rewards. Without relying on ground-truth annotations, our new method called Deep State Identifier learns to predict returns from episodes encoded as videos. Then it uses a kind of mask-based sensitivity analysis to extract/identify important critical states. Extensive experiments showcase our method's potential for understanding and improving agent behavior. The source code and the generated datasets are available at https://github.com/AI-Initiative-KAUST/VideoRLCS., Comment: This paper was accepted to ICCV23
Published: 2023

10. Push the Boundary of SAM: A Pseudo-label Correction Framework for Medical Segmentation

Author: Huang, Ziyi, Liu, Hongshan, Zhang, Haofeng, Li, Xueshen, Liu, Haozhe, Xing, Fuyong, Laine, Andrew, Angelini, Elsa, Hendon, Christine, and Gan, Yu
Subjects: Electrical Engineering and Systems Science - Image and Video Processing
Abstract: Segment anything model (SAM) has emerged as the leading approach for zero-shot learning in segmentation tasks, offering the advantage of avoiding pixel-wise annotations. It is particularly appealing in medical image segmentation, where the annotation process is laborious and expertise-demanding. However, the direct application of SAM often yields inferior results compared to conventional fully supervised segmentation networks. An alternative approach is to use SAM as the initial stage to generate pseudo labels for further network training. However, the performance is limited by the quality of pseudo labels. In this paper, we propose a novel label correction framework to push the boundary of SAM-based segmentation. Our model utilizes a label quality evaluation module to distinguish between noisy labels and clean labels. This enables the correction of the noisy labels using an uncertainty-based self-correction module, thereby enriching the clean training set. Finally, we retrain the segmentation network with updated labels to optimize its weights for future predictions. One key advantage of our model is its ability to train deep networks using SAM-generated pseudo labels without relying on a set of expert-level annotations while attaining good segmentation performance. We demonstrate the effectiveness of our proposed model on three public datasets, indicating its ability to improve segmentation accuracy and outperform baseline methods in label correction.
Published: 2023

11. BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion

Author: Xie, Jinheng, Li, Yuexiang, Huang, Yawen, Liu, Haozhe, Zhang, Wentian, Zheng, Yefeng, and Shou, Mike Zheng
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Recent text-to-image diffusion models have demonstrated an astonishing capacity to generate high-quality images. However, researchers mainly studied the way of synthesizing images with only text prompts. While some works have explored using other modalities as conditions, considerable paired data, e.g., box/mask-image pairs, and fine-tuning time are required for nurturing models. As such paired data is time-consuming and labor-intensive to acquire and restricted to a closed set, this potentially becomes the bottleneck for applications in an open world. This paper focuses on the simplest form of user-provided conditions, e.g., box or scribble. To mitigate the aforementioned problem, we propose a training-free method to control objects and contexts in the synthesized images adhering to the given spatial conditions. Specifically, three spatial constraints, i.e., Inner-Box, Outer-Box, and Corner Constraints, are designed and seamlessly integrated into the denoising step of diffusion models, requiring no additional training and massive annotated layout data. Extensive experimental results demonstrate that the proposed constraints can control what and where to present in the images while retaining the ability of Diffusion models to synthesize with high fidelity and diverse concept coverage. The code is publicly available at https://github.com/showlab/BoxDiff., Comment: Accepted by ICCV 2023. Code is available at: https://github.com/showlab/BoxDiff
Published: 2023

12. Dynamically Masked Discriminator for Generative Adversarial Networks

Author: Zhang, Wentian, Liu, Haozhe, Li, Bing, Xie, Jinheng, Huang, Yawen, Li, Yuexiang, Zheng, Yefeng, and Ghanem, Bernard
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Training Generative Adversarial Networks (GANs) remains a challenging problem. The discriminator trains the generator by learning the distribution of real/generated data. However, the distribution of generated data changes throughout the training process, which is difficult for the discriminator to learn. In this paper, we propose a novel method for GANs from the viewpoint of online continual learning. We observe that the discriminator model, trained on historically generated data, often slows down its adaptation to the changes in the new arrival generated data, which accordingly decreases the quality of generated results. By treating the generated data in training as a stream, we propose to detect whether the discriminator slows down the learning of new knowledge in generated data. Therefore, we can explicitly enforce the discriminator to learn new knowledge fast. Particularly, we propose a new discriminator, which automatically detects its retardation and then dynamically masks its features, such that the discriminator can adaptively learn the temporally-vary distribution of generated data. Experimental results show our method outperforms the state-of-the-art approaches., Comment: Updated v2 -- NeurIPS 2023 camera ready version
Published: 2023

13. Pressure-induced shape and color changes and mechanical-stimulation-driven reverse transition in a one-dimensional hybrid halide

Author: Zhang, Die, Fu, Boyang, He, Weilong, Li, Hengtao, Liu, Fuyang, Wang, Luhong, Liu, Haozhe, Zhou, Liujiang, and Cai, Weizhao
Published: 2024
Full Text: View/download PDF

14. Mindstorms in Natural Language-Based Societies of Mind

Author: Zhuge, Mingchen, Liu, Haozhe, Faccio, Francesco, Ashley, Dylan R., Csordás, Róbert, Gopalakrishnan, Anand, Hamdi, Abdullah, Hammoud, Hasan Abed Al Kader, Herrmann, Vincent, Irie, Kazuki, Kirsch, Louis, Li, Bing, Li, Guohao, Liu, Shuming, Mai, Jinjie, Piękos, Piotr, Ramesh, Aditya, Schlag, Imanol, Shi, Weimin, Stanić, Aleksandar, Wang, Wenyi, Wang, Yuhui, Xu, Mengmeng, Fan, Deng-Ping, Ghanem, Bernard, and Schmidhuber, Jürgen
Subjects: Computer Science - Artificial Intelligence, Computer Science - Computation and Language, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning, Computer Science - Multiagent Systems, 68T07, I.2.6, I.2.11
Abstract: Both Minsky's "society of mind" and Schmidhuber's "learning to think" inspire diverse societies of large multimodal neural networks (NNs) that solve problems by interviewing each other in a "mindstorm." Recent implementations of NN-based societies of minds consist of large language models (LLMs) and other NN-based experts communicating through a natural language interface. In doing so, they overcome the limitations of single LLMs, improving multimodal zero-shot reasoning. In these natural language-based societies of mind (NLSOMs), new agents -- all communicating through the same universal symbolic language -- are easily added in a modular fashion. To demonstrate the power of NLSOMs, we assemble and experiment with several of them (having up to 129 members), leveraging mindstorms in them to solve some practical AI tasks: visual question answering, image captioning, text-to-image synthesis, 3D generation, egocentric retrieval, embodied AI, and general language-based task solving. We view this as a starting point towards much larger NLSOMs with billions of agents-some of which may be humans. And with this emergence of great societies of heterogeneous minds, many new research questions have suddenly become paramount to the future of artificial intelligence. What should be the social structure of an NLSOM? What would be the (dis)advantages of having a monarchical rather than a democratic structure? How can principles of NN economies be used to maximize the total reward of a reinforcement learning NLSOM? In this work, we identify, discuss, and try to answer some of these questions., Comment: 9 pages in main text + 7 pages of references + 38 pages of appendices, 14 figures in main text + 13 in appendices, 7 tables in appendices
Published: 2023

15. Superconducting-insulating phase transition in pressurized Ba$_{1-x}$K$_x$BiO$_3$

Author: Han, Jinyu, Zhu, Xiangde, Zhang, Jianfeng, Cai, Shu, Wang, Luhong, Gao, Yang, Liu, Fuyang, Liu, Haozhe, Kawaguchi, Saori I., Guo, Jing, Zhou, Yazhou, Zhao, Jinyu, Wang, Pengyu, Cao, Lixin, Tian, Mingliang, Wu, Qi, Xiang, Tao, and Sun, Liling
Subjects: Condensed Matter - Superconductivity
Abstract: We report the first observation of a pressure-induced transition from a superconducting (SC) to an insulating (I) phase in single-crystal Ba$_{1-x}$K$_x$BiO$_3$ ($x$ = 0.4, 0.43, 0.52, and 0.58) superconductors. X-ray diffraction measurements conducted at 20 K reveal a direct relationship between this SC-I transition and a pressure-induced distortion of crystal structure. With increasing pressure, the lattice parameters a and c of the ambient-pressure superconducting tetragonal (T) phase are compressed continuously below a critical pressure (Pc1), wherein the pressure (P) dependence of superconducting transition temperature (Tc) displays a small variation. However, upon further compression, the lattice of the compressed T phase displays an anisotropic change, and Tc shows a monotonous decrease. When the pressure reaches Pc2 (Pc2 > Pc1), the compressed T phase collapses along the c axis, followed by the disappearance of superconductivity and the appearance of the insulating phase. This SC-I transition is fully reversible, with the critical pressure increasing alongside K doping concentration. These findings are strikingly similar to the SC-I transition observed in hole-doped high-Tc cuprate superconductors under pressure. Identifying their commonalities could deepen our understanding of the mechanisms that underlie high- Tc superconductivity in these two oxide superconductors with a perovskite structure., Comment: 16 pages and 5 figures
Published: 2023

16. Open-World Weakly-Supervised Object Localization

Author: Xie, Jinheng, Luo, Zhaochuan, Li, Yuexiang, Liu, Haozhe, Shen, Linlin, and Shou, Mike Zheng
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: While remarkable success has been achieved in weakly-supervised object localization (WSOL), current frameworks are not capable of locating objects of novel categories in open-world settings. To address this issue, we are the first to introduce a new weakly-supervised object localization task called OWSOL (Open-World Weakly-Supervised Object Localization). During training, all labeled data comes from known categories and, both known and novel categories exist in the unlabeled data. To handle such data, we propose a novel paradigm of contrastive representation co-learning using both labeled and unlabeled data to generate a complete G-CAM (Generalized Class Activation Map) for object localization, without the requirement of bounding box annotation. As no class label is available for the unlabelled data, we conduct clustering over the full training set and design a novel multiple semantic centroids-driven contrastive loss for representation learning. We re-organize two widely used datasets, i.e., ImageNet-1K and iNatLoc500, and propose OpenImages150 to serve as evaluation benchmarks for OWSOL. Extensive experiments demonstrate that the proposed method can surpass all baselines by a large margin. We believe that this work can shift the close-set localization towards the open-world setting and serve as a foundation for subsequent works. Code will be released at https://github.com/ryylcc/OWSOL.
Published: 2023

17. Superconductivity above 70 K observed in lutetium polyhydrides

Author: Li, Zhiwen, He, Xin, Zhang, Changling, Lu, Ke, Min, Baosen, Zhang, Jun, Zhang, Sijia, Zhao, Jianfa, Shi, Luchuan, Feng, Shaomin, Wang, Xiancheng, Peng, Yi, Yu, Richeng, Wang, Luhong, Li, Yingzhe, Bass, Jay D, Prakapenka, Vitali, Chariton, Stella, Liu, Haozhe, and Jin, Changqing
Subjects: Condensed Matter - Superconductivity, Condensed Matter - Materials Science, Condensed Matter - Strongly Correlated Electrons
Abstract: The binary polyhydrides of heavy rare earth lutetium that shares a similar valence electron configuration to lanthanum have been experimentally discovered to be superconductive. The lutetium polyhydrides were successfully synthesized at high pressure and high temperature conditions using a diamond anvil cell in combinations with the in-situ high pressure laser heating technique. The resistance measurements as a function of temperature were performed at the same pressure of synthesis in order to study the transitions of superconductivity (SC). The superconducting transition with a maximum onset temperature (Tc) 71 K was observed at pressure of 218 GPa in the experiments. The Tc decreased to 65 K when pressure was at 181 GPa. From the evolution of SC at applied magnetic fields, the upper critical field at zero temperature {\mu}0Hc2(0) was obtained to be ~36 Tesla. The in-situ high pressure X-ray diffraction experiments imply that the high Tc SC should arise from the Lu4H23 phase with Pm-3n symmetry that forms a new type of hydrogen cage framework different from those reported for previous light rare earth polyhydride superconductors.
Published: 2023
Full Text: View/download PDF

18. Superconductivity above 30 K achieved in dense scandium

Author: He, Xin, Zhang, Changling, Li, Zhiwen, Zhang, Sijia, Feng, Shaomin, Zhao, Jianfa, Lu, Ke, Min, Baosen, Peng, Yi, Wang, Xiancheng, Song, Jin, Wang, Luhong, Kawaguchi, Saori I., Ji, Cheng, Li, Bing, Liu, Haozhe, Tse, J. S., and Jin, Changqing
Subjects: Condensed Matter - Superconductivity, Condensed Matter - Materials Science, Condensed Matter - Strongly Correlated Electrons
Abstract: Superconductivity is one of most intriguing quantum phenomena, and the quest for elemental superconductors with high critical temperature (Tc) is of great scientific significance due to their relatively simple material composition and the underlying mechanism. Here we report the experimental discovery of densely compressed scandium (Sc) becoming the first elemental superconductor with Tc breaking into 30 K range, which is comparable to the Tc values of the classic La-Ba-Cu-O or LaFeAsO superconductors. Our results show that Tconset of Sc increases from ~3 K at around 43 GPa to ~32 K at about 283 GPa (Tczero ~ 31 K), which is well above liquid neon temperature. Interestingly measured Tc shows no sign of saturation up to the maximum pressure achieved in our experiments, indicating that Tc might be even higher upon further compression., Comment: 17 pages, 4 figures, plus supplementary materials with 5 pages and 3 figures
Published: 2023
Full Text: View/download PDF

19. Improving GAN Training via Feature Space Shrinkage

Author: Liu, Haozhe, Zhang, Wentian, Li, Bing, Wu, Haoqian, He, Nanjun, Huang, Yawen, Li, Yuexiang, Ghanem, Bernard, and Zheng, Yefeng
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence
Abstract: Due to the outstanding capability for data generation, Generative Adversarial Networks (GANs) have attracted considerable attention in unsupervised learning. However, training GANs is difficult, since the training distribution is dynamic for the discriminator, leading to unstable image representation. In this paper, we address the problem of training GANs from a novel perspective, \emph{i.e.,} robust image classification. Motivated by studies on robust image representation, we propose a simple yet effective module, namely AdaptiveMix, for GANs, which shrinks the regions of training data in the image representation space of the discriminator. Considering it is intractable to directly bound feature space, we propose to construct hard samples and narrow down the feature distance between hard and easy samples. The hard samples are constructed by mixing a pair of training images. We evaluate the effectiveness of our AdaptiveMix with widely-used and state-of-the-art GAN architectures. The evaluation results demonstrate that our AdaptiveMix can facilitate the training of GANs and effectively improve the image quality of generated samples. We also show that our AdaptiveMix can be further applied to image classification and Out-Of-Distribution (OOD) detection tasks, by equipping it with state-of-the-art methods. Extensive experiments on seven publicly available datasets show that our method effectively boosts the performance of baselines. The code is publicly available at https://github.com/WentianZhang-ML/AdaptiveMix., Comment: Accepted by CVPR'2023. Code and Demo are available at https://github.com/WentianZhang-ML/AdaptiveMix
Published: 2023

20. Decoupled Mixup for Generalized Visual Recognition

Author: Liu, Haozhe, Zhang, Wentian, Xie, Jinheng, Wu, Haoqian, Li, Bing, Zhang, Ziqi, Li, Yuexiang, Huang, Yawen, Ghanem, Bernard, and Zheng, Yefeng
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Convolutional neural networks (CNN) have demonstrated remarkable performance when the training and testing data are from the same distribution. However, such trained CNN models often largely degrade on testing data which is unseen and Out-Of-the-Distribution (OOD). To address this issue, we propose a novel "Decoupled-Mixup" method to train CNN models for OOD visual recognition. Different from previous work combining pairs of images homogeneously, our method decouples each image into discriminative and noise-prone regions, and then heterogeneously combines these regions of image pairs to train CNN models. Since the observation is that noise-prone regions such as textural and clutter backgrounds are adverse to the generalization ability of CNN models during training, we enhance features from discriminative regions and suppress noise-prone ones when combining an image pair. To further improve the generalization ability of trained models, we propose to disentangle discriminative and noise-prone regions in frequency-based and context-based fashions. Experiment results show the high generalization performance of our method on testing data that are composed of unseen contexts, where our method achieves 85.76\% top-1 accuracy in Track-1 and 79.92\% in Track-2 in the NICO Challenge. The source code is available at https://github.com/HaozheLiu-ST/NICOChallenge-OOD-Classification., Comment: Accepted by ECCV'2022 Workshop: Causality in Vision
Published: 2022

21. A Uniform Representation Learning Method for OCT-based Fingerprint Presentation Attack Detection and Reconstruction

Author: Zhang, Wentian, Liu, Haozhe, Liu, Feng, and Ramachandra, Raghavendra
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: The technology of optical coherence tomography (OCT) to fingerprint imaging opens up a new research potential for fingerprint recognition owing to its ability to capture depth information of the skin layers. Developing robust and high security Automated Fingerprint Recognition Systems (AFRSs) are possible if the depth information can be fully utilized. However, in existing studies, Presentation Attack Detection (PAD) and subsurface fingerprint reconstruction based on depth information are treated as two independent branches, resulting in high computation and complexity of AFRS building.Thus, this paper proposes a uniform representation model for OCT-based fingerprint PAD and subsurface fingerprint reconstruction. Firstly, we design a novel semantic segmentation network which only trained by real finger slices of OCT-based fingerprints to extract multiple subsurface structures from those slices (also known as B-scans). The latent codes derived from the network are directly used to effectively detect the PA since they contain abundant subsurface biological information, which is independent with PA materials and has strong robustness for unknown PAs. Meanwhile, the segmented subsurface structures are adopted to reconstruct multiple subsurface 2D fingerprints. Recognition can be easily achieved by using existing mature technologies based on traditional 2D fingerprints. Extensive experiments are carried on our own established database, which is the largest public OCT-based fingerprint database with 2449 volumes. In PAD task, our method can improve 0.33% Acc from the state-of-the-art method. For reconstruction performance, our method achieves the best performance with 0.834 mIOU and 0.937 PA. By comparing with the recognition performance on surface 2D fingerprints, the effectiveness of our proposed method on high quality subsurface fingerprint reconstruction is further proved., Comment: 13 pages, 8 figures
Published: 2022

22. A Benchmark for Weakly Semi-Supervised Abnormality Localization in Chest X-Rays

Author: Ji, Haoqin, Liu, Haozhe, Li, Yuexiang, Xie, Jinheng, He, Nanjun, Huang, Yawen, Wei, Dong, Chen, Xinrong, Shen, Linlin, and Zheng, Yefeng
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Accurate abnormality localization in chest X-rays (CXR) can benefit the clinical diagnosis of various thoracic diseases. However, the lesion-level annotation can only be performed by experienced radiologists, and it is tedious and time-consuming, thus difficult to acquire. Such a situation results in a difficulty to develop a fully-supervised abnormality localization system for CXR. In this regard, we propose to train the CXR abnormality localization framework via a weakly semi-supervised strategy, termed Point Beyond Class (PBC), which utilizes a small number of fully annotated CXRs with lesion-level bounding boxes and extensive weakly annotated samples by points. Such a point annotation setting can provide weakly instance-level information for abnormality localization with a marginal annotation cost. Particularly, the core idea behind our PBC is to learn a robust and accurate mapping from the point annotations to the bounding boxes against the variance of annotated points. To achieve that, a regularization term, namely multi-point consistency, is proposed, which drives the model to generate the consistent bounding box from different point annotations inside the same abnormality. Furthermore, a self-supervision, termed symmetric consistency, is also proposed to deeply exploit the useful information from the weakly annotated data for abnormality localization. Experimental results on RSNA and VinDr-CXR datasets justify the effectiveness of the proposed method. When less than 20% box-level labels are used for training, an improvement of ~5 in mAP can be achieved by our PBC, compared to the current state-of-the-art method (i.e., Point DETR). Code is available at https://github.com/HaozheLiu-ST/Point-Beyond-Class., Comment: Accepted by MICCAI-2022
Published: 2022

23. Combating Mode Collapse in GANs via Manifold Entropy Estimation

Author: Liu, Haozhe, Li, Bing, Wu, Haoqian, Liang, Hanbang, Huang, Yawen, Li, Yuexiang, Ghanem, Bernard, and Zheng, Yefeng
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence
Abstract: Generative Adversarial Networks (GANs) have shown compelling results in various tasks and applications in recent years. However, mode collapse remains a critical problem in GANs. In this paper, we propose a novel training pipeline to address the mode collapse issue of GANs. Different from existing methods, we propose to generalize the discriminator as feature embedding and maximize the entropy of distributions in the embedding space learned by the discriminator. Specifically, two regularization terms, i.e., Deep Local Linear Embedding (DLLE) and Deep Isometric feature Mapping (DIsoMap), are designed to encourage the discriminator to learn the structural information embedded in the data, such that the embedding space learned by the discriminator can be well-formed. Based on the well-learned embedding space supported by the discriminator, a non-parametric entropy estimator is designed to efficiently maximize the entropy of embedding vectors, playing as an approximation of maximizing the entropy of the generated distribution. By improving the discriminator and maximizing the distance of the most similar samples in the embedding space, our pipeline effectively reduces the mode collapse without sacrificing the quality of generated samples. Extensive experimental results show the effectiveness of our method, which outperforms the GAN baseline, MaF-GAN on CelebA (9.13 vs. 12.43 in FID) and surpasses the recent state-of-the-art energy-based model on the ANIME-FACE dataset (2.80 vs. 2.26 in Inception score). The code is available at https://github.com/HaozheLiu-ST/MEE, Comment: Accepted by AAAI'2023 (Oral); Code is released at https://github.com/HaozheLiu-ST/MEE
Published: 2022

24. Activation Template Matching Loss for Explainable Face Recognition

Author: Lin, Huawei, Liu, Haozhe, Li, Qiufu, and Shen, Linlin
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence
Abstract: Can we construct an explainable face recognition network able to learn a facial part-based feature like eyes, nose, mouth and so forth, without any manual annotation or additionalsion datasets? In this paper, we propose a generic Explainable Channel Loss (ECLoss) to construct an explainable face recognition network. The explainable network trained with ECLoss can easily learn the facial part-based representation on the target convolutional layer, where an individual channel can detect a certain face part. Our experiments on dozens of datasets show that ECLoss achieves superior explainability metrics, and at the same time improves the performance of face verification without face alignment. In addition, our visualization results also illustrate the effectiveness of the proposed ECLoss., Comment: 13 pages, 7 figures, 5 tables
Published: 2022

25. Robust Representation via Dynamic Feature Aggregation

Author: Liu, Haozhe, Ji, Haoqin, Li, Yuexiang, He, Nanjun, Wu, Haoqian, Liu, Feng, Shen, Linlin, and Zheng, Yefeng
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence
Abstract: Deep convolutional neural network (CNN) based models are vulnerable to the adversarial attacks. One of the possible reasons is that the embedding space of CNN based model is sparse, resulting in a large space for the generation of adversarial samples. In this study, we propose a method, denoted as Dynamic Feature Aggregation, to compress the embedding space with a novel regularization. Particularly, the convex combination between two samples are regarded as the pivot for aggregation. In the embedding space, the selected samples are guided to be similar to the representation of the pivot. On the other side, to mitigate the trivial solution of such regularization, the last fully-connected layer of the model is replaced by an orthogonal classifier, in which the embedding codes for different classes are processed orthogonally and separately. With the regularization and orthogonal classifier, a more compact embedding space can be obtained, which accordingly improves the model robustness against adversarial attacks. An averaging accuracy of 56.91% is achieved by our method on CIFAR-10 against various attack methods, which significantly surpasses a solid baseline (Mixup) by a margin of 37.31%. More surprisingly, empirical results show that, the proposed method can also achieve the state-of-the-art performance for out-of-distribution (OOD) detection, due to the learned compact feature space. An F1 score of 0.937 is achieved by the proposed method, when adopting CIFAR-10 as in-distribution (ID) dataset and LSUN as OOD dataset. Code is available at https://github.com/HaozheLiu-ST/DynamicFeatureAggregation.
Published: 2022

26. Scene Consistency Representation Learning for Video Scene Segmentation

Author: Wu, Haoqian, Chen, Keyu, Luo, Yanan, Qiao, Ruizhi, Ren, Bo, Liu, Haozhe, Xie, Weicheng, and Shen, Linlin
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence
Abstract: A long-term video, such as a movie or TV show, is composed of various scenes, each of which represents a series of shots sharing the same semantic story. Spotting the correct scene boundary from the long-term video is a challenging task, since a model must understand the storyline of the video to figure out where a scene starts and ends. To this end, we propose an effective Self-Supervised Learning (SSL) framework to learn better shot representations from unlabeled long-term videos. More specifically, we present an SSL scheme to achieve scene consistency, while exploring considerable data augmentation and shuffling methods to boost the model generalizability. Instead of explicitly learning the scene boundary features as in the previous methods, we introduce a vanilla temporal model with less inductive bias to verify the quality of the shot features. Our method achieves the state-of-the-art performance on the task of Video Scene Segmentation. Additionally, we suggest a more fair and reasonable benchmark to evaluate the performance of Video Scene Segmentation methods. The code is made available., Comment: Accepted to CVPR 2022
Published: 2022

27. Effect of Bacillus subtilis Supplemented Diet on Broiler’s Intestinal Microbiota and TLRs Gene Expression

Author: Khan, Salman, Khalid, Anam, Yang, Ru, Khalid, Fatima, Zahid, Muhammad Hamza, Liu, Haozhe, Zhang, Yunhua, and Wang, Zaigui
Published: 2023
Full Text: View/download PDF

28. Why KDAC? A general activation function for knowledge discovery

Author: Wang, Zhenhua, Gao, Dong, Liu, Haozhe, and Liu, Fanglin
Subjects: Computer Science - Machine Learning, Computer Science - Computation and Language, Computer Science - Neural and Evolutionary Computing
Abstract: Deep learning oriented named entity recognition (DNER) has gradually become the paradigm of knowledge discovery, which greatly promotes domain intelligence. However, the current activation function of DNER fails to treat gradient vanishing, no negative output or non-differentiable existence, which may impede knowledge exploration caused by the omission and incomplete representation of latent semantics. To break through the dilemma, we present a novel activation function termed KDAC. Detailly, KDAC is an aggregation function with multiple conversion modes. The backbone of the activation region is the interaction between exponent and linearity, and the both ends extend through adaptive linear divergence, which surmounts the obstacle of gradient vanishing and no negative output. Crucially, the non-differentiable points are alerted and eliminated by an approximate smoothing algorithm. KDAC has a series of brilliant properties, including nonlinear, stable near-linear transformation and derivative, as well as dynamic style, etc. We perform experiments based on BERT-BiLSTM-CNN-CRF model on six benchmark datasets containing different domain knowledge, such as Weibo, Clinical, E-commerce, Resume, HAZOP and People's daily. The evaluation results show that KDAC is advanced and effective, and can provide more generalized activation to stimulate the performance of DNER. We hope that KDAC can be exploited as a promising activation function to devote itself to the construction of knowledge., Comment: Accepted by Neurocomputing
Published: 2021

29. FRT-PAD: Effective Presentation Attack Detection Driven by Face Related Task

Author: Zhang, Wentian, Liu, Haozhe, Liu, Feng, Ramachandra, Raghavendra, and Busch, Christoph
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: The robustness and generalization ability of Presentation Attack Detection (PAD) methods is critical to ensure the security of Face Recognition Systems (FRSs). However, in a real scenario, Presentation Attacks (PAs) are various and it is hard to predict the Presentation Attack Instrument (PAI) species that will be used by the attacker. Existing PAD methods are highly dependent on the limited training set and cannot generalize well to unknown PAI species. Unlike this specific PAD task, other face related tasks trained by huge amount of real faces (e.g. face recognition and attribute editing) can be effectively adopted into different application scenarios. Inspired by this, we propose to trade position of PAD and face related work in a face system and apply the free acquired prior knowledge from face related tasks to solve face PAD, so as to improve the generalization ability in detecting PAs. The proposed method, first introduces task specific features from other face related task, then, we design a Cross-Modal Adapter using a Graph Attention Network (GAT) to re-map such features to adapt to PAD task. Finally, face PAD is achieved by using the hierarchical features from a CNN-based PA detector and the re-mapped features. The experimental results show that the proposed method can achieve significant improvements in the complicated and hybrid datasets, when compared with the state-of-the-art methods. In particular, when training on the datasets OULU-NPU, CASIA-FASD, and Idiap Replay-Attack, we obtain HTER (Half Total Error Rate) of 5.48% for the testing dataset MSU-MFSD, outperforming the baseline by 7.39%., Comment: Accepted by ECCV 2022
Published: 2021

30. Fingerprint Presentation Attack Detection by Channel-wise Feature Denoising

Author: Liu, Feng, Kong, Zhe, Liu, Haozhe, Zhang, Wentian, and Shen, Linlin
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence
Abstract: Due to the diversity of attack materials, fingerprint recognition systems (AFRSs) are vulnerable to malicious attacks. It is thus important to propose effective fingerprint presentation attack detection (PAD) methods for the safety and reliability of AFRSs. However, current PAD methods often exhibit poor robustness under new attack types settings. This paper thus proposes a novel channel-wise feature denoising fingerprint PAD (CFD-PAD) method by handling the redundant noise information ignored in previous studies. The proposed method learns important features of fingerprint images by weighing the importance of each channel and identifying discriminative channels and "noise" channels. Then, the propagation of "noise" channels is suppressed in the feature map to reduce interference. Specifically, a PA-Adaptation loss is designed to constrain the feature distribution to make the feature distribution of live fingerprints more aggregate and that of spoof fingerprints more disperse. Experimental results evaluated on the LivDet 2017 dataset showed that the proposed CFD-PAD can achieve a 2.53% average classification error (ACE) and a 93.83% true detection rate when the false detection rate equals 1.0% (TDR@FDR=1%). Also, the proposed method markedly outperforms the best single-model-based methods in terms of ACE (2.53% vs. 4.56%) and TDR@FDR=1%(93.83% vs. 73.32%), which demonstrates its effectiveness. Although we have achieved a comparable result with the state-of-the-art multiple-model-based methods, there still is an increase in TDR@FDR=1% from 91.19% to 93.83%. In addition, the proposed model is simpler, lighter and more efficient and has achieved a 74.76% reduction in computation time compared with the state-of-the-art multiple-model-based method. The source code is available at https://github.com/kongzhecn/cfd-pad., Comment: 15 pages, 8 figures, Accepted by TIFS
Published: 2021
Full Text: View/download PDF

31. Manifold-preserved GANs

Author: Liu, Haozhe, Liang, Hanbang, Hou, Xianxu, Wu, Haoqian, Liu, Feng, and Shen, Linlin
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Generative Adversarial Networks (GANs) have been widely adopted in various fields. However, existing GANs generally are not able to preserve the manifold of data space, mainly due to the simple representation of discriminator for the real/generated data. To address such open challenges, this paper proposes Manifold-preserved GANs (MaF-GANs), which generalize Wasserstein GANs into high-dimensional form. Specifically, to improve the representation of data, the discriminator in MaF-GANs is designed to map data into a high-dimensional manifold. Furthermore, to stabilize the training of MaF-GANs, an operation with precise and universal solution for any K-Lipschitz continuity, called Topological Consistency is proposed. The effectiveness of the proposed method is justified by both theoretical analysis and empirical results. When adopting DCGAN as the backbone on CelebA (256*256), the proposed method achieved 12.43 FID, which outperforms the state-of-the-art model like Realness GAN (23.51 FID) by a large margin. Code will be made publicly available.
Published: 2021

32. Taming Self-Supervised Learning for Presentation Attack Detection: De-Folding and De-Mixing

Author: Kong, Zhe, Zhang, Wentian, Liu, Feng, Luo, Wenhan, Liu, Haozhe, Shen, Linlin, and Ramachandra, Raghavendra
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence
Abstract: Biometric systems are vulnerable to Presentation Attacks (PA) performed using various Presentation Attack Instruments (PAIs). Even though there are numerous Presentation Attack Detection (PAD) techniques based on both deep learning and hand-crafted features, the generalization of PAD for unknown PAI is still a challenging problem. In this work, we empirically prove that the initialization of the PAD model is a crucial factor for the generalization, which is rarely discussed in the community. Based on such observation, we proposed a self-supervised learning-based method, denoted as DF-DM. Specifically, DF-DM is based on a global-local view coupled with De-Folding and De-Mixing to derive the task-specific representation for PAD. During De-Folding, the proposed technique will learn region-specific features to represent samples in a local pattern by explicitly minimizing generative loss. While De-Mixing drives detectors to obtain the instance-specific features with global information for more comprehensive representation by minimizing interpolation-based consistency. Extensive experimental results show that the proposed method can achieve significant improvements in terms of both face and fingerprint PAD in more complicated and hybrid datasets when compared with state-of-the-art methods. When training in CASIA-FASD and Idiap Replay-Attack, the proposed method can achieve an 18.60% Equal Error Rate (EER) in OULU-NPU and MSU-MFSD, exceeding baseline performance by 9.54%. The source code of the proposed technique is available at https://github.com/kongzhecn/dfdm., Comment: Accepted by IEEE Transactions on Neural Networks and Learning Systems (TNNLS)
Published: 2021
Full Text: View/download PDF

33. Study on the mechanism and law of temperature, humidity and moisture content on the mechanical properties of molded fiber products

Author: Fu, Zhiqiang, Zhao, Tong, Wang, Hu, Wei, Jingyi, Liu, Haozhe, Duan, Liying, Wang, Yan, and Yan, Ruixiang
Published: 2024
Full Text: View/download PDF

34. Target-site and non-target-site resistance mechanisms confer mesosulfuron-methyl resistance in Alopecurus aequalis

Author: Zhan, You, Liu, Haozhe, Cao, Ziheng, Qi, Jiale, Bai, Lianyang, and Pan, Lang
Published: 2024
Full Text: View/download PDF

35. Anomaly detection via gating highway connection for retinal fundus images

Author: Zhang, Wentian, Liu, Haozhe, Xie, Jinheng, Huang, Yawen, Zhang, Yu, Li, Yuexiang, Ramachandra, Raghavendra, and Zheng, Yefeng
Published: 2024
Full Text: View/download PDF

36. Group-wise Inhibition based Feature Regularization for Robust Classification

Author: Liu, Haozhe, Wu, Haoqian, Xie, Weicheng, Liu, Feng, and Shen, Linlin
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence
Abstract: The convolutional neural network (CNN) is vulnerable to degraded images with even very small variations (e.g. corrupted and adversarial samples). One of the possible reasons is that CNN pays more attention to the most discriminative regions, but ignores the auxiliary features when learning, leading to the lack of feature diversity for final judgment. In our method, we propose to dynamically suppress significant activation values of CNN by group-wise inhibition, but not fixedly or randomly handle them when training. The feature maps with different activation distribution are then processed separately to take the feature independence into account. CNN is finally guided to learn richer discriminative features hierarchically for robust classification according to the proposed regularization. Our method is comprehensively evaluated under multiple settings, including classification against corruptions, adversarial attacks and low data regime. Extensive experimental results show that the proposed method can achieve significant improvements in terms of both robustness and generalization performances, when compared with the state-of-the-art methods. Code is available at https://github.com/LinusWu/TENET_Training., Comment: Accepted to ICCV 2021
Published: 2021

37. Decoupled Mixup for Out-of-Distribution Visual Recognition

Author: Liu, Haozhe, Zhang, Wentian, Xie, Jinheng, Wu, Haoqian, Li, Bing, Zhang, Ziqi, Li, Yuexiang, Huang, Yawen, Ghanem, Bernard, Zheng, Yefeng, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Karlinsky, Leonid, editor, Michaeli, Tomer, editor, and Nishino, Ko, editor
Published: 2023
Full Text: View/download PDF

38. Synergistic hydroxyl mechanism on halloysite-confined PtFe alloy boosting low-temperature CO-PROX performance

Author: Wang, Qi, Li, Liping, Huang, Taotao, Ding, Junfang, Li, Xinbo, Geng, Zhibin, Liu, Haozhe, and Li, Guangshe
Published: 2024
Full Text: View/download PDF

39. High-temperature dynamic luminescence of MgGa2O4: Tb3+, Er3+ phosphors for advanced anti-counterfeiting and information encryption

Author: Wang, Guohao, Wang, Ting, Bai, Yan, Hou, Lihui, Huang, Wenlong, Zhu, Xuanyu, Liu, Haozhe, Guo, Longchao, and Yu, Xue
Published: 2024
Full Text: View/download PDF

40. A uniform representation model for OCT-based fingerprint presentation attack detection and reconstruction

Author: Zhang, Wentian, Liu, Haozhe, Liu, Feng, and Ramachandra, Raghavendra
Published: 2024
Full Text: View/download PDF

41. A Zero-Shot based Fingerprint Presentation Attack Detection System

Author: Liu, Haozhe, Zhang, Wentian, Liu, Guojie, and Liu, Feng
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning, Electrical Engineering and Systems Science - Image and Video Processing
Abstract: With the development of presentation attacks, Automated Fingerprint Recognition Systems(AFRSs) are vulnerable to presentation attack. Thus, numerous methods of presentation attack detection(PAD) have been proposed to ensure the normal utilization of AFRS. However, the demand of large-scale presentation attack images and the low-level generalization ability always astrict existing PAD methods' actual performances. Therefore, we propose a novel Zero-Shot Presentation Attack Detection Model to guarantee the generalization of the PAD model. The proposed ZSPAD-Model based on generative model does not utilize any negative samples in the process of establishment, which ensures the robustness for various types or materials based presentation attack. Different from other auto-encoder based model, the Fine-grained Map architecture is proposed to refine the reconstruction error of the auto-encoder networks and a task-specific gaussian model is utilized to improve the quality of clustering. Meanwhile, in order to improve the performance of the proposed model, 9 confidence scores are discussed in this article. Experimental results showed that the ZSPAD-Model is the state of the art for ZSPAD, and the MS-Score is the best confidence score. Compared with existing methods, the proposed ZSPAD-Model performs better than the feature-based method and under the multi-shot setting, the proposed method overperforms the learning based method with little training data. When large training data is available, their results are similar.
Published: 2020

42. A deep weakly semi-supervised framework for endoscopic lesion segmentation

Author: Shi, Yuxuan, Wang, Hong, Ji, Haoqin, Liu, Haozhe, Li, Yuexiang, He, Nanjun, Wei, Dong, Huang, Yawen, Dai, Qi, Wu, Jianrong, Chen, Xinrong, Zheng, Yefeng, and Yu, Hongmeng
Published: 2023
Full Text: View/download PDF

43. Deep Learning Reconstruction Improves the Image Quality of CT Angiography Derived From 80-kVp Cerebral CT Perfusion Data

Author: Chen, Yu, Wang, Yanling, Su, Tong, Xu, Min, Yan, Jing, Wang, Jian, Liu, Haozhe, Lu, Xiaoping, Wang, Yun, and Jin, Zhengyu
Published: 2023
Full Text: View/download PDF

44. Limit cycles and bifurcations in a class of planar piecewise linear systems with a nonregular separation line

Author: Liu, Haozhe, Wei, Zhouchao, and Moroz, Irene
Published: 2023
Full Text: View/download PDF

45. Phosphoramidic acid functionalized silica microspheres for simultaneous removal of Cr(VI), As(V) and Se(VI) from aqueous solutions based on molecular geometry match

Author: Liang, Shiqi, Jiao, Wenmei, Zhang, Dingyi, Zhang, Hu, Qiao, Rongrong, Liu, Haozhe, Wang, Meng, Chen, Yu, Zou, Meng, Huang, Yan, Guo, Wenhui, Li, Lei, and Huang, Guang
Published: 2023
Full Text: View/download PDF

46. Detoxification mechanism of herbicide in Polypogon fugax and its influence on rhizosphere enzyme activities

Author: Chen, Wen, Li, Sifu, Bai, Dingyi, Li, Zongfang, Liu, Haozhe, Bai, Lianyang, and Pan, Lang
Published: 2023
Full Text: View/download PDF

47. Advances of mass spectrometry in characterization of disinfection byproducts in drinking water

Author: Chen, Yu, Zou, Meng, Huang, Yan, Xie, Ziyan, Liu, Haozhe, Wu, Qian, Jiao, Wenmei, Qiu, Junlang, Huang, Guang, and Yang, Xin
Published: 2023
Full Text: View/download PDF

48. Depth and contaminant-shaped bacterial community structure and assembly at an aged chlorinated aliphatic hydrocarbon-contaminated site

Author: Zhao, Ke, Yang, Yuying, Hou, Jinyu, Liu, Haozhe, Zhang, Yun, Wang, Qingling, Christie, Peter, Qi, Peishi, and Liu, Wuxing
Published: 2023
Full Text: View/download PDF

49. Temperature dependent luminescence properties of Mn2+ ions for site preference of NaCa2GeO4F: Mn2+, Fe3+ phosphor

Author: Liu, Haozhe, Wang, Ting, Ge, Yicen, Zhu, Xuanyu, Nie, Lin, Zhao, Feng, Qiu, Jianbei, Xu, Xuhui, and Yu, Xue
Published: 2023
Full Text: View/download PDF

50. Micron-size bubble defects in fused silica and its laser induced damage near 355 nm

Author: Hu, Shuo, Li, Hongyu, Dong, Bo, Ma, Chuan, Zhang, Shuai, Liu, Haozhe, Lu, Lihua, Chen, Jiaxuan, and Miao, Xinxiang
Published: 2023
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

518 results on '"Liu, Haozhe"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources