797 results on '"Similarity learning"'
Search Results
2. Similarity learning networks uniquely identify individuals of four marine and terrestrial species.
- Author
-
Kabuga, Emmanuel, Langley, Izzy, Arso Civil, Monica, Measey, John, Bah, Bubacarr, and Durbach, Ian
- Subjects
ANIMAL population estimates ,CONVOLUTIONAL neural networks ,COMPUTER vision ,DATA augmentation ,DEEP learning - Abstract
Estimating the size of animal populations plays an important role in evidence‐based conservation and management. Some methods for estimating population size rely on animals being individually identifiable. Traditionally, this has been done by marking physically captured animals, but increasingly, animals with distinctive natural markings are surveyed noninvasively using cameras. Animal reidentification from photographs is usually done manually, which is expensive, laborious, and requires considerable skill. An alternative is to develop computer vision methods that can support or replace the manual identification task. We developed an automated approach using deep learning to identify whether a pair of photographs is of the same individual or not. The core of the approach is a similarity learning network that uses paired convolutional neural networks with a triplet loss function to summarize image pairs and decide whether they are from the same individual. Prior to the main matching step, two additional convolutional neural networks perform image segmentation, cropping the animal object within the image, and orientation prediction, deciding which side of the animal was photographed. We applied the approach to four species, with images of the same individual often spanning several years: systematic surveys of bottlenose dolphins (Tursiops truncatus, 2008–2019) and harbor seals (Phoca vitulina, 2015–2019), a citizen science dataset of western leopard toads (Sclerophrys pantherina, unknown dates), and a publicly available repository of humpback whale images (Megaptera novaeangliae, unknown dates). For these species, our best‐performing models were able to identify whether a pair of images were from the same individual or different individuals in 95.8%, 94.6%, 88.2%, and 83.8% of the cases, respectively. We found that triplet loss functions outperformed binary cross‐entropy loss functions and that data augmentation and additional manual curation of training data provided small but consistent improvements in performance. These results demonstrate the potential of deep learning to replace or, more likely, support and facilitate manual individual identification efforts. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
3. A novel bearing fault diagnosis method for compound defects via zero-shot learning.
- Author
-
Thuan, Nguyen Duc
- Subjects
- *
DIAGNOSIS methods , *DATA mapping , *DIAGNOSIS , *ACHIEVEMENT , *DEEP learning - Abstract
In recent years, deep learning-based bearing fault diagnosis methods have made significant achievements. However, these methods only work with single faults and cannot diagnose compound faults because compound fault data is often unavailable in practice. To address this problem, this paper proposes a zero-shot learning-based bearing fault diagnosis method for compound defects. The proposed method utilizes an autoencoder network to observe the attributes of single faults and then estimates the attributes of compound faults. Afterward, a mapping from the data space to the attribute space is established to predict the attribute output of the data. The attribute output is then compared with prior attributes to determine the type of bearing fault. Verification experiments were conducted on HUST bearing dataset. The experimental results showed that the proposed method achieved a high accuracy of 75.64 % in diagnosing compound bearing faults. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
4. Boosting Semi-Supervised Learning with Dual-Threshold Screening and Similarity Learning.
- Author
-
Liang, Zechen, Wang, Yuan-Gen, Lu, Wei, and Cao, Xiaochun
- Subjects
SUPERVISED learning ,CLASSIFICATION ,FORECASTING - Abstract
How to effectively utilize unlabeled data for training is a key problem in Semi-Supervised Learning (SSL). Existing SSL methods often consider the unlabeled data whose predictions are beyond a fixed threshold (e.g., 0.95) and discard those less than 0.95. We argue that these discarded data have a large proportion, are of hard sample, and will benefit the model training if used properly. In this article, we propose a novel method to take full advantage of the unlabeled data, termed DTS-SimL, which includes two core designs: Dual-Threshold Screening and Similarity Learning. Except for the fixed threshold, DTS-SimL extracts another class-adaptive threshold from the labeled data. Such a class-adaptive threshold can screen many unlabeled data whose predictions are lower than 0.95 but over the extracted one for model training. On the other hand, we design a new similar loss to perform similarity learning for all the highly similar unlabeled data, which can further mine the valuable information from the unlabeled data. Finally, for more effective training of DTS-SimL, we construct an overall loss function by assigning four different losses to four different types of data. Extensive experiments are conducted on five benchmark datasets, including CIFAR-10, CIFAR-100, SVHN, Mini-ImageNet, and DomainNet-Real. Experimental results show that the proposed DTS-SimL achieves state-of-the-art classification accuracy. The code is publicly available at https://github.com/GZHU-DVL/DTS-SimL. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
5. Pairwise-Constraint-Guided Multi-View Feature Selection by Joint Sparse Regularization and Similarity Learning.
- Author
-
Li, Jinxi and Tao, Hong
- Subjects
- *
FEATURE selection , *IMAGE recognition (Computer vision) , *ALGORITHMS - Abstract
Feature selection is a basic and important step in real applications, such as face recognition and image segmentation. In this paper, we propose a new weakly supervised multi-view feature selection method by utilizing pairwise constraints, i.e., the pairwise constraint-guided multi-view feature selection (PCFS for short) method. In this method, linear projections of all views and a consistent similarity graph with pairwise constraints are jointly optimized to learning discriminative projections. Meanwhile, the l 2 , 0 -norm-based row sparsity constraint is imposed on the concatenation of projections for discriminative feature selection. Then, an iterative algorithm with theoretically guaranteed convergence is developed for the optimization of PCFS. The performance of the proposed PCFS method was evaluated by comprehensive experiments on six benchmark datasets and applications on cancer clustering. The experimental results demonstrate that PCFS exhibited competitive performance in feature selection in comparison with related models. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
6. Learning Attribute-guided Fashion Similarity with Spatial and Channel Attention.
- Author
-
Wan, Yongquan, Yan, Kang, Yan, Cairong, and Zhang, Bofeng
- Subjects
- *
IMAGE retrieval , *ARTIFICIAL intelligence - Abstract
Fashion image retrieval is one of the important services of e-commerce platforms, and it is also the basis of various fashion-related AI applications. Studies have shown that in a multi-modal environment (images + attribute labels), embedding items into specific attribute spaces can support more fine-grained similarity measures, which is especially suitable for fashion retrieval tasks. In this paper, we propose an attention-based attribute-guided similarity learning network (AttnFashion) for fashion image retrieval. The core of this network is an attribute-guided spatial attention module and an attribute-guided channel attention module, which correspond to the mapping between attributes and image regions, and the mapping between attributes and high-level image semantics, respectively. To make these two modules interact deeply, we design a parallel structure that allows them to share attribute embeddings and guide each other to extract specific features, which also helps to reduce the network parameters of the attention modules. An adaptive feature fusion strategy is proposed to synthesise the features extracted by the two modules. Extensive experiments show that the proposed AttnFashion performs better than current competitive networks in the field of fine-grained attribute-based fashion retrieval. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
7. Similarity learning networks uniquely identify individuals of four marine and terrestrial species
- Author
-
Emmanuel Kabuga, Izzy Langley, Monica Arso Civil, John Measey, Bubacarr Bah, and Ian Durbach
- Subjects
deep learning ,individual identification ,siamese neural network ,similarity learning ,triplet loss ,Ecology ,QH540-549.5 - Abstract
Abstract Estimating the size of animal populations plays an important role in evidence‐based conservation and management. Some methods for estimating population size rely on animals being individually identifiable. Traditionally, this has been done by marking physically captured animals, but increasingly, animals with distinctive natural markings are surveyed noninvasively using cameras. Animal reidentification from photographs is usually done manually, which is expensive, laborious, and requires considerable skill. An alternative is to develop computer vision methods that can support or replace the manual identification task. We developed an automated approach using deep learning to identify whether a pair of photographs is of the same individual or not. The core of the approach is a similarity learning network that uses paired convolutional neural networks with a triplet loss function to summarize image pairs and decide whether they are from the same individual. Prior to the main matching step, two additional convolutional neural networks perform image segmentation, cropping the animal object within the image, and orientation prediction, deciding which side of the animal was photographed. We applied the approach to four species, with images of the same individual often spanning several years: systematic surveys of bottlenose dolphins (Tursiops truncatus, 2008–2019) and harbor seals (Phoca vitulina, 2015–2019), a citizen science dataset of western leopard toads (Sclerophrys pantherina, unknown dates), and a publicly available repository of humpback whale images (Megaptera novaeangliae, unknown dates). For these species, our best‐performing models were able to identify whether a pair of images were from the same individual or different individuals in 95.8%, 94.6%, 88.2%, and 83.8% of the cases, respectively. We found that triplet loss functions outperformed binary cross‐entropy loss functions and that data augmentation and additional manual curation of training data provided small but consistent improvements in performance. These results demonstrate the potential of deep learning to replace or, more likely, support and facilitate manual individual identification efforts.
- Published
- 2024
- Full Text
- View/download PDF
8. astPSL: Similarity Learning System Based on Structured and Unstructured Records
- Author
-
Zhang, Jie, Zhang, Wenxin, Kang, Mengfei, Zhang, Xin, Zhu, Lei, Hei, Xinhong, Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Onizuka, Makoto, editor, Lee, Jae-Gil, editor, Tong, Yongxin, editor, Xiao, Chuan, editor, Ishikawa, Yoshiharu, editor, Amer-Yahia, Sihem, editor, Jagadish, H. V., editor, and Lu, Kejing, editor
- Published
- 2024
- Full Text
- View/download PDF
9. Automatic Adjusting Global Similarity Measures in Learning CBR Systems
- Author
-
Ottersen, Stuart G., Bach, Kerstin, Hartmanis, Juris, Founding Editor, van Leeuwen, Jan, Series Editor, Hutchison, David, Editorial Board Member, Kanade, Takeo, Editorial Board Member, Kittler, Josef, Editorial Board Member, Kleinberg, Jon M., Editorial Board Member, Kobsa, Alfred, Series Editor, Mattern, Friedemann, Editorial Board Member, Mitchell, John C., Editorial Board Member, Naor, Moni, Editorial Board Member, Nierstrasz, Oscar, Series Editor, Pandu Rangan, C., Editorial Board Member, Sudan, Madhu, Series Editor, Terzopoulos, Demetri, Editorial Board Member, Tygar, Doug, Editorial Board Member, Weikum, Gerhard, Series Editor, Vardi, Moshe Y, Series Editor, Goos, Gerhard, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Woeginger, Gerhard, Editorial Board Member, Recio-Garcia, Juan A., editor, Orozco-del-Castillo, Mauricio G., editor, and Bridge, Derek, editor
- Published
- 2024
- Full Text
- View/download PDF
10. HARR: Learning Discriminative and High-Quality Hash Codes for Image Retrieval.
- Author
-
Ma, Zeyu, Wang, Siwei, Luo, Xiao, Gu, Zhonghui, Chen, Chong, Li, Jinxing, Hua, Xian-Sheng, and Lu, Guangming
- Subjects
IMAGE retrieval ,COMPUTER programming education ,HUFFMAN codes ,BINARY codes - Abstract
This article studies deep unsupervised hashing, which has attracted increasing attention in large-scale image retrieval. The majority of recent approaches usually reconstruct semantic similarity information, which then guides the hash code learning. However, they still fail to achieve satisfactory performance in reality for two reasons. On the one hand, without accurate supervised information, these methods usually fail to produce independent and robust hash codes with semantics information well preserved, which may hinder effective image retrieval. On the other hand, due to discrete constraints, how to effectively optimize the hashing network in an end-to-end manner with small quantization errors remains a problem. To address these difficulties, we propose a novel unsupervised hashing method called HARR to learn discriminative and high-quality hash codes. To comprehensively explore semantic similarity structure, HARR adopts the Winner-Take-All hash to model the similarity structure. Then similarity-preserving hash codes are learned under the reliable guidance of the reconstructed similarity structure. Additionally, we improve the quality of hash codes by a bit correlation reduction module, which forces the cross-correlation matrix between a batch of hash codes under different augmentations to approach the identity matrix. In this way, the generated hash bits are expected to be invariant to disturbances with minimal redundancy, which can be further interpreted as an instantiation of the information bottleneck principle. Finally, for effective hashing network training, we minimize the cosine distances between real-value network outputs and their binary codes for small quantization errors. Extensive experiments demonstrate the effectiveness of our proposed HARR. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
11. Remaining Useful Life Prediction for Pressurized Fluid Pipelines Based on Acoustic Emission Monitoring and an Adaptive Fuzzy Similarity Measure
- Author
-
Duc-Thuan Nguyen, Tuan-Khai Nguyen, Zahoor Ahmad, and Jong-Myon Kim
- Subjects
Remaining useful life ,pipelines ,acoustic emission ,fuzzy logic ,similarity learning ,Electrical engineering. Electronics. Nuclear engineering ,TK1-9971 - Abstract
Pressurized fluid pipelines are among the most crucial components in industrial settings. Operating under high pressure leads to pipeline susceptibility to cracking, rupture, and significant damage. Monitoring the condition and predicting the remaining useful life before the failure of pressurized pipes are essential for informed and timely maintenance decisions. In this work, we propose a novel method for predicting the remaining useful life of pressurized pipelines based on acoustic emission monitoring and similarity-based learning. Specifically, acoustic emission sensors are deployed to record acoustic emission events caused by cracks in the pipeline. A pipeline health indicator is proposed based on accumulated events detected through a constant false alarm rate signal detector. Leveraging the historical run-to-fail trajectories of the health indicator, a similarity measure is introduced to predict the remaining pipeline life. This method computes the similarity between the current health indicator trajectory and past trajectories based on a Euclidean distance in the proposed derivative convolutional domain. Trajectory similarity determines the remaining lifetime similarity, which is weighted using data-adaptive fuzzy rules to estimate the current remaining useful life. Elaborate experimental validations are conducted on a custom pressurized pipeline system in a laboratory setting. Experimental results demonstrate the high efficacy of the proposed method in predicting the remaining useful life of the pipeline, surpassing other commonly used methods in both accuracy and certainty.
- Published
- 2024
- Full Text
- View/download PDF
12. Robust Representation Learning via Sparse Attention Mechanism for Similarity Models
- Author
-
Alina Ermilova, Nikita Baramiia, Valerii Kornilov, Sergey Petrakov, and Alexey Zaytsev
- Subjects
Deep learning ,efficient transformer ,robust transformer ,representation learning ,similarity learning ,Electrical engineering. Electronics. Nuclear engineering ,TK1-9971 - Abstract
The attention-based models are widely used for time series data. However, due to the quadratic complexity of attention regarding input sequence length, the application of Transformers is limited by high resource demands. Moreover, their modifications for industrial time series need to be robust to missing or noisy values, which complicates the expansion of their application horizon. To cope with these issues, we introduce the class of efficient Transformers named Regularized Transformers (Reguformers). We implement the regularization technique inspired by the dropout ideas to improve robustness and reduce computational expenses without significantly modifying the pipeline. The focus in our experiments is on oil&gas data. For well-interval similarity task, our best Reguformer configuration reaches ROC AUC 0.97, which is comparable to Informer (0.978) and outperforms baselines: the previous LSTM model (0.934), the classical Transformer model (0.967), and three recent most promising modifications of the original Transformer, namely, Performer (0.949), LRformer (0.955), and DropDim (0.777). We also conduct the corresponding experiments on three additional datasets from different domains and obtain superior results. The increase in the quality of the best Reguformer relative to Transformer for different datasets varies from 3.7% to 9.6%, while the increase range relative to Informer is wider: from 1.7% to 18.4%.
- Published
- 2024
- Full Text
- View/download PDF
13. Similarity Learning for Land Use Scene-Level Change Detection
- Author
-
Jinglei Liu, Weixun Zhou, Haiyan Guan, and Wenzhi Zhao
- Subjects
Land use ,scene change detection ,scene similarity ,similarity learning ,Ocean engineering ,TC1501-1800 ,Geophysics. Cosmic physics ,QC801-809 - Abstract
Scene-level change detection (SLCD) can provide semantic change information at image level, thus it is of great significance for monitoring land use changes. Supervised SLCD approaches tend to outperform unsupervised ones. However, the existing supervised methods rely on postclassification, which often results in unsatisfactory performance due to classification error accumulation. We therefore formulate SLCD as a similarity learning task, and propose a scene similarity learning network (SSLN) for land use SLCD. To be specific, SSLN is a two-branch network with ResNet as the backbone for feature extraction, where the global feature difference and the multiscale local feature fusion modules are considered in order to better mine the temporal correlation between bitemporal scenes. Then, the trained SSLN is further exploited to obtain the similarity of scene pairs for determining the similarity threshold via threshold traversal algorithm. Finally, the land use scenes are categorized into changed or unchanged by comparing scene similarity with the threshold. Experimental results on the publicly available MtS-WH dataset and our newly released land use scene change detection dataset show that the proposed approach achieves better performance than comparison methods, indicating that our approach is a simple yet effective solution to land use SLCD.
- Published
- 2024
- Full Text
- View/download PDF
14. Context-flexible cartography with Siamese topological neural networks
- Author
-
Pitoyo Hartono
- Subjects
Cartography ,Similarity learning ,Self-organizing maps ,Visualization ,Topological representation ,Dimensionality reduction ,Computational linguistics. Natural language processing ,P98-98.5 ,Electronic computers. Computer science ,QA75.5-76.95 - Abstract
Abstract Cartography is a technique for creating maps, which are graphical representations of spatial information. Traditional cartography involves the creation of geographical data, such as locations of countries, geographical features of mountains, rivers, and oceans, and celestial objects. However, cartography has recently been utilized to display various data, such as antigenic signatures, graphically. Hence, it is natural to consider a new cartography that can flexibly deal with various data types. This study proposes a model of Siamese topological neural networks consisting of a pair of hierarchical neural networks, each with a low-dimensional internal layer for creating context-flexible maps. The proposed Siamese topological neural network transfers high-dimensional data with various contexts into their low-dimensional spatial representations on a map that humans can use to gain insights from the data. Here, it is enough to define a metric of difference between an arbitrary pair of data instances for training the proposed neural network. As the metric can be arbitrarily defined, the proposed neural network realizes context-flexible cartography useful for visual data analysis. This paper applies the proposed network for visualizing various demographic data.
- Published
- 2024
- Full Text
- View/download PDF
15. Random forest kernel for high-dimension low sample size classification.
- Author
-
Cavalheiro, Lucca Portes, Bernard, Simon, Barddal, Jean Paul, and Heutte, Laurent
- Abstract
High dimension, low sample size (HDLSS) problems are numerous among real-world applications of machine learning. From medical images to text processing, traditional machine learning algorithms are usually unsuccessful in learning the best possible concept from such data. In a previous work, we proposed a dissimilarity-based approach for multi-view classification, the random forest dissimilarity, that perfoms state-of-the-art results for such problems. In this work, we transpose the core principle of this approach to solving HDLSS classification problems, by using the RF similarity measure as a learned precomputed SVM kernel (RFSVM). We show that such a learned similarity measure is particularly suited and accurate for this classification context. Experiments conducted on 40 public HDLSS classification datasets, supported by rigorous statistical analyses, show that the RFSVM method outperforms existing methods for the majority of HDLSS problems and remains at the same time very competitive for low or non-HDLSS problems. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
16. Context-flexible cartography with Siamese topological neural networks.
- Author
-
Hartono, Pitoyo
- Subjects
CARTOGRAPHY ,SELF-organizing maps - Abstract
Cartography is a technique for creating maps, which are graphical representations of spatial information. Traditional cartography involves the creation of geographical data, such as locations of countries, geographical features of mountains, rivers, and oceans, and celestial objects. However, cartography has recently been utilized to display various data, such as antigenic signatures, graphically. Hence, it is natural to consider a new cartography that can flexibly deal with various data types. This study proposes a model of Siamese topological neural networks consisting of a pair of hierarchical neural networks, each with a low-dimensional internal layer for creating context-flexible maps. The proposed Siamese topological neural network transfers high-dimensional data with various contexts into their low-dimensional spatial representations on a map that humans can use to gain insights from the data. Here, it is enough to define a metric of difference between an arbitrary pair of data instances for training the proposed neural network. As the metric can be arbitrarily defined, the proposed neural network realizes context-flexible cartography useful for visual data analysis. This paper applies the proposed network for visualizing various demographic data. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
17. DENSE SEMANTIC REFINEMENT USING ACTIVE SIMILARITY LEARNING.
- Author
-
Clarkson, Connor, Edwards, Michael, and Xianghua Xie
- Subjects
INTERACTIVE learning ,COMPUTER vision ,DECISION making ,CLASSIFICATION - Abstract
Defect detection has achieved state-of-the-art results in both localisation and classification of various types of defects, manufacturing domains is no exception to this. Just like in many areas of computer vision there is an assume of very high-quality datasets that have been verified by domain experts, however labelling such data has become an increasing problem as we require greater quantities of it. Within defect detection the variability and composite nature of defect characteristics makes this a time-consuming and interaction-heavy task with great amount of expert effort. We propose a new acquisition function based on the similarity of defect properties for refining labels over time by showing the expert only the most required to be labelled. We also explore different ways in which the expert labels defects and how we should feed these new refinements back into the model for utilising new knowledge in an effortful way. We achieve this with a graphical interface that provides additional information as data gets refined into a dense segmentation, allowing for decision-making with uncertain areas of the image. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
18. Multi-view subspace similarity learning based on t-SVD.
- Author
-
Tang, Rong and Lu, Gui-Fu
- Abstract
Multi-view clustering is extensively applied to practical applications. The similarity matrix acquired by most of the existing approaches is obtained by using original multi-view data. However, when dealing with data in a nonlinear subspace, the results of existing methods are not satisfactory. In addition, the existing methods cannot solve the high order relevance of multi-view data. To solve these problems, we present a novel multi-view subspace similarity learning method, MSSLt-SVD, on the basis of tensor singular value decomposition (t-SVD). First, we map each view of the data to the Hilbert space through a Gaussian kernel and then minimize the reconstruction error of the obtained kernel matrix. Second, we use the t-SVD-based tensor nuclear norm (TNN) instead of the matrix-kernel norm as the regularization term to capture the high-order relevance of multi-view data. Then, we incorporate these two steps into a framework and design the corresponding goal function, which can be solved by using the augmented lagrange multiplier (ALM) method. Experiments on some datasets show that the performance of MSSLt-SVD algorithm is better than some representative ones. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
19. Pairwise-Constraint-Guided Multi-View Feature Selection by Joint Sparse Regularization and Similarity Learning
- Author
-
Jinxi Li and Hong Tao
- Subjects
multi-view feature selection ,pairwise constraints ,weakly supervised learning ,joint subspace ,similarity learning ,Mathematics ,QA1-939 - Abstract
Feature selection is a basic and important step in real applications, such as face recognition and image segmentation. In this paper, we propose a new weakly supervised multi-view feature selection method by utilizing pairwise constraints, i.e., the pairwise constraint-guided multi-view feature selection (PCFS for short) method. In this method, linear projections of all views and a consistent similarity graph with pairwise constraints are jointly optimized to learning discriminative projections. Meanwhile, the l2,0-norm-based row sparsity constraint is imposed on the concatenation of projections for discriminative feature selection. Then, an iterative algorithm with theoretically guaranteed convergence is developed for the optimization of PCFS. The performance of the proposed PCFS method was evaluated by comprehensive experiments on six benchmark datasets and applications on cancer clustering. The experimental results demonstrate that PCFS exhibited competitive performance in feature selection in comparison with related models.
- Published
- 2024
- Full Text
- View/download PDF
20. Using Similarity Learning with SBERT to Optimize Teacher Report Embeddings for Academic Performance Prediction
- Author
-
Fateen, Menna, Mine, Tsunenori, Filipe, Joaquim, Editorial Board Member, Ghosh, Ashish, Editorial Board Member, Prates, Raquel Oliveira, Editorial Board Member, Zhou, Lizhu, Editorial Board Member, Wang, Ning, editor, Rebolledo-Mendez, Genaro, editor, Dimitrova, Vania, editor, Matsuda, Noboru, editor, and Santos, Olga C., editor
- Published
- 2023
- Full Text
- View/download PDF
21. Handwritten Character Evaluation and Recommendation System
- Author
-
Mittal, Rishabh, Bhadauria, Madhulika, Garg, Anchal, Angrisani, Leopoldo, Series Editor, Arteaga, Marco, Series Editor, Panigrahi, Bijaya Ketan, Series Editor, Chakraborty, Samarjit, Series Editor, Chen, Jiming, Series Editor, Chen, Shanben, Series Editor, Chen, Tan Kay, Series Editor, Dillmann, Rüdiger, Series Editor, Duan, Haibin, Series Editor, Ferrari, Gianluigi, Series Editor, Ferre, Manuel, Series Editor, Hirche, Sandra, Series Editor, Jabbari, Faryar, Series Editor, Jia, Limin, Series Editor, Kacprzyk, Janusz, Series Editor, Khamis, Alaa, Series Editor, Kroeger, Torsten, Series Editor, Li, Yong, Series Editor, Liang, Qilian, Series Editor, Martín, Ferran, Series Editor, Ming, Tan Cher, Series Editor, Minker, Wolfgang, Series Editor, Misra, Pradeep, Series Editor, Möller, Sebastian, Series Editor, Mukhopadhyay, Subhas, Series Editor, Ning, Cun-Zheng, Series Editor, Nishida, Toyoaki, Series Editor, Oneto, Luca, Series Editor, Pascucci, Federica, Series Editor, Qin, Yong, Series Editor, Seng, Gan Woon, Series Editor, Speidel, Joachim, Series Editor, Veiga, Germano, Series Editor, Wu, Haitao, Series Editor, Zamboni, Walter, Series Editor, Zhang, Junjie James, Series Editor, Singh, Yashwant, editor, Verma, Chaman, editor, Zoltán, Illés, editor, Chhabra, Jitender Kumar, editor, and Singh, Pradeep Kumar, editor
- Published
- 2023
- Full Text
- View/download PDF
22. High-Order Correlation Embedding for Large-Scale Multi-modal Hashing
- Author
-
An, Junfeng, Li, Yingjian, Zhang, Zheng, Chen, Yongyong, Lu, Guangming, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Li, Bohan, editor, Yue, Lin, editor, Tao, Chuanqi, editor, Han, Xuming, editor, Calvanese, Diego, editor, and Amagasa, Toshiyuki, editor
- Published
- 2023
- Full Text
- View/download PDF
23. Channel-Based Similarity Learning Using 2D Channel-Based Convolutional Neural Network
- Author
-
Tiwari, Ravi Shekhar, Tavares, João Manuel R. S., Series Editor, Jorge, Renato Natal, Series Editor, Frangi, Alejandro, Editorial Board Member, BAJAJ, CHANDRAJIT, Editorial Board Member, Onate, Eugenio, Editorial Board Member, Perales, Francisco José, Editorial Board Member, Holzapfel, Gerhard A., Editorial Board Member, Vilas-Boas, João, Editorial Board Member, Weiss, Jeffrey, Editorial Board Member, Middleton, John, Editorial Board Member, Garcia Aznar, Jose Manuel, Editorial Board Member, Nithiarasu, Perumal, Editorial Board Member, Tamma, Kumar K., Editorial Board Member, Cohen, Laurent, Editorial Board Member, Doblare, Manuel, Editorial Board Member, Prendergast, Patrick J., Editorial Board Member, Löhner, Rainald, Editorial Board Member, Kamm, Roger, Editorial Board Member, Li, Shuo, Editorial Board Member, Hughes, Thomas J.R., Editorial Board Member, Zhang, Yongjie, Editorial Board Member, Gupta, Mousumi, editor, Ghatak, Sujata, editor, Gupta, Amlan, editor, and Mukherjee, Abir Lal, editor
- Published
- 2023
- Full Text
- View/download PDF
24. DFU-Helper: An Innovative Framework for Longitudinal Diabetic Foot Ulcer Diseases Evaluation Using Deep Learning.
- Author
-
Toofanee, Mohammud Shaad Ally, Dowlut, Sabeena, Hamroun, Mohamed, Tamine, Karim, Duong, Anh Kiet, Petit, Vincent, and Sauveron, Damien
- Subjects
DIABETIC foot ,FOOT ,DEEP learning ,FOOT diseases ,LEG amputation ,MEDICAL personnel ,DIABETES complications - Abstract
Diabetes affects roughly 537 million people, and is predicted to reach 783 million by 2045. Diabetes Foot Ulcer (DFU) is a major complication associated with diabetes and can lead to lower limb amputation. The rapid evolution of diabetic foot ulcers (DFUs) necessitates immediate intervention to prevent the severe consequences of amputation and related complications. Continuous and meticulous patient monitoring for individuals with diabetic foot ulcers (DFU) is crucial and is currently carried out by medical practitioners on a daily basis. This research article introduces DFU-Helper, a novel framework that employs a Siamese Neural Network (SNN) for accurate and objective assessment of the progression of diabetic foot ulcers (DFUs) over time. DFU-Helper provides healthcare professionals with a comprehensive visual and numerical representation in terms of the similarity distance of the disease, considering five distinct disease conditions: none, infection, ischemia, both (presence of ischemia and infection), and healthy. The SNN achieves the best Macro F1-score of 0.6455 on the test dataset when applying pseudo-labeling with a pseudo-threshold set to 0.9. The SNN is used in the process of creating anchors for each class using feature vectors. When a patient initially consults a healthcare professional, an image is transmitted to the model, which computes the distances from each class anchor point. It generates a comprehensive table with corresponding figures and a visually intuitive radar chart. In subsequent visits, another image is captured and fed into the model alongside the initial image. DFU-Helper then plots both images and presents the distances from the class anchor points. Our proposed system represents a significant advancement in the application of deep learning for the longitudinal assessment of DFU. To the best of our knowledge, no existing tool harnesses deep learning for DFU follow-up in a comparable manner. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
25. Multi-Task Learning and Gender-Aware Fashion Recommendation System Using Deep Learning.
- Author
-
Naham, Al-Zuhairi, Wang, Jiayang, and Raeed, Al-Sabri
- Subjects
RECOMMENDER systems ,MODELS (Persons) ,DEEP learning ,FASHION marketing ,BUSINESS revenue ,SOCIAL media - Abstract
Many people wonder, when they look at fashion models on social media or on television, whether they could look like them by wearing similar products. Furthermore, many people suffer when they sometimes find fashion models in e-commerce, and they want to obtain similar products, but after clicking on the fashion model, they receive unwanted products or products for the opposite gender. To address these issues, in our work, we built a multi-task learning and gender-aware fashion recommendation system (MLGFRS). The proposed MLGFRS can increase the revenue of the e-commerce fashion market. Moreover, we realized that people are accustomed to clicking on that part of the fashion model, which includes the product they want to obtain. Therefore, we classified the query image into many cropped products to detect the user's click. What makes this paper novel is that we contributed to improving the efficiency performance by detecting the gender from the query image to reduce the retrieving time. Second, we effectively improved the quality of results by retrieving similarities for each object in the query image to recommend the most relevant products. The MLGFRS consists of four components: gender detection, object detection, similarity generation, and recommendation results. The MLGFRS achieves better performance compared to the state-of-the-art baselines. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
26. Approximating dynamic time warping with a convolutional neural network on EEG data.
- Author
-
Lerogeron, Hugo, Picot-Clémente, Romain, Rakotomamonjy, Alain, and Heutte, Laurent
- Subjects
- *
CONVOLUTIONAL neural networks , *TIME series analysis , *ELECTROENCEPHALOGRAPHY , *EUCLIDEAN distance - Abstract
• We propose a fast and differentiable approximation of DTW by comparing two architectures. • Our method achieves at least the same level of accuracy as other DTW main approximations with higher computational efficiency. • It can be used to learn in an end-to-end setting on long time series by proposing generative models of EEGs. Dynamic Time Warping (DTW) is a widely used algorithm for measuring similarities between two time series. It is especially valuable in a wide variety of applications, such as clustering, anomaly detection, classification, or video segmentation, where the time series have different timescales, are irregularly sampled, or are shifted. However, it is not prone to be considered as a loss function in an end-to-end learning framework because of its non-differentiability and its quadratic temporal complexity. While differentiable variants of DTW have been introduced by the community, they still present some drawbacks: computing the distance is still expensive, and this similarity tends to blur some differences in the time series. In this paper, we propose a fast and differentiable approximation of DTW by comparing two architectures: the first one aims to learn an embedding in which the Euclidean distance mimics the DTW, and the second one to directly predict the DTW output value using regression. We build the former by training a siamese neural network to regress the DTW value between two time series. Depending on the nature of the activation function, this approximation naturally supports differentiation, and it is efficient to compute. We show, in a time series retrieval context on EEG datasets, that our methods achieve at least the same level of accuracy as other DTW main approximations with higher computational efficiency. We also show that it can be used to learn in an end-to-end setting on long time series by proposing generative models of EEGs. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
27. Vessel Trajectory Similarity Computation Based on Heterogeneous Graph Neural Network.
- Author
-
Luo, Sizhe and Zeng, Weiming
- Subjects
DATA analysis - Abstract
As AIS data play an increasingly important role in intelligent shipping and shipping regulation, research on AIS trajectories has attracted more attention. Effective measurement is a critical issue in AIS trajectory research. It directly impacts downstream research areas such as anomaly detection, trajectory clustering, and trajectory prediction. However, the extremely time-consuming and labor-intensive traditional pairwise methods for calculating different types of distances between trajectories hinders the large-scale application and further analysis of AIS data. To tackle these issues, we introduce AISim—a metric learning framework that utilizes heterogeneous graph neural networks. This framework includes a spatial pre-training graph and a hierarchical heterogeneous graph, which incorporate spatial and sequential dependency to extract latent features from vessel trajectories. This approach enhances the model's ability to capture a more accurate representation of the trajectories and approximate various similarity measurements. Extensive experiments on multiple real trajectory datasets have verified the effectiveness and generality of the proposed framework. AISim outperforms advanced learning-based models by 5% to 66% on the HR10 metric in top-k search tasks. The experimental results demonstrate that the proposed framework facilitates research on AIS trajectory similarity learning, thereby promoting the development of AIS trajectory analysis. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
28. Noncontrastive Representation Learning for Intervals From Well Logs.
- Author
-
Marusov, Alexander E. and Zaytsev, Alexey
- Abstract
The representation learning problem in the oil and gas industry aims to construct a model that provides a representation based on logging data for a well interval. Previous attempts are mainly supervised and focus on similarity task, which estimates closeness between intervals. We desire to build informative representations without using supervised (labeled) data. One of the possible approaches is self-supervised learning (SSL). In contrast to the supervised paradigm, this one requires little or no labels for the data. Nowadays, most SSL approaches are either contrastive or noncontrastive. Contrastive methods make representations of similar (positive) objects closer and distancing different (negative) ones. Due to possible wrong marking of positive and negative pairs, these methods can provide an inferior performance. Noncontrastive methods do not rely on such labeling and are widespread in computer vision. They learn using only pairs of similar objects that are easier to identify in logging data. We are the first to introduce noncontrastive SSL for well-logging data. In particular, we exploit Bootstrap Your Own Latent (BYOL) and Barlow Twins methods that avoid using negative pairs and focus only on matching positive pairs. The crucial part of these methods is an augmentation strategy. Our augmentation strategies and adaption of BYOL and Barlow Twins together allow us to achieve superior quality on clusterization and mostly the best performance on different classification tasks. Our results prove the usefulness of the proposed noncontrastive self-supervised approaches for representation learning and interval similarity in particular. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
29. A multi-criteria point of interest recommendation using the dominance concept.
- Author
-
Davtalab, Mehri and Alesheikh, Ali Asghar
- Abstract
The learning similarity between users and points of interests (POIs) is an important function in location-based social networks (LBSN), which could primarily benefit multiple location-based services, especially in terms of POI recommendation. As one of the well-known recommender technologies, Collaborative Filtering (CF) has been employed to a great extent in literature, due to its simplicity and interpretability. However, it is facing a great challenge in generating accurate similarities between users or items, because of data sparsity. Traditional similarity measures which rely on explicit user feedback (e.g., rating) are not applicable for implicit feedback (e.g., check-ins). In this study, we propose multi-criteria user–user and POI–POI similarity measures, based on the dominance concept. In this regard, we incorporate geographical, temporal, social, preferential and textual criteria into the similarity measures of CF. We are interested in measuring POI similarity from a location perspective, by taking into account the influence of the dominance concept on multiple dimensions of POIs. To evaluate the effectiveness of our method, a series of experiments are conducted with a large-scale real dataset, collected from the Foursquare of two cities in terms of POI recommendation. Experimental results revealed that the proposed method significantly outperforms the existing state-of-the-art alternatives. A further experiment demonstrates the superiority of the proposed method in alleviating sparsity and handling the cold-start problem affecting POI recommendation. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
30. ACTIVE ANCHORS: SIMILARITY BASED REFINEMENT LEARNING.
- Author
-
Clarkson, Connor, Edwards, Michael, and Xianghua Xie
- Subjects
STEEL manufacture ,MANUFACTURING defects ,GRAPHICS processing units ,ACTIVE learning ,DECISION making - Abstract
Defect detection in steel manufacturing has achieved state-of-the-art results in both localisation and classification on various types of defects, however, this assumes very high-quality datasets that have been verified by domain experts. Labelling such data has become a time-consuming and interaction-heavy task with a great amount of user effort, this is due to variability in the defect characteristics and composite nature. We propose a new acquisition function based on the similarity of defects for refining labels over time by showing the user only the most required to be labelled. We also explore different ways in which to feed these new refinements back into the model to utilize the new knowledge in an effortful way. We achieve this with a graphical interface that provides additional information to the domain expert as the data gets refined, allowing for decision-making with uncertain areas of the steel. [ABSTRACT FROM AUTHOR]
- Published
- 2023
31. COSINER: COntext SImilarity data augmentation for Named Entity Recognition
- Author
-
Bartolini, Ilaria, Moscato, Vincenzo, Postiglione, Marco, Sperlì, Giancarlo, Vignali, Andrea, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Skopal, Tomáš, editor, Falchi, Fabrizio, editor, Lokoč, Jakub, editor, Sapino, Maria Luisa, editor, Bartolini, Ilaria, editor, and Patella, Marco, editor
- Published
- 2022
- Full Text
- View/download PDF
32. Multiview Learning via Non-negative Matrix Factorization for Clustering Applications
- Author
-
Chen, Jiajia, Li, Ao, Li, Jie, Wang, Yangwei, Akan, Ozgur, Editorial Board Member, Bellavista, Paolo, Editorial Board Member, Cao, Jiannong, Editorial Board Member, Coulson, Geoffrey, Editorial Board Member, Dressler, Falko, Editorial Board Member, Ferrari, Domenico, Editorial Board Member, Gerla, Mario, Editorial Board Member, Kobayashi, Hisashi, Editorial Board Member, Palazzo, Sergio, Editorial Board Member, Sahni, Sartaj, Editorial Board Member, Shen, Xuemin, Editorial Board Member, Stan, Mircea, Editorial Board Member, Jia, Xiaohua, Editorial Board Member, Zomaya, Albert Y., Editorial Board Member, Shi, Shuo, editor, Ma, Ruofei, editor, and Lu, Weidang, editor
- Published
- 2022
- Full Text
- View/download PDF
33. Attribute-guided and attribute-manipulated similarity learning network for fashion image retrieval.
- Author
-
Wan, Yongquan, Yan, Cairong, Zou, Guobing, and Zhang, Bofeng
- Subjects
- *
IMAGE retrieval - Abstract
Learning the similarity between fashion items is essential for many fashion-related tasks. Most methods based on global or local image similarity cannot meet the fine-grained retrieval requirements related to attributes. We are the first to clearly distinguish the concepts of attribute name and their values and divide fashion retrieval tasks that combine images and text into: attribute-guided retrieval and attribute-manipulated retrieval. We propose a hierarchical attribute-aware embedding network (HAEN) that takes images and attributes as input, learns multiple attribute-specific embedding spaces, and measures fine-grained similarity in the corresponding spaces. It can accurately map different attributes to the corresponding areas of the image, thereby facilitating the feature fusion of two different modalities of text and image, including enhancement and replacement. Then on this basis, we propose three attribute-manipulated similarity learning methods, HAEN_Avg, HAEN_Rec, and HAEN_Cmb. With comprehensive validation on two real-world fashion datasets, we demonstrate that our methods can effectively leverage semantic knowledge to improve image retrieval performance, including attribute-guided and attribute-manipulated retrieval tasks. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
34. Dual attention composition network for fashion image retrieval with attribute manipulation.
- Author
-
Wan, Yongquan, Zou, Guobing, Yan, Cairong, and Zhang, Bofeng
- Subjects
- *
IMAGE retrieval , *AFFINE transformations - Abstract
Due to practical demands and substantial potential benefits, there is growing interest in fashion image retrieval with attribute manipulation. For example, if a user wants a product similar to a query image and has the attribute "3/4 sleeves" instead of "short sleeves" he can modify the query image by entering text. Unlike general items, fashion items are rich in categories and attributes, and some items with different attributes have only very subtle differences in vision. Moreover, the visual appearance of fashion items changes dramatically under different conditions, such as lighting, viewing angle, and occlusion. These pose challenges to the fashion retrieval task. Therefore, we consider learning an attribute-specific space for each attribute to obtain discriminative features. In this paper, we propose a dual attention composition network for image retrieval with manipulation, which addresses two important issues, where to focus and how to modify. The dual attention module aims to capture fine-grained image-text alignment through corresponding spatial and channel attention and then satisfy multi-modal composition through corresponding affine transformation. The TIRG-based semantic composition module combines the query image's attention features and the manipulation text's embedding features to obtain a synthetic representation close to the target image. Meanwhile, we investigate the semantic hierarchy of attributes and propose a hierarchical encoding method, which can preserve the associations between attributes for efficient feature learning. Extensive experiments conducted on three multi-modal fashion-related retrieval datasets demonstrate the superiority of our network. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
35. Query-adaptive training data recommendation for cross-building predictive modeling.
- Author
-
Labiadh, Mouna, Obrecht, Christian, Ferreira da Silva, Catarina, Ghodous, Parisa, and Benabdeslem, Khalid
- Subjects
DEEP learning ,MACHINE learning ,PREDICTION models - Abstract
Predictive modeling in buildings is a key task for the optimal management of building energy. Relevant building operational data are a prerequisite for such task, notably when deep learning is used. However, building operational data are not always available, such is the case in newly built, newly renovated, or even not yet built buildings. To address this problem, we propose a deep similarity learning approach to recommend relevant training data to a target building solely by using a minimal contextual description on it. Contextual descriptions are modeled as user queries. We further propose to ensemble most used machine learning algorithms in the context of predictive modeling. This contributes to the genericity of the proposed methodology. Experimental evaluations show that our methodology offers a generic methodology for cross-building predictive modeling and achieves good generalization performance. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
36. Multi-view spectral clustering with latent representation learning for applications on multi-omics cancer subtyping.
- Author
-
Ge, Shuguang, Liu, Jian, Cheng, Yuhu, Meng, Xiaojing, and Wang, Xuesong
- Subjects
- *
MULTIOMICS , *GLOBAL method of teaching ,CANCER case studies - Abstract
Driven by multi-omics data, some multi-view clustering algorithms have been successfully applied to cancer subtypes prediction, aiming to identify subtypes with biometric differences in the same cancer, thereby improving the clinical prognosis of patients and designing personalized treatment plan. Due to the fact that the number of patients in omics data is much smaller than the number of genes, multi-view spectral clustering based on similarity learning has been widely developed. However, these algorithms still suffer some problems, such as over-reliance on the quality of pre-defined similarity matrices for clustering results, inability to reasonably handle noise and redundant information in high-dimensional omics data, ignoring complementary information between omics data, etc. This paper proposes multi-view spectral clustering with latent representation learning (MSCLRL) method to alleviate the above problems. First, MSCLRL generates a corresponding low-dimensional latent representation for each omics data, which can effectively retain the unique information of each omics and improve the robustness and accuracy of the similarity matrix. Second, the obtained latent representations are assigned appropriate weights by MSCLRL, and global similarity learning is performed to generate an integrated similarity matrix. Third, the integrated similarity matrix is used to feed back and update the low-dimensional representation of each omics. Finally, the final integrated similarity matrix is used for clustering. In 10 benchmark multi-omics datasets and 2 separate cancer case studies, the experiments confirmed that the proposed method obtained statistically and biologically meaningful cancer subtypes. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
37. Consensus similarity learning based on tensor nuclear norm.
- Author
-
Tang, Rong and Lu, Gui-Fu
- Abstract
Clustering approaches based on similarity learning have achieved good results, but they still have the following problems: (1) these approaches generally learn similar expressions on the original data, thereby disregarding the nonlinear structure of the data; (2) these methods generally do not consider the consistency and high-order relevance among multi-view data; and (3) these approaches generally use the learned similarity matrix for clustering, usually not achieving the optimal effect. To resolve the above issues, we present a new approach referred to as consensus similarity learning based on tensor nuclear norm. First, to address the first problem, we map the data of each view to the Hilbert space to discover the nonlinear structure of the data. Second, to address the second problem, we introduce the tensor nuclear norm to constrain the regularization term, and then, the consistency and high-order relevance among multi-view data can be captured. Third, to address the third problem, i.e., to obtain a better clustering effect, we learn a clustering indicator matrix in the kernel space instead of a similarity matrix for clustering by using a consensus representation term. Last, we incorporate these three steps into a unified framework and design the corresponding goal function. In addition, experimental outcomes on some datasets show that our algorithm is superior to certain representative approaches. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
38. Similarity Heuristics for Clustering Wells Based on Logging-Data.
- Author
-
Khliustov, D. K., Kovalev, D. Y., and Safonov, S. S.
- Abstract
Accurate clustering of oil wells is important for lithological studies and hydrocarbon production processes. Traditional methods for solving this problem require large amount of expert work, including careful analysis of high-dimensional datasets. Modern approaches to automatizing clustering are mostly based on deep neural networks (DNN) with complex architecture, which require significant training time and lack interpretability. This paper analyses methods based on simple similarity heuristics, which stem from reasonable assumptions on data generating process and can be interpreted in terms of statistics and geometry. For dataset labeled by experts, clustering is performed by means of computed heuristics, and the quality of algorithm is measured by Adjusted Rand Index (ARI). Results thus obtained turn out to be comparable to those of modern DNN models (0.41 vs 0.37), but the computation time is reduced significantly. For some types of heuristics physical interpretation is suggested and approaches to obtaining geological insights are studied. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
39. Content-based image retrieval for industrial material images with deep learning and encoded physical properties.
- Author
-
Myung Seok Shim, Thiele, Christopher, Vila, Jeremy, Saxena, Nishank, and Hohl, Detlef
- Subjects
DEEP learning ,IMAGE retrieval ,COMPUTED tomography ,METADATA ,IMAGE compression - Abstract
Industrial materials images are an important application domain for content-based image retrieval. Users need to quickly search databases for images that exhibit similar appearance, properties, and/or features to reduce analysis turnaround time and cost. The images in this study are 2D images of millimeter-scale rock samples acquired at micrometer resolution with light microscopy or extracted from 3D micro-CT scans. Labeled rock images are expensive and time-consuming to acquire and thus are typically only available in the tens of thousands. Training a high-capacity deep learning (DL) model from scratch is therefore not practicable due to data paucity. To overcome this "few-shot learning" challenge, we propose leveraging pretrained common DL models in conjunction with transfer learning. The "similarity" of industrial materials images is subjective and assessed by human experts based on both visual appearance and physical qualities. We have emulated this human-driven assessment process via a physics-informed neural network including metadata and physical measurements in the loss function. We present a novel DL architecture that combines Siamese neural networks with a loss function that integrates classification and regression terms. The networks are trained with both image and metadata similarity (classification), and with metadata prediction (regression). For efficient inference, we use a highly compressed image feature representation, computed offline once, to search the database for images similar to a query image. Numerical experiments demonstrate superior retrieval performance of our new architecture compared with other DL and custom-featurebased approaches. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
40. Detecting Compressed Deepfake Images Using Two-Branch Convolutional Networks with Similarity and Classifier.
- Author
-
Chen, Ping, Xu, Ming, and Wang, Xiaodong
- Subjects
- *
IMAGE compression , *ARTIFICIAL neural networks , *FEATURE extraction - Abstract
As a popular technique for swapping faces with someone else's in images or videos through deep neural networks, deepfake causes a serious threat to the security of multimedia content today. However, because counterfeit images are usually compressed when propagating over the Internet, and because the compression factor used is unknown, most of the existing deepfake detection models have poor robustness for the detection of compressed images with unknown compression factors. To solve this problem, we notice that an image has a high similarity with its compressed image based on symmetry, and this similarity is not easily affected by the compression factor, so this similarity feature can be used as an important clue for compressed deepfake detection. A TCNSC (Two-branch Convolutional Networks with Similarity and Classifier) method that combines compression factor independence is proposed in this paper. The TCNSC method learns two feature representations from the deepfake image, i.e., similarity of the image and its compressed counterpart and authenticity of the deepfake image. A joint training strategy is then utilized for feature extraction, in which the similarity characteristics are obtained by similarity learning while obtaining authenticity characteristics, so the proposed TCNSC model is trained for robust feature learning. Experimental results on the FaceForensics++ (FF++) dataset show that the proposed method significantly outperforms all competing methods under three compression settings of high-quality (HQ), medium-quality (MQ), and low-quality (LQ). For the LQ, MQ, and HQ settings, TCNSC achieves 91.8%, 93.4%, and 95.3% in accuracy, and outperforms the state-of-art method (Xception-RAW) by 16.9%, 10.1%, and 4.1%, respectively. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
41. DFU-Helper: An Innovative Framework for Longitudinal Diabetic Foot Ulcer Diseases Evaluation Using Deep Learning
- Author
-
Mohammud Shaad Ally Toofanee, Sabeena Dowlut, Mohamed Hamroun, Karim Tamine, Anh Kiet Duong, Vincent Petit, and Damien Sauveron
- Subjects
DFU and deep learning ,CNN ,Vision Image Transformers ,Siamese Neural Network ,similarity learning ,longitudinal disease evaluation ,Technology ,Engineering (General). Civil engineering (General) ,TA1-2040 ,Biology (General) ,QH301-705.5 ,Physics ,QC1-999 ,Chemistry ,QD1-999 - Abstract
Diabetes affects roughly 537 million people, and is predicted to reach 783 million by 2045. Diabetes Foot Ulcer (DFU) is a major complication associated with diabetes and can lead to lower limb amputation. The rapid evolution of diabetic foot ulcers (DFUs) necessitates immediate intervention to prevent the severe consequences of amputation and related complications. Continuous and meticulous patient monitoring for individuals with diabetic foot ulcers (DFU) is crucial and is currently carried out by medical practitioners on a daily basis. This research article introduces DFU-Helper, a novel framework that employs a Siamese Neural Network (SNN) for accurate and objective assessment of the progression of diabetic foot ulcers (DFUs) over time. DFU-Helper provides healthcare professionals with a comprehensive visual and numerical representation in terms of the similarity distance of the disease, considering five distinct disease conditions: none, infection, ischemia, both (presence of ischemia and infection), and healthy. The SNN achieves the best Macro F1-score of 0.6455 on the test dataset when applying pseudo-labeling with a pseudo-threshold set to 0.9. The SNN is used in the process of creating anchors for each class using feature vectors. When a patient initially consults a healthcare professional, an image is transmitted to the model, which computes the distances from each class anchor point. It generates a comprehensive table with corresponding figures and a visually intuitive radar chart. In subsequent visits, another image is captured and fed into the model alongside the initial image. DFU-Helper then plots both images and presents the distances from the class anchor points. Our proposed system represents a significant advancement in the application of deep learning for the longitudinal assessment of DFU. To the best of our knowledge, no existing tool harnesses deep learning for DFU follow-up in a comparable manner.
- Published
- 2023
- Full Text
- View/download PDF
42. Learning a Similarity Metric Discriminatively with Application to Ancient Character Recognition
- Author
-
Liu, Xuxing, Tang, Xiaoqin, Chen, Shanxiong, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Woeginger, Gerhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Qiu, Han, editor, Zhang, Cheng, editor, Fei, Zongming, editor, Qiu, Meikang, editor, and Kung, Sun-Yuan, editor
- Published
- 2021
- Full Text
- View/download PDF
43. VAN: Versatile Affinity Network for End-to-End Online Multi-object Tracking
- Author
-
Lee, Hyemin, Kim, Inhan, Kim, Daijin, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Woeginger, Gerhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Ishikawa, Hiroshi, editor, Liu, Cheng-Lin, editor, Pajdla, Tomas, editor, and Shi, Jianbo, editor
- Published
- 2021
- Full Text
- View/download PDF
44. Improving Deep Metric Learning by Divide and Conquer.
- Author
-
Sanakoyeu, Artsiom, Ma, Pingchuan, Tschernezki, Vadim, and Ommer, Bjorn
- Subjects
- *
DEEP learning , *COMPUTER vision , *IMAGE retrieval , *APPLICATION software , *SUBSPACES (Mathematics) , *PHENYLKETONURIA , *VISUAL cryptography - Abstract
Deep metric learning (DML) is a cornerstone of many computer vision applications. It aims at learning a mapping from the input domain to an embedding space, where semantically similar objects are located nearby and dissimilar objects far from another. The target similarity on the training data is defined by user in form of ground-truth class labels. However, while the embedding space learns to mimic the user-provided similarity on the training data, it should also generalize to novel categories not seen during training. Besides user-provided groundtruth training labels, a lot of additional visual factors (such as viewpoint changes or shape peculiarities) exist and imply different notions of similarity between objects, affecting the generalization on the images unseen during training. However, existing approaches usually directly learn a single embedding space on all available training data, struggling to encode all different types of relationships, and do not generalize well. We propose to build a more expressive representation by jointly splitting the embedding space and the data hierarchically into smaller sub-parts. We successively focus on smaller subsets of the training data, reducing its variance and learning a different embedding subspace for each data subset. Moreover, the subspaces are learned jointly to cover not only the intricacies, but the breadth of the data as well. Only after that, we build the final embedding from the subspaces in the conquering stage. The proposed algorithm acts as a transparent wrapper that can be placed around arbitrary existing DML methods. Our approach significantly improves upon the state-of-the-art on image retrieval, clustering, and re-identification tasks evaluated using CUB200-2011, CARS196, Stanford Online Products, In-shop Clothes, and PKU VehicleID datasets. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
45. Wool fabric image retrieval based on soft similarity and listwise learning.
- Author
-
Xiang, Jun, Zhang, Ning, Pan, Ruru, and Gao, Weidong
- Subjects
WOOL textiles ,IMAGE retrieval ,WOOL ,CONTENT-based image retrieval ,ARTIFICIAL neural networks ,CONVOLUTIONAL neural networks ,IMAGE databases - Abstract
As a special case in content-based image retrieval, fabric retrieval has high potential application value in many fields. However, fabric retrieval has higher requirements for results, which makes it difficult for common retrieval methods to be directly applied to fabric retrieval. It is also a challenging issue with several obstacles: variety and complexity of fabric appearance, and high requirements for retrieval accuracy. To address this issue, this paper presents a novel method for fabric image retrieval based on soft similarity and pairwise learning. First, a soft similarity between two fabric images is defined to describe their relationship. Then, a convolutional neural network with compact structure and cross-domain connections is designed to learn the fabric image representation. Finally, listwise learning is introduced to train the convolutional neural network model and hash function. The generated hash codes are used to index the fabric image. The experiments are conducted on a wool fabric dataset. The experimental results show that the newly proposed method has a greater improvement than our previous work. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
46. Improving Image Similarity Learning by Adding External Memory.
- Author
-
Gao, Xinjian, Mu, Tingting, Goulermas, John Y., Song, Jingkuan, and Wang, Meng
- Subjects
- *
ARTIFICIAL intelligence , *ARTIFICIAL neural networks , *MEMORY , *BIOLOGICAL neural networks , *IMAGE retrieval - Abstract
The type of neural networks widely used in artificial intelligence applications mixes its computation and memory modules in neuron weights and activities. The previously learned information are stored in network weights. When dealing with complex data, e.g., those possessing diverse content or containing long-sequences, some information stored in the weights can be altered drastically or wiped as the training goes, but they are not necessarily unimportant. External memory is a recent technique proposed to prevent from forgetting significant previously learned information. In this work, we aim at taking advantage of this recent technique to advance the similarity learning task that is critical in many real-world artificial intelligence applications. We propose suitable external memory design supported by extended attention mechanism. Two different kinds of memory modules are proposed so that the similarity learning process can dynamically shift focus over a wide range of diverse content contained by the training data. Effectiveness of the proposed method is demonstrated through evaluations based on different image retrieval tasks and compared against various state-of-the-art algorithms in the field. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
47. Two-step learning for crowdsourcing data classification.
- Author
-
Yu, Hao, Li, Jiaye, Wu, Zhaojiang, Xu, Hang, and Zhu, Lei
- Subjects
CROWDSOURCING ,CLASSIFICATION algorithms ,MACHINE learning ,BIG data ,KNOWLEDGE workers - Abstract
Crowdsourcing learning (Bonald and Combes 2016; Dawid and Skene, J R Stat Soc: Series C (Appl Stat), 28(1):20–28 1979; Karger et al. 2011; Li et al, IEEE Trans Knowl Data Eng, 28(9):2296–2319 2016; Liu et al. 2012; Schlagwein and Bjorn-Andersen, J Assoc Inform Syst, 15(11):3 2014; Zhang et al. 2014) plays an increasingly important role in the era of big data (Liu et al., IEEE Trans Syst Man Cybern: Syst, 48(12): 451–2461, 2017; Zhang et al. 2014) due to its ability to easily solve large-scale data annotations (Musen et al., J Amer Med Informs Assoc, 22(6):1148–1152 2015). However, in the process of crowdsourcing learning, the uneven knowledge level of workers often leads to low accuracy of the label after marking, which brings difficulties to the subsequent processing (Edwards and Teddy 2013) and analysis of crowdsourcing data. In order to solve this problem, this paper proposes a two-step learning crowdsourced data classification algorithm, which optimizes the original label data by simultaneously considering the two issues of different worker abilities and the similarity between crowdsourced data (Kasikci et al. 2013) samples, so as to get more accurate label data. The two-step learning algorithm mainly includes two steps. Firstly, the worker's ability to label different samples is obtained by constructing and training the worker's ability model, and then the similarity between samples is calculated by the cosine measurement method (Muflikhah and Baharudin 2009), and finally the original label data is optimized by combining the above two results. The experimental results also show that the two-step learning classification algorithm proposed in this article has achieved better experimental results than the comparison algorithm. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
48. Generalized Locally-Linear Embedding: A Neural Network Implementation
- Author
-
Lu, Xiao, Kang, Zhao, Tang, Jiachun, Xie, Shuang, Su, Yuanzhang, Filipe, Joaquim, Editorial Board Member, Ghosh, Ashish, Editorial Board Member, Kotenko, Igor, Editorial Board Member, Prates, Raquel Oliveira, Editorial Board Member, Zhou, Lizhu, Editorial Board Member, Zhang, Haijun, editor, Zhang, Zhao, editor, Wu, Zhou, editor, and Hao, Tianyong, editor
- Published
- 2020
- Full Text
- View/download PDF
49. Mobility Irregularity Detection with Smart Transit Card Data
- Author
-
Wang, Xuesong, Yao, Lina, Liu, Wei, Li, Can, Bai, Lei, Waller, S. Travis, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Woeginger, Gerhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Lauw, Hady W., editor, Wong, Raymond Chi-Wing, editor, Ntoulas, Alexandros, editor, Lim, Ee-Peng, editor, Ng, See-Kiong, editor, and Pan, Sinno Jialin, editor
- Published
- 2020
- Full Text
- View/download PDF
50. NSEEN: Neural Semantic Embedding for Entity Normalization
- Author
-
Fakhraei, Shobeir, Mathew, Joel, Ambite, José Luis, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Woeginger, Gerhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Brefeld, Ulf, editor, Fromont, Elisa, editor, Hotho, Andreas, editor, Knobbe, Arno, editor, Maathuis, Marloes, editor, and Robardet, Céline, editor
- Published
- 2020
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.