Language: chinese / Search Limiters: Academic (Peer-Reviewed) Journals / Topic: 3 selected - Searchworks@Jio Institute Digital Library Search Results

1. 基于 Transformer 人像关键点检测网络的研究.

Author: 陈凯, 林珊玲, 林坚普, 林志贤, 缪志辉, and 郭太良
Subjects: *PARALLEL processing, *COMPUTER vision, *DEEP learning, *PROBLEM solving, *ALGORITHMS, *LOCATION problems (Programming), *PARALLEL algorithms
Abstract: In order to address the shortcomings of the facial landmarks detection models, which cannot model the relations between long-distance landmarks, this paper proposed a parallel multi-branch architecture combining with convolution and Transformer for facial landmarks tasks, called MCTN, it utilized the dynamic attention mechanism to model the long-distance relations between facial landmarks. The multi-branch parallel structure designing allowed MCTN to include shared weights, global information fusion and other merits. What’s more, this paper proposed the novel Transformer structure, Deformer, which could make the MCTN focused attention weights faster on sparse and meaningful locations and solved the problem of slow convergence of Transformer. MCTN reached 4.33%,3.12% and 3.15% normalized average error respectively on the WFLW,300W and COFW datasets, the results show that MCTN utilizes Transformer with CNN multi-branch parallel structure and Deformer structure to dramatically outperform other facial landmarks localization algorithms based on convolution network. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

2. 保留低阶和高阶关系的图表示深度学习集成算法.

Author: 欧阳勐涔, 张应龙, 夏学文, and 徐星
Subjects: *DEEP learning, *REPRESENTATIONS of graphs, *FEEDFORWARD neural networks, *PROBLEM solving, *ALGORITHMS, *SHALLOW-water equations
Abstract: High-quality learning low-dimensional representation of nodes in the graph is a current research hotspot. The existing shallow model methods cannot capture the nonlinear relationship of the graph structure, and the graph convolution model in the graph neural network technology will cause an over-smoothing problem. At the same time, how to determine the role of different hop number relationships in graph representation learning is also a problem that needs to be solved in the research. To solve the above problems, this paper proposes a deep learning model based on T（T＞1） feedforward neural networks. The framework uses deep learning models to extract the nonlinear relationship of the graph structure, and T sub-models effectively capture the local and global（higher-order） relationship information of the graph, and they give different roles in the final vector representation to take advantage of different hop relations. Experimental results on vertex classification and link prediction tasks show that the framework is competitive with existing methods, the benchmark algorithm can be improved by about 20%. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

3. 基于稀疏特征改进的单视图表面重建.

Author: 梁春阳, 唐红梅, 席建锐, and 刘鑫
Subjects: *SURFACE reconstruction, *FEATURE extraction, *SPECTRUM analysis, *DEEP learning, *PROBLEM solving, *ALGORITHMS
Abstract: Single-view 3D reconstruction based on deep learning is a research hot spot at present. In order to discover more high-frequency details, SDF-SRN algorithm introduces positional encoding, but neural network is easy to overfit without accurate supervision, and reconstructs uneven surface. To solve the problem, this paper proposed the network model based on sparse feature. The model enabled the network that preferred to overfitting to predict high-frequency residual by residual learning. The feature extraction network extracted sparse features and the global features. Then one hypernetwork took the sparse features as input and generated prediction shallow head. This shallow head predicted low-frequency part of signed distance function. Another hypernetwork took global features as input and generated another shallow head. This shallow head predicted high-frequency residual. It fused two predictions of shallow heads into final signed distance function. Spectrum analysis shows that the design purpose of network is achieved. Compared with other smooth surface reconstruction schemes, the network can achieve smoother surface reconstruction with enough details. It overcomes the overfitting of SDF-SRN. The qualitative and quantitative comparison with other advanced single-view reconstruction approaches show the superiority of the proposed approach. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

4. 基于图神经网络与深度学习的商品推荐算法.

Author: 冯兴杰 and 生晓宇
Subjects: *PROBLEM solving, *GRAPH algorithms, *DEEP learning, *RECOMMENDER systems, *ALGORITHMS
Abstract: The recommendation algorithm based on graph neural network can extract the association relationship between users and goods. Traditional methods can’t extract this relationship. At present, most of these algorithms ignore the general prefe-rences in the review data of users and products. In order to solve this problem, this paper proposed a new method. This me-thod used the graph neural network to extract association relations, and the advantage of deep learning to extract the general preferences, and carried out feature fusion to improve the recommendation effect. This paper conducted comparative experiments and ablation experiments on four sets of public data sets to verify the effectiveness of the proposed method. The evaluation indexes include the recall rate and normalized discounted cumulative gain. Experiments show that this method is more effective than the existing algorithms. The feature fusion of the two networks can improve the recommendation effect. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

5. 基于深度学习的遥感图像去雾算法.

Author: 李玉峰, 任静波, and 黄煜峰
Subjects: *DEEP learning, *CONVOLUTIONAL neural networks, *REMOTE sensing, *PROBLEM solving, *ALGORITHMS
Abstract: This paper presented an image dehazing method based on deep learning to solve the problem that the remote sensing images had reduced image sharpness due to haze. Firstly, this algorithm distorted the original atmospheric scattering model to get an end-to-end defrosting model. Then, it unified several unknown parameters to one parameter and estimated the unknown parameters by using multiscale convolution neural network. Finally, it brought the parameter estimates into the defrosting model to get a haze-free image. For no reference image dataset, this paper used the existing dataset to preliminary train the network, then added the self-built dataset to secondary train the network. The experimental results show that, compared with the related defrosting algorithms, the proposed algorithm improves the visual effect and objective index to some extent, and effectively improves the clarity of remote sensing images in haze weather conditions. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

6. 基于卷积神经网络和语义相关的协同显著性检测.

Author: 张华迪, 樊　玮, and 黄　睿
Subjects: *CONVOLUTIONAL neural networks, *ALGORITHMS, *DEFINITIONS, *PROBLEM solving, *DEEP learning
Abstract: To solve the problems that the objects with different semantic classes are identified as co-salient objects in current co-saliency detection methods, this paper proposed a CNN and semantic correction-based co-saliency detection method (CSCCD). The proposed method first adopted the guided super pixel filter to process the super pixels obtained by SLIC and the saliency results generated by DSS, which showed clear object boundaries. Then it utilized Mask R-CNN to extract semantic features. It proposed the definitions of image semantic feature and semantic consistency. It also defined the image group semantic correction to solve the problem that detected the objects with different pose belonging to a semantic class as different semantic classes. With the concept, this paper defined image group semantic correlation class, solving semantic correlation problem of multiple images. It generated the final co-saliency detection results by fusing the saliency detection regions with the image group semantic consistent regions. The experimental results on public benchmark datasets show that this algorithm can effectively highlight the whole and outline of the object, and its comprehensive performance in objective quantification is obviously improved. [ABSTRACT FROM AUTHOR]
Published: 2020
Full Text: View/download PDF

7. 用于图像超分辨的密集跳跃注意连接网络.

Author: 吴荣贵 and 蒋　平
Subjects: *GOAL (Psychology), *ALGORITHMS, *HIGH resolution imaging, *PROBLEM solving, *IMAGE reconstruction algorithms, *DEEP learning, *MACHINE learning, *CHANNEL estimation
Abstract: In order to solve the problem that the existing super-resolution algorithm based on deep learning didn't make full use of the feature information of each level, resulting in low reconstruction accuracy and large parameter quantity, this paper proposed a double dense connection structure named densely channel attention skip connection network. In the inner structure of the network, it improved the original dense cascade block to generate a channel separable dense cascade block. The outer structure adopted a densely residual connection and attention mechanism to fuse the features extracted by the dense block to achieve the goal that less convolution layer and higher precision effect. This paper tested the network models on several benchmark datasets . The results show the proposed model has higher accuracy and fewer parameters than the other models. [ABSTRACT FROM AUTHOR]
Published: 2020
Full Text: View/download PDF

8. 融合多层次结构信息的深度属性二分网络表示学习.

Author: 李婷婷, 吕少卿, 赵雪莉, and 任新成
Subjects: *VECTOR spaces, *VIRTUAL networks, *INFORMATION networks, *BIPARTITE graphs, *PROBLEM solving, *DEEP learning, *ALGORITHMS
Abstract: Network representation learning aims to transform the nodes in the network into a low-dimensional vector space while maintaining the inherent properties of the network. Most of the existing methods are aimed at normal networks, ignoring the particularity of attribute bipartite networks and the highly non-linear characteristics of the network. To solve the above problem, this paper proposed a deep attributed bipartite network embedding method incorporating with multilevel structure information. Specifically, the algorithm introduced an extended weight matrix to fuse the explicit and implicit structure of the bipartite network with attribute information. Then, it utilized a deep auto-encoder model to capture the highly non-linear characteristics of the network. To maintain the global network structure, a deep auto-encoder reconstructed the second-order proximity. Meanwhile, the deep auto-encoder used the first-order proximity of the nodes as supervisory information to maintain the local network structure. Finally, the algorithm performed joint optimization to get the final representation vectors of nodes. The model was executed on the four datasets of Yelp, Douban Book, Douban Movie and MovieLens. Compared with the latest benchmark method, the average values of F1@10, MAP@10,MRR@10 and NDGG@10 of this model have improved by 4.29%、5.63%、6.26%、4.21%. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

9. 基于轻量级卷积神经网络的人证比对.

Author: 高凌飞, 王海龙, 王海涛, 刘强, 张鲁洋, and 王怀斌
Subjects: *CONVOLUTIONAL neural networks, *PROBLEM solving, *HUMAN facial recognition software, *ALGORITHMS, *ADDITIVE functions, *DEEP learning
Abstract: In the scene of document verification，the standard deep learning face recognition method has low accuracy and poor real-time performance on embedded devices. To solve these problems，this paper proposes a modified efficient convolutional neural network（CNN）called Lightnet and adopts the transfer learning method. Lightnet is an efficient CNN module composed of depthwise separable convolution，linear bottleneck structure and attention module. After introducing the loss function AM-Softmax with additive angle margin， the network model can effectively solve the problems of redundancy parameter and vast calculation for standard CNN in the foundation of ensuring the high accuracy of face recognition. The transfer learning method can enhance the scene-identity face matching performance by freezing all the convolution layer weights of the pre-trained model and fine-tuning training in the self-made scene-identity face matching dataset. The experimental results show that the designed efficient scene-identity face matching algorithm has achieved good results in terms of verification accuracy，parameters and verification speed，and has good robustness in life scenarios. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

10. 基于多头注意力机制和位置信息的 xDeepFM 推荐模型.

Author: 牛路帅 and 彭颵
Subjects: *PROBLEM solving, *ALGORITHMS, *DEEP learning, *MACHINE learning
Abstract: In order to solve the problem that the recommendation model cannot mine the diversity of user interest and capture the order information between the sequence of user behavior, and the interaction occurs at the element level rather than between the feature vectors, etc., based on multiple attention mechanism and location information, this paper proposed the x DMAL FM. Firstly, it extracted the feature depth from different subspaces by using the multi-head attention mechanism, and then location information could capture the sequential relationship between user behavior sequences. Finally, it used three public datasets to conduct comparative experiments and the AUC index to evaluate the results. The experimental results show that the proposed algorithm has better recommendation performance than the xDeepFM model and show the effectiveness and feasibility of the proposed algorithm. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

11. 一种基于条件生成式对抗网络的道路提取方法.

Author: 陆川伟, 孙群, 赵云鹏, 孙士杰, 马京振, 程绵绵, and 李元復
Subjects: *PROBLEM solving, *DATA mining, *DEEP learning, *ACQUISITION of data, *ALGORITHMS, *TAXI service, *TAXICABS
Abstract: Objectives: Road information extraction based on vehicle trajectory data is one of the hotspots and difficulties in the field of geographic information data acquisition. Traditional methods are faced with the problems of high accuracy of trajectory data source, complex road extraction algorithm model, and poor adaptability of different road extraction model parameters. In order to solve the above problems, a trajectoryroad conversion model based on conditional generative adversarial nets is proposed, and we called it trajectory‐ to‐road translation with conditional generative adversarial nets( TR‐CGAN).Methods: Firstly, the trajectory data and the corresponding road data are raster processed in the sample area to construct the trajectory‐ road sample image pairs, then the parameters of TR‐CGAN are learned based on the sample data as the prior knowledge. Through the continuous iteration of the one player game, the optimal generation result is gradually approached. Before that, according to the characteristics of vehicle trajectory data, this paper uses the control variable method and enumeration method to analyze the parameters of U ‐Net generator depth, discriminator receptive field size and objective function in the conversion model, so as to obtain the optimal structure of TR‐CGAN.Results: Using the taxi track data in the third ring road of Zhengzhou city, the experiment results show that this proposed method can find new roads more effectively. At the same time, the trained TR‐CGAN is compared with the raster road extraction method, and it is found that our method has stronger adaptability of trajectory data in both the sparse and dense areas of the trajectory, and the accuracy of generated road is higher.Conclusions: Our proposed method can realize road extraction based on trajectory data, and has better data adaptability and accuracy. In the further research, we can increase the type of sample data, so that the road extraction model can learn to generate more road types, such as ring road types. The model can be further optimized to extract two lane roads or even lane roads. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

12. 基于强化底层特征的无人机航拍图像小目标检测算法.

Author: 吕晓君, 向伟, and 刘云鹏
Subjects: *PROBLEM solving, *DEEP learning, *FEATURE extraction, *ALGORITHMS, *RADARSAT satellites
Abstract: In order to solve the problem of low accuracy and residual error in small object detection on UAV aerial images, this paper proposed a new kind of multi-scale small target detection method based on enhanced lower feature. Basing on Faster RCNN ResNet-50-FPN model, the algorithm enhanced the lower feature by designing the structure of DetNet-59 feature extraction network and Flat-FPN feature fusion network, and applied soft-NMS to face the appearance of overlapping small objects. From simulation test on VOC2007 and VisDrone2019, the method is able to increase m AP by 11% compared to the base model when time consumption is no more than 2%, and it also performs better in terms of accuracy than current common algorithms. It was proved that the algorithm can effectively improve the detection accuracy of small targets while ensuring real-time performance. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

13. 基于改进的深度残差网络的表情识别研究.

Author: 何俊, 刘跃, 李倡洪, 沈津铭, 李帅, and 王京威
Subjects: *FACIAL expression, *MATHEMATICAL convolutions, *PROBLEM solving, *DATABASES, *ALGORITHMS, *KERNEL operating systems, *DEEP learning
Abstract: This paper proposed an improved residual network (ResNet) expression recognition algorithm. The algorithm used small convolution kernels and a deep network structure to solve the problem of accuracy reduction with the increase of depth by the residual module. The experiment overcame the shortcoming of insufficient data through transfer learning, which could effectively prevent overfitting. The network architecture used a linear SVM for classification. The experiment used the ImageNet database to pre-train network parameters to have an excellent ability to extract feature. According to transfer learning, the algorithm used the FER-2013 database and the expanded CK + database to fine-tune and train network parameters, and overcame the problem that shallow networks rely on manual features and deep networks were difficult to train. The results show the recognition rates is 91.333% and 95.775% on the CK + database and the GENKI-4K database,respectively. The classification accuracy of SVM in CK + database is about 1% higher than that of softmax. [ABSTRACT FROM AUTHOR]
Published: 2020
Full Text: View/download PDF

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

13 results

1. 基于 Transformer 人像关键点检测网络的研究.

2. 保留低阶和高阶关系的图表示深度学习集成算法.

3. 基于稀疏特征改进的单视图表面重建.

4. 基于图神经网络与深度学习的商品推荐算法.

5. 基于深度学习的遥感图像去雾算法.

6. 基于卷积神经网络和语义相关的协同显著性检测.

7. 用于图像超分辨的密集跳跃注意连接网络.

8. 融合多层次结构信息的深度属性二分网络表示学习.

9. 基于轻量级卷积神经网络的人证比对.

10. 基于多头注意力机制和位置信息的 xDeepFM 推荐模型.

11. 一种基于条件生成式对抗网络的道路提取方法.

12. 基于强化底层特征的无人机航拍图像小目标检测算法.

13. 基于改进的深度残差网络的表情识别研究.

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Publication Type

Journal

Database

Publisher

13 results

Search Results

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources