Author: "Cen, Yukuo" / Publication Year Range: Last 3 years - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Cen, Yukuo"' showing total 23 results

Start Over Author "Cen, Yukuo" Publication Year Range Last 3 years

23 results on '"Cen, Yukuo"'

1. LongRAG: A Dual-Perspective Retrieval-Augmented Generation Paradigm for Long-Context Question Answering

Author: Zhao, Qingfei, Wang, Ruobing, Cen, Yukuo, Zha, Daren, Tan, Shicheng, Dong, Yuxiao, and Tang, Jie
Subjects: Computer Science - Computation and Language
Abstract: Long-Context Question Answering (LCQA), a challenging task, aims to reason over long-context documents to yield accurate answers to questions. Existing long-context Large Language Models (LLMs) for LCQA often struggle with the "lost in the middle" issue. Retrieval-Augmented Generation (RAG) mitigates this issue by providing external factual evidence. However, its chunking strategy disrupts the global long-context information, and its low-quality retrieval in long contexts hinders LLMs from identifying effective factual details due to substantial noise. To this end, we propose LongRAG, a general, dual-perspective, and robust LLM-based RAG system paradigm for LCQA to enhance RAG's understanding of complex long-context knowledge (i.e., global information and factual details). We design LongRAG as a plug-and-play paradigm, facilitating adaptation to various domains and LLMs. Extensive experiments on three multi-hop datasets demonstrate that LongRAG significantly outperforms long-context LLMs (up by 6.94%), advanced RAG (up by 6.16%), and Vanilla RAG (up by 17.25%). Furthermore, we conduct quantitative ablation studies and multi-dimensional analyses, highlighting the effectiveness of the system's components and fine-tuning strategies. Data and code are available at https://github.com/QingFei1/LongRAG., Comment: EMNLP 2024 Main, Final
Published: 2024

2. Pre-Training and Prompting for Few-Shot Node Classification on Text-Attributed Graphs

Author: Zhao, Huanjing, Yang, Beining, Cen, Yukuo, Ren, Junyu, Zhang, Chenhui, Dong, Yuxiao, Kharlamov, Evgeny, Zhao, Shu, and Tang, Jie
Subjects: Computer Science - Social and Information Networks, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: The text-attributed graph (TAG) is one kind of important real-world graph-structured data with each node associated with raw texts. For TAGs, traditional few-shot node classification methods directly conduct training on the pre-processed node features and do not consider the raw texts. The performance is highly dependent on the choice of the feature pre-processing method. In this paper, we propose P2TAG, a framework designed for few-shot node classification on TAGs with graph pre-training and prompting. P2TAG first pre-trains the language model (LM) and graph neural network (GNN) on TAGs with self-supervised loss. To fully utilize the ability of language models, we adapt the masked language modeling objective for our framework. The pre-trained model is then used for the few-shot node classification with a mixed prompt method, which simultaneously considers both text and graph information. We conduct experiments on six real-world TAGs, including paper citation networks and product co-purchasing networks. Experimental results demonstrate that our proposed framework outperforms existing graph few-shot learning methods on these datasets with +18.98% ~ +35.98% improvements., Comment: Accepted to KDD'24
Published: 2024
Full Text: View/download PDF

3. Generalizing Graph Transformers Across Diverse Graphs and Tasks via Pre-Training on Industrial-Scale Data

Author: He, Yufei, Hou, Zhenyu, Cen, Yukuo, He, Feng, Cheng, Xu, and Hooi, Bryan
Subjects: Computer Science - Machine Learning, Computer Science - Social and Information Networks
Abstract: Graph pre-training has been concentrated on graph-level on small graphs (e.g., molecular graphs) or learning node representations on a fixed graph. Extending graph pre-trained models to web-scale graphs with billions of nodes in industrial scenarios, while avoiding negative transfer across graphs or tasks, remains a challenge. We aim to develop a general graph pre-trained model with inductive ability that can make predictions for unseen new nodes and even new graphs. In this work, we introduce a scalable transformer-based graph pre-training framework called PGT (Pre-trained Graph Transformer). Specifically, we design a flexible and scalable graph transformer as the backbone network. Meanwhile, based on the masked autoencoder architecture, we design two pre-training tasks: one for reconstructing node features and the other one for reconstructing local structures. Unlike the original autoencoder architecture where the pre-trained decoder is discarded, we propose a novel strategy that utilizes the decoder for feature augmentation. We have deployed our framework on Tencent's online game data. Extensive experiments have demonstrated that our framework can perform pre-training on real-world web-scale graphs with over 540 million nodes and 12 billion edges and generalizes effectively to unseen new graphs with different downstream tasks. We further conduct experiments on the publicly available ogbn-papers100M dataset, which consists of 111 million nodes and 1.6 billion edges. Our framework achieves state-of-the-art performance on both industrial datasets and public datasets, while also enjoying scalability and efficiency., Comment: This work has been submitted to the IEEE for possible publication
Published: 2024

4. GraphAlign: Pretraining One Graph Neural Network on Multiple Graphs via Feature Alignment

Author: Hou, Zhenyu, Li, Haozhan, Cen, Yukuo, Tang, Jie, and Dong, Yuxiao
Subjects: Computer Science - Machine Learning
Abstract: Graph self-supervised learning (SSL) holds considerable promise for mining and learning with graph-structured data. Yet, a significant challenge in graph SSL lies in the feature discrepancy among graphs across different domains. In this work, we aim to pretrain one graph neural network (GNN) on a varied collection of graphs endowed with rich node features and subsequently apply the pretrained GNN to unseen graphs. We present a general GraphAlign method that can be seamlessly integrated into the existing graph SSL framework. To align feature distributions across disparate graphs, GraphAlign designs alignment strategies of feature encoding, normalization, alongside a mixture-of-feature-expert module. Extensive experiments show that GraphAlign empowers existing graph SSL frameworks to pretrain a unified and powerful GNN across multiple graphs, showcasing performance superiority on both in-domain and out-of-domain graphs.
Published: 2024

5. Does Negative Sampling Matter? A Review with Insights into its Theory and Applications

Author: Yang, Zhen, Ding, Ming, Huang, Tinglin, Cen, Yukuo, Song, Junshuai, Xu, Bin, Dong, Yuxiao, and Tang, Jie
Subjects: Computer Science - Machine Learning
Abstract: Negative sampling has swiftly risen to prominence as a focal point of research, with wide-ranging applications spanning machine learning, computer vision, natural language processing, data mining, and recommender systems. This growing interest raises several critical questions: Does negative sampling really matter? Is there a general framework that can incorporate all existing negative sampling methods? In what fields is it applied? Addressing these questions, we propose a general framework that leverages negative sampling. Delving into the history of negative sampling, we trace the development of negative sampling through five evolutionary paths. We dissect and categorize the strategies used to select negative sample candidates, detailing global, local, mini-batch, hop, and memory-based approaches. Our review categorizes current negative sampling methods into five types: static, hard, GAN-based, Auxiliary-based, and In-batch methods, providing a clear structure for understanding negative sampling. Beyond detailed categorization, we highlight the application of negative sampling in various areas, offering insights into its practical benefits. Finally, we briefly discuss open problems and future directions for negative sampling., Comment: 20 pages, 11 figures
Published: 2024

6. PST-Bench: Tracing and Benchmarking the Source of Publications

Author: Zhang, Fanjin, Cao, Kun, Cen, Yukuo, Yu, Jifan, Yin, Da, and Tang, Jie
Subjects: Computer Science - Digital Libraries, Computer Science - Computation and Language
Abstract: Tracing the source of research papers is a fundamental yet challenging task for researchers. The billion-scale citation relations between papers hinder researchers from understanding the evolution of science efficiently. To date, there is still a lack of an accurate and scalable dataset constructed by professional researchers to identify the direct source of their studied papers, based on which automatic algorithms can be developed to expand the evolutionary knowledge of science. In this paper, we study the problem of paper source tracing (PST) and construct a high-quality and ever-increasing dataset PST-Bench in computer science. Based on PST-Bench, we reveal several intriguing discoveries, such as the differing evolution patterns across various topics. An exploration of various methods underscores the hardness of PST-Bench, pinpointing potential directions on this topic. The dataset and codes have been available at https://github.com/THUDM/paper-source-trace., Comment: 8 pages, 3 appendix pages
Published: 2024

7. OAG-Bench: A Human-Curated Benchmark for Academic Graph Mining

Author: Zhang, Fanjin, Shi, Shijie, Zhu, Yifan, Chen, Bo, Cen, Yukuo, Yu, Jifan, Chen, Yelin, Wang, Lulu, Zhao, Qingfei, Cheng, Yuqing, Han, Tianyi, An, Yuwei, Zhang, Dan, Tam, Weng Lam, Cao, Kun, Pang, Yunhe, Guan, Xinyu, Yuan, Huihui, Song, Jian, Li, Xiaoyan, Dong, Yuxiao, and Tang, Jie
Subjects: Computer Science - Digital Libraries, Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: With the rapid proliferation of scientific literature, versatile academic knowledge services increasingly rely on comprehensive academic graph mining. Despite the availability of public academic graphs, benchmarks, and datasets, these resources often fall short in multi-aspect and fine-grained annotations, are constrained to specific task types and domains, or lack underlying real academic graphs. In this paper, we present OAG-Bench, a comprehensive, multi-aspect, and fine-grained human-curated benchmark based on the Open Academic Graph (OAG). OAG-Bench covers 10 tasks, 20 datasets, 70+ baselines, and 120+ experimental results to date. We propose new data annotation strategies for certain tasks and offer a suite of data pre-processing codes, algorithm implementations, and standardized evaluation protocols to facilitate academic graph mining. Extensive experiments reveal that even advanced algorithms like large language models (LLMs) encounter difficulties in addressing key challenges in certain tasks, such as paper source tracing and scholar profiling. We also introduce the Open Academic Graph Challenge (OAG-Challenge) to encourage community input and sharing. We envisage that OAG-Bench can serve as a common ground for the community to evaluate and compare algorithms in academic graph mining, thereby accelerating algorithm development and advancement in this field. OAG-Bench is accessible at https://www.aminer.cn/data/., Comment: KDD'24, 9 pages, 5 appendix pages
Published: 2024
Full Text: View/download PDF

8. BatchSampler: Sampling Mini-Batches for Contrastive Learning in Vision, Language, and Graphs

Author: Yang, Zhen, Huang, Tinglin, Ding, Ming, Dong, Yuxiao, Ying, Rex, Cen, Yukuo, Geng, Yangliao, and Tang, Jie
Subjects: Computer Science - Machine Learning, Computer Science - Computation and Language, Computer Science - Computer Vision and Pattern Recognition
Abstract: In-Batch contrastive learning is a state-of-the-art self-supervised method that brings semantically-similar instances close while pushing dissimilar instances apart within a mini-batch. Its key to success is the negative sharing strategy, in which every instance serves as a negative for the others within the mini-batch. Recent studies aim to improve performance by sampling hard negatives \textit{within the current mini-batch}, whose quality is bounded by the mini-batch itself. In this work, we propose to improve contrastive learning by sampling mini-batches from the input data. We present BatchSampler\footnote{The code is available at \url{https://github.com/THUDM/BatchSampler}} to sample mini-batches of hard-to-distinguish (i.e., hard and true negatives to each other) instances. To make each mini-batch have fewer false negatives, we design the proximity graph of randomly-selected instances. To form the mini-batch, we leverage random walk with restart on the proximity graph to help sample hard-to-distinguish instances. BatchSampler is a simple and general technique that can be directly plugged into existing contrastive learning models in vision, language, and graphs. Extensive experiments on datasets of three modalities show that BatchSampler can consistently improve the performance of powerful contrastive models, as shown by significant improvements of SimCLR on ImageNet-100, SimCSE on STS (language), and GraphCL and MVGRL on graph datasets., Comment: 17 pages, 16 figures
Published: 2023

9. GraphMAE2: A Decoding-Enhanced Masked Self-Supervised Graph Learner

Author: Hou, Zhenyu, He, Yufei, Cen, Yukuo, Liu, Xiao, Dong, Yuxiao, Kharlamov, Evgeny, and Tang, Jie
Subjects: Computer Science - Machine Learning
Abstract: Graph self-supervised learning (SSL), including contrastive and generative approaches, offers great potential to address the fundamental challenge of label scarcity in real-world graph data. Among both sets of graph SSL techniques, the masked graph autoencoders (e.g., GraphMAE)--one type of generative method--have recently produced promising results. The idea behind this is to reconstruct the node features (or structures)--that are randomly masked from the input--with the autoencoder architecture. However, the performance of masked feature reconstruction naturally relies on the discriminability of the input features and is usually vulnerable to disturbance in the features. In this paper, we present a masked self-supervised learning framework GraphMAE2 with the goal of overcoming this issue. The idea is to impose regularization on feature reconstruction for graph SSL. Specifically, we design the strategies of multi-view random re-mask decoding and latent representation prediction to regularize the feature reconstruction. The multi-view random re-mask decoding is to introduce randomness into reconstruction in the feature space, while the latent representation prediction is to enforce the reconstruction in the embedding space. Extensive experiments show that GraphMAE2 can consistently generate top results on various public datasets, including at least 2.45% improvements over state-of-the-art baselines on ogbn-Papers100M with 111M nodes and 1.6B edges., Comment: Accepted to WWW'23
Published: 2023

10. Mask and Reason: Pre-Training Knowledge Graph Transformers for Complex Logical Queries

Author: Liu, Xiao, Zhao, Shiyu, Su, Kai, Cen, Yukuo, Qiu, Jiezhong, Zhang, Mengdi, Wu, Wei, Dong, Yuxiao, and Tang, Jie
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: Knowledge graph (KG) embeddings have been a mainstream approach for reasoning over incomplete KGs. However, limited by their inherently shallow and static architectures, they can hardly deal with the rising focus on complex logical queries, which comprise logical operators, imputed edges, multiple source entities, and unknown intermediate entities. In this work, we present the Knowledge Graph Transformer (kgTransformer) with masked pre-training and fine-tuning strategies. We design a KG triple transformation method to enable Transformer to handle KGs, which is further strengthened by the Mixture-of-Experts (MoE) sparse activation. We then formulate the complex logical queries as masked prediction and introduce a two-stage masked pre-training strategy to improve transferability and generalizability. Extensive experiments on two benchmarks demonstrate that kgTransformer can consistently outperform both KG embedding-based baselines and advanced encoders on nine in-domain and out-of-domain reasoning tasks. Additionally, kgTransformer can reason with explainability via providing the full reasoning paths to interpret given answers., Comment: kgTransformer; Accepted to KDD 2022
Published: 2022
Full Text: View/download PDF

11. GACT: Activation Compressed Training for Generic Network Architectures

Author: Liu, Xiaoxuan, Zheng, Lianmin, Wang, Dequan, Cen, Yukuo, Chen, Weize, Han, Xu, Chen, Jianfei, Liu, Zhiyuan, Tang, Jie, Gonzalez, Joey, Mahoney, Michael, and Cheung, Alvin
Subjects: Computer Science - Machine Learning
Abstract: Training large neural network (NN) models requires extensive memory resources, and Activation Compressed Training (ACT) is a promising approach to reduce training memory footprint. This paper presents GACT, an ACT framework to support a broad range of machine learning tasks for generic NN architectures with limited domain knowledge. By analyzing a linearized version of ACT's approximate gradient, we prove the convergence of GACT without prior knowledge on operator type or model architecture. To make training stable, we propose an algorithm that decides the compression ratio for each tensor by estimating its impact on the gradient at run time. We implement GACT as a PyTorch library that readily applies to any NN architecture. GACT reduces the activation memory for convolutional NNs, transformers, and graph NNs by up to 8.1x, enabling training with a 4.2x to 24.7x larger batch size, with negligible accuracy loss. We implement GACT as a PyTorch library at https://github.com/LiuXiaoxuanPKU/GACT-ICML.
Published: 2022

12. Rethinking the Setting of Semi-supervised Learning on Graphs

Author: Li, Ziang, Ding, Ming, Li, Weikai, Wang, Zihan, Zeng, Ziyu, Cen, Yukuo, and Tang, Jie
Subjects: Computer Science - Machine Learning
Abstract: We argue that the present setting of semisupervised learning on graphs may result in unfair comparisons, due to its potential risk of over-tuning hyper-parameters for models. In this paper, we highlight the significant influence of tuning hyper-parameters, which leverages the label information in the validation set to improve the performance. To explore the limit of over-tuning hyperparameters, we propose ValidUtil, an approach to fully utilize the label information in the validation set through an extra group of hyper-parameters. With ValidUtil, even GCN can easily get high accuracy of 85.8% on Cora. To avoid over-tuning, we merge the training set and the validation set and construct an i.i.d. graph benchmark (IGB) consisting of 4 datasets. Each dataset contains 100 i.i.d. graphs sampled from a large graph to reduce the evaluation variance. Our experiments suggest that IGB is a more stable benchmark than previous datasets for semisupervised learning on graphs., Comment: To appear in IJCAI 2022
Published: 2022

13. GraphMAE: Self-Supervised Masked Graph Autoencoders

Author: Hou, Zhenyu, Liu, Xiao, Cen, Yukuo, Dong, Yuxiao, Yang, Hongxia, Wang, Chunjie, and Tang, Jie
Subjects: Computer Science - Machine Learning
Abstract: Self-supervised learning (SSL) has been extensively explored in recent years. Particularly, generative SSL has seen emerging success in natural language processing and other AI fields, such as the wide adoption of BERT and GPT. Despite this, contrastive learning-which heavily relies on structural data augmentation and complicated training strategies-has been the dominant approach in graph SSL, while the progress of generative SSL on graphs, especially graph autoencoders (GAEs), has thus far not reached the potential as promised in other fields. In this paper, we identify and examine the issues that negatively impact the development of GAEs, including their reconstruction objective, training robustness, and error metric. We present a masked graph autoencoder GraphMAE that mitigates these issues for generative self-supervised graph pretraining. Instead of reconstructing graph structures, we propose to focus on feature reconstruction with both a masking strategy and scaled cosine error that benefit the robust training of GraphMAE. We conduct extensive experiments on 21 public datasets for three different graph learning tasks. The results manifest that GraphMAE-a simple graph autoencoder with careful designs-can consistently generate outperformance over both contrastive and generative state-of-the-art baselines. This study provides an understanding of graph autoencoders and demonstrates the potential of generative self-supervised pre-training on graphs., Comment: 11 pages; Accepted to KDD'22
Published: 2022

14. SCR: Training Graph Neural Networks with Consistency Regularization

Author: Zhang, Chenhui, He, Yufei, Cen, Yukuo, Hou, Zhenyu, Feng, Wenzheng, Dong, Yuxiao, Cheng, Xu, Cai, Hongyun, He, Feng, and Tang, Jie
Subjects: Computer Science - Social and Information Networks, Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: We present the SCR framework for enhancing the training of graph neural networks (GNNs) with consistency regularization. Regularization is a set of strategies used in Machine Learning to reduce overfitting and improve the generalization ability. However, it is unclear how to best design the generalization strategies in GNNs, as it works in a semi-supervised setting for graph data. The major challenge lies in how to efficiently balance the trade-off between the error from the labeled data and that from the unlabeled data. SCR is a simple yet general framework in which we introduce two strategies of consistency regularization to address the challenge above. One is to minimize the disagreements among the perturbed predictions by different versions of a GNN model. The other is to leverage the Mean Teacher paradigm to estimate a consistency loss between teacher and student models instead of the disagreement of the predictions. We conducted experiments on three large-scale node classification datasets in the Open Graph Benchmark (OGB). Experimental results demonstrate that the proposed SCR framework is a general one that can enhance various GNNs to achieve better performance. Finally, SCR has been the top-1 entry on all three OGB leaderboards as of this submission.
Published: 2021

15. Graph Robustness Benchmark: Benchmarking the Adversarial Robustness of Graph Machine Learning

Author: Zheng, Qinkai, Zou, Xu, Dong, Yuxiao, Cen, Yukuo, Yin, Da, Xu, Jiarong, Yang, Yang, and Tang, Jie
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Cryptography and Security
Abstract: Adversarial attacks on graphs have posed a major threat to the robustness of graph machine learning (GML) models. Naturally, there is an ever-escalating arms race between attackers and defenders. However, the strategies behind both sides are often not fairly compared under the same and realistic conditions. To bridge this gap, we present the Graph Robustness Benchmark (GRB) with the goal of providing a scalable, unified, modular, and reproducible evaluation for the adversarial robustness of GML models. GRB standardizes the process of attacks and defenses by 1) developing scalable and diverse datasets, 2) modularizing the attack and defense implementations, and 3) unifying the evaluation protocol in refined scenarios. By leveraging the GRB pipeline, the end-users can focus on the development of robust GML models with automated data processing and experimental evaluations. To support open and reproducible research on graph adversarial learning, GRB also hosts public leaderboards across different scenarios. As a starting point, we conduct extensive experiments to benchmark baseline techniques. GRB is open-source and welcomes contributions from the community. Datasets, codes, leaderboards are available at https://cogdl.ai/grb/home., Comment: 21 pages, 12 figures, NeurIPS 2021 Datasets and Benchmarks Track
Published: 2021

16. Does Negative Sampling Matter? A Review with Insights into its Theory and Applications

Author: Yang, Zhen, primary, Ding, Ming, additional, Huang, Tinglin, additional, Cen, Yukuo, additional, Song, Junshuai, additional, Xu, Bin, additional, Dong, Yuxiao, additional, and Tang, Jie, additional
Published: 2024
Full Text: View/download PDF

17. BatchSampler: Sampling Mini-Batches for Contrastive Learning in Vision, Language, and Graphs

Author: Yang, Zhen, primary, Huang, Tinglin, additional, Ding, Ming, additional, Dong, Yuxiao, additional, Ying, Rex, additional, Cen, Yukuo, additional, Geng, Yangliao, additional, and Tang, Jie, additional
Published: 2023
Full Text: View/download PDF

18. CogDL: A Comprehensive Library for Graph Deep Learning

Author: Cen, Yukuo, primary, Hou, Zhenyu, additional, Wang, Yan, additional, Chen, Qibin, additional, Luo, Yizhen, additional, Yu, Zhongming, additional, Zhang, Hengrui, additional, Yao, Xingcheng, additional, Zeng, Aohan, additional, Guo, Shiguang, additional, Dong, Yuxiao, additional, Yang, Yang, additional, Zhang, Peng, additional, Dai, Guohao, additional, Wang, Yu, additional, Zhou, Chang, additional, Yang, Hongxia, additional, and Tang, Jie, additional
Published: 2023
Full Text: View/download PDF

19. GraphMAE2: A Decoding-Enhanced Masked Self-Supervised Graph Learner

Author: Hou, Zhenyu, primary, He, Yufei, additional, Cen, Yukuo, additional, Liu, Xiao, additional, Dong, Yuxiao, additional, Kharlamov, Evgeny, additional, and Tang, Jie, additional
Published: 2023
Full Text: View/download PDF

20. GraphMAE: Self-Supervised Masked Graph Autoencoders

Author: Hou, Zhenyu, primary, Liu, Xiao, additional, Cen, Yukuo, additional, Dong, Yuxiao, additional, Yang, Hongxia, additional, Wang, Chunjie, additional, and Tang, Jie, additional
Published: 2022
Full Text: View/download PDF

21. Mask and Reason

Author: Liu, Xiao, primary, Zhao, Shiyu, additional, Su, Kai, additional, Cen, Yukuo, additional, Qiu, Jiezhong, additional, Zhang, Mengdi, additional, Wu, Wei, additional, Dong, Yuxiao, additional, and Tang, Jie, additional
Published: 2022
Full Text: View/download PDF

22. Rethinking the Setting of Semi-supervised Learning on Graphs

Author: Li, Ziang, primary, Ding, Ming, additional, Li, Weikai, additional, Wang, Zihan, additional, Zeng, Ziyu, additional, Cen, Yukuo, additional, and Tang, Jie, additional
Published: 2022
Full Text: View/download PDF

23. Automated Unsupervised Graph Representation Learning

Author: Hou, Zhenyu, Cen, Yukuo, Dong, Yuxiao, Zhang, Jie, and Tang, Jie
Abstract: Graph data mining has largely benefited from the recent developments of graph representation learning. Most attempts to improve graph representations have thus far focused on designing new network embedding or graph neural network (GNN) architectures. Inspired by the SGC and ProNE models, we instead focus on enhancing any existing or learned graph representations by further smoothing them via graph filters. In this paper, we introduce an automated framework AutoProNE to achieve this. Specifically, AutoProNE automatically searches for a unique optimal set of graph filters for any input dataset, and its existing representations are then smoothed via the selected filters. To make AutoProNE more general, we adopt self-supervised loss functions to guide the optimization of the automated search process. Extensive experiments on eight commonly used datasets demonstrate that the AutoProNE framework can consistently improve the expressive power of graph representations learned by existing network embedding and GNN methods by up to 44%. AutoProNE is also implemented in CogDL, an open source graph learning library, to help boost more algorithms.
Published: 2023
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

23 results on '"Cen, Yukuo"'

1. LongRAG: A Dual-Perspective Retrieval-Augmented Generation Paradigm for Long-Context Question Answering

2. Pre-Training and Prompting for Few-Shot Node Classification on Text-Attributed Graphs

3. Generalizing Graph Transformers Across Diverse Graphs and Tasks via Pre-Training on Industrial-Scale Data

4. GraphAlign: Pretraining One Graph Neural Network on Multiple Graphs via Feature Alignment

5. Does Negative Sampling Matter? A Review with Insights into its Theory and Applications

6. PST-Bench: Tracing and Benchmarking the Source of Publications

7. OAG-Bench: A Human-Curated Benchmark for Academic Graph Mining

8. BatchSampler: Sampling Mini-Batches for Contrastive Learning in Vision, Language, and Graphs

9. GraphMAE2: A Decoding-Enhanced Masked Self-Supervised Graph Learner

10. Mask and Reason: Pre-Training Knowledge Graph Transformers for Complex Logical Queries

11. GACT: Activation Compressed Training for Generic Network Architectures

12. Rethinking the Setting of Semi-supervised Learning on Graphs

13. GraphMAE: Self-Supervised Masked Graph Autoencoders

14. SCR: Training Graph Neural Networks with Consistency Regularization

15. Graph Robustness Benchmark: Benchmarking the Adversarial Robustness of Graph Machine Learning

16. Does Negative Sampling Matter? A Review with Insights into its Theory and Applications

17. BatchSampler: Sampling Mini-Batches for Contrastive Learning in Vision, Language, and Graphs

18. CogDL: A Comprehensive Library for Graph Deep Learning

19. GraphMAE2: A Decoding-Enhanced Masked Self-Supervised Graph Learner

20. GraphMAE: Self-Supervised Masked Graph Autoencoders

21. Mask and Reason

22. Rethinking the Setting of Semi-supervised Learning on Graphs

23. Automated Unsupervised Graph Representation Learning

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

23 results on '"Cen, Yukuo"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources