Hard Sample Aware Network for Contrastive Deep Graph Clustering

Authors

  • Yue Liu National University of Defense Technology
  • Xihong Yang National University of Defense Technology
  • Sihang Zhou National University of Defense Technology
  • Xinwang Liu National University of Defense Technology
  • Zhen Wang Northwestern Polytechnical University
  • Ke Liang National University of Defense Technology
  • Wenxuan Tu National University of Defense Technology
  • Liang Li National University of Defense Technology
  • Jingcan Duan National University of Defense Technology
  • Cancan Chen Beijing Information Science and Technology University

DOI:

https://doi.org/10.1609/aaai.v37i7.26071

Keywords:

ML: Clustering, ML: Graph-based Machine Learning, ML: Multi-Instance/Multi-View Learning

Abstract

Contrastive deep graph clustering, which aims to divide nodes into disjoint groups via contrastive mechanisms, is a challenging research spot. Among the recent works, hard sample mining-based algorithms have achieved great attention for their promising performance. However, we find that the existing hard sample mining methods have two problems as follows. 1) In the hardness measurement, the important structural information is overlooked for similarity calculation, degrading the representativeness of the selected hard negative samples. 2) Previous works merely focus on the hard negative sample pairs while neglecting the hard positive sample pairs. Nevertheless, samples within the same cluster but with low similarity should also be carefully learned. To solve the problems, we propose a novel contrastive deep graph clustering method dubbed Hard Sample Aware Network (HSAN) by introducing a comprehensive similarity measure criterion and a general dynamic sample weighing strategy. Concretely, in our algorithm, the similarities between samples are calculated by considering both the attribute embeddings and the structure embeddings, better revealing sample relationships and assisting hardness measurement. Moreover, under the guidance of the carefully collected high-confidence clustering information, our proposed weight modulating function will first recognize the positive and negative samples and then dynamically up-weight the hard sample pairs while down-weighting the easy ones. In this way, our method can mine not only the hard negative samples but also the hard positive sample, thus improving the discriminative capability of the samples further. Extensive experiments and analyses demonstrate the superiority and effectiveness of our proposed method. The source code of HSAN is shared at https://github.com/yueliu1999/HSAN and a collection (papers, codes and, datasets) of deep graph clustering is shared at https://github.com/yueliu1999/Awesome-Deep-Graph-Clustering on Github.

Downloads

Published

2023-06-26

How to Cite

Liu, Y., Yang, X., Zhou, S., Liu, X., Wang, Z., Liang, K., Tu, W., Li, L., Duan, J., & Chen, C. (2023). Hard Sample Aware Network for Contrastive Deep Graph Clustering. Proceedings of the AAAI Conference on Artificial Intelligence, 37(7), 8914-8922. https://doi.org/10.1609/aaai.v37i7.26071

Issue

Section

AAAI Technical Track on Machine Learning II