KGCN

.

Background

CF Questions

  1. sparsity
  2. cold start

KG Benefits

  1. The rich semantic relatedness among items in a KG can help explore their latent connections and improve the precision of results;
  2. The various types of relations in a KG are helpful for extending a user’s interests reasonably and increasing the diversity of recommended items;
  3. KG connects a user’s historically-liked and recommended items, thereby bringing explainability to recommender systems.

Previous KG Method

Knowledge graph embedding

Example: TransE [1] and TransR [12] assume head + relation = tail, which focus on modeling rigorous semantic relatedness

Problem: KGE methods are more suitable for in-graph applications such as KG completion and link prediction rather than the recommendation system.

Path-base Method

Example: PER, FMG

problem: Labor sensitivity

Ripple Net

problem:

  1. the importance of relations is weakly characterized in RippleNet, because the relation R can hardly be trained to capture the sense of importance in the quadratic form vRh (v and h are embedding vectors of two entities).
  2. The size of ripple set may go unpredictably with the increase of the size of KG, which incurs heavy computation and storage overhead.

Solution: KGCN

  1. Propagation and aggregation mechanism.
  2. Attention mechanism.
  3. sample a fixed-size neighborhood to control compute cost.

Model

Single layer

Consider a pair(u,v)

Overall of single layer

!!!Propagation only use for updating of item's vector

image-20230302114017500

Propagation

image-20230302111400335

\(N(v)\) is the neighbor set of v

e$ is the embedding of entity(parameter to train)

\(\pi^u_{r_{v,e}}\) is attention weight

\(r_{v,e}\) represent the relation of v and e

Attention

image-20230302112401231
image-20230302112315690

\(g : R^d ×R^d → R\) (e.g., inner product:内积能计算相似度)to compute the score between a user and a relation

\(u\in R\), \(r\in R\) : embedding of user and relation(parameter to train)

\(\pi^u_{r_{v,e}}\)characterizes the importance of relation r to user u. E.g. example, a user may have more interests in the movies that share the same “star" with his historically liked ones, while another user may be more concerned about the “genre" of movies.

!!!!!!!!!!个性化!!!用户对不同关系重视程度不同!!

所以KGCN不用propagation更新用户的原因是否是因为希望user的embedding能专注于提取个性化信息(提高用户和重要relation的相似度),但是这样是否会让user和item没那么好聚类?

Sample the number of neighbors

limit the neighbor number in K(can be config)

image-20230302113734945

S(v) is also called the (single layer) receptive field of entity v

example K=2:

image-20230302113907523

aggregation

image-20230302114047762
image-20230302114057869
image-20230302114106975

\(\sigma\) is ReLU

Multi layer

9C6A6E4346910DDDD43EBB55E1A2FE6C

First we consider the Receptive-Field:

1739B546615C1AEDC69F7A3514E2E458

We first update eneities in M[0] by using propagation and aggregation to get the high-hop neighbor information.

And then gradually narrow it down, and finally focus on v.

Note that we have only one user in one pair, every relations will calculate the score with this user embedding

Predict

image-20230302114437063

Loss function

image-20230302120154981

\(J\) is cross-entropy loss

P is a negative sampling distribution, and \(T_u\) is the number of negative samples for user u. In this paper,

Experiment