Multi-Relational Data Characterization by Tensors: Perturbation Analysis

Publication Date

1-1-2023

Document Type

Article

Publication Title

IEEE Transactions on Knowledge and Data Engineering

Volume

35

Issue

1

DOI

10.1109/TKDE.2021.3087671

First Page

756

Last Page

769

Abstract

Data perturbation is deemed a common problem in data processing. It is often inevitable to avoid noisy or misleading data which may arise from real-world collection or model imprecision. Besides, when data privacy is concerned, data perturbation is used as a prevalent data-protection approach, which alters individual data in a way such that the summary statistics still remain more or less the same. Since many data-mining problems can be formulated as tensor equations for characterizing multi-relational data, the main focus of this work is to perform a new perturbation analysis of tensor equations. From our recent study on tensor inversion, we propose a new mathematical framework to invert an arbitrary tensor but the existing iterative algorithms cannot always do so. In this work, we will establish the theoretical tensor-perturbation analysis to quantify the crucial query performance in terms of normalized error-norm with respect to perturbation degree and condition number. The condition number can be taken as a new measure to determine how the solution of a tensor equation varies as the entries are perturbed. Information-retrieval experiments for conducting the perturbation analysis of the solutions to tensor equations over both artificial and real data are undertaken and studied finally.

Keywords

data perturbation, multi-relational data, perturbation analysis, Tensor equations, tensor inverse

Department

Applied Data Science

Share

COinS