A Joint Matrix Factorization and Clustering Scheme for Irregular Time Series Data

19 Pages Posted: 11 Jul 2022

See all articles by shiming he

shiming he

Changsha University of Science and Technology

Meng Guo

Changsha University of Science and Technology

Zhuozhou Li

Changsha University of Science and Technology

Siyuan Zhou

Changsha University of Science and Technology

Kun Xie

Hunan University

Naixue Xiong

Northeastern State University

Abstract

Key Performance Indicator (KPI) clustering plays an important role in Artificial Intelligence for IT Operations when facing large-scale KPIs. It can effectively reduce the overhead by dividing KPIs into several classes, then applying the same anomaly detection or prediction model to all KPIs in a class. However, KPI sampling strategies vary according to the various environments, which produces irregular KPIs. Few existing works have considered the clustering of KPIs with irregular sampling. Matrix factorization (MF) is widely applied in low-rank data recovery and can be used to align and fill the irregular KPIs. However, the clustering performance after recovering and filling by MF remains unknown. These two problems interact with each other and should be solved together. Therefore, we formulate the joint MF and clustering problem for irregular KPIs and design an iterative clustering scheme based on MF. The iterative clustering scheme comprises alignment and pre-filling, the loop of clustering and subclass filling by MF, which can work with two pre-filling methods and any type of clustering algorithms. Extensive experiments on two real-world datasets show that the iterative clustering scheme can obtain higher normalized mutual information (NMI) than non-iterative clustering, and moreover consumes less computational time than Dynamic Time Warping.

Keywords: Time series clusteringIrregular samplingMatrix factorizationIteration

Suggested Citation

he, shiming and Guo, Meng and Li, Zhuozhou and Zhou, Siyuan and Xie, Kun and Xiong, Naixue, A Joint Matrix Factorization and Clustering Scheme for Irregular Time Series Data. Available at SSRN: https://ssrn.com/abstract=4160013

Shiming He

Changsha University of Science and Technology ( email )

Wangxin Rd
Changsha, 410004
China

Meng Guo

Changsha University of Science and Technology ( email )

Wangxin Rd
Changsha, 410004
China

Zhuozhou Li

Changsha University of Science and Technology ( email )

Wangxin Rd
Changsha, 410004
China

Siyuan Zhou

Changsha University of Science and Technology ( email )

Wangxin Rd
Changsha, 410004
China

Kun Xie

Hunan University ( email )

2 Lushan South Rd
Changsha, CA 410082
China

Naixue Xiong (Contact Author)

Northeastern State University ( email )

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
85
Abstract Views
262
Rank
640,419
PlumX Metrics