TY - GEN
T1 - Step-by-step regression
T2 - 7th Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2003
AU - Liu, Chao
AU - Zhang, Ming
AU - Zheng, Minrui
AU - Chen, Yixin
N1 - Publisher Copyright:
© Springer-Verlag Berlin Heidelberg 2003.
PY - 2003
Y1 - 2003
N2 - Facing tremendous and potentially infinite stream data, it is impossible to record them entirely. Thus synopses are required to be generated timely to capture the underlying model for stream management systems. Traditionally, curve fitting through Multiple Linear Regression (MLR) is a powerful and efficient modeling tool. In order to further accelerate its processing efficiency, we propose Step-by-step Regression (SR) as a more efficient alternative. As revealed in experiments, it speeds up for more than 40 times. In addition, inspired by previous work, we integrated SR into cube environment through similar compression technique to perform online analytical processing and mining over data stream. Finally, experiments show that SR not only significantly alleviates the computation pressure on the front ends of data stream management systems, but also results in a much smaller stream cube for on line analysis and real-time surveillance.
AB - Facing tremendous and potentially infinite stream data, it is impossible to record them entirely. Thus synopses are required to be generated timely to capture the underlying model for stream management systems. Traditionally, curve fitting through Multiple Linear Regression (MLR) is a powerful and efficient modeling tool. In order to further accelerate its processing efficiency, we propose Step-by-step Regression (SR) as a more efficient alternative. As revealed in experiments, it speeds up for more than 40 times. In addition, inspired by previous work, we integrated SR into cube environment through similar compression technique to perform online analytical processing and mining over data stream. Finally, experiments show that SR not only significantly alleviates the computation pressure on the front ends of data stream management systems, but also results in a much smaller stream cube for on line analysis and real-time surveillance.
UR - http://www.scopus.com/inward/record.url?scp=7444226341&partnerID=8YFLogxK
U2 - 10.1007/3-540-36175-8_44
DO - 10.1007/3-540-36175-8_44
M3 - Conference contribution
AN - SCOPUS:7444226341
T3 - Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science)
SP - 437
EP - 448
BT - Advances in Knowledge Discovery and Data Mining
A2 - Wang, Kyu-Young
A2 - Jeon, Jongwoo
A2 - Shim, Kyuseok
A2 - Srivastava, Jaideep
PB - Springer Verlag
Y2 - 30 April 2003 through 2 May 2003
ER -