NETS: Extremely Fast Outlier Detection from a Data Stream via Set-Based Processing

Cited 30 time in webofscience Cited 21 time in scopus
  • Hit : 325
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorYoon, Susikko
dc.contributor.authorLee, Jae-Gilko
dc.contributor.authorLee, Byung Sukko
dc.date.accessioned2019-12-13T08:21:55Z-
dc.date.available2019-12-13T08:21:55Z-
dc.date.created2019-12-09-
dc.date.created2019-12-09-
dc.date.issued2019-07-
dc.identifier.citationPROCEEDINGS OF THE VLDB ENDOWMENT, v.12, no.11, pp.1303 - 1315-
dc.identifier.issn2150-8097-
dc.identifier.urihttp://hdl.handle.net/10203/269074-
dc.description.abstractThis paper addresses the problem of efficiently detecting outliers from a data stream as old data points expire from and new data points enter the window incrementally. The proposed method is based on a newly discovered characteristic of a data stream that the change in the locations of data points in the data space is typically very insignificant. This observation has led to the finding that the existing distance-based outlier detection algorithms perform excessive unnecessary computations that are repetitive and/or canceling out the effects. Thus, in this paper, we propose a novel set-based approach to detecting outliers, whereby data points at similar locations are grouped and the detection of outliers or inliers is handled at the group level. Specifically, a new algorithm NETS is proposed to achieve a remarkable performance improvement by realizing set-based early identification of outliers or inners and taking advantage of the "net effect" between expired and new data points. Additionally, NETS is capable of achieving the same efficiency even for a high-dimensional data stream through two-level dimensional filtering. Comprehensive experiments using six real-world data streams show 5 to 25 times faster processing time than state-of-the-art algorithms with comparable memory consumption. We assert that NETS opens a new possibility to real-time data stream outlier detection.-
dc.languageEnglish-
dc.publisherASSOC COMPUTING MACHINERY-
dc.titleNETS: Extremely Fast Outlier Detection from a Data Stream via Set-Based Processing-
dc.typeArticle-
dc.identifier.wosid000497645900006-
dc.identifier.scopusid2-s2.0-85077815763-
dc.type.rimsART-
dc.citation.volume12-
dc.citation.issue11-
dc.citation.beginningpage1303-
dc.citation.endingpage1315-
dc.citation.publicationnamePROCEEDINGS OF THE VLDB ENDOWMENT-
dc.identifier.doi10.14778/3342263.3342269-
dc.contributor.localauthorLee, Jae-Gil-
dc.contributor.nonIdAuthorLee, Byung Suk-
dc.description.isOpenAccessN-
dc.type.journalArticleArticle-
Appears in Collection
CS-Journal Papers(저널논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 30 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0