Untangling the Braid: Finding Outliers in a Set of Streams

Buragohain, Chiranjeeb; Foschini, Luca; Suri, Subhash

Computer Science > Databases

arXiv:0907.2951 (cs)

[Submitted on 16 Jul 2009]

Title:Untangling the Braid: Finding Outliers in a Set of Streams

Authors:Chiranjeeb Buragohain, Luca Foschini, Subhash Suri

View PDF

Abstract: Monitoring the performance of large shared computing systems such as the cloud computing infrastructure raises many challenging algorithmic problems. One common problem is to track users with the largest deviation from the norm (outliers), for some measure of performance. Taking a stream-computing perspective, we can think of each user's performance profile as a stream of numbers (such as response times), and the aggregate performance profile of the shared infrastructure as a "braid" of these intermixed streams. The monitoring system's goal then is to untangle this braid sufficiently to track the top k outliers. This paper investigates the space complexity of one-pass algorithms for approximating outliers of this kind, proves lower bounds using multi-party communication complexity, and proposes small-memory heuristic algorithms. On one hand, stream outliers are easily tracked for simple measures, such as max or min, but our theoretical results rule out even good approximations for most of the natural measures such as average, median, or the quantiles. On the other hand, we show through simulation that our proposed heuristics perform quite well for a variety of synthetic data.

Subjects:	Databases (cs.DB); Data Structures and Algorithms (cs.DS)
Cite as:	arXiv:0907.2951 [cs.DB]
	(or arXiv:0907.2951v1 [cs.DB] for this version)
	https://doi.org/10.48550/arXiv.0907.2951

Submission history

From: Luca Foschini [view email]
[v1] Thu, 16 Jul 2009 22:57:53 UTC (76 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.DB

< prev | next >

new | recent | 2009-07

Change to browse by:

cs
cs.DS

References & Citations

DBLP - CS Bibliography

listing | bibtex

Chiranjeeb Buragohain
Luca Foschini
Subhash Suri

export BibTeX citation

Computer Science > Databases

Title:Untangling the Braid: Finding Outliers in a Set of Streams

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Databases

Title:Untangling the Braid: Finding Outliers in a Set of Streams

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators