Scalable Bayesian Rule Lists

Yang, Hongyu; Rudin, Cynthia; Seltzer, Margo

Computer Science > Artificial Intelligence

arXiv:1602.08610 (cs)

[Submitted on 27 Feb 2016 (v1), last revised 3 Apr 2017 (this version, v2)]

Title:Scalable Bayesian Rule Lists

Authors:Hongyu Yang, Cynthia Rudin, Margo Seltzer

View PDF

Abstract:We present an algorithm for building probabilistic rule lists that is two orders of magnitude faster than previous work. Rule list algorithms are competitors for decision tree algorithms. They are associative classifiers, in that they are built from pre-mined association rules. They have a logical structure that is a sequence of IF-THEN rules, identical to a decision list or one-sided decision tree. Instead of using greedy splitting and pruning like decision tree algorithms, we fully optimize over rule lists, striking a practical balance between accuracy, interpretability, and computational speed. The algorithm presented here uses a mixture of theoretical bounds (tight enough to have practical implications as a screening or bounding procedure), computational reuse, and highly tuned language libraries to achieve computational efficiency. Currently, for many practical problems, this method achieves better accuracy and sparsity than decision trees; further, in many cases, the computational time is practical and often less than that of decision trees. The result is a probabilistic classifier (which estimates P(y = 1|x) for each x) that optimizes the posterior of a Bayesian hierarchical model over rule lists.

Comments:	31 pages, 19 figures
Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:1602.08610 [cs.AI]
	(or arXiv:1602.08610v2 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.1602.08610

Submission history

From: Hongyu Yang [view email]
[v1] Sat, 27 Feb 2016 16:29:24 UTC (1,106 KB)
[v2] Mon, 3 Apr 2017 07:01:26 UTC (943 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.AI

< prev | next >

new | recent | 2016-02

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Hongyu Yang
Cynthia Rudin
Margo I. Seltzer
Margo Seltzer

export BibTeX citation

Computer Science > Artificial Intelligence

Title:Scalable Bayesian Rule Lists

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Scalable Bayesian Rule Lists

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators