作成者 |
|
|
|
本文言語 |
|
出版者 |
|
発行日 |
|
収録物名 |
|
出版タイプ |
|
アクセス権 |
|
関連DOI |
|
概要 |
We propose a new approach to large-scale machine learning, learning over compressed data: First compress the training data somehow and then em-ploy various machine learning algorithms on the compresse...d data, with the hope that the computation time is signi_cantly reduced when the training data is well compressed. As a _rst step toward this approach, we consider a variant of the Zero-Suppressed Binary Decision Diagram (ZDD) as the data structure for representing the training data, which is a generalization of the ZDD by incorporating non-determinism. For the learning algorithm to be employed, we consider a boosting algorithm called AdaBoost_ and its precursor AdaBoost. In this paper, we give efficient implementations of the boosting algorithms whose running times (per iteration) are linear in the size of the given ZDD.続きを見る
|