PLDA+

Zhiyuan Liu; Yuzhou Zhang; Edward Y. Chang; Maosong Sun

doi:10.1145/1961189.1961198

Back to TIST

TIST 2011

PLDA+

Journal Article journal-article Artificial Intelligence · Intelligent Systems

Details DOI

Abstract

Previous methods of distributed Gibbs sampling for LDA run into either memory or communication bottlenecks. To improve scalability, we propose four strategies: data placement, pipeline processing, word bundling, and priority-based scheduling. Experiments show that our strategies significantly reduce the unparallelizable communication bottleneck and achieve good load balancing, and hence improve scalability of LDA.

Authors

Zhiyuan Liu Chongqing Key Laboratory of Computational Intelligence Chongqing University of Posts and Telecommunications, Chongqing, China Key Laboratory of Cyberspace Big Data Intelligent Security, Ministry of Education
Yuzhou Zhang Northeastern University
Edward Y. Chang
Maosong Sun Dept. of Comp. Sci. & Tech., Institute for AI, Tsinghua University, Beijing, China

Keywords

No keywords are indexed for this paper.

Context

Venue: ACM Transactions on Intelligent Systems and Technology
Archive span: 2010-2026
Indexed papers: 1415
Paper id: 1084234275243846381