adding some initial skeleton of the report.

2014-12-02 01:19:36 -08:00 · 2014-12-02 01:19:36 -08:00 · 2c166d7a3a
commit 2c166d7a3a
parent dcea64c838
3 changed files with 142 additions and 0 deletions
--- a/report/.gitignore
+++ b/report/.gitignore
@ -0,0 +1,8 @@
+*.pdf
+*.bbl
+*.blg
+*.fls
+*.aux
+*.gz
+*.log
+Output
--- a/report/rabit.bib
+++ b/report/rabit.bib
@ -0,0 +1,69 @@
+@inproceedings {paramServer,
+author = {Mu Li and David G. Andersen and Jun Woo Park and Alexander J. Smola and Amr Ahmed and Vanja Josifovski and James Long and Eugene J. Shekita and Bor-Yiing Su},
+title = {Scaling Distributed Machine Learning with the Parameter Server},
+booktitle = {11th USENIX Symposium on Operating Systems Design and Implementation (OSDI 14)},
+year = {2014},
+month = Oct,
+isbn = { 978-1-931971-16-4},
+address = {Broomfield, CO},
+pages = {583--598},
+url = {https://www.usenix.org/conference/osdi14/technical-sessions/presentation/li_mu},
+publisher = {USENIX Association},
+}
+
+@article{DuchiAW12,
+  author = {Duchi, John C. and Agarwal, Alekh and Wainwright, Martin J.},
+  biburl = {http://www.bibsonomy.org/bibtex/241ff9a4754f77538c4d5076acebbf772/dblp},
+  ee = {http://dx.doi.org/10.1109/TAC.2011.2161027},
+  journal = {IEEE Trans. Automat. Contr.},
+  keywords = {dblp},
+  number = 3,
+  pages = {592-606},
+  title = {Dual Averaging for Distributed Optimization: Convergence Analysis and Network Scaling.},
+  url = {http://dblp.uni-trier.de/db/journals/tac/tac57.html#DuchiAW12},
+  volume = 57,
+  year = 2012
+}
+
+@INPROCEEDINGS{Zinkevich,
+    author = {Martin A. Zinkevich and Alex Smola and Markus Weimer and Lihong Li},
+    title = {Parallelized stochastic gradient descent},
+    booktitle = {Advances in Neural Information Processing Systems 23},
+    year = {2010},
+    pages = {2595--2603}
+}
+
+@article{Dekel,
+  author = {Dekel, Ofer and Gilad-Bachrach, Ran and Shamir, Ohad and Xiao, Lin},
+  biburl = {http://www.bibsonomy.org/bibtex/20603ddb3c1f66680cb38f01683f544c3/dblp},
+  ee = {http://arxiv.org/abs/1012.1367},
+  journal = {CoRR},
+  keywords = {dblp},
+  title = {Optimal Distributed Online Prediction using Mini-Batches},
+  url = {http://dblp.uni-trier.de/db/journals/corr/corr1012.html#abs-1012-1367},
+  volume = {abs/1012.1367},
+  year = 2010
+}
+
+@inproceedings{Low,
+title = {GraphLab: A New Parallel Framework for Machine Learning},
+author = {Yucheng Low and Joseph Gonzalez and Aapo Kyrola and Danny Bickson and Carlos Guestrin and Joseph M. Hellerstein},
+booktitle = {Conference on Uncertainty in Artificial Intelligence (UAI)},
+month = {July},
+year = {2010},
+address = {Catalina Island, California},
+wwwfilebase = {uai2010-low-gonzalez-kyrola-bickson-guestrin-hellerstein},
+wwwtopic = {Parallel Learning},
+}
+
+@article{Agarwal,
+  author = {Agarwal, Alekh and Chapelle, Olivier and Dudík, Miroslav and Langford, John},
+  biburl = {http://www.bibsonomy.org/bibtex/2e0e1d583d5b30e917e67124acbe3acd4/dblp},
+  ee = {http://arxiv.org/abs/1110.4198},
+  journal = {CoRR},
+  keywords = {dblp},
+  title = {A Reliable Effective Terascale Linear Learning System},
+  url = {http://dblp.uni-trier.de/db/journals/corr/corr1110.html#abs-1110-4198},
+  volume = {abs/1110.4198},
+  year = 2011
+}
--- a/report/rabit.tex
+++ b/report/rabit.tex
@ -0,0 +1,65 @@
+\documentclass[10pt,twocolumn]{article}
+
+\usepackage{times}
+\usepackage{fullpage}
+\usepackage{color}
+\usepackage{natbib}
+
+\newcommand{\todo}[1]{\noindent{\textcolor{red}{\{{\bf TODO:}  #1\}}}}
+
+\begin{document}
+
+\title{\bf RABIT: A Robust AllReduce and Broadcast Interface}
+\author{Tianqi Chen\hspace{0.5in}Ignacio Cano\hspace{0.5in}Tianyi Zhou \\\\
+Department of Computer Science \& Engineering \\
+University of Washington\\
+}
+\date{}
+\maketitle
+\thispagestyle{empty}
+
+\begin{abstract}
+
+AllReduce is an abstraction commonly used for solving machine learning problems. It is an operation where every node starts with a local value and ends up with an aggregate global result.
+MPI package provides an AllReduce implementation. Though it has been widely adopted, it is somewhat limited; it lacks fault tolerance and cannot run easily on existent systems, such as Spark, Hadoop, etc.
+
+In this work, we propose RABIT, an AllReduce library suitable for distributed machine learning algorithms that overcomes the aforementioned drawbacks; it is fault-tolerant and can easily run on top of existent systems.
+
+\end{abstract}
+
+\section{Introduction}
+Distributed machine learning is an active research area that has seen an incredible grow in recent years. Several approaches have been proposed, using a parameter server framework, graph approaches, among others \cite{paramServer,DuchiAW12,Zinkevich,Dekel,Low}. The closest example to our work is proposed by Agarwal et al. \cite{Agarwal}, in which they have a  communication infrastructure that efficiently accumulates and broadcasts values to every node involved in a computation.
+\todo {add more stuff}
+
+
+\section{AllReduce}
+
+In AllReduce settings, nodes are organized in a tree structure. Each node holds a portion of the data and computes some values on it. Those values are passed up the tree and aggregated, until a global aggregate value is calculated in the root node (reduce). The global value is then passed down to all other nodes (broadcast). Figure \todo{add image} shows an example of an AllReduce operation.
+
+\section{Design}
+
+\todo{add key design decisions}
+
+\subsection{Interface}
+
+\todo{add sync module interface, example of how to use the library}
+
+\section{Evaluation}
+
+\todo{add benchmarks and our results}
+
+
+\section{Conclusion \& Future Work}
+
+With the exponential increase of data on the web, it becomes critical to build systems that can process information efficiently in order to extract value out of it. Several abstractions have been proposed to address those requirements. In this project, we focus on the AllReduce abstraction. We propose an efficient and fault tolerant version that can be used together with existent big data analytics systems, such as Spark, Hadoop, etc.
+We compare our solution to MPI's AllReduce implementation, and show that the performance difference between the two is negligible considering our version is fault tolerant.
+\todo{improve this}
+
+\subsection*{Acknowledgments}
+Thanks to Arvind Krishnamurthy and the CSE550 teaching staff for their guidance and support during the quarter.
+
+\bibliography{rabit}
+\bibliographystyle{abbrv} 
+
+\end{document}
+