History

Common interface for collective communication (#8057 )

* implement broadcast for federated communicator

* implement allreduce

* add communicator factory

* add device adapter

* add device communicator to factory

* add rabit communicator

* add rabit communicator to the factory

* add nccl device communicator

* add synchronize to device communicator

* add back print and getprocessorname

* add python wrapper and c api

* clean up types

* fix non-gpu build

* try to fix ci

* fix std::size_t

* portable string compare ignore case

* c style size_t

* fix lint errors

* cross platform setenv

* fix memory leak

* fix lint errors

* address review feedback

* add python test for rabit communicator

* fix failing gtest

* use json to configure communicators

* fix lint error

* get rid of factories

* fix cpu build

* fix include

* fix python import

* don't export collective.py yet

* skip collective communicator pytest on windows

* add review feedback

* update documentation

* remove mpi communicator type

* fix tests

* shutdown the communicator separately

Co-authored-by: Hyunsu Cho <chohyu01@cs.washington.edu>

2022-09-12 15:21:12 -07:00

CMakeLists.txt

Make federated plugin work with cmake 3.16.3 (#8029 )

2022-06-27 17:26:41 +08:00

engine_federated.cc

Common interface for collective communication (#8057 )

2022-09-12 15:21:12 -07:00

federated_client.h

Set max message size in insecure gRPC (#8203 )

2022-08-26 16:33:51 +08:00

federated_communicator.h

Common interface for collective communication (#8057 )

2022-09-12 15:21:12 -07:00

federated_server.cc

Common interface for collective communication (#8057 )

2022-09-12 15:21:12 -07:00

federated_server.h

Allow insecure gRPC connections for federated learning (#8181 )

2022-08-19 12:16:14 +08:00

federated.proto

Common interface for collective communication (#8057 )

2022-09-12 15:21:12 -07:00

README.md

Make federated plugin work with cmake 3.16.3 (#8029 )

2022-06-27 17:26:41 +08:00

README.md

XGBoost Plugin for Federated Learning

This folder contains the plugin for federated learning. Follow these steps to build and test it.

Install gRPC

sudo apt-get install build-essential autoconf libtool pkg-config cmake ninja-build
git clone -b v1.47.0 https://github.com/grpc/grpc
cd grpc
git submodule update --init
cmake -S . -B build -GNinja -DABSL_PROPAGATE_CXX_STD=ON
cmake --build build --target install

Build the Plugin

# Under xgboost source tree.
mkdir build
cd build
# For now NCCL needs to be turned off.
cmake .. -GNinja\
 -DPLUGIN_FEDERATED=ON\
 -DUSE_CUDA=ON\
 -DBUILD_WITH_CUDA_CUB=ON\
 -DUSE_NCCL=OFF
ninja
cd ../python-package
pip install -e .  # or equivalently python setup.py develop

Test Federated XGBoost

# Under xgboost source tree.
cd tests/distributed
# This tests both CPU training (`hist`) and GPU training (`gpu_hist`).
./runtests-federated.sh