- Add a grow-only virtual memory allocator. - Define a driver API wrapper. Split up the runtime API wrapper.
- Support resource view in ellpack. - Define the CUDA version of MMAP resource. - Define the CUDA version of malloc resource. - Refactor cuda runtime API wrappers, and add memory access related wrappers. - gather windows macros into a single header.
- Implement `CommGroup` for double dispatching. - Small cleanup to tracker for handling abort.