Hendrik Groove
8c15f3b665
revert
2024-10-21 11:43:29 +02:00
Hendrik Groove
bb2feab0b2
try
2024-10-21 01:55:41 +02:00
Hendrik Groove
1b6c6baf76
restore stream logic
2024-10-21 01:39:43 +02:00
Hendrik Groove
b3ee7a59c7
try other stream
2024-10-21 01:33:00 +02:00
Hendrik Groove
ea3e7adcdc
hipStreamPerThread
2024-10-21 01:16:40 +02:00
Hendrik Groove
807ee5da88
fix
2024-10-21 00:39:06 +02:00
Hendrik Groove
a135895f3a
fix
2024-10-21 00:35:28 +02:00
Hendrik Groove
ed1636b9c0
fix
2024-10-21 00:32:33 +02:00
Hendrik Groove
1f4154d756
fix
2024-10-21 00:29:38 +02:00
Hendrik Groove
86fcbaf0e5
fix
2024-10-21 00:26:25 +02:00
Hendrik Groove
0d600b4535
try to change stream
2024-10-21 00:25:18 +02:00
Hendrik Groove
6fcffef7dc
fix
2024-10-21 00:18:25 +02:00
Hendrik Groove
ca6fcd361e
fix
2024-10-21 00:13:27 +02:00
Hendrik Groove
c39ad981ce
fix
2024-10-21 00:11:08 +02:00
Hendrik Groove
1de5734d4c
more logging
2024-10-21 00:08:50 +02:00
Hendrik Groove
e2e6b6e71f
more logging
2024-10-21 00:06:21 +02:00
Hendrik Groove
db66fad9e9
SumReduction logging
2024-10-20 23:27:50 +02:00
Hendrik Groove
bf2ef6c586
log reduce function
2024-10-20 23:26:21 +02:00
Hendrik Groove
58a27ba968
more logging
2024-10-20 20:59:23 +02:00
Hendrik Groove
c964dd62b4
more logging
2024-10-20 20:53:50 +02:00
Hendrik Groove
4a10135006
validate label debug
2024-10-20 18:11:03 +02:00
Hendrik Groove
f54355f470
fix path
2024-10-20 17:56:27 +02:00
Hendrik Groove
08f3936bc9
fix path
2024-10-20 17:51:05 +02:00
Hendrik Groove
f50d5344f3
get gradient error logging
2024-10-20 17:40:52 +02:00
Hendrik Groove
ab41cd26a6
add gpu error check
2024-10-20 17:34:51 +02:00
Hendrik Groove
fd95be5f20
validate label logging
2024-10-20 17:32:22 +02:00
Hendrik Groove
60a3bea7c6
add logging
2024-10-20 17:30:17 +02:00
Hendrik Groove
7301022fed
logging
2024-10-20 17:05:34 +02:00
Hendrik Groove
288193cf82
try
2024-10-20 02:41:57 +02:00
Hendrik Groove
e142b52540
use new func
2024-10-20 02:18:41 +02:00
Hendrik Groove
e8fceb8198
add logging
2024-10-20 02:03:55 +02:00
Hendrik Groove
971d3ca8cd
array interface
2024-10-20 01:46:48 +02:00
Hendrik Groove
206f305b65
array interface
2024-10-20 01:28:40 +02:00
Hendrik Groove
8e703f3a5a
try hipHostMalloc
2024-10-20 01:13:44 +02:00
Hendrik Groove
0790bf7f8f
change back
2024-10-17 17:47:10 +02:00
Hendrik Groove
d8a92fe783
test
2024-10-17 17:42:37 +02:00
Hui Liu
bce48bffc6
Merge pull request #2 from hliuca/master-rocm
...
Merge latest upstream changes
2024-04-23 09:50:11 -07:00
Hui Liu
45dc134151
merge changes from upstream
2024-04-22 14:22:16 -07:00
Hui Liu
b27f35e270
rm hip from src
2024-04-22 12:31:14 -07:00
Hui Liu
8b75204fed
merge latest change from upstream
2024-04-22 09:35:31 -07:00
Jiaming Yuan
3fbb221fec
[coll] Implement shutdown for tracker and comm. ( #10208 )
...
- Force shutdown the tracker.
- Implement shutdown notice for error handling thread in comm.
2024-04-20 04:08:17 +08:00
Bobby Wang
8fb05c8c95
[pyspark] support stage-level for yarn/k8s ( #10209 )
2024-04-20 00:24:40 +08:00
dependabot[bot]
bb212bf33c
Bump org.apache.flink:flink-clients in /jvm-packages ( #10197 )
...
Bumps [org.apache.flink:flink-clients](https://github.com/apache/flink ) from 1.18.0 to 1.19.0.
- [Commits](https://github.com/apache/flink/compare/release-1.18.0...release-1.19.0 )
---
updated-dependencies:
- dependency-name: org.apache.flink:flink-clients
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-04-18 15:20:31 -07:00
Jiaming Yuan
3f64b4fde3
[coll] Add global functions. ( #10203 )
2024-04-19 03:17:23 +08:00
dependabot[bot]
551fa6e25e
Bump scalatest.version from 3.2.17 to 3.2.18 in /jvm-packages/xgboost4j ( #10196 )
...
Bumps `scalatest.version` from 3.2.17 to 3.2.18.
Updates `org.scalatest:scalatest_2.12` from 3.2.17 to 3.2.18
- [Release notes](https://github.com/scalatest/scalatest/releases )
- [Commits](https://github.com/scalatest/scalatest/compare/release-3.2.17...release-3.2.18 )
Updates `org.scalactic:scalactic_2.12` from 3.2.17 to 3.2.18
- [Release notes](https://github.com/scalatest/scalatest/releases )
- [Commits](https://github.com/scalatest/scalatest/compare/release-3.2.17...release-3.2.18 )
---
updated-dependencies:
- dependency-name: org.scalatest:scalatest_2.12
dependency-type: direct:development
update-type: version-update:semver-patch
- dependency-name: org.scalactic:scalactic_2.12
dependency-type: direct:development
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-04-18 11:46:28 -07:00
dependabot[bot]
531ff21b20
Bump org.scala-lang.modules:scala-collection-compat_2.12 ( #10193 )
...
Bumps [org.scala-lang.modules:scala-collection-compat_2.12](https://github.com/scala/scala-collection-compat ) from 2.11.0 to 2.12.0.
- [Release notes](https://github.com/scala/scala-collection-compat/releases )
- [Commits](https://github.com/scala/scala-collection-compat/compare/v2.11.0...v2.12.0 )
---
updated-dependencies:
- dependency-name: org.scala-lang.modules:scala-collection-compat_2.12
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-04-18 07:18:18 -07:00
Jiaming Yuan
303c603c7d
[pyspark] Reuse the collective communicator. ( #10198 )
2024-04-18 19:09:30 +08:00
dependabot[bot]
0aa2600399
Bump org.apache.maven.plugins:maven-jar-plugin ( #10202 )
...
Bumps [org.apache.maven.plugins:maven-jar-plugin](https://github.com/apache/maven-jar-plugin ) from 3.3.0 to 3.4.0.
- [Release notes](https://github.com/apache/maven-jar-plugin/releases )
- [Commits](https://github.com/apache/maven-jar-plugin/compare/maven-jar-plugin-3.3.0...maven-jar-plugin-3.4.0 )
---
updated-dependencies:
- dependency-name: org.apache.maven.plugins:maven-jar-plugin
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-04-17 23:04:41 -07:00
Philip Hyunsu Cho
f53f5ca359
[CI] Update machine images ( #10201 )
2024-04-17 19:15:06 -07:00
Jiaming Yuan
4b10200456
[coll] Improve event loop. ( #10199 )
...
- Add a test for blocking calls.
- Do not require the queue to be empty after waking up; this frees up the thread to answer blocking calls.
- Handle EOF in read.
- Improve the error message in the result. Allow concatenation of multiple results.
2024-04-18 03:29:52 +08:00