Last Update: 2025-10-01T18:34:04.878Z

kokkos/kokkos

PR 8164 - Adds support for loop unrolling within range loops
Updated: 2025-10-01T18:31:22.000Z open
Comments
Rohit-Kakodkar 2025-10-01T17:08:13.000Z

I’m OK with only implementing the feature for Cuda right (since it’s in Experimental) now but would be prefer an implementation for all bcakends at once. I would approve after resolving the discussion in https://github.com/kokkos/kokkos/pull/8164/files#r2263249878 regardimg potetial overflow (I would also just use a ternary).

I was planning to submit separate PRs for different backends to keep each PR small. Regarding your second comment, what would be the workaround for the overflow error? @romintomasetti @masterleinad

masterleinad 2025-10-01T17:37:06.000Z

Regarding your second comment, what would be the workaround for the overflow error?

Something like

for (Member i = 0; ((i < static_cast<Member>(work_stride * batch_size)) &&
                          (i < work_end - iwork));
           i = (i < (work_end - work_stride - iwork)) ? i + work_stride) : work_end - iwork) {

should do the trick.

Commits
Rohit-Kakodkar - 2025-10-01T18:29:15.000Z - 934c69d9 - Fixed overflow error Signed-off-by: Rohit Kakodkar <kakodkar.rohit@gmail.com>
Events
2025-10-01T13:07:17.000Z - mentioned
2025-10-01T13:07:18.000Z - subscribed
2025-10-01T17:08:15.000Z - mentioned
2025-10-01T17:08:15.000Z - subscribed
2025-10-01T17:08:15.000Z - mentioned
2025-10-01T17:08:15.000Z - subscribed
PR 7675 - Check that parallel constructs invoked before initialize or after finalize will error out
Updated: 2025-10-01T18:04:37.000Z open
Commits
science-enthusiast - 2025-10-01T15:48:17.000Z - 909ce662 - Made tests more readable. Refactored CheckUsage into source and header files. Label parallel constructs using string literals. etc. Signed-off-by: Hariprasad Kannan <hkannan@gmail.com> Co-authored-by: Jan Ciesko <jan.ciesko@gmail.com>
PR 8387 - core(policy): delegating constructor for binding iteration domain with new execution space instance [impl]
Updated: 2025-10-01T18:03:43.000Z open
Comments
Commits
romintomasetti - 2025-10-01T18:02:06.000Z - 62652ad8 - core(policy): allow execution policy space to be updated [impl] Signed-off-by: romintomasetti <romin.tomasetti@gmail.com>
Events
2025-09-30T07:50:08.000Z - head_ref_force_pushed
2025-09-30T12:02:12.000Z - review_requested
2025-09-30T12:22:54.000Z - head_ref_force_pushed
2025-09-30T15:02:16.000Z - head_ref_force_pushed
2025-09-30T15:27:26.000Z - head_ref_force_pushed
2025-09-30T16:13:49.000Z - labeled
2025-10-01T11:09:10.000Z - unlabeled
2025-10-01T11:09:15.000Z - labeled
2025-10-01T11:58:47.000Z - head_ref_force_pushed
2025-10-01T18:03:43.000Z - head_ref_force_pushed
PR 8512 - HIP: improve non-team block size deduction during limited parallelism
Updated: 2025-10-01T18:00:01.000Z open
Comments
joewpeterson 2025-10-01T16:43:41.000Z

Confirmed, these updates have improved the perforce in my little test to what I get from the raw HIP kernel test

Commits
cwpearson - 2025-10-01T15:27:28.000Z - 830e0639 - Add Gemv performance test Signed-off-by: Carl Pearson <cwpears@sandia.gov>
cwpearson - 2025-10-01T17:41:08.000Z - 5dab1d58 - HIP: improve non-team block size heuristic for low parallelism Signed-off-by: Carl Pearson <cwpears@sandia.gov>
Events
2025-10-01T16:09:41.000Z - review_requested
2025-10-01T16:11:17.000Z - head_ref_force_pushed
2025-10-01T16:11:22.000Z - review_requested
2025-10-01T16:11:31.000Z - ready_for_review
2025-10-01T17:31:11.000Z - convert_to_draft
2025-10-01T17:39:31.000Z - head_ref_force_pushed
2025-10-01T17:41:17.000Z - head_ref_force_pushed
2025-10-01T17:59:45.000Z - ready_for_review
PR 8476 - Introduce implicit integer precision access operator for View
Updated: 2025-10-01T17:58:15.000Z open
Comments
crtrott 2025-10-01T17:41:59.000Z

What about View::operator[] ?

Not needed it anyway does 64bit for 1D, and in C++23 I don’t want to introduce the shitty overflow semantics we have right now. I want to get rid of that behavior in the medium term anyway.

Commits
crtrott - 2025-10-01T17:49:27.000Z - f763f6a9 - Introduce implicit integer precision access operator for View Signed-off-by: Christian Trott <crtrott@sandia.gov>
Events
2025-10-01T17:58:15.000Z - head_ref_force_pushed
Issue 8510 - TeamPolicy should store its exec space instance
Updated: 2025-10-01T17:08:44.000Z open
Comments
dalg24 2025-10-01T11:48:00.000Z

cc @rgayatri23 @seyonglee

seyonglee 2025-10-01T17:08:44.000Z

For the OpenACC backend, this issue is addressed in a pending OpenACC PR (#7920).

Events
2025-10-01T11:46:19.000Z - labeled
2025-10-01T11:46:19.000Z - labeled
2025-10-01T11:46:19.000Z - labeled
2025-10-01T11:48:01.000Z - mentioned
2025-10-01T11:48:01.000Z - subscribed
2025-10-01T11:48:01.000Z - mentioned
2025-10-01T11:48:01.000Z - subscribed
PR 8117 - Allow building a C++20 module for Kokkos
Updated: 2025-10-01T16:55:27.000Z open draft
Commits
masterleinad - 2025-09-30T18:02:08.000Z - b84d9250 - Merge remote-tracking branch 'upstream/develop' into kokkoscore_module_only
masterleinad - 2025-09-30T18:03:44.000Z - 5ad25155 - Fix copyright Signed-off-by: Daniel Arndt <arndtd@ornl.gov>
masterleinad - 2025-09-30T18:12:35.000Z - 9acc3c27 - Update example
masterleinad - 2025-10-01T04:00:29.000Z - c41a15c2 - All unit tests compiling
masterleinad - 2025-10-01T04:05:32.000Z - 887a417a - Examples and benchmarks compiling
masterleinad - 2025-10-01T12:19:03.000Z - 386c774b - indentation
masterleinad - 2025-10-01T13:39:08.000Z - 7e0c5bad - Remove duplications Signed-off-by: Daniel Arndt <arndtd@ornl.gov>
masterleinad - 2025-10-01T16:55:04.000Z - a5e7596e - Fix up
Events
2025-10-01T12:31:55.000Z - head_ref_force_pushed
Issue 8497 - Should `Random_XorShift{64,1024}_Pool::init()` be deprecated code 5 instead of 4
Updated: 2025-10-01T16:31:06.000Z open
Comments
crtrott 2025-09-30T14:40:50.000Z

I don’t quite get the argument that it forces to guard against the version. Doesn’t it always if I deprecate/remove/introduce-something new?

dalg24 2025-09-30T14:44:57.000Z

I don’t quite get the argument that it forces to guard against the version. Doesn’t it always if I deprecate/remove/introduce-something new?

Because there is no overlap of versions with the old syntax to avoid and the new thing to use instead, the guards are necessary on this occasion. Because that deprecation is introduced so close to 5.0, users do not have as much time to migrate that we typically like to give them.

crtrott 2025-09-30T14:54:36.000Z

oh you mean we didn’t deprecate in 4.7 so you didn’t get the warning in the same thing. I mean whatever we can do deprecated 5.

terwin 2025-10-01T16:26:26.000Z

We currently support Kokkos 4.5, but also test against develop. I decided to write a simple helper to create the state pool as appropriate:

template<typename ExecSpace>
auto randomPool(const std::uint64_t seed, const int num_states)
{
    using Pool = Kokkos::Random_XorShift64_Pool<ExecSpace>;

    if constexpr (std::is_constructible_v<Pool, std::uint64_t, std::uint64_t>)
    {
        return Pool(seed, num_states);
    }
    else
    {
        Pool pool{};
        pool.init(seed, num_states);
        return pool;
    }
}
dalg24 2025-10-01T16:31:06.000Z

Thanks for sharing your solution.

There is currently no consensus to change the deprecation vehicle. I will keep this issue open a bit longer to give an opportunity to others to weigh in.

PR 8488 - Ensure OpenMP's partition_space doesn't fail for small concurrency
Updated: 2025-10-01T16:25:14.000Z open
Comments
masterleinad 2025-10-01T15:58:19.000Z

@crtrott I updated the pull request as disucssed in the developer meeting. Namely

  • disallow creating non-default Kokkos::OpenMP instances within OpenMP parallel regions
  • return default Kokkos::OpenMP when OMP_NESTED isn’t set or inside OpenMP parallel regions
Commits
masterleinad - 2025-10-01T15:56:03.000Z - c508fe14 - Disallow creating non-default OpenMP instances inside parallel regions Signed-off-by: Daniel Arndt <arndtd@ornl.gov>
masterleinad - 2025-10-01T15:56:03.000Z - f3eef4f4 - OpenMP: Let partition_space return default executiopn space instances in parallel regions and if OMP_NESTED isn't set Signed-off-by: Daniel Arndt <arndtd@ornl.gov>
Events
2025-10-01T15:58:39.000Z - mentioned
2025-10-01T15:58:39.000Z - subscribed
Issue 8506 - HIP: block size heuristic can be bad for low parallelism
Updated: 2025-10-01T16:11:57.000Z open
Comments
Events
2025-09-30T16:48:17.000Z - labeled
2025-09-30T16:48:17.000Z - labeled
2025-09-30T16:48:17.000Z - mentioned
2025-09-30T16:48:17.000Z - subscribed
2025-09-30T22:51:31.000Z - unsubscribed
2025-09-30T22:51:35.000Z - subscribed
PR 8508 - Add Gemv performance test
Updated: 2025-10-01T15:27:36.000Z open
Comments
janciesko 2025-10-01T00:21:33.000Z

Mentioning https://github.com/kokkos/kokkos/pull/7924 to get both benchmarks in while we’re at it.

AdRi1t 2025-10-01T09:21:07.000Z

Using launch bounds to control the block size with HIP may affect compiler optimizations, though I guess in your case it didn’t make a difference. Using chunk size would be preferred because you could reuse the same compiled kernel and the same policy.

cwpearson 2025-10-01T12:15:00.000Z

I don’t see any interaction with ChunkSize / chunk_size in e.g. https://github.com/kokkos/kokkos/blob/b3ec45c33bc8151b92d63a764f01281911c83cf9/core/src/HIP/Kokkos_HIP_BlockSize_Deduction.hpp#L23-L35 so I think for HIP we only have the LaunchBounds mechanism.

Rombur 2025-10-01T12:20:45.000Z

ChunkSize controls the block size only for SYCL not for the other backends.

AdRi1t 2025-10-01T12:45:01.000Z

ChunkSize controls the block size only for SYCL not for the other backends.

Sorry, I misread the documentation

Commits
cwpearson - 2025-10-01T15:27:28.000Z - 830e0639 - Add Gemv performance test Signed-off-by: Carl Pearson <cwpears@sandia.gov>
Events
2025-09-30T19:47:45.000Z - assigned
2025-09-30T19:49:59.000Z - head_ref_force_pushed
2025-09-30T19:55:20.000Z - assigned
2025-09-30T19:55:25.000Z - ready_for_review
2025-10-01T15:27:36.000Z - head_ref_force_pushed
PR 8509 - Use concepts to constrain bit manip overloads and fix `bit_width` return type
Updated: 2025-10-01T15:25:56.000Z open
Commits
dalg24 - 2025-10-01T10:37:32.000Z - 697f3c90 - Use concepts to constrain bit manip overloads and fix bit_width return type Reference [here](https://eel.is/c++draft/bit) Signed-off-by: Damien L-G <dalg24@gmail.com>
dalg24 - 2025-10-01T10:48:00.000Z - 36949887 - Use requires clause for bit_cast Signed-off-by: Damien L-G <dalg24@gmail.com>
dalg24 - 2025-10-01T11:52:36.000Z - 36900efa - Fixup missed a static assertion in the tests Signed-off-by: Damien L-G <dalg24@gmail.com>
dalg24 - 2025-10-01T13:46:31.000Z - 90ab916e - Remove stray enable_if Signed-off-by: Damien L-G <dalg24@gmail.com>
Events
2025-10-01T10:41:57.000Z - labeled
2025-10-01T10:41:57.000Z - labeled
2025-10-01T10:41:57.000Z - labeled
PR 7368 - Fix using OpenMP TeamPolicy from OpenMP parallel region
Updated: 2025-10-01T15:19:04.000Z closed draft
Events
2025-10-01T15:19:05.000Z - closed
PR 8479 - Deprecate `Experimental::ErrorReporter`
Updated: 2025-10-01T15:09:17.000Z closed
Comments
dalg24 2025-10-01T15:08:59.000Z

No consensus to deprecate, we are proceeding with #8486

Events
2025-10-01T15:08:59.000Z - closed
2025-10-01T15:09:17.000Z - head_ref_deleted
PR 8511 - Desul atomics: Revisit always lock-free
Updated: 2025-10-01T15:02:38.000Z open
Comments
JBludau 2025-10-01T13:38:09.000Z

on GH200 with cuda 12.8.1 I see the same speedup as in https://github.com/kokkos/kokkos/pull/8495

Test_Atomic<__int128>/100000/iterations:10                     7.29 s          7.27 s            10 Passed=1 Size of type=16 Time atomic=11.8644 Time non-atomic=806.09u Time serial=626.88u Value atomic=4.99995G Value non-atomic=160.648k Value serial=4.99995G
dalg24 2025-10-01T13:50:12.000Z

Please check the performance on the host side on a system that has 126-bit atomics support

JBludau 2025-10-01T14:58:15.000Z

Please check the performance on the host side on a system that has 126-bit atomics support

On X86-64 with OPENMP I see a similar speedup with this (and linking -latomic and excluding the macro generation for the device trait to make it compile)

NEW
Test_Atomic<__int128>/100000/iterations:10                    0.330 s         0.305 s            10 Passed=1 Size of type=16 Time atomic=0.593478 Time non-atomic=0.408238 Time serial=971.4u Value atomic=4.99995G Value non-atomic=188.618M Value serial=4.99995G
OLD
Test_Atomic<__int128>/100000/iterations:10                     3.20 s          2.94 s            10 Passed=1 Size of type=16 Time atomic=4.10126 Time non-atomic=0.0639739 Time serial=642u Value atomic=4.99995G Value non-atomic=121.247M Value serial=4.99995G
Commits
dalg24 - 2025-10-01T12:28:09.000Z - 2b2f1764 - Desul atomics: Revisit always lock-free We need to distinguish host and device overload sets. Signed-off-by: Damien L-G <dalg24@gmail.com>
2025-10-01T13:36:43.000Z - 9ee51f11 - host_device annotations on 16byte dummy
Events
2025-10-01T12:28:58.000Z - review_requested
PR 8167 - Make Kokkos be usable in multiple CMake languages
Updated: 2025-10-01T14:52:11.000Z open
PR 8491 - SYCL: Check for device global support on Nvidia and AMD gpus
Updated: 2025-10-01T14:06:04.000Z open
Events
2025-09-29T19:56:43.000Z - labeled
2025-10-01T13:54:11.000Z - review_requested
2025-10-01T14:06:04.000Z - review_dismissed
PR 8495 - Cas based 128bit cuda atomics
Updated: 2025-10-01T12:56:38.000Z open
Comments
masterleinad 2025-10-01T12:52:04.000Z

Since pretty much all the changes are in desul it should rather be discussed there.

Commits
2025-09-30T16:20:09.000Z - 65384cb6 - unsorted WIP Signed-off-by: Jakob Bludau <bludauj@ornl.gov>
2025-10-01T00:54:03.000Z - 9939c266 - move compile line definition to kokkos_tpls Signed-off-by: Jakob Bludau <bludauj@ornl.gov>
2025-10-01T00:54:46.000Z - 45849b50 - cmake format Signed-off-by: Jakob Bludau <bludauj@ornl.gov>
Events
2025-09-30T16:20:53.000Z - head_ref_force_pushed
2025-09-30T20:46:59.000Z - ready_for_review
2025-09-30T20:47:12.000Z - review_requested
2025-10-01T00:54:52.000Z - head_ref_force_pushed
2025-10-01T00:55:58.000Z - review_requested
PR 7893 - Enable large memory tests on some GPU builds in HPSF CI
Updated: 2025-10-01T12:49:32.000Z open
Comments
masterleinad 2025-10-01T12:30:25.000Z
serial.view_64bit (60744 ms)
cuda.view_64bit (733 ms)
sycl.view_64bit (1872 ms)
hip.view_64bit (5081 ms)
masterleinad 2025-10-01T12:49:32.000Z

Since most of the changes are for SYCL, I’m fine with just changing that one build if we think it’s too expensive otherwise.

Commits
masterleinad - 2025-09-30T13:02:06.000Z - a9f81863 - Merge remote-tracking branch 'upstream/develop' into hpsf_enable_large_memory_tests
masterleinad - 2025-09-30T14:18:38.000Z - 3b51489a - Make end const drop one check
Events
2025-09-29T21:32:41.000Z - review_requested
PR 8505 - [skip ci] Update jenkins nightly configurations to ROCm 7.0
Updated: 2025-10-01T12:23:31.000Z open
Comments
Rombur 2025-09-30T15:34:51.000Z

I spoke too soon when I said that our tests are passing with ROCm 7.0. Everything passes when core dump is enabled but the death tests fail when it is disabled. This is because of

Exceptions occurring during a kernel execution will not abort the process anymore but will return an error unless core dump is enabled.
janciesko 2025-09-30T23:11:48.000Z

This does not show up in our CI?

I spoke too soon when I said that our tests are passing with ROCm 7.0. Everything passes when core dump is enabled but the death tests fail when it is disabled. This is because of

Exceptions occurring during a kernel execution will not abort the process anymore but will return an error unless core dump is enabled.
Rombur 2025-10-01T12:23:31.000Z

This does not show up in our CI?

No because the container has core dump enable same as Frontier

Commits
Rombur - 2025-09-30T14:46:05.000Z - 666763d4 - [skip ci] Update jenkins nightly configurations to ROCm 7.0 Signed-off-by: Bruno Turcksin <bruno.turcksin@gmail.com>
Rombur - 2025-09-30T15:05:05.000Z - 7a16ff2d - [skip-ci]: use /opt/rocm instead of /opt/rocm/lib/cmake Signed-off-by: Bruno Turcksin <bruno.turcksin@gmail.com>
Events
2025-09-30T14:46:15.000Z - head_ref_force_pushed
2025-09-30T15:31:02.000Z - labeled
Issue 7060 - Sycl unit failures with intel/2024.1.0 on PVC arch
Updated: 2025-10-01T11:03:06.000Z closed
Comments
dalg24 2025-10-01T11:03:06.000Z

(2024.2.1 is the new minimum required in Kokkos 5.0)

Events
2025-10-01T10:50:43.000Z - closed
PR 8492 - Modernize bit manipulation function templates
Updated: 2025-10-01T01:49:24.000Z closed
Comments
dalg24 2025-09-30T01:47:41.000Z

@seyonglee would you be a able to look at the OpenACC failure?

seyonglee 2025-09-30T14:42:11.000Z

@seyonglee would you be a able to look at the OpenACC failure?

I will check the issue.

seyonglee 2025-09-30T17:59:19.000Z

@seyonglee would you be a able to look at the OpenACC failure?

I will check the issue.

Temporarily disabled failing OpenACC tests in TestBitManipulationBuiltins unit test.

dalg24 2025-09-30T20:19:13.000Z

Temporarily disabled failing OpenACC tests in TestBitManipulationBuiltins unit test.

Thank you

Commits
dalg24 - 2025-09-30T14:07:32.000Z - 4766c3a2 - Distinguish host and device in the if consteval branch Signed-off-by: Damien L-G <dalg24@gmail.com>
dalg24 - 2025-09-30T14:12:12.000Z - d298664e - Per review rollback using C++23 extensions Signed-off-by: Damien L-G <dalg24@gmail.com>
seyonglee - 2025-09-30T17:44:17.000Z - f5320909 - Disable failing OpenACC tests Signed-off-by: Seyong Lee <lees2@ornl.gov>
Events
2025-09-30T01:47:42.000Z - mentioned
2025-09-30T01:47:42.000Z - subscribed
2025-09-30T14:42:13.000Z - mentioned
2025-09-30T14:42:13.000Z - subscribed
2025-09-30T17:59:20.000Z - mentioned
2025-09-30T17:59:20.000Z - subscribed
2025-09-30T19:09:18.000Z - labeled
2025-10-01T01:48:57.000Z - merged
2025-10-01T01:48:57.000Z - closed
2025-10-01T01:49:24.000Z - head_ref_deleted
PR 8502 - Add COPYRIGHT notice file with contributors
Updated: 2025-10-01T00:16:56.000Z open
Commits
crtrott - 2025-10-01T00:16:43.000Z - b5fbe401 - Add COPYRIGHT notice file with contributors Signed-off-by: Christian Trott <crtrott@sandia.gov>
Events
2025-09-29T18:58:27.000Z - head_ref_force_pushed
2025-10-01T00:16:51.000Z - head_ref_force_pushed
PR 8367 - [WIP] Self similar parallel range interface using partial specialization
Updated: 2025-09-30T21:09:48.000Z open draft
Commits
tcclevenger - 2025-09-30T20:57:12.000Z - 56e7cf10 - Merge TeamVectorRange into RangePolicy using partial specialization Both of these concepts represent "iterate over a range using all available resources", now they are merged into RangePolicy, where RangePolicy<ExecSpace> is callable on host, and RangePolicy<TeamHandle> is callable within a team. Signed-off-by: Conrad Clevenger <tccleve@sandia.gov>
crtrott - 2025-09-30T20:57:19.000Z - 89d5f34f - Fix host device warnings Signed-off-by: Christian Trott <crtrott@sandia.gov>
Events
2025-09-30T19:47:16.000Z - head_ref_force_pushed
2025-09-30T20:42:25.000Z - head_ref_force_pushed
2025-09-30T20:57:35.000Z - head_ref_force_pushed
PR 8263 - Provide a C++20 module for Kokkos Core
Updated: 2025-09-30T18:04:04.000Z open
Commits
masterleinad - 2025-09-30T18:02:08.000Z - b84d9250 - Merge remote-tracking branch 'upstream/develop' into kokkoscore_module_only
masterleinad - 2025-09-30T18:03:44.000Z - 5ad25155 - Fix copyright Signed-off-by: Daniel Arndt <arndtd@ornl.gov>
PR 8507 - Desul atomics: Use Clang atomic min/max GCC-style builtins
Updated: 2025-09-30T17:43:15.000Z open
Commits
dalg24 - 2025-09-30T17:40:18.000Z - 9127ec1d - Desul atomics: Use Clang atomic min/max GCC-style builtins Signed-off-by: Damien L-G <dalg24@gmail.com>
PR 8503 - simd: update simd type names
Updated: 2025-09-30T15:00:04.000Z open
Comments
ldh4 2025-09-29T21:15:02.000Z

Should this be guarded with KOKKOS_ENABLE_DEPRECATED_CODE_4 or KOKKOS_ENABLE_DEPRECATED_CODE_5?

nmm0 2025-09-30T15:00:04.000Z

To be honest I’m not sure it’s worth deprecating the old simd names or even changing them in our code. Our stuff is spelled differently anyway with being in the Kokkos namespace. But if @crtrott had a different opinion I’m fine following that

Commits
ldh4 - 2025-09-29T21:14:06.000Z - 7e0ac27e - clang-formatted Signed-off-by: Dong Hun Lee <donlee@sandia.gov>
Events
2025-09-30T15:00:05.000Z - mentioned
2025-09-30T15:00:05.000Z - subscribed
PR 8504 - simd: replace sfinae with concepts
Updated: 2025-09-30T14:47:53.000Z open
Commits
ldh4 - 2025-09-29T20:21:20.000Z - 07bb4b14 - Converted to use concept over sfinae Signed-off-by: Dong Hun Lee <donlee@sandia.gov>
ldh4 - 2025-09-29T21:26:31.000Z - 2d0f90ec - Remove unused concepts Signed-off-by: Dong Hun Lee <donlee@sandia.gov>
Events
2025-09-29T21:11:45.000Z - labeled
2025-09-30T14:47:53.000Z - review_requested
Issue 8265 - CHANGELOG for 5.0
Updated: 2025-09-30T14:21:35.000Z open
PR 8501 - HIP: add support for `navi4` architecture
Updated: 2025-09-30T14:19:36.000Z closed
Comments
dalg24 2025-09-30T14:19:35.000Z

Please add to the 5.0 changelog

Events
2025-09-29T23:39:41.000Z - labeled
2025-09-30T14:19:24.000Z - merged
2025-09-30T14:19:24.000Z - closed
PR 6636 - Revisit view move semantics
Updated: 2025-09-30T12:29:28.000Z open
Comments
masterleinad 2025-09-30T12:29:28.000Z

What behavior do we get when/o the changes in the source files?

The data pointer doesn’t change (movnig a pointer doesn’t change it’s value).

Commits
masterleinad - 2025-09-29T22:03:19.000Z - e43c695c - Merge remote-tracking branch 'upstream/develop' into view_move_semantics Signed-off-by: Daniel Arndt <arndtd@ornl.gov>
PR 8458 - Add an MDRange stencil perf test
Updated: 2025-09-30T12:18:01.000Z open
Commits
tretre91 - 2025-09-30T07:35:04.000Z - 5009eeb2 - Use c++17 styel traits and fix clang-tidy warnings in the perf test Signed-off-by: Trévis Morvany <trevis.morvany@cea.fr>
tretre91 - 2025-09-30T07:35:04.000Z - 75a49b6b - Replace the init functors by calls to deep_copy Signed-off-by: Trévis Morvany <trevis.morvany@cea.fr>
tretre91 - 2025-09-30T07:35:05.000Z - 9cd824fa - Factorize the correctness check function Signed-off-by: Trévis Morvany <trevis.morvany@cea.fr>
tretre91 - 2025-09-30T07:35:05.000Z - 0dd41e92 - Factorize the test functions and add google benchmark Signed-off-by: Trévis Morvany <trevis.morvany@cea.fr>
tretre91 - 2025-09-30T07:35:05.000Z - 858cb01f - Add bench for 2 dim collapsed and all dim collapsed case Signed-off-by: Trévis Morvany <trevismorvany@gmail.com>
tretre91 - 2025-09-30T07:35:05.000Z - d48636c0 - Add multiple layouts and use int everywhere Signed-off-by: Trévis Morvany <trevis.morvany@cea.fr>
tretre91 - 2025-09-30T07:35:05.000Z - 6123e8ca - Add the 2D case - start making the functor structs more generic and a 2D stencil benchmarks Signed-off-by: Trévis Morvany <trevis.morvany@cea.fr>
tretre91 - 2025-09-30T07:35:05.000Z - 66cfbae0 - Fix 2D index computation Signed-off-by: Trévis Morvany <trevis.morvany@cea.fr>
tretre91 - 2025-09-30T07:35:05.000Z - e9e08f43 - Add 4D tests Signed-off-by: Trévis Morvany <trevismorvany@gmail.com>
tretre91 - 2025-09-30T07:36:04.000Z - 740f47cf - Rename the perf test and remove unused file Signed-off-by: Trévis Morvany <trevis.morvany@cea.fr>
tretre91 - 2025-09-30T07:36:29.000Z - 824a88e9 - Add a macro for declaring the benchmarks Signed-off-by: Trévis Morvany <trevis.morvany@cea.fr>
tretre91 - 2025-09-30T07:36:29.000Z - d2dd7e2d - Correct an index calculation Signed-off-by: Trévis Morvany <trevis.morvany@cea.fr>
tretre91 - 2025-09-30T07:36:29.000Z - f332dd3b - Don't fix the number of iterations Signed-off-by: Trévis Morvany <trevis.morvany@cea.fr>
tretre91 - 2025-09-30T07:36:29.000Z - c8cffdec - Replace the `get_pointer` struct with a more generic one Signed-off-by: Trévis Morvany <trevis.morvany@cea.fr>
tretre91 - 2025-09-30T07:36:29.000Z - be824b9e - Add missing typename Signed-off-by: Trévis Morvany <trevis.morvany@cea.fr>
Events
2025-09-30T07:39:44.000Z - head_ref_force_pushed
2025-09-30T12:18:01.000Z - labeled
PR 8459 - enable view hooks for new mdspan-based view implementation
Updated: 2025-09-29T23:39:09.000Z open
PR 8496 - Update source code header
Updated: 2025-09-29T22:37:44.000Z closed
Commits
dalg24 - 2025-09-29T19:00:10.000Z - 3f730dbd - Update all source files header Signed-off-by: Damien L-G <dalg24@gmail.com>
dalg24 - 2025-09-29T19:00:11.000Z - 0e81126b - Update template header used in CI checks Signed-off-by: Damien L-G <dalg24@gmail.com>
dalg24 - 2025-09-29T19:00:11.000Z - 14d5b291 - Move header template to scripts/ Signed-off-by: Damien L-G <dalg24@gmail.com>
dalg24 - 2025-09-29T19:00:11.000Z - 5ee71c3b - Update test sensitive to the length of the header Signed-off-by: Damien L-G <dalg24@gmail.com>
dalg24 - 2025-09-29T19:15:28.000Z - cc3b1b8a - Update copyright script to also handle .cppm module interface files
dalg24 - 2025-09-29T19:17:05.000Z - c50fdea7 - Update headers in .cppm files Signed-off-by: Damien L-G <dalg24@gmail.com>
dalg24 - 2025-09-29T19:27:04.000Z - 079a9863 - Update more files that are not tracked byt the copyright scripts Signed-off-by: Damien L-G <dalg24@gmail.com>
Events
2025-09-29T19:27:53.000Z - head_ref_force_pushed
2025-09-29T22:37:44.000Z - merged
2025-09-29T22:37:44.000Z - closed
PR 8025 - Desul atomics: Add 128bit CAS for >CC90 on NVIDIA
Updated: 2025-09-29T21:21:36.000Z closed
Events
2025-09-29T21:21:36.000Z - merged
2025-09-29T21:21:36.000Z - closed

kokkos/kokkos-core-wiki

PR 731 - Adding documentation for ErrorReporter
Updated: 2025-10-01T00:37:12.000Z open
PR 732 - Bump crate-ci/typos from 1.36.2 to 1.36.3
Updated: 2025-09-29T22:34:43.000Z closed
Commits
dependabot[bot] - 2025-09-29T22:31:49.000Z - 4b253a64 - Bump crate-ci/typos from 1.36.2 to 1.36.3 Bumps [crate-ci/typos](https://github.com/crate-ci/typos) from 1.36.2 to 1.36.3. - [Release notes](https://github.com/crate-ci/typos/releases) - [Changelog](https://github.com/crate-ci/typos/blob/master/CHANGELOG.md) - [Commits](https://github.com/crate-ci/typos/compare/v1.36.2...v1.36.3) --- updated-dependencies: - dependency-name: crate-ci/typos dependency-version: 1.36.3 dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com>
Events
2025-09-29T22:31:50.000Z - labeled
2025-09-29T22:31:50.000Z - labeled
2025-09-29T22:31:52.000Z - labeled
2025-09-29T22:31:52.000Z - labeled
2025-09-29T22:34:36.000Z - merged
2025-09-29T22:34:36.000Z - closed
2025-09-29T22:34:43.000Z - head_ref_deleted

kokkos/kokkos-fft

PR 278 - Address oneAPI 2025 deprecations
Updated: 2025-10-01T18:00:37.000Z open
Commits
2025-10-01T17:58:31.000Z - a8677092 - Address oneAPI 2025 deprecations
2025-10-01T17:58:31.000Z - f053b9f1 - remove duplicated comments
2025-10-01T17:58:31.000Z - 4d10914a - Upass vectors rather than pointers to set_value function for newer oneMKL
2025-10-01T18:00:06.000Z - 642696ae - fix: FWD_STRIDES and BWD_STRIDES in SYCL plan creation
Events
2025-10-01T18:00:37.000Z - head_ref_force_pushed
PR 327 - Build(deps): Bump docker/login-action from 3.5.0 to 3.6.0
Updated: 2025-09-30T07:30:18.000Z closed
Commits
dependabot[bot] - 2025-09-29T23:20:19.000Z - b8f6da19 - Build(deps): Bump docker/login-action from 3.5.0 to 3.6.0 Bumps [docker/login-action](https://github.com/docker/login-action) from 3.5.0 to 3.6.0. - [Release notes](https://github.com/docker/login-action/releases) - [Commits](https://github.com/docker/login-action/compare/184bdaa0721073962dff0199f1fb9940f07167d1...5e57cd118135c172c3672efd75eb46360885c0ef) --- updated-dependencies: - dependency-name: docker/login-action dependency-version: 3.6.0 dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com>
Events
2025-09-29T23:20:20.000Z - labeled
2025-09-29T23:20:20.000Z - labeled
2025-09-29T23:20:21.000Z - labeled
2025-09-29T23:20:21.000Z - labeled
2025-09-30T07:30:10.000Z - merged
2025-09-30T07:30:10.000Z - closed
2025-09-30T07:30:18.000Z - head_ref_deleted
PR 326 - Build(deps): Bump crate-ci/typos from 1.36.2 to 1.36.3
Updated: 2025-09-30T07:30:10.000Z closed
Commits
dependabot[bot] - 2025-09-29T23:11:28.000Z - 10aa6516 - Build(deps): Bump crate-ci/typos from 1.36.2 to 1.36.3 Bumps [crate-ci/typos](https://github.com/crate-ci/typos) from 1.36.2 to 1.36.3. - [Release notes](https://github.com/crate-ci/typos/releases) - [Changelog](https://github.com/crate-ci/typos/blob/master/CHANGELOG.md) - [Commits](https://github.com/crate-ci/typos/compare/85f62a8a84f939ae994ab3763f01a0296d61a7ee...0c17dabcee8b8f1957fa917d17393a23e02e1583) --- updated-dependencies: - dependency-name: crate-ci/typos dependency-version: 1.36.3 dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com>
Events
2025-09-29T23:11:30.000Z - labeled
2025-09-29T23:11:30.000Z - labeled
2025-09-29T23:11:31.000Z - labeled
2025-09-29T23:11:31.000Z - labeled
2025-09-30T07:30:02.000Z - merged
2025-09-30T07:30:02.000Z - closed
2025-09-30T07:30:10.000Z - head_ref_deleted

kokkos/kokkos-kernels

PR 2796 - [Docs] batched serial tbsv
Updated: 2025-10-01T16:56:53.000Z open
Commits
yasahi-hpc - 2025-09-30T09:48:55.000Z - f5527744 - fix syr routine name Signed-off-by: yasahi-hpc <y.asahi6412@gmail.com>
yasahi-hpc - 2025-09-30T09:48:55.000Z - a6d653b2 - Add batched serial tbsv docs Signed-off-by: yasahi-hpc <y.asahi6412@gmail.com>
Events
2025-09-30T09:51:05.000Z - assigned
2025-09-30T09:51:12.000Z - labeled
PR 2800 - Avoid opening the Kokkos namespace
Updated: 2025-10-01T16:39:40.000Z open
Commits
dalg24 - 2025-09-30T22:55:20.000Z - 0c7d81f3 - Avoid opening the Kokkos namespace This is a nitpick. Qualify the reduction_identity specialization instead of opening the Kokkos namespace. With that change, one can grep for "namespace Kokkos {" to find all naughty code. Signed-off-by: Damien L-G <dalg24@gmail.com>
Events
2025-09-30T22:56:32.000Z - labeled
PR 2795 - Update headers in preparation for the 5.0 release
Updated: 2025-10-01T17:19:01.000Z open
Comments
dalg24 2025-10-01T17:07:08.000Z

Note that https://github.com/kokkos/kokkos-kernels/pull/2771 introduced a new header with the Kokkos 4 header and that is not handled in the current version of this PR

lucbv 2025-10-01T17:19:00.000Z

Good point, I’ll fix the header for KokkosKernels_ArithTraits

Commits
dalg24 - 2025-09-30T02:13:24.000Z - d62e1691 - Update header Kokkos 4 -> 5 (misformatted) Signed-off-by: Damien L-G <dalg24@gmail.com>
dalg24 - 2025-09-30T02:14:39.000Z - 36f6cca4 - Update header Kokkos 4 -> 5 Signed-off-by: Damien L-G <dalg24@gmail.com>
dalg24 - 2025-09-30T02:17:52.000Z - 634b2d4b - Update header Kokkos 3 -> 5 Signed-off-by: Damien L-G <dalg24@gmail.com>
dalg24 - 2025-09-30T02:20:41.000Z - 174f2f5a - Last ones Signed-off-by: Damien L-G <dalg24@gmail.com>
dalg24 - 2025-09-30T02:22:13.000Z - d9bc7f09 - Remove unused files Signed-off-by: Damien L-G <dalg24@gmail.com>
dalg24 - 2025-09-30T02:24:35.000Z - 7b1f0a3b - Update header in Python file Signed-off-by: Damien L-G <dalg24@gmail.com>
dalg24 - 2025-09-30T02:25:05.000Z - 094a3a1a - Improve script to fix the headers Signed-off-by: Damien L-G <dalg24@gmail.com>
dalg24 - 2025-09-30T02:25:34.000Z - e52fe380 - Update copyright notice in the doc Signed-off-by: Damien L-G <dalg24@gmail.com>
PR 2771 - Common - ArithTraits: moving from Kokkos to KokkosKernels
Updated: 2025-10-01T14:55:32.000Z closed
Comments
dalg24 2025-09-30T12:08:35.000Z

Any chance to deprecate the header as well and redirect to a new header with appropriate prefix?

Would you be willing to move the traits definition to a new <KokkosKernels_ArithTraits.hpp>header and have < Kokkos_ArithTraits.hpp > just include it along with a using directive in the Kokkos:: namespace.

ndellingwood 2025-10-01T02:04:42.000Z

We’ll need to fix Trilinos for these changes, e.g.

In file included from /root/Trilinos/packages/stokhos/src/sacado/kokkos/vector/tpetra/Stokhos_Tpetra_MP_Vector.hpp:29:
/root/Trilinos/packages/stokhos/src/sacado/kokkos/vector/linalg/Kokkos_ArithTraits_MP_Vector.hpp:24:7: error: cannot specialize a dependent template
   24 | class ArithTraits< Sacado::MP::Vector<S> > {
ndellingwood 2025-10-01T02:05:47.000Z

@lucbv just to check, these changes only apply to versions >= 5.0 ? (So I know how to guard in Trilinos)

dalg24 2025-10-01T02:24:09.000Z

I suppose you could do “`C++ #if __has_include(KokkosKernels_ArithTraits.hpp) // since 5.0

lucbv 2025-10-01T14:55:32.000Z

Yes this would only apply to versions of Kokkos >= 5.0 so for a 4.7.2 release we would not want this change.

Commits
lucbv - 2025-09-29T19:18:05.000Z - f4866831 - Common - ArithTraits: moving from Kokkos to KokkosKernels This simply moves the class to a new namespace to avoid having it in the Kokkos namespace. Signed-off-by: Luc Berger-Vergiat <lberge@sandia.gov>
lucbv - 2025-09-29T20:52:21.000Z - bdef930c - Applying change of namespace to library and tests Signed-off-by: Luc Berger-Vergiat <lberge@sandia.gov>
lucbv - 2025-09-29T21:29:46.000Z - d90eff83 - Trying to fix the issue with sycl... Signed-off-by: Luc Berger-Vergiat <lberge@sandia.gov>
lucbv - 2025-09-29T21:48:40.000Z - f0c21cad - More fixes for std::complex<T> Signed-off-by: Luc Berger-Vergiat <lberge@sandia.gov>
lucbv - 2025-09-30T15:38:34.000Z - b615b278 - common: moving ArithTraits to KokkosKernels_ArithTraits.hpp This avoid having us use the naming convention of Kokkos Core for something that lives in Kokkos Kernels. Next, I will modify header inclusions in the library to reflect this change. Signed-off-by: Luc Berger-Vergiat <lberge@sandia.gov>
lucbv - 2025-09-30T15:55:21.000Z - 782b2297 - Updating header inclusion to use new KokkosKernels_ArithTraits.hpp Signed-off-by: Luc Berger-Vergiat <lberge@sandia.gov>
lucbv - 2025-09-30T16:00:20.000Z - 560e9d4d - Applying clang-format Signed-off-by: Luc Berger-Vergiat <lberge@sandia.gov>
Events
2025-09-29T20:52:56.000Z - head_ref_force_pushed
2025-09-30T22:22:58.000Z - merged
2025-09-30T22:22:59.000Z - closed
2025-09-30T22:23:03.000Z - head_ref_deleted
2025-10-01T02:05:48.000Z - mentioned
2025-10-01T02:05:48.000Z - subscribed
2025-10-01T02:27:22.000Z - referenced
2025-10-01T05:39:04.000Z - referenced
2025-10-01T05:39:04.000Z - referenced
2025-10-01T05:39:04.000Z - referenced
2025-10-01T05:39:04.000Z - referenced
2025-10-01T05:39:04.000Z - referenced
2025-10-01T05:39:04.000Z - referenced
2025-10-01T05:39:04.000Z - referenced
2025-10-01T05:39:04.000Z - referenced
2025-10-01T05:39:05.000Z - referenced
2025-10-01T05:39:05.000Z - referenced
2025-10-01T05:39:05.000Z - referenced
2025-10-01T05:39:05.000Z - referenced
2025-10-01T05:39:05.000Z - referenced
2025-10-01T05:39:05.000Z - referenced
2025-10-01T14:12:54.000Z - referenced
2025-10-01T14:12:54.000Z - referenced
2025-10-01T14:12:55.000Z - referenced
2025-10-01T14:12:55.000Z - referenced
2025-10-01T14:12:55.000Z - referenced
2025-10-01T14:12:55.000Z - referenced
2025-10-01T14:12:55.000Z - referenced
2025-10-01T14:12:55.000Z - referenced
2025-10-01T14:12:55.000Z - referenced
2025-10-01T14:12:56.000Z - referenced
2025-10-01T14:12:56.000Z - referenced
2025-10-01T14:12:56.000Z - referenced
2025-10-01T14:12:56.000Z - referenced
2025-10-01T14:12:56.000Z - referenced
2025-10-01T14:12:56.000Z - referenced
Issue 1485 - Test failures in clang >= 10 + cuda builds
Updated: 2025-10-01T13:22:08.000Z closed
Comments
ndellingwood 2025-10-01T13:21:58.000Z

Outdated (unsupported compilers)

Events
2025-10-01T13:22:08.000Z - closed
Issue 1445 - Nightly build failure: ICE with intel/19 + HSW,KNL arch, in perf_test/blas/blas3/KokkosBlas3_perf_test.cpp
Updated: 2025-10-01T10:49:19.000Z closed
Events
2025-10-01T10:49:19.000Z - closed
Issue 2799 - Kokkos compatibility: Move `InnerProductSpaceTraits` to KokkosKernels
Updated: 2025-09-30T22:43:53.000Z open
PR 2798 - Require `KOKKOS_ENABLE_DEPRECATED_CODE_5` to be defined for including `<Kokkos_ArithTraits.hpp>`
Updated: 2025-09-30T22:37:06.000Z open
Commits
dalg24 - 2025-09-30T22:34:43.000Z - 64077d23 - Require KOKKOS_ENABLE_DEPRECATED_CODE_5 to be defined for including <Kokkos_ArithTraits.hpp> Signed-off-by: Damien L-G <dalg24@gmail.com>
Events
2025-09-30T22:37:07.000Z - labeled
PR 2797 - [Docs] Add batched serial laswp example for documentation
Updated: 2025-09-30T20:25:29.000Z closed
Commits
2025-09-30T11:29:21.000Z - 4b666bcf - Add batched serial laswp example Signed-off-by: Yuuichi Asahi <y.asahi@nr.titech.ac.jp>
2025-09-30T12:08:30.000Z - 81304fda - use new header in laswp example Signed-off-by: Yuuichi Asahi <y.asahi@nr.titech.ac.jp>
Events
2025-09-30T11:31:14.000Z - assigned
2025-09-30T11:31:21.000Z - labeled
2025-09-30T11:31:21.000Z - labeled
2025-09-30T13:25:46.000Z - labeled
2025-09-30T20:25:29.000Z - merged
2025-09-30T20:25:29.000Z - closed
PR 2786 - Prefer bit manipulation functions from Kokkos Core
Updated: 2025-09-29T22:10:35.000Z closed
Events
2025-09-29T21:51:30.000Z - merged
2025-09-29T21:51:30.000Z - closed
2025-09-29T22:10:35.000Z - head_ref_deleted
PR 2793 - Bump pyyaml from 6.0.2 to 6.0.3 in /docs
Updated: 2025-09-29T19:14:20.000Z closed
Events
2025-09-29T19:14:13.000Z - merged
2025-09-29T19:14:13.000Z - closed
2025-09-29T19:14:20.000Z - head_ref_deleted

kokkos/kokkos.github.io

PR 200 - Fix tab reference to the developer meetings page
Updated: 2025-09-30T17:44:11.000Z closed
Commits
dalg24 - 2025-09-30T16:54:51.000Z - 154b46c1 - Fix tab reference to the developer meetings page Signed-off-by: Damien L-G <dalg24@gmail.com>
Events
2025-09-30T16:56:13.000Z - review_requested
2025-09-30T17:43:41.000Z - merged
2025-09-30T17:43:41.000Z - closed
2025-09-30T17:44:11.000Z - head_ref_deleted
PR 199 - Add Adrien from CEA to the team roster
Updated: 2025-09-30T16:48:01.000Z closed
Commits
dalg24 - 2025-09-30T16:45:16.000Z - 03de2bd2 - Add Adrien from CEA to the team roster
Events
2025-09-30T16:47:50.000Z - merged
2025-09-30T16:47:50.000Z - closed
2025-09-30T16:48:01.000Z - head_ref_deleted
PR 198 - add uliege contributors
Updated: 2025-09-30T16:44:09.000Z closed
Commits
romintomasetti - 2025-09-30T15:33:28.000Z - e3caf7a9 - add uliege contributors Signed-off-by: romintomasetti <romin.tomasetti@gmail.com>
Events
2025-09-30T14:42:07.000Z - mentioned
2025-09-30T14:42:07.000Z - subscribed
2025-09-30T14:42:07.000Z - mentioned
2025-09-30T14:42:07.000Z - subscribed
2025-09-30T14:50:06.000Z - ready_for_review
2025-09-30T14:56:35.000Z - head_ref_force_pushed
2025-09-30T15:35:09.000Z - head_ref_force_pushed
2025-09-30T16:43:15.000Z - merged
2025-09-30T16:43:15.000Z - closed
2025-09-30T16:44:09.000Z - head_ref_deleted

kokkos/mdspan

PR 429 - Fix return type of mdspan members
Updated: 2025-10-01T13:17:49.000Z closed
Commits
numEricL - 2025-10-01T07:40:41.000Z - 12f6ce98 - size_t -> rank_type
Events
2025-10-01T13:17:49.000Z - merged
2025-10-01T13:17:49.000Z - closed