Changes
#6 (Jan 29, 2024, 2:07:56 PM)
- Introduce constructor for multi-GPU support. — Daniel Arndt / githubweb
- Fix typo — noreply / githubweb
- Add test — Daniel Arndt / githubweb
- Fix typo. — noreply / githubweb
- Explicitly check for valid device id — Daniel Arndt / githubweb
- Set the device id in cuda_kernel_arch — Daniel Arndt / githubweb
- Check for default device — Daniel Arndt / githubweb
- Check that device associated with stream matches requested device — Daniel Arndt / githubweb
- Remove extra constructor — Daniel Arndt / githubweb
- Remove logical memory spaces — Damien L-G / githubweb
- Address reviewer comments — Daniel Arndt / githubweb
- m_cudaDev isn't static anymore — Daniel Arndt / githubweb
- Set the device id explicitly for CUDA API calls in impl_initialize — Daniel Arndt / githubweb
- Add warp sync for Cuda parallel reduce — tccleve / githubweb
- update comment to include final() mention — tccleve / githubweb
- This PR fixes the too-much-OpenACC-warning issue, mentioned in PR #6639. — lees2 / githubweb
- add missing header fix #6644 — fnrizzi / githubweb
- Fixup test math functions ulp should double -> int — Damien L-G / githubweb
- Drop DualView converting copy assignment operator — Damien L-G / githubweb
- Don't use rocm-docker for clang-format — Daniel Arndt / githubweb
- Diable HIP CI — Daniel Arndt / githubweb
- Remove deprecation warning for AllocationMechanism for gcc <11.0 — Daniel Arndt / githubweb
- OpenMPTarget: clang extensions for dynamic shared memory. (#6380) — noreply / githubweb
- Fix builtin_unreachable use for MSVC/CUDA — crtrott / githubweb
- Fix missing include on msvc/cuda — crtrott / githubweb
- Avoid lambdas in constexpr branch for msvc/cuda — crtrott / githubweb
- Sidestep lacking CTAD support msvc/cuda — crtrott / githubweb
- Fix formatting — crtrott / githubweb
- Move header for Damien because he is right — crtrott / githubweb
- Unit test for issue 3371 (negative vector length should not yield a negative max_team_size) (#6076) — noreply / githubweb
- Add CMakeLists.txt for stream benchmark — cwpears / githubweb
- Do not negate the dependent true traits helper — Damien L-G / githubweb
- Drop guards to accommodate external code defining KOKKOS_ASSERT — Damien L-G / githubweb
- Use omp_get_max_active_levels() when supported — Daniel Arndt / githubweb
- Add missing gfx940 — rberger / githubweb
- Add Impl::always_false type-dendent false trait — Damien L-G / githubweb
- Per review prefer always_false<Arg>::value to is_void_v<Arg> — Damien L-G / githubweb
- Improve "no copy mechanism" exception message — bmkelle / githubweb
- Add a unit test for new deep_copy exception msg — bmkelle / githubweb
- Add missing include sstream — bmkelle / githubweb
- src->source, dst->destination — bmkelle / githubweb
- Workaround for ROCm 6.0 failing to compile with AVX2 SIMD support — Bruno Turcksin / githubweb
- SYCL: Force inlining of Kokkos::printf (#6650) — noreply / githubweb
- Improve handling of printf in OMPT on Intel GPUs — Daniel Arndt / githubweb
- OpenMP: Use `omp_get_nested` for older gcc versions (#6685) — noreply / githubweb
- Disable more Bessel tests for SYCL on INtel GPUs — Daniel Arndt / githubweb
- fill_random without exceution space instance should fence — Daniel Arndt / githubweb
- Drop unnecessary guarding for a tool library being loaded in ProfilingSection — Damien L-G / githubweb
- Drop unnecessary header include in Kokkos_Profiling_ProfileSection.hpp — Damien L-G / githubweb
- #5333: CUDA: Use scratch space appropriate to small reduction elements in Team reductions (#5334) — noreply / githubweb
- Cuda: Allocate using the correct device (#6392) — noreply / githubweb
- Let `Profiling::ProfilingSection(std::string)` constructor be explicit and nodiscard (#6690) — noreply / githubweb
- Cosmetic changes to ProfilingSection — Damien L-G / githubweb
- GitHub CI: Test with AddressSanitizer (#6676) — noreply / githubweb
- Kokkos::Array deduction guide (#6373) — noreply / githubweb
- Add CI for MSVC+Cuda (#6661) — noreply / githubweb
- SYCL: Address deprecations after oneAPI 2023.2.0 (#6577) — noreply / githubweb
- Fixup cast tolerance to double before printing — Damien L-G / githubweb
- Try linking against CUDA libararies even with KOKKOS_ENABLE_COMPILE_AS_CMAKE_LANGUAGE — Daniel Arndt / githubweb
- test_array_ctad: disable test for intel versions < 2021 — ndellin / githubweb
- Enable `{transform_}exclusive_scan` in place (#6667) — noreply / githubweb
- Add `ATOMICS_BYPASS` configuration option to disable atomics (#6692) — noreply / githubweb
- Check matching static extents in View constructor (#5190) — noreply / githubweb
- Remove Kokkos::[b]half_t volatile overloads (#6579) — noreply / githubweb
- add tests — fnrizzi / githubweb
- Provide `kokkos_swap` as part of Core and deprecate `Experimental::swap` in Algorithms (#6697) — noreply / githubweb
- Provide new public headers `<Kokkos_Clamp.hpp>` and `<Kokkos_MinMax.hpp>` (#6687) — noreply / githubweb
- Fix TeamThreadMDRange parallel_reduce (#6511) — noreply / githubweb
- add tests for in-place `inclusive_scan` (#6682) — noreply / githubweb
- Drop pointless Kokkos::Impl::CudaExec forward declaration — Damien L-G / githubweb
- Don't use the compiler launcher script if the compile language is CUDA. (#6704) — noreply / githubweb
- Deprecate `{Cuda,HIP}::detect_device_count()` and `Cuda::[detect_]device_arch()` (#6710) — noreply / githubweb
- Get rid of CudaInternal::cuda_get_error_{name,string}_wrapper — Damien L-G / githubweb
- No need to jump through so many hoops to print the error message — Damien L-G / githubweb
- HIP: Forgot to delete matching brace closing the namespace — Damien L-G / githubweb
- Make initialize and finalize of the Cuda/HIP singleton less special (#6714) — noreply / githubweb
- Kokkos_HIP.cpp: include Kokkos_Core.hpp to resolve errors — ndellin / githubweb
- Add bound checks in RangePolicy and MDRangePolicy (#6617) — noreply / githubweb
- Temporary fix to reenable HIP CI — Bruno Turcksin / githubweb
- Let the smart pointer manage the CUDA/HIP stream (#6721) — noreply / githubweb
- Fix Docker env variables — Bruno Turcksin / githubweb
- Ensure view_allocation_error does not silently ignore that no exception was thrown — Damien L-G / githubweb
- Add RawMemoryAllocationFailure::AllocationMechanism::OpenACCMalloc enumerator — Damien L-G / githubweb
- [OpenACC] throw if acc_malloc returned nullptr — Damien L-G / githubweb
- Fixup using declaration — Damien L-G / githubweb
- Disable openacc.view_allocation_error test — Damien L-G / githubweb
- Guard `[MD]RangePolicy` precondition check for deprecated code 4 (#6726) — noreply / githubweb
- Add C++26 standard to CMake Setup — dev / githubweb
- Add support for C++26 in generated makefiles — Damien L-G / githubweb
- Add KOKKOS_ENABLE_CXX26 to the configuration metadata — Damien L-G / githubweb
- Reenable HIP testing — Bruno Turcksin / githubweb
- Disabling failing HIP test in the CI — Bruno Turcksin / githubweb
- Use team_size_max to fix "Team size too large" error in reducer test (#6725) — noreply / githubweb
- Don't touch my records! (refactor Cuda/HIP/SYCL/Threads to not directly mess with `SharedAllocationRecord`) (#6732) — noreply / githubweb
- Untangle SharedAllocationRecord spaghetti code — Damien L-G / githubweb
- Trilinos: Don't let Kokkos set CMAKE_CXX_FLAGS — Daniel Arndt / githubweb
- Fix TestThreadVectorMDRangeParallelReduce (#6734) — noreply / githubweb
- Cuda multi-GPU support: Allow execution space instance constructor to run (#6706) — noreply / githubweb
- add more warp sync for cuda reductions — tccleve / githubweb
- Drop support for deprecated command-line arguments and environment variables (#6744) — noreply / githubweb
- Avoid unnecessary zero-memset of the scratch flags in SYCL (#6739) — noreply / githubweb
- std(remove-if): fixing tmp view alloc + avoid evaluating twice the predicate during final pass — romin.tomasetti / githubweb
#5 (Dec 4, 2023, 11:47:15 AM)
- UnorderedMap: promote `invalid_index` to `public` and use `Experimental::finite_max_v` for its value — romin.tomasetti / githubweb
#3 (Dec 1, 2023, 11:50:50 AM)
- Add jenkins multibranch pipeline options — Bruno Turcksin / githubweb
- Replace ubuntu:18.04 with ubuntu:20.04 as base image for clang-format — Bruno Turcksin / githubweb
#3 (Dec 1, 2023, 11:50:50 AM)
- Add jenkins multibranch pipeline options — Bruno Turcksin / githubweb
- Replace ubuntu:18.04 with ubuntu:20.04 as base image for clang-format — Bruno Turcksin / githubweb
#3 (Dec 1, 2023, 11:50:50 AM)
- Add jenkins multibranch pipeline options — Bruno Turcksin / githubweb
- Replace ubuntu:18.04 with ubuntu:20.04 as base image for clang-format — Bruno Turcksin / githubweb
#3 (Dec 1, 2023, 11:50:50 AM)
- Add jenkins multibranch pipeline options — Bruno Turcksin / githubweb
- Replace ubuntu:18.04 with ubuntu:20.04 as base image for clang-format — Bruno Turcksin / githubweb
#3 (Dec 1, 2023, 11:50:50 AM)
- Add jenkins multibranch pipeline options — Bruno Turcksin / githubweb
- Replace ubuntu:18.04 with ubuntu:20.04 as base image for clang-format — Bruno Turcksin / githubweb
#3 (Dec 1, 2023, 11:50:50 AM)
- Add jenkins multibranch pipeline options — Bruno Turcksin / githubweb
- Replace ubuntu:18.04 with ubuntu:20.04 as base image for clang-format — Bruno Turcksin / githubweb
#2 (Dec 1, 2023, 8:08:22 AM)
- core(macros): adding `KOKKOS_IMPL_INVALID_INDEX_TYPE` — romin.tomasetti / githubweb