1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98
|
/*
* Copyright 2008-2013 NVIDIA Corporation
*
* Licensed under the Apache License, Version 2.0 (the "License");
* you may not use this file except in compliance with the License.
* You may obtain a copy of the License at
*
* http://www.apache.org/licenses/LICENSE-2.0
*
* Unless required by applicable law or agreed to in writing, software
* distributed under the License is distributed on an "AS IS" BASIS,
* WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
* See the License for the specific language governing permissions and
* limitations under the License.
*/
#pragma once
#include <thrust/detail/config.h>
#if THRUST_DEVICE_COMPILER == THRUST_DEVICE_COMPILER_NVCC
#include <thrust/system/cuda/config.h>
#include <thrust/system/cuda/detail/cross_system.h>
#include <thrust/detail/raw_pointer_cast.h>
#include <thrust/iterator/iterator_traits.h>
THRUST_NAMESPACE_BEGIN
namespace cuda_cub {
namespace
{
template<typename DerivedPolicy, typename Pointer>
inline __host__ __device__
typename thrust::iterator_value<Pointer>::type
get_value_msvc2005_war(execution_policy<DerivedPolicy> &exec, Pointer ptr)
{
typedef typename thrust::iterator_value<Pointer>::type result_type;
// XXX war nvbugs/881631
struct war_nvbugs_881631
{
__host__ inline static result_type host_path(execution_policy<DerivedPolicy> &exec, Pointer ptr)
{
// when called from host code, implement with assign_value
// note that this requires a type with default constructor
result_type result;
thrust::host_system_tag host_tag;
cross_system<thrust::host_system_tag, DerivedPolicy> systems(host_tag, exec);
assign_value(systems, &result, ptr);
return result;
}
__device__ inline static result_type device_path(execution_policy<DerivedPolicy> &, Pointer ptr)
{
// when called from device code, just do simple deref
return *thrust::raw_pointer_cast(ptr);
}
};
// The usual pattern for separating host and device code doesn't work here
// because it would result in a compiler warning, either about falling off
// the end of a non-void function, or about result_type's default constructor
// being a host-only function.
#ifdef _NVHPC_CUDA
if (THRUST_IS_HOST_CODE) {
return war_nvbugs_881631::host_path(exec, ptr);
} else {
return war_nvbugs_881631::device_path(exec, ptr);
}
#else
#ifndef __CUDA_ARCH__
return war_nvbugs_881631::host_path(exec, ptr);
#else
return war_nvbugs_881631::device_path(exec, ptr);
#endif // __CUDA_ARCH__
#endif
} // end get_value_msvc2005_war()
} // end anon namespace
template<typename DerivedPolicy, typename Pointer>
inline __host__ __device__
typename thrust::iterator_value<Pointer>::type
get_value(execution_policy<DerivedPolicy> &exec, Pointer ptr)
{
return get_value_msvc2005_war(exec,ptr);
} // end get_value()
} // end cuda_cub
THRUST_NAMESPACE_END
#endif
|